Fast and Space-Efficient Entity Linking in Queries


Entity linking deals with identifying entities from a knowledge base in a given piece of text and has become a fundamental building block for web search engines, enabling numerous downstream improvements from better document ranking to enhanced search results pages. A key problem in the context of web search queries

Time-Aware Rank Aggregation for Microblog Search


We tackle the problem of searching microblog posts and frame it as a rank aggregation problem where we merge result lists generated by separate rankers so as to produce a final ranking to be returned to the user. We propose a rank aggregation method, TimeRA, that is able to infer the


WSDM 2014, a recap


WSDM is wrapping up today, with only the workshops left for tomorrow. All in all, it was an exciting WSDM with lots of interesting talks and discussions. And of course Times Square cannot be beaten as conference venue location. Some papers/talks that caught my (semantic search) eye at WSDM, in no particular

Linking queries to entities


I'm happy to announce we're releasing a new test collection for entity linking for web queries (within user sessions) to Wikipedia. About half of the queries in this dataset are sampled from Yahoo search logs, the other half comes from the TREC Session track. Check out the L24 dataset

Entity Linking and Retrieval for Semantic Search (WSDM 2014)


This morning, we presented the last edition of our tutorial series on Entity Linking and Retrieval, entitled "Entity Linking and Retrieval for Semantic Search" (with Krisztian Balog and Daan Odijk) at WSDM 2014! This final edition of the series builds upon our earlier tutorials at WWW 2013 and SIGIR 2013. The focus of this

RepLab 2014


RepLab is a competitive evaluation exercise for Online Reputation Management systems. In 2012 and 2013, RepLab focused on the problem of monitoring the reputation of (company) entities on Twitter, and dealt with the tasks of entity linking ("Is the tweet about the entity?"), reputation polarity ("Does the tweet have positive

Using Temporal Bursts for Query Modeling


In this paper, we present an approach to query modeling that leverages the temporal distribution of documents in an initially retrieved set of documents. Continue reading “Using Temporal Bursts for Query Modeling” »

We’re now hiring next year’s interns!


I'm happy to announce that we have just opened up our applications for next year's internships at Yahoo Labs in Barcelona. So, if you're a PhD student in a related field, do consider applying. Especially if you're interested in spending some time in sunny Barcelona and gaining research experience along


Entity Linking and Retrieval Tutorial @ SIGIR 2013 – Slides, Code, and Bibliography


The material for our “Entity Linking and Retrieval” tutorial (with Krisztian Balog and Daan Odijk) for SIGIR 2013 has been updated and is available online on GitHub (slides), Dropbox (slides), Mendeley, and CodeAcademy. All material is summarized at the webpage for the tutorial: See my other blogpost for a brief summary.

Multilingual Semantic Linking for Video Streams: Making “Ideas Worth Sharing” More Accessible


This paper describes our (winning!) submission to the Developers Challenge at WoLE2013, "Doing Good by Linking Entities." We present a fully automatic system – called "Semantic TED" – which provides intelligent suggestions in the form of links to Wikipedia articles for video streams in multiple

