ECIR 2012

ECIR preprints published

The camera-ready ver­sion of the ECIR papers, A Framework for Unsupervised Spam Detection in Social Networking Sites (with Maarten Bosma and Wouter Weerkamp) and Adaptive Temporal Query Modeling (with Hendrike Peetz, Wouter Weerkamp, and Maarten de Rijke) are available now. In the first paper, we report on the effectiveness of an unsupervised…
Twitter standing

A comparison of five semantic linking algorithms on tweets

Late last December, Yahoo! released a new version of their Content Analysis service and they announced that the initial version will be deprecated in 2012. Inspired by a recent post by Tony Hirst, entitled A Quick Peek at Three Content Analysis Services, this seemed like a perfect opportunity to test…
Research on Twitter

Dataset for “Adding Semantics to Microblog Posts”

As promised, I’m releasing the dataset used for my WSDM paper, Adding Semantics to Microblog Posts (with Wouter Weerkamp and Maarten de Rijke). In the paper, we evaluate various methods for automatically identifying concepts (in the form of Wikipedia articles) that are contained in or meant by a tweet. This…