Archive for year 2011
Preprint of my WSDM paper, ‘Adding Semantics to Microblog Posts’ available now
0The camera-ready version of my WSDM paper, Adding Semantics to Microblog Posts (with Wouter Weerkamp and Maarten de Rijke) can be found here. I will release the associated data files (such as the manual annotations) soon, stay tuned.
Adaptive Temporal Query Modeling
0We present an approach to query modeling that uses the temporal distribution of documents in an initially retrieved set of documents. Such distributions tend to exhibit bursts, especially in news related document collections. We hypothesize that documents in those bursts are more likely to be relevant than others. Predicated on More >
A Framework for Unsupervised Spam Detection in Social Networking Sites
0Social networking sites offer users the option to submit user spam reports for a given message, indicating this message is inappropriate. In this paper we present a framework that uses these user spam reports for spam detection. The framework is based on the HITS web link analysis framework and More >
Hello world!
0Hello world! A new project, a new website. And this is the first post.
Wij-woorden op websites: Zoekmachines voor geesteswetenschappers
0Volgens velen in onze samenleving zijn we onszelf in het proces van integratie en multi-culti finaal voorbijgelopen. Sinds tien jaar is de toon van het debat in de media en op internet volslagen veranderd. De regering verkondigt dat de multiculturele samenleving is mislukt en dus wordt afgeschaft. Etnische achterstandsgroepen moeten More >
People searching for people: analysis of a people search engine log
0Recent years show an increasing interest in vertical search: searching within a particular type of information. Understanding what people search for in these “verticals” gives direction to research and provides pointers for the search engines themselves. In this paper we analyze the search logs of one particular vertical: people
Online Religious Studies
0Data transitions have revolutionized many scientific disciplines, starting with the exact sciences, then the life sciences, and now the social sciences and humanities are in the process of making the transition to becoming data intensive sciences, with descriptions through quantitative measurements. New analysis tools, and publicly accessible utterances, opinions, transactions More >
Classifying Queries Submitted to a Vertical Search Engine
0We propose and motivate a scheme for classifying queries submitted to a people search engine. We specify a number of features for automatically classifying people queries into the proposed classes and examine the effectiveness of these features. Our main finding is that classification is feasible and that using information from More >
Dir 2011: the eleventh Dutch-Belgian information retrieval workshop
0The 11th edition of the annual Dutch-Belgian Information Retrieval workshop (DIR 2011) took place on February 4 in Amsterdam. It was organized by the University of Amsterdam and the Centrum Wiskunde & Informatica. The focus of this year’s workshop was on interaction, with the goal of facilitating and increasing More >