Publications
A list of my scientific publications. You can use the menu above to filter on a specific type or use the term cloud to look for publications on a certain topic. Alternatively, you can use the search box if you know what you’re looking for. A more succinct version of this list can be found on the ILPS website.
Zoekmachines van de toekomst
0Er bestaat enige discussie over wat de logische opvolger zal zijn van web 2.0, waarin user-generated content, het delen van informatie en interoperabiliteit centraal stonden. Hoewel meer ideeën de ronde doen, is er veel steun voor het idee web 3.0 gelijk te stellen aan het semantische web. Het sturende idee More >
The University of Amsterdam at the TREC 2011 Session Track
0We describe the participation of the University of Amsterdam’s ILPS group in the Session track at TREC 2011.
The stream of interactions created by a user engaging with a search system contains a wealth of information. For retrieval purposes, previous interactions can help inform us about a user’s current More >
Team COMMIT at TREC 2011
0We describe the participation of Team COMMIT in this year’s Microblog and Entity track.
In our participation in the Microblog track, we used a feature-based approach. Specifically, we pursued a precision oriented recency-aware retrieval approach for tweets. Amongst others we used various types of external data. In particular, we More >
DutchHatTrick: Semantic query modeling, ConText, section detection, and match score maximization.
0This report discusses the collaborative work of the ErasmusMC, University of Twente, and the University of Amsterdam on the TREC 2011 Medical track. Here, the task is to retrieve patient visits from the University of Pittsburgh NLP Repository for 35 topics. The repository consists of 101,711 patient reports, More >
Adaptive Temporal Query Modeling
0To appear
We present an approach to query modeling that uses the temporal distribution of documents in an initially retrieved set of documents. Such distributions tend to exhibit bursts, especially in news related document collections. We hypothesize that documents in those bursts are more likely to be relevant than others. More >
A Framework for Unsupervised Spam Detection in Social Networking Sites
0To appear
Social networking sites offer users the option to submit user spam reports for a given message, indicating this message is inappropriate. In this paper we present a framework that uses these user spam reports for spam detection. The framework is based on the HITS web link analysis More >
Adding Semantics to Microblog Posts
0To appear
Microblogs have become an important source of information for marketing, intelligence, and reputation management purposes. Streams of microblogs are of great value because of their direct and real-time nature. Determining what an individual microblog post is about, however, can be non-trivial because of creative language usage, the highly More >
Wij-woorden op websites: Zoekmachines voor geesteswetenschappers
0Volgens velen in onze samenleving zijn we onszelf in het proces van integratie en multi-culti finaal voorbijgelopen. Sinds tien jaar is de toon van het debat in de media en op internet volslagen veranderd. De regering verkondigt dat de multiculturele samenleving is mislukt en dus wordt afgeschaft. Etnische achterstandsgroepen moeten More >
People searching for people: analysis of a people search engine log
0Recent years show an increasing interest in vertical search: searching within a particular type of information. Understanding what people search for in these “verticals” gives direction to research and provides pointers for the search engines themselves. In this paper we analyze the search logs of one particular vertical: people