Category: Publications
A list of my scientific publications. You can use the menu above to filter on a specific type or use the term cloud to look for publications on a certain topic. Alternatively, you can use the search box if you know what you’re looking for. Another version of this list can be found on Google Scholar.
Feeding the Second Screen: Semantic Linking based on Subtitles
Television is changing. Increasingly, broadcasts are consumed interactively. This allows broadcasters to provide consumers with additional background information that they may bookmark for later consumption.
Een bril die alles weet
Dit is het artikel uit het NRC van 9 februari 2013, getiteld “Een bril die alles weet.” De fantasie van Star Trek en The Matrix komt dichterbij. Computers gaan u steeds beter begrijpen. Straks kunt u gewoon met zoekmachines praten.
Personalized Time-Aware Tweets Summarization
To appear as full paper at SIGIR 2013. In this paper we focus on selecting meaningful tweets given a user’s interests. Specifically, we consider the task of time-aware tweets summarization, based on a user’s history and collaborative social influences from “social circles.”
The University of Amsterdam at TREC 2012
This year the Information and Language Processing Systems (ILPS) group of the University of Amsterdam participated in the Microblog and the Knowledge Base Acceleration (KBA) tracks.
Overview of RepLab 2012: Evaluating Online Reputation Management Systems
This paper summarizes the goals, organization and results of the first RepLab competitive evaluation campaign for Online Reputation Management Systems (RepLab 2012). RepLab focused on the reputation of companies, and asked participant systems to annotate different types of information on tweets containing the names of several companies. Two tasks were proposed: a proling task, where…
Generating Pseudo Test Collections for Learning to Rank Scientific Articles
Pseudo test collections are automatically generated to provide training material for learning to rank methods. We propose a method for generating pseudo test collections in the domain of digital libraries, where data is relatively sparse, but comes with rich annotations. Our intuition is that documents are annotated to make them…
OpenGeist: Insight in the Stream of Page Views on Wikipedia
We present a RESTful interface that captures insights into the zeitgeist of Wikipedia users. In recent years many so-called zeitgeist applications have been launched. Such applications are used to gain insights into the current gist of society and actual affairs. Several news sources run zeitgeist applications for popular and trending news.…
Identifying Entity Aspects in Microblog Posts
Online reputation management is about monitoring and handling the public image of entities (such as companies) on the Web. An important task in this area is identifying aspects of the entity of interest (such as products, services, competitors, key people, etc.) given a stream of microblog posts referring to the…