Conference Papers Archives - Page 4 of 6

Generating Pseudo Test Collections for Learning to Rank Scientific Articles

Pseudo test collections are automatically generated to provide training material for learning to rank methods. We propose a method for generating pseudo test collections in the domain of digital libraries, where data is relatively sparse, but comes with rich annotations. Our intuition is that documents are annotated to make them…

Identifying Entity Aspects in Microblog Posts

03/05/2012 Blog Conference Papers Publications No Comments

Online reputation management is about monitoring and handling the public image of entities (such as companies) on the Web. An important task in this area is identifying aspects of the entity of interest (such as products, services, competitors, key people, etc.) given a stream of microblog posts referring to the…

Adding Semantics to Microblog Posts

08/02/2012 Conference Papers Publications 6 Comments

Microblogs have become an important source of information for marketing, intelligence, and reputation management purposes. Streams of microblogs are of great value because of their direct and real-time nature. Determining what an individual microblog post is about, however, can be non-trivial because of creative language usage, the highly contextualized and…

Adaptive Temporal Query Modeling

23/11/2011 Conference Papers Publications No Comments

We present an approach to query modeling that uses the temporal distribution of documents in an initially retrieved set of documents. Such distributions tend to exhibit bursts, especially in news related document collections. We hypothesize that documents in those bursts are more likely to be relevant than others. Predicated on…

A Framework for Unsupervised Spam Detection in Social Networking Sites

23/11/2011 Conference Papers Publications 1 Comment

Social networking sites offer users the option to submit user spam reports for a given message, indicating this message is inappropriate. In this paper we present a framework that uses these user spam reports for spam detection. The framework is based on the HITS web link analysis framework and is…

People searching for people: analysis of a people search engine log

20/07/2011 Conference Papers Publications No Comments

Recent years show an increasing interest in vertical search: searching within a particular type of information. Understanding what people search for in these “verticals” gives direction to research and provides pointers for the search engines themselves. In this paper we analyze the search logs of one particular vertical: people search…

Online Religious Studies

20/06/2011 Conference Papers Publications No Comments

Data transitions have revolutionized many scientific disciplines, starting with the exact sciences, then the life sciences, and now the social sciences and humanities are in the process of making the transition to becoming data intensive sciences, with descriptions through quantitative measurements. New analysis tools, and publicly accessible utterances, opinions, transactions…

Classifying Queries Submitted to a Vertical Search Engine

20/06/2011 Conference Papers Publications No Comments

We propose and motivate a scheme for classifying queries submitted to a people search engine. We specify a number of features for automatically classifying people queries into the proposed classes and examine the effectiveness of these features. Our main finding is that classification is feasible and that using information from…

Supervised query modeling using Wikipedia

14/07/2010 Conference Papers Publications No Comments

In a web retrieval setting, there is a clear need for precision enhancing methods. For example, the query “the secret garden” (a novel that has been adapted into movies and musicals) is a query that is easily led astray because of the generality of the individual query terms. While some…

Enabling Data Transport between Web Services

20/04/2010 Conference Papers Publications No Comments

Despite numerous benefits, many Web Services (WS) face problems with respect to data transport, either because SOAP doesn’t offer a scalable way of transporting large data-sets or because orchestration workflows (WF) don’t move data around efficiently. In this paper we address both problems with the development of the ProxyWS. This…

Edgar Meij

Category: Conference Papers