Language modeling Archives - Page 3 of 3

Parsimonious Relevance Models

16/07/2008 Conference Papers Publications No Comments

Relevance feedback is often applied to better capture a user’s information need. Automatically reformulating queries (or blind relevance feedback) entails looking at the terms in some set of (pseudo-)relevant documents and selecting the most informative ones with respect to the set or the collection. These terms may then be reweighed…

Parsimonious concept modeling

16/07/2008 Conference Papers Publications No Comments

In many collections, documents are annotated using concepts from a structured knowledge source such as an ontology or thesaurus. Examples include the news domain, where each news item is categorized according to the nature of the event that took place, and Wikipedia, with its per-article categories. These categorizing systems originally…

Bootstrapping Language Associated with Biomedical Entities

16/01/2008 Publications Unrefereed No Comments

The TREC Genomics 2007 task included recognizing topic-specific entities in the returned passages. To address this task, we have designed and implemented a novel data-driven ap- proach by combining information extraction with language modeling techniques. Instead of using an exhaustive list of all possible instances for an entity type, we…

Integrating Conceptual Knowledge into Relevance Models: A Model and Estimation Method

13/10/2007 Conference Papers Publications No Comments

We address the issue of combining explicit background knowledge with pseudo-relevance feedback from within a document collection. To this end, we use document-level annotations in tandem with generative language models to generate terms from pseudo-relevant documents and bias the probability estimates of expansion terms in a principled manner. By applying…

Thesaurus-Based Feedback to Support Mixed Search and Browsing Environments

13/09/2007 Conference Papers Publications No Comments

We propose and evaluate a query expansion mechanism that supports searching and browsing in collections of annotated documents. Based on generative language models, our feedback mechanism uses document-level annotations to bias the generation of expansion terms and to generate browsing suggestions in the form of concepts selected from a controlled…

Using Prior Information Derived from Citations in Literature Search

13/05/2007 Conference Papers Publications No Comments

Researchers spend a large amount of their time searching through an ever increasing number of scientific articles. Although users of scientific literature search engines prefer the ranking of results according to the number of citations a publication has received, it is unknown whether this notion of authoritativeness could also benefit…

Edgar Meij

Tag: Language modeling