• Publications
    • Conference Papers
    • Workshop Papers
    • Journal Papers
    • Publicity
    • Books
    • Theses
    • Submitted
  • Professional Activities
  • Teaching
  • About
  • Contact

Edgar Meij

semantic search research ッ

  • Publications
    • Conference Papers
    • Workshop Papers
    • Journal Papers
    • Publicity
    • Books
    • Theses
    • Submitted
  • Professional Activities
  • Teaching
  • About
  • Contact
Example time series

Using Temporal Bursts for Query Modeling

01/02/2014 Blog Journal Papers Publications No Comments

In this paper, we present an approach to query modeling that leverages the temporal distribution of documents in an initially retrieved set of documents. Query modeling is often used to better capture a user’s information need and help bridge the lexical gap between a query and the documents to be retrieved. Typical approaches consider terms in some set of documents and select the most informative ones. These terms may then be reweighted and–in a language modeling setting–be used to estimate a query model, i.e., a distribution over terms for a query. Temporal distributions tend to exhibit bursts, especially in news-related document collections.

Time series for the query "grammys"
Time series for the query grammys.

We define a burst to be a time period where unusually many documents were published; in our approach we detect bursts in result lists returned for a query. We then model the bursts using a reduced result list and select its most descriptive terms. Finally, we merge the term sets so as to arrive at a reformulation of the original query.

The main contributions we make in this paper are novel temporal query models and an analysis of their effectiveness, both for time-aware queries and for arbitrary queries. For query sets that consist of both temporal and non-temporal queries, our model is able to find the balance between performing query modeling or not: only if there are bursts and only if some of the top ranked documents are in the burst, the query is remodeled based on the bursts. We consistently improve over various baselines such as relevance models, often significantly so.

For query sets that consist of both temporal and non-temporal queries, our query modeling approach incorporates an effective selection method of terms. We consistently and significantly improve over various baselines, such as relevance models, on both news collections and a collection of blog posts.

See the Springer website for the online version, the PDF can be found below.

  • [PDF] M. Peetz, E. Meij, and M. Rijke, “Using temporal bursts for query modeling,” Information retrieval, vol. 17, iss. 1, pp. 74-108, 2014.
    [Bibtex]
    @article{IRJ:2013:Peetz,
    Author = {Peetz, Maria-Hendrike and Meij, Edgar and Rijke, Maarten},
    Date-Added = {2014-05-16 06:22:13 +0000},
    Date-Modified = {2014-05-16 06:22:42 +0000},
    Journal = {Information Retrieval},
    Number = {1},
    Pages = {74-108},
    Title = {Using temporal bursts for query modeling},
    Volume = {17},
    Year = {2014},
    Bdsk-Url-1 = {http://dx.doi.org/10.1007/s10791-013-9227-2}}
burst-information-retrievalhttpedgar-meij-prohttpedgar-meij-protaglogo-penerbit-buku-internasionalidentify-temporal-burst-for-query-modelingInformation retrievalLanguage modelinglogo-penerbit-buku-internasionalmeijpenerbit-buku-internasionalQuery modelingquery-model-consists-oftemporal-distribution-of-documentsTime seriesTREC Blogusing-temporal-bursts-for-query-modeling

We're now hiring next year's interns!

RepLab 2014

Leave a Reply Cancel reply

Time limit is exhausted. Please reload CAPTCHA.

Edgar Meij logo

Welcome!

This is the website of Edgar Meij. I lead several groups of researchers and engineers at Bloomberg working on knowledge graphs, question answering, information retrieval, machine learning, and more…

Search

Tweets by @edgarmeij

Tags

AIDA Artificial Intelligence CLEF DBpedia Document priors edgar-meij entity-linking-and-retrieval entity-linking-and-retrieval-tutorial entity-linking-tutorial Entity finding Entity linking Information retrieval Knowledge base population Knowledge Graph Language modeling Linking Open Data LOD logo-penerbit-buku-internasional Lucene Machine learning meij MeSH Microblogs penerbit-buku-internasional Query log analysis Query modeling Relevance modeling Semanticizing Semantic linking Semantic query analysis Semantic search Teaching Text mining TREC Blog TREC Enterprise TREC Genomics TREC KBA TREC Microblog TREC Relevance Feedback Tutorial Twitter Web services Wikipedia Workflows Workshop
Proudly powered by WordPress | Theme: Doo by ThemeVS.