Adding Semantics to Microblog Posts

Research on Twitter

Microblogs have become an important source of information for marketing, intelligence, and reputation management purposes. Streams of microblogs are of great value because of their direct and real-time nature. Determining what an individual microblog post is about, however, can be non-trivial because of creative language usage, the highly contextualized and informal nature of microblog posts, and the limited length of this form of communication.

We propose a solution to the problem of determining what a microblog post is about through semantic linking: we add semantics to posts by automatically identifying concepts that are semantically related to it and generating links to the corresponding Wikipedia articles. The identified concepts can subsequently be used for, e.g., social media mining, thereby reducing the need for manual inspection and selection. Using a purpose-built test collection of tweets, we show that recently proposed approaches for semantically linking do not perform well, mainly due to the idiosyncratic nature of microblog posts. We propose a novel method based on machine learning with a set of innovative features and show that is able to achieve significant improvements over all other methods, especially in terms of precision.

  • [PDF] E. Meij, W. Weerkamp, and M. de Rijke, “Adding semantics to microblog posts,” in Proceedings of the fifth acm international conference on web search and data mining, 2012.
    Author = {Meij, Edgar and Weerkamp, Wouter and de Rijke, Maarten},
    Booktitle = {Proceedings of the fifth ACM international conference on Web search and data mining},
    Date-Added = {2015-01-20 20:28:31 +0000},
    Date-Modified = {2015-01-20 20:28:31 +0000},
    Series = {WSDM 2012},
    Title = {Adding Semantics to Microblog Posts},
    Year = {2012},
    Bdsk-Url-1 = {}}

    Adding Semantics to Microblog Posts | Follow the Crowd

    […] For more, see our full paper, Adding Semantics to Microblog Posts. […]

    [BLOCKED BY STBV] Context-based Entity Linking | research | GRAUS.NU

    […] The Text Analysis Conference is a yearly ‘benchmark event’, where a dataset is provided (lots of documents, a knowledge base, and a list of queries, words or ‘entity mentions’ that occur in the documents). I describe the task in more detail here. We participated in this track, by building and modifying a system that was created for entity-linking tweets. […]


    I work on your approach adding-semantics-to-microblog-posts but I have problems at the implementation level and test, can you tell me what we use as softwork please ?

      Edgar Meij

      Not sure what you mean?


    Thank you for the post. I am interested in your paper and was wondering if it is possible to get access to the used in your paper.

      Edgar Meij

      Sure! Please send me an email…

Leave a Reply

Your email address will not be published.

Time limit is exhausted. Please reload CAPTCHA.