Weakly-supervised Contextualization of Knowledge Graph Facts

Knowledge graphs (KGs) model facts about the world; they consist of nodes (entities such as companies and people) that are connected by edges (relations such as founderOf ). Facts encoded in KGs are frequently used by search applications to augment result pages. When presenting a KG fact to the user, providing other facts that are pertinent to that main fact can enrich the user experience and support exploratory information needs. KG fact contextualization is the task of augmenting a given KG fact with additional and useful KG facts. The task is challenging because of the large size of KGs; discovering other relevant facts even in a small neighborhood of the given fact results in an enormous amount of candidates. We introduce a neural fact contextualization method (NFCM) to address the KG fact contextualization task. NFCM first generates a set of candidate facts in the neighborhood of a given fact and then ranks the candidate facts using a supervised learning to rank model. The ranking model combines features that we automatically learn from data and that represent the query-candidate facts with a set of hand-crafted features we devised or adjusted for this task. In order to obtain the annotations required to train the learning to rank model at scale, we generate training data automatically using distant supervision on a large entity-tagged text corpus. We show that ranking functions learned on this data are effective at contextualizing KG facts. Evaluation using human assessors shows that it significantly outperforms several competitive baselines.

  • [PDF] [DOI] N. Voskarides, E. Meij, R. Reinanda, A. Khaitan, M. Osborne, G. Stefanoni, P. Kambadur, and M. de Rijke, “Weakly-supervised contextualization of knowledge graph facts,” in The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, New York, NY, USA, 2018, p. 765–774.
    [Bibtex]
    @inproceedings{SIGIR:2018:Voskarides,
    Acmid = {3210031},
    Address = {New York, NY, USA},
    Author = {Voskarides, Nikos and Meij, Edgar and Reinanda, Ridho and Khaitan, Abhinav and Osborne, Miles and Stefanoni, Giorgio and Kambadur, Prabhanjan and de Rijke, Maarten},
    Booktitle = {The 41st {International ACM SIGIR Conference on Research} \& {Development in Information Retrieval}},
    Date-Added = {2018-07-26 18:23:41 +0000},
    Date-Modified = {2018-09-27 21:55:17 +0100},
    Doi = {10.1145/3209978.3210031},
    Isbn = {978-1-4503-5657-2},
    Keywords = {distant supervision, fact contextualization, knowledge graphs},
    Location = {Ann Arbor, MI, USA},
    Numpages = {10},
    Pages = {765--774},
    Publisher = {ACM},
    Series = {SIGIR '18},
    Title = {Weakly-supervised Contextualization of Knowledge Graph Facts},
    Url = {http://doi.acm.org/10.1145/3209978.3210031},
    Year = {2018},
    Bdsk-Url-1 = {http://doi.acm.org/10.1145/3209978.3210031},
    Bdsk-Url-2 = {https://doi.org/10.1145/3209978.3210031}}

The Second Workshop on Knowledge Graphs and Semantics for Text Retrieval, Analysis, and Understanding (KG4IR)

Semantic technologies such as controlled vocabularies, thesauri, and knowledge graphs have been used throughout the history of information retrieval for a variety of tasks. Recent advances in knowledge acquisition, alignment, and utilization have given rise to a body of new approaches for utilizing knowledge graphs in text retrieval tasks and it is therefore time to consolidate the community efforts and study how such technologies can be employed in information retrieval systems in the most effective way. It is also time to start and deepen the dialogue between researchers and practitioners in order to ensure that breakthroughs, technologies, and algorithms in this space are widely disseminated. The goal of this workshop, co-located with SIGIR 2018, is to bring together and grow a community of researchers and practitioners who are interested in using, aligning, and constructing knowledge graphs and similar semantic resources for information retrieval applications. See https://kg4ir.github.io/ for more info.

  • [PDF] [DOI] L. Dietz, C. Xiong, J. Dalton, and E. Meij, “The second workshop on knowledge graphs and semantics for text retrieval, analysis, and understanding (kg4ir),” in The 41st international acm sigir conference on research & development in information retrieval, New York, NY, USA, 2018, p. 1423–1426.
    [Bibtex]
    @inproceedings{SIGIR:2018:Dietz-WS,
    Acmid = {3210196},
    Address = {New York, NY, USA},
    Author = {Dietz, Laura and Xiong, Chenyan and Dalton, Jeff and Meij, Edgar},
    Booktitle = {The 41st International ACM SIGIR Conference on Research \& Development in Information Retrieval},
    Date-Added = {2018-07-26 18:25:34 +0000},
    Date-Modified = {2018-07-26 18:31:50 +0000},
    Doi = {10.1145/3209978.3210196},
    Isbn = {978-1-4503-5657-2},
    Keywords = {entity linking, entity retrieval, entity-oriented search, information retrieval, knowledge graphs},
    Location = {Ann Arbor, MI, USA},
    Numpages = {4},
    Pages = {1423--1426},
    Publisher = {ACM},
    Series = {SIGIR '18},
    Title = {The Second Workshop on Knowledge Graphs and Semantics for Text Retrieval, Analysis, and Understanding (KG4IR)},
    Url = {http://doi.acm.org/10.1145/3209978.3210196},
    Year = {2018},
    Bdsk-Url-1 = {http://doi.acm.org/10.1145/3209978.3210196},
    Bdsk-Url-2 = {https://doi.org/10.1145/3209978.3210196}}

Utilizing Knowledge Graphs for Text-Centric Information Retrieval

The past decade has witnessed the emergence of several publicly available and proprietary knowledge graphs (KGs). The depth and breadth of content in these KGs made them not only rich sources of structured knowledge by themselves, but also valuable resources for search systems. A surge of recent developments in entity linking and entity retrieval methods gave rise to a new line of research that aims at utilizing KGs for text-centric retrieval applications. This tutorial is the first to summarize and disseminate the progress in this emerging area to industry practitioners and researchers.

  • [PDF] [DOI] L. Dietz, A. Kotov, and E. Meij, “Utilizing knowledge graphs for text-centric information retrieval,” in The 41st international acm sigir conference on research & development in information retrieval, New York, NY, USA, 2018, p. 1387–1390.
    [Bibtex]
    @inproceedings{SIGIR:2018:Dietz-Tut,
    Acmid = {3210187},
    Address = {New York, NY, USA},
    Author = {Dietz, Laura and Kotov, Alexander and Meij, Edgar},
    Booktitle = {The 41st International ACM SIGIR Conference on Research \& Development in Information Retrieval},
    Date-Added = {2018-07-26 18:24:31 +0000},
    Date-Modified = {2018-07-26 18:31:50 +0000},
    Doi = {10.1145/3209978.3210187},
    Isbn = {978-1-4503-5657-2},
    Keywords = {entity linking, entity retrieval, information retrieval, knowledge graphs},
    Location = {Ann Arbor, MI, USA},
    Numpages = {4},
    Pages = {1387--1390},
    Publisher = {ACM},
    Series = {SIGIR '18},
    Title = {Utilizing Knowledge Graphs for Text-Centric Information Retrieval},
    Url = {http://doi.acm.org/10.1145/3209978.3210187},
    Year = {2018},
    Bdsk-Url-1 = {http://doi.acm.org/10.1145/3209978.3210187},
    Bdsk-Url-2 = {https://doi.org/10.1145/3209978.3210187}}

The First Workshop on Knowledge Graphs and Semantics for Text Retrieval and Analysis (KG4IR)

Knowledge graphs have been used throughout the history of information retrieval for a variety of tasks. Advances in knowledge acquisition and alignment technology in the last few years have given rise to a body of new approaches for utilizing knowledge graphs in text retrieval tasks. This report presents the motivation, output, and outlook of the first workshop on Knowledge Graphs and Semantics for Text Retrieval and Analysis which was co-located with SIGIR 2017 in Tokyo, Japan. We aim to assess where we stand today, what future directions are, and which preconditions could lead to further performance increases. See https://kg4ir.github.io/ for more info.

  • [PDF] [DOI] L. Dietz, C. Xiong, and E. Meij, “The first workshop on knowledge graphs and semantics for text retrieval and analysis (kg4ir),” in Proceedings of the 40th international acm sigir conference on research and development in information retrieval, New York, NY, USA, 2017, p. 1427–1428.
    [Bibtex]
    @inproceedings{SIGIR:2017:Dietz,
    Acmid = {3084371},
    Address = {New York, NY, USA},
    Author = {Dietz, Laura and Xiong, Chenyan and Meij, Edgar},
    Booktitle = {Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval},
    Date-Added = {2018-07-26 18:17:39 +0000},
    Date-Modified = {2018-07-26 18:17:51 +0000},
    Doi = {10.1145/3077136.3084371},
    Isbn = {978-1-4503-5022-8},
    Keywords = {entities, information retrieval, knowledge graphs},
    Location = {Shinjuku, Tokyo, Japan},
    Numpages = {2},
    Pages = {1427--1428},
    Publisher = {ACM},
    Series = {SIGIR '17},
    Title = {The First Workshop on Knowledge Graphs and Semantics for Text Retrieval and Analysis (KG4IR)},
    Url = {http://doi.acm.org/10.1145/3077136.3084371},
    Year = {2017},
    Bdsk-Url-1 = {http://doi.acm.org/10.1145/3077136.3084371},
    Bdsk-Url-2 = {https://doi.org/10.1145/3077136.3084371}}