OpenGeist: Insight in the Stream of Page Views on Wikipedia
We present a RESTful interface that captures insights into the zeitgeist of Wikipedia users. In recent years many so-called zeitgeist applications have been launched. Such applications are used to gain insights into the current gist of society and actual affairs. Several news sources run zeitgeist applications for popular and trending news. In addition, there are zeitgeist applications that report on trending publications such as LibraryThing, and trending topics, such as Google Zeitgeist. There is an interesting open data source from which a stream of people’s changing interests can be observed across a very broad spectrum of areas: the Wikimedia access logs. These logs contain the number of requests made to any Wikimedia domain, sorted by subdomain, and aggregated on an hourly basis. Since they are a log of the actual requests, they are noisy and can also contain non-existing pages. They are also quite large, yielding 60 GB worth of compressed textual data per month. Currently, we update the data on a daily basis and filter the raw source data by matching the URLs of all English Wikipedia articles and their redirects.
In this paper we describe an API that facilitates easy access to the access logs. We have identified the following requirements our system should have:
- The user must have access to the raw time series data for a concept.
- The user must be able to find the N most temporally similar concepts.
- The user must be able to group concepts and their data, based either on the categorial system of Wikipedia or on similarity between concepts.
- The system must return either a textual or a visual representation.
- The user should be able to apply time series filters to extract trends and (recurring) events.
The API is an interface for clustering and comparing concepts based on the time series of the number of views of their Wikipedia page.
See http://www.opengeist.org for more info and examples.
-
A. Saravanou, G. Stefanoni, and E. Meij, “Identifying notable news stories,” in Advances in information retrieval, Cham, 2020, p. 352–358.
[Bibtex]@inproceedings{ECIR:2020:Saravanou, Abstract = {The volume of news content has increased significantly in recent years and systems to process and deliver this information in an automated fashion at scale are becoming increasingly prevalent. One critical component that is required in such systems is a method to automatically determine how notable a certain news story is, in order to prioritize these stories during delivery. One way to do so is to compare each story in a stream of news stories to a notable event. In other words, the problem of detecting notable news can be defined as a ranking task; given a trusted source of notable events and a stream of candidate news stories, we aim to answer the question: ``Which of the candidate news stories is most similar to the notable one?''. We employ different combinations of features and learning to rank (LTR) models and gather relevance labels using crowdsourcing. In our approach, we use structured representations of candidate news stories (triples) and we link them to corresponding entities. Our evaluation shows that the features in our proposed method outperform standard ranking methods, and that the trained model generalizes well to unseen news stories.}, Address = {Cham}, Author = {Saravanou, Antonia and Stefanoni, Giorgio and Meij, Edgar}, Booktitle = {Advances in Information Retrieval}, Date-Added = {2020-06-03 06:36:13 +0100}, Date-Modified = {2020-06-03 06:47:12 +0100}, Editor = {Jose, Joemon M. and Yilmaz, Emine and Magalh{\~a}es, Jo{\~a}o and Castells, Pablo and Ferro, Nicola and Silva, M{\'a}rio J. and Martins, Fl{\'a}vio}, Isbn = {978-3-030-45442-5}, Pages = {352--358}, Publisher = {Springer International Publishing}, Title = {Identifying Notable News Stories}, Year = {2020}}
-
T. Safavi, D. Koutra, and E. Meij, Improving the utility of knowledge graph embeddings with calibration, 2020.
[Bibtex]@misc{ARXIV:2020:Safavi, Archiveprefix = {arXiv}, Author = {Tara Safavi and Danai Koutra and Edgar Meij}, Date-Added = {2020-06-03 06:34:40 +0100}, Date-Modified = {2020-06-03 06:47:20 +0100}, Eprint = {2004.01168}, Primaryclass = {cs.AI}, Title = {Improving the Utility of Knowledge Graph Embeddings with Calibration}, Year = {2020}}
-
S. Zhang, E. Meij, K. Balog, and R. Reinanda, “Novel entity discovery from web tables,” in Proceedings of the web conference 2020, New York, NY, USA, 2020, p. 1298–1308.
[Bibtex]@inproceedings{WWW:2020:Zhang, Address = {New York, NY, USA}, Author = {Zhang, Shuo and Meij, Edgar and Balog, Krisztian and Reinanda, Ridho}, Booktitle = {Proceedings of The Web Conference 2020}, Date-Added = {2020-06-03 06:23:41 +0100}, Date-Modified = {2020-06-03 06:24:53 +0100}, Doi = {10.1145/3366423.3380205}, Isbn = {9781450370233}, Keywords = {tabular data extraction, Novel entity discovery, entity linking, KBP}, Location = {Taipei, Taiwan}, Numpages = {11}, Pages = {1298--1308}, Publisher = {Association for Computing Machinery}, Series = {WWW '20}, Title = {Novel Entity Discovery from Web Tables}, Url = {https://doi.org/10.1145/3366423.3380205}, Year = {2020}, Bdsk-Url-1 = {https://doi.org/10.1145/3366423.3380205}}
-
L. Dietz, C. Xiong, J. Dalton, and E. Meij, “Special issue on knowledge graphs and semantics in text analysis and retrieval,” Information retrieval journal, 2019.
[Bibtex]@article{IRJ:2019:Dietz, Author = {Dietz, Laura and Xiong, Chenyan and Dalton, Jeff and Meij, Edgar}, Date-Added = {2019-03-12 20:19:31 +0000}, Date-Modified = {2019-03-12 20:19:39 +0000}, Day = {04}, Doi = {10.1007/s10791-019-09354-z}, Issn = {1573-7659}, Journal = {Information Retrieval Journal}, Month = {Mar}, Title = {Special issue on knowledge graphs and semantics in text analysis and retrieval}, Url = {https://doi.org/10.1007/s10791-019-09354-z}, Year = {2019}, Bdsk-Url-1 = {https://doi.org/10.1007/s10791-019-09354-z}}
-
R. Reinanda, E. Meij, J. Pantony, and D. Jonathan, “Related entity finding on highly-heterogeneous knowledge graphs,” in Asonam, 2018.
[Bibtex]@inproceedings{ASONAM:2018:Reinanda, Author = {Reinanda, Ridho and Meij, Edgar and Pantony, Joshua and Dorando Jonathan}, Booktitle = {ASONAM}, Date-Added = {2018-09-27 21:43:39 +0100}, Date-Modified = {2018-09-27 21:55:03 +0100}, Series = {{ASONAM} '18}, Title = {Related Entity Finding on Highly-heterogeneous Knowledge Graphs}, Year = {2018}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/3209978.3210031}, Bdsk-Url-2 = {https://doi.org/10.1145/3209978.3210031}}
-
L. Dietz, C. Xiong, J. Dalton, and E. Meij, “The second workshop on knowledge graphs and semantics for text retrieval, analysis, and understanding (kg4ir),” in The 41st international acm sigir conference on research & development in information retrieval, New York, NY, USA, 2018, p. 1423–1426.
[Bibtex]@inproceedings{SIGIR:2018:Dietz-WS, Acmid = {3210196}, Address = {New York, NY, USA}, Author = {Dietz, Laura and Xiong, Chenyan and Dalton, Jeff and Meij, Edgar}, Booktitle = {The 41st International ACM SIGIR Conference on Research \& Development in Information Retrieval}, Date-Added = {2018-07-26 18:25:34 +0000}, Date-Modified = {2018-07-26 18:31:50 +0000}, Doi = {10.1145/3209978.3210196}, Isbn = {978-1-4503-5657-2}, Keywords = {entity linking, entity retrieval, entity-oriented search, information retrieval, knowledge graphs}, Location = {Ann Arbor, MI, USA}, Numpages = {4}, Pages = {1423--1426}, Publisher = {ACM}, Series = {SIGIR '18}, Title = {The Second Workshop on Knowledge Graphs and Semantics for Text Retrieval, Analysis, and Understanding (KG4IR)}, Url = {http://doi.acm.org/10.1145/3209978.3210196}, Year = {2018}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/3209978.3210196}, Bdsk-Url-2 = {https://doi.org/10.1145/3209978.3210196}}
-
L. Dietz, A. Kotov, and E. Meij, “Utilizing knowledge graphs for text-centric information retrieval,” in The 41st international acm sigir conference on research & development in information retrieval, New York, NY, USA, 2018, p. 1387–1390.
[Bibtex]@inproceedings{SIGIR:2018:Dietz-Tut, Acmid = {3210187}, Address = {New York, NY, USA}, Author = {Dietz, Laura and Kotov, Alexander and Meij, Edgar}, Booktitle = {The 41st International ACM SIGIR Conference on Research \& Development in Information Retrieval}, Date-Added = {2018-07-26 18:24:31 +0000}, Date-Modified = {2018-07-26 18:31:50 +0000}, Doi = {10.1145/3209978.3210187}, Isbn = {978-1-4503-5657-2}, Keywords = {entity linking, entity retrieval, information retrieval, knowledge graphs}, Location = {Ann Arbor, MI, USA}, Numpages = {4}, Pages = {1387--1390}, Publisher = {ACM}, Series = {SIGIR '18}, Title = {Utilizing Knowledge Graphs for Text-Centric Information Retrieval}, Url = {http://doi.acm.org/10.1145/3209978.3210187}, Year = {2018}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/3209978.3210187}, Bdsk-Url-2 = {https://doi.org/10.1145/3209978.3210187}}
-
N. Voskarides, E. Meij, R. Reinanda, A. Khaitan, M. Osborne, G. Stefanoni, P. Kambadur, and M. de Rijke, “Weakly-supervised contextualization of knowledge graph facts,” in The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, New York, NY, USA, 2018, p. 765–774.
[Bibtex]@inproceedings{SIGIR:2018:Voskarides, Acmid = {3210031}, Address = {New York, NY, USA}, Author = {Voskarides, Nikos and Meij, Edgar and Reinanda, Ridho and Khaitan, Abhinav and Osborne, Miles and Stefanoni, Giorgio and Kambadur, Prabhanjan and de Rijke, Maarten}, Booktitle = {The 41st {International ACM SIGIR Conference on Research} \& {Development in Information Retrieval}}, Date-Added = {2018-07-26 18:23:41 +0000}, Date-Modified = {2018-09-27 21:55:17 +0100}, Doi = {10.1145/3209978.3210031}, Isbn = {978-1-4503-5657-2}, Keywords = {distant supervision, fact contextualization, knowledge graphs}, Location = {Ann Arbor, MI, USA}, Numpages = {10}, Pages = {765--774}, Publisher = {ACM}, Series = {SIGIR '18}, Title = {Weakly-supervised Contextualization of Knowledge Graph Facts}, Url = {http://doi.acm.org/10.1145/3209978.3210031}, Year = {2018}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/3209978.3210031}, Bdsk-Url-2 = {https://doi.org/10.1145/3209978.3210031}}
-
L. Dietz, C. Xiong, and E. Meij, “Overview of the first workshop on knowledge graphs and semantics for text retrieval and analysis (kg4ir),” Sigir forum, vol. 51, iss. 3, p. 139–144, 2018.
[Bibtex]@article{Forum:2018:Dietz, Acmid = {3190601}, Address = {New York, NY, USA}, Author = {Dietz, Laura and Xiong, Chenyan and Meij, Edgar}, Date-Added = {2018-07-26 18:22:37 +0000}, Date-Modified = {2018-07-26 18:22:48 +0000}, Doi = {10.1145/3190580.3190601}, Issn = {0163-5840}, Issue_Date = {December 2017}, Journal = {SIGIR Forum}, Month = 2, Number = {3}, Numpages = {6}, Pages = {139--144}, Publisher = {ACM}, Title = {Overview of The First Workshop on Knowledge Graphs and Semantics for Text Retrieval and Analysis (KG4IR)}, Url = {http://doi.acm.org/10.1145/3190580.3190601}, Volume = {51}, Year = {2018}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/3190580.3190601}, Bdsk-Url-2 = {https://doi.org/10.1145/3190580.3190601}}
-
L. Dietz, C. Xiong, and E. Meij, “The first workshop on knowledge graphs and semantics for text retrieval and analysis (kg4ir),” in Proceedings of the 40th international acm sigir conference on research and development in information retrieval, New York, NY, USA, 2017, p. 1427–1428.
[Bibtex]@inproceedings{SIGIR:2017:Dietz, Acmid = {3084371}, Address = {New York, NY, USA}, Author = {Dietz, Laura and Xiong, Chenyan and Meij, Edgar}, Booktitle = {Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval}, Date-Added = {2018-07-26 18:17:39 +0000}, Date-Modified = {2018-07-26 18:17:51 +0000}, Doi = {10.1145/3077136.3084371}, Isbn = {978-1-4503-5022-8}, Keywords = {entities, information retrieval, knowledge graphs}, Location = {Shinjuku, Tokyo, Japan}, Numpages = {2}, Pages = {1427--1428}, Publisher = {ACM}, Series = {SIGIR '17}, Title = {The First Workshop on Knowledge Graphs and Semantics for Text Retrieval and Analysis (KG4IR)}, Url = {http://doi.acm.org/10.1145/3077136.3084371}, Year = {2017}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/3077136.3084371}, Bdsk-Url-2 = {https://doi.org/10.1145/3077136.3084371}}
-
L. Dietz, A. Kotov, and E. Meij, “Utilizing knowledge bases in text-centric information retrieval,” in Proceedings of the 2016 acm international conference on the theory of information retrieval, 2016, p. 5–5.
[Bibtex]@inproceedings{ICTIR:2016:dietz, Author = {Dietz, Laura and Kotov, Alexander and Meij, Edgar}, Booktitle = {Proceedings of the 2016 ACM International Conference on the Theory of Information Retrieval}, Date-Added = {2017-01-10 21:28:50 +0000}, Date-Modified = {2017-01-10 21:29:16 +0000}, Pages = {5--5}, Series = {ICTIR '16}, Title = {Utilizing Knowledge Bases in Text-centric Information Retrieval}, Year = {2016}}
-
N. Voskarides, E. Meij, and M. de Rijke, “Generating descriptions of entity relationships,” in Ecir 2017: 39th european conference on information retrieval, 2017.
[Bibtex]@inproceedings{ECIR:2017:voskarides, Author = {Voskarides, Nikos and Meij, Edgar and de Rijke, Maarten}, Booktitle = {ECIR 2017: 39th European Conference on Information Retrieval}, Date-Added = {2017-01-10 21:27:37 +0000}, Date-Modified = {2017-01-10 21:27:58 +0000}, Month = {April}, Publisher = {Springer}, Series = {LNCS}, Title = {Generating descriptions of entity relationships}, Year = {2017}}
-
R. Reinanda, E. Meij, and M. de Rijke, “Document filtering for long-tail entities,” in Cikm 2016: 25th acm conference on information and knowledge management, 2016.
[Bibtex]@inproceedings{CIKM:2016:Reinanda, Author = {Reinanda, Ridho and Meij, Edgar and de Rijke, Maarten}, Booktitle = {CIKM 2016: 25th ACM Conference on Information and Knowledge Management}, Date-Added = {2016-09-05 18:55:21 +0000}, Date-Modified = {2016-09-05 19:00:33 +0000}, Month = {October}, Publisher = {ACM}, Title = {Document filtering for long-tail entities}, Year = {2016}}
-
D. Graus, M. Tsagkias, W. Weerkamp, E. Meij, and M. de Rijke, “Dynamic collective entity representations for entity ranking,” in Proceedings of the ninth acm international conference on web search and data mining, 2016.
[Bibtex]@inproceedings{WSDM:2016:Graus, Author = {Graus, David and Tsagkias, Manos and Weerkamp, Wouter and Meij, Edgar and de Rijke, Maarten}, Booktitle = {Proceedings of the ninth ACM international conference on Web search and data mining}, Date-Added = {2016-01-07 17:24:16 +0000}, Date-Modified = {2016-01-07 17:25:55 +0000}, Series = {WSDM 2016}, Title = {Dynamic Collective Entity Representations for Entity Ranking}, Year = {2016}, Bdsk-Url-1 = {http://aclweb.org/anthology/P15-1055}}
-
D. Odijk, E. Meij, I. Sijaranamual, and M. de Rijke, “Dynamic query modeling for related content finding,” in SIGIR 2015: 38th international ACM SIGIR conference on Research and development in information retrieval, 2015.
[Bibtex]@inproceedings{SIGIR:2015:Odijk, Author = {Odijk, Daan and Meij, Edgar and Sijaranamual, Isaac and de Rijke, Maarten}, Booktitle = {{SIGIR 2015: 38th international ACM SIGIR conference on Research and development in information retrieval}}, Date-Added = {2015-08-06 13:14:13 +0000}, Date-Modified = {2015-08-06 13:39:24 +0000}, Month = {August}, Publisher = {ACM}, Title = {Dynamic query modeling for related content finding}, Year = {2015}}
-
R. Reinanda, E. Meij, and M. de Rijke, “Mining, ranking and recommending entity aspects,” in SIGIR 2015: 38th international ACM SIGIR conference on Research and development in information retrieval, 2015.
[Bibtex]@inproceedings{SIGIR:2015:Reinanda, Author = {Reinanda, Ridho and Meij, Edgar and de Rijke, Maarten}, Booktitle = {{SIGIR 2015: 38th international ACM SIGIR conference on Research and development in information retrieval}}, Date-Added = {2015-08-06 13:12:53 +0000}, Date-Modified = {2015-08-06 13:39:33 +0000}, Month = {August}, Publisher = {ACM}, Title = {Mining, ranking and recommending entity aspects}, Year = {2015}}
-
N. Voskarides, E. Meij, M. Tsagkias, M. de Rijke, and W. Weerkamp, “Learning to explain entity relationships in knowledge graphs,” in Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing (volume 1: long papers), 2015, p. 564–574.
[Bibtex]@inproceedings{ACL:2015:Voskarides, Author = {Voskarides, Nikos and Meij, Edgar and Tsagkias, Manos and de Rijke, Maarten and Weerkamp, Wouter}, Booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)}, Date-Added = {2015-08-06 13:08:02 +0000}, Date-Modified = {2015-08-06 13:08:14 +0000}, Location = {Beijing, China}, Pages = {564--574}, Publisher = {Association for Computational Linguistics}, Title = {Learning to Explain Entity Relationships in Knowledge Graphs}, Url = {http://aclweb.org/anthology/P15-1055}, Year = {2015}, Bdsk-Url-1 = {http://aclweb.org/anthology/P15-1055}}
-
E. Meij, W. Weerkamp, and M. de Rijke, “Adding semantics to microblog posts,” in Proceedings of the fifth acm international conference on web search and data mining, 2012.
[Bibtex]@inproceedings{WSDM:2012:meij, Author = {Meij, Edgar and Weerkamp, Wouter and de Rijke, Maarten}, Booktitle = {Proceedings of the fifth ACM international conference on Web search and data mining}, Date-Added = {2015-01-20 20:28:31 +0000}, Date-Modified = {2015-01-20 20:28:31 +0000}, Series = {WSDM 2012}, Title = {Adding Semantics to Microblog Posts}, Year = {2012}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/1935826.1935842}}
-
S. Liang, Z. Ren, W. Weerkamp, E. Meij, and M. de Rijke, “Time-aware rank aggregation for microblog search,” in Proceedings of the 23th acm conference on information and knowledge management, 2014.
[Bibtex]@inproceedings{CIKM:2014:liang, Author = {Liang, Shangsong and Ren, Zhaochun and Weerkamp, Wouter and Meij, Edgar and de Rijke, Maarten}, Booktitle = {Proceedings of the 23th ACM conference on Information and knowledge management}, Date-Added = {2014-08-24 01:12:25 +0000}, Date-Modified = {2014-08-24 01:13:40 +0000}, Series = {CIKM 2014}, Title = {Time-Aware Rank Aggregation for Microblog Search}, Year = {2014}}
-
Z. Ren, S. Liang, E. Meij, and M. de Rijke, “Personalized time-aware tweets summarization,” in Proceedings of the 36th international acm sigir conference on research and development in information retrieval, 2013, p. 513–522.
[Bibtex]@inproceedings{SIGIR:2013:Ren, Author = {Ren, Zhaochun and Liang, Shangsong and Meij, Edgar and de Rijke, Maarten}, Booktitle = {Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval}, Date-Added = {2014-05-16 06:24:55 +0000}, Date-Modified = {2014-05-16 06:25:35 +0000}, Pages = {513--522}, Series = {SIGIR '13}, Title = {Personalized Time-aware Tweets Summarization}, Year = {2013}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/2484028.2484052}, Bdsk-Url-2 = {http://dx.doi.org/10.1145/2484028.2484052}}
-
M. Peetz, E. Meij, and M. Rijke, “Using temporal bursts for query modeling,” Information retrieval, vol. 17, iss. 1, pp. 74-108, 2014.
[Bibtex]@article{IRJ:2013:Peetz, Author = {Peetz, Maria-Hendrike and Meij, Edgar and Rijke, Maarten}, Date-Added = {2014-05-16 06:22:13 +0000}, Date-Modified = {2014-05-16 06:22:42 +0000}, Journal = {Information Retrieval}, Number = {1}, Pages = {74-108}, Title = {Using temporal bursts for query modeling}, Volume = {17}, Year = {2014}, Bdsk-Url-1 = {http://dx.doi.org/10.1007/s10791-013-9227-2}}
- E. Meij, “Een bril die alles weet,” Nrc handelsblad, 2013.
[Bibtex]@article{NRC:2013:meij, Author = {Meij, E.}, Date-Added = {2013-05-22 11:41:00 +0000}, Date-Modified = {2013-05-22 11:41:38 +0000}, Journal = {NRC Handelsblad}, Month = {February}, Title = {Een bril die alles weet}, Year = {2013}}
-
D. Odijk, E. Meij, D. Graus, and T. Kenter, “Multilingual semantic linking for video streams: making “ideas worth sharing” more accessible,” in Proceedings of the 2nd international workshop on web of linked entities (wole 2013), 2013.
[Bibtex]@inproceedings{WOLE:2013:Odijk, Author = {Odijk, Daan and Meij, Edgar and Graus, David and Kenter, Tom}, Booktitle = {Proceedings of the 2nd International Workshop on Web of Linked Entities (WoLE 2013)}, Date-Added = {2013-05-15 14:09:58 +0000}, Date-Modified = {2013-05-15 14:11:37 +0000}, Title = {Multilingual Semantic Linking for Video Streams: Making "Ideas Worth Sharing" More Accessible}, Year = {2013}}
-
D. Odijk, E. Meij, and M. de Rijke, “Feeding the second screen: semantic linking based on subtitles,” in OAIR ’13, 2013.
[Bibtex]@inproceedings{OAIR:2013:Odijk, Author = {Odijk, D. and Meij, E. and de Rijke, M.}, Booktitle = {{OAIR '13}}, Date-Added = {2013-02-18 11:26:22 +0000}, Date-Modified = {2013-03-19 13:45:33 +0000}, Title = {Feeding the Second Screen: Semantic Linking based on Subtitles}, Year = {2013}}
- D. Graus, T. Kenter, M. Bron, E. Meij, and M. de Rijke, “Context-based entity linking – University of Amsterdam at TAC 2012,” in Tac 2012 working notes, 2012.
[Bibtex]@inproceedings{TAC:2012:wn, Author = {Graus, D. and Kenter, T. and Bron, M. and Meij, E. and de Rijke, M.}, Booktitle = {TAC 2012 Working Notes}, Date-Added = {2012-10-28 22:16:50 +0000}, Date-Modified = {2013-05-22 11:44:03 +0000}, Month = {November}, Title = {Context-Based Entity Linking -- {University of Amsterdam at TAC 2012}}, Year = {2012}}
-
R. Berendsen, E. Meij, D. Odijk, M. de Rijke, and W. Weerkamp, “The University of Amsterdam at TREC 2012,” in Trec 2012 working notes, 2012.
[Bibtex]@inproceedings{TREC:2012:wn, Author = {Berendsen, R. and Meij, E. and Odijk, D. and de Rijke, M. and Weerkamp, W.}, Booktitle = {TREC 2012 Working Notes}, Date-Added = {2012-10-28 22:15:47 +0000}, Date-Modified = {2013-05-22 11:43:55 +0000}, Month = {November}, Series = {TREC 2012}, Title = {{The University of Amsterdam at TREC} 2012}, Year = {2012}}
- K. Balog, M. de Rijke, and E. Meij, Statistical language modeling for information access, Springer, In preparation.
[Bibtex]@book{book:meij, Author = {Balog, Krisztian and de Rijke, Maarten and Meij, Edgar}, Date-Added = {2012-10-28 21:56:33 +0000}, Date-Modified = {2012-10-28 21:57:34 +0000}, Publisher = {Springer}, Title = {Statistical language modeling for information access}, Year = {In preparation}}
-
M-H. Peetz, E. Meij, and M. de Rijke, “OpenGeist: insight in the stream of page views on Wikipedia,” in Sigir 2012 workshop on time-aware information access, 2012.
[Bibtex]@inproceedings{SIGIR-WS:2012:Peetz, Author = {Peetz, M-H. and Meij, E. and de Rijke, M.}, Booktitle = {SIGIR 2012 Workshop on Time-aware Information Access}, Date-Added = {2012-10-28 16:35:47 +0000}, Date-Modified = {2012-10-31 10:48:46 +0000}, Title = {{OpenGeist}: Insight in the Stream of Page Views on {Wikipedia}}, Year = {2012}}
-
E. Amigó, A. Corujo, J. Gonzalo, E. Meij, and M. de Rijke, “Overview of RepLab 2012: evaluating online reputation management systems,” in Clef (online working notes/labs/workshop), 2012.
[Bibtex]@inproceedings{CLEF:2012:replab, Author = {Enrique Amig{\'o} and Adolfo Corujo and Julio Gonzalo and Edgar Meij and Maarten de Rijke}, Booktitle = {CLEF (Online Working Notes/Labs/Workshop)}, Date-Added = {2012-09-20 12:48:33 +0000}, Date-Modified = {2012-10-30 09:30:49 +0000}, Title = {Overview of {RepLab} 2012: Evaluating Online Reputation Management Systems}, Year = {2012}}
-
R. Berendsen, M. Tsagkias, M. de Rijke, and E. Meij, “Generating pseudo test collections for learning to rank scientific articles,” in Information access evaluation. multilinguality, multimodality, and visual analytics – third international conference of the clef initiative, clef 2012, 2012.
[Bibtex]@inproceedings{CLEF:2012:berendsen, Author = {Berendsen, Richard and Tsagkias, Manos and de Rijke, Maarten and Meij, Edgar}, Booktitle = {Information Access Evaluation. Multilinguality, Multimodality, and Visual Analytics - Third International Conference of the CLEF Initiative, CLEF 2012}, Date-Added = {2012-07-03 13:44:06 +0200}, Date-Modified = {2012-10-30 08:37:52 +0000}, Title = {Generating Pseudo Test Collections for Learning to Rank Scientific Articles}, Year = {2012}}
-
E. Meij and M. de Rijke, “Supervised query modeling using Wikipedia,” in Proceedings of the 33rd international acm sigir conference on research and development in information retrieval, 2010.
[Bibtex]@inproceedings{SIGIR:2010:meij, Author = {Meij, Edgar and de Rijke, Maarten}, Booktitle = {Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval}, Date-Added = {2012-05-03 22:16:10 +0200}, Date-Modified = {2012-10-30 08:40:21 +0000}, Series = {SIGIR 2010}, Title = {Supervised query modeling using {Wikipedia}}, Year = {2010}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/1835449.1835660}}
-
D. Spina, E. Meij, A. Oghina, B. M. Thuong, M. Breuss, and M. de Rijke, “A corpus for entity profiling in microblog posts,” in Lrec 2012 workshop on language engineering for online reputation management, 2012.
[Bibtex]@inproceedings{LEROM:2012:spina, Author = {Damiano Spina and Edgar Meij and Andrei Oghina and Bui Minh Thuong and Mathias Breuss and Maarten de Rijke}, Booktitle = {LREC 2012 Workshop on Language Engineering for Online Reputation Management}, Date-Added = {2012-03-29 12:18:51 +0200}, Date-Modified = {2012-03-29 12:20:09 +0200}, Title = {A Corpus for Entity Profiling in Microblog Posts}, Year = {2012}}
- E. Meij, “Zoekmachines van de toekomst,” Informatie professional, vol. 11, p. 16–20, 2011.
[Bibtex]@article{IP:2011:meij, Author = {Meij, E.}, Date-Added = {2012-02-12 10:34:00 +0100}, Date-Modified = {2012-02-12 10:37:39 +0100}, Journal = {Informatie Professional}, Month = {November}, Pages = {16--20}, Title = {Zoekmachines van de toekomst}, Volume = {11}, Year = {2011}}
-
E. Meij, M. Bron, L. Hollink, B. Huurnink, and M. de Rijke, “Mapping queries to the Linking Open Data cloud: a case study using DBpedia,” Web semantics: science, services and agents on the world wide web, vol. 9, iss. 4, pp. 418-433, 2011.
[Bibtex]@article{JWS:2011:meij, Abstract = {We introduce the task of mapping search engine queries to DBpedia, a major linking hub in the Linking Open Data cloud. We propose and compare various methods for addressing this task, using a mixture of information retrieval and machine learning techniques. Specifically, we present a supervised machine learning-based method to determine which concepts are intended by a user issuing a query. The concepts are obtained from an ontology and may be used to provide contextual information, related concepts, or navigational suggestions to the user submitting the query. Our approach first ranks candidate concepts using a language modeling for information retrieval framework. We then extract query, concept, and search-history feature vectors for these concepts. Using manual annotations we inform a machine learning algorithm that learns how to select concepts from the candidates given an input query. Simply performing a lexical match between the queries and concepts is found to perform poorly and so does using retrieval alone, i.e., omitting the concept selection stage. Our proposed method significantly improves upon these baselines and we find that support vector machines are able to achieve the best performance out of the machine learning algorithms evaluated.}, Author = {Edgar Meij and Marc Bron and Laura Hollink and Bouke Huurnink and Maarten de Rijke}, Date-Added = {2011-11-25 08:45:19 +0100}, Date-Modified = {2012-10-28 21:59:08 +0000}, Doi = {10.1016/j.websem.2011.04.001}, Issn = {1570-8268}, Journal = {Web Semantics: Science, Services and Agents on the World Wide Web}, Keywords = {Information retrieval}, Number = {4}, Pages = {418 - 433}, Title = {Mapping queries to the {Linking Open Data} cloud: A case study using {DBpedia}}, Url = {http://www.sciencedirect.com/science/article/pii/S1570826811000187}, Volume = {9}, Year = {2011}, Bdsk-Url-1 = {http://www.sciencedirect.com/science/article/pii/S1570826811000187}, Bdsk-Url-2 = {http://dx.doi.org/10.1016/j.websem.2011.04.001}}
-
M. Peetz, E. Meij, M. de Rijke, and W. Weerkamp, “Adaptive temporal query modeling,” in Advances in information retrieval – 34th european conference on ir research, ecir 2012, 2012.
[Bibtex]@inproceedings{ECIR:2012:peetz, Author = {Peetz, Maria-Hendrike and Meij, Edgar and de Rijke, Maarten and Weerkamp, Wouter}, Booktitle = {Advances in Information Retrieval - 34th European Conference on IR Research, ECIR 2012}, Date-Added = {2011-11-23 18:10:40 +0100}, Date-Modified = {2012-10-28 23:01:12 +0000}, Title = {Adaptive Temporal Query Modeling}, Year = {2012}}
-
M. Bosma, E. Meij, and W. Weerkamp, “A framework for unsupervised spam detection in social networking sites,” in Advances in information retrieval – 34th european conference on ir research, ecir 2012, 2012.
[Bibtex]@inproceedings{ECIR:2012:bosma, Author = {Maarten Bosma and Meij, Edgar and Weerkamp, Wouter}, Booktitle = {Advances in Information Retrieval - 34th European Conference on IR Research, ECIR 2012}, Date-Added = {2011-11-23 18:10:33 +0100}, Date-Modified = {2012-10-28 23:00:37 +0000}, Title = {A Framework for Unsupervised Spam Detection in Social Networking Sites}, Year = {2012}}
-
R. Blanco, G. Ottaviano, and E. Meij, “Fast and space-efficient entity linking in queries,” in Proceedings of the eighth acm international conference on web search and data mining, 2015.
[Bibtex]@inproceedings{WSDM:2015:blanco, Author = {Blanco, Roi and Ottaviano, Giuseppe and Meij, Edgar}, Booktitle = {Proceedings of the eighth ACM international conference on Web search and data mining}, Date-Added = {2011-10-26 11:21:51 +0200}, Date-Modified = {2015-01-20 20:29:19 +0000}, Series = {WSDM 2015}, Title = {Fast and Space-Efficient Entity Linking in Queries}, Year = {2015}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/1935826.1935842}}
-
M. Bron, E. Meij, M. Peetz, M. Tsagkias, and M. de Rijke, “Team COMMIT at TREC 2011,” in The twentieth text retrieval conference, 2012.
[Bibtex]@inproceedings{TREC:2011:commit, Author = {Bron, Marc and Meij, Edgar and Peetz, Maria-Hendrike and Tsagkias, Manos and de Rijke, Maarten}, Booktitle = {The Twentieth Text REtrieval Conference}, Date-Added = {2011-10-22 12:22:19 +0200}, Date-Modified = {2012-10-30 09:26:12 +0000}, Series = {TREC 2011}, Title = {Team {COMMIT} at {TREC 2011}}, Year = {2012}}
-
B. Huurnink, R. Berendsen, K. Hofmann, E. Meij, and M. de Rijke, “The University of Amsterdam at the TREC 2011 session track,” in The twentieth text retrieval conference, 2012.
[Bibtex]@inproceedings{TREC:2011:huurnink, Author = {Huurnink, Bouke and Berendsen, Richard and Hofmann, Katja and Meij, Edgar and de Rijke, Maarten}, Booktitle = {The Twentieth Text REtrieval Conference}, Date-Added = {2011-10-22 12:22:18 +0200}, Date-Modified = {2013-05-22 11:44:53 +0000}, Month = {January}, Series = {TREC 2011}, Title = {The {University of Amsterdam} at the {TREC} 2011 Session Track}, Year = {2012}}
-
M. Schuemie, D. Trieschnigg, and E. Meij, “DutchHatTrick: semantic query modeling, ConText, section detection, and match score maximization,” in The twentieth text retrieval conference, 2012.
[Bibtex]@inproceedings{TREC:2011:schuemie, Author = {Schuemie, M. and Trieschnigg, Dolf and Meij, Edgar}, Booktitle = {The Twentieth Text REtrieval Conference}, Date-Added = {2011-10-22 12:14:30 +0200}, Date-Modified = {2013-05-22 11:44:30 +0000}, Month = {January}, Series = {TREC 2011}, Title = {{DutchHatTrick:} Semantic query modeling, {ConText}, section detection, and match score maximization}, Year = {2012}}
-
M. Bron, J. He, K. Hofmann, E. Meij, M. de Rijke, E. Tsagkias, and W. Weerkamp, “The University of Amsterdam at TREC 2010: session, entity, and relevance feedback,” in The nineteenth text retrieval conference, 2011.
[Bibtex]@inproceedings{TREC:2011:bron, Abstract = {We describe the participation of the University of Amsterdam's Intelligent Systems Lab in the web track at TREC 2009. We participated in the adhoc and diversity task. We find that spam is an important issue in the ad hoc task and that Wikipedia-based heuristic optimization approaches help to boost the retrieval performance, which is assumed to potentially reduce spam in the top ranked results. As for the diversity task, we explored different methods. Clustering and a topic model-based approach have a similar performance and both are relatively better than a query log based approach.}, Author = {M. Bron and He, J. and Hofmann, K. and Meij, E. and de Rijke, M. and Tsagkias, E. and Weerkamp, W.}, Booktitle = {The Nineteenth Text REtrieval Conference}, Date-Added = {2011-10-20 11:18:35 +0200}, Date-Modified = {2012-10-30 09:25:06 +0000}, Series = {TREC 2010}, Title = {{The University of Amsterdam at TREC 2010}: Session, Entity, and Relevance Feedback}, Year = {2011}}
-
W. Weerkamp, R. Berendsen, B. Kovachev, E. Meij, K. Balog, and M. de Rijke, “People searching for people: analysis of a people search engine log,” in Proceedings of the 34th international acm sigir conference on research and development in information, 2011.
[Bibtex]@inproceedings{sigir:2011:weerkamp, Author = {Weerkamp, Wouter and Berendsen, Richard and Kovachev, Bogomil and Meij, Edgar and Balog, Krisztian and de Rijke, Maarten}, Booktitle = {Proceedings of the 34th international ACM SIGIR conference on Research and development in Information}, Date-Added = {2011-10-20 10:50:25 +0200}, Date-Modified = {2012-10-30 08:41:27 +0000}, Series = {SIGIR 2011}, Title = {People searching for people: analysis of a people search engine log}, Year = {2011}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/2009916.2009927}}
-
J. Bekkenkamp, E. Meij, and M. de Rijke, “Online religious studies,” in Web science 2011, Koblenz, 2011.
[Bibtex]@inproceedings{websci:2011:meij, Abstract = {Data transitions have revolutionized many scientific disciplines, starting with the exact sciences, then the life sciences, and now the social sciences and humanities are in the process of making the transition to becoming data intensive sciences, with descriptions through quantitative measurements. New analysis tools and publicly accessible utterances, opinions, transactions and interactions resulting from widespread internet and social media usage facilitate new, data-intensive research methods in disciplines that have so far relied on small-scale literature and/or panel-based studies. To illustrate the new possibilities, we report on a pilot carried out by a cross-disciplinary team consisting of computer scientists and researchers in religious studies. In the latter area, research is often focused on mapping out the convictions, hopes, and beliefs of groups of people, be it within certain religions or within any other group, such as those defined by a political party. In the pilot, religious scholars examined the core keywords in a left-wing political party in order to determine their hopes and beliefs. Rather than following their standard way-of- working, they were equipped with a search engine with an index of content crawled from discussion forums, the party‚{\"A}{\^o}s web site plus a range of online publications relating to the party and going back to 1990. In this paper we focus on lessons learned and on methodological innovations for religious scholars as well as for computer scientists building the enabling technology.}, Address = {Koblenz}, Author = {Bekkenkamp, J. and Meij, E. and de Rijke, M.}, Booktitle = {Web Science 2011}, Date-Added = {2011-10-20 10:49:41 +0200}, Date-Modified = {2012-10-30 08:39:02 +0000}, Title = {Online Religious Studies}, Year = {2011}}
-
R. Berendsen, B. Kovachev, E. Meij, M. de Rijke, and W. Weerkamp, “Classifying queries submitted to a vertical search engine,” in Web science 2011, Koblenz, 2011.
[Bibtex]@inproceedings{websci:2011:berendsen, Address = {Koblenz}, Author = {Berendsen, R. and Kovachev, B. and Meij, E. and de Rijke, M. and Weerkamp, W.}, Booktitle = {Web Science 2011}, Date-Added = {2011-10-20 10:49:24 +0200}, Date-Modified = {2012-10-30 08:39:05 +0000}, Title = {Classifying Queries Submitted to a Vertical Search Engine}, Year = {2011}}
-
C. Boscarino, K. Hofmann, V. B. Jijkoun, E. Meij, M. de Rijke, and W. Weerkamp, “Workshop report: dutch-belgian information retrieval,” Sigir forum, vol. 45, iss. 1, pp. 42-44, 2011.
[Bibtex]@article{forum:2011:dir, Author = {Boscarino, C. and Hofmann, K. and Jijkoun, V.B. and Meij, E. and de Rijke, M. and Weerkamp, W.}, Chapter = {42}, Date-Added = {2011-10-20 10:48:47 +0200}, Date-Modified = {2011-10-20 10:48:52 +0200}, Journal = {SIGIR Forum}, Number = {1}, Pages = {42-44}, Title = {Workshop report: Dutch-Belgian Information Retrieval}, Volume = {45}, Year = {2011}}
-
Dir 2011 – dutch-belgian information retrieval workshop, 2011.
[Bibtex]@proceedings{DIR:2011, Date-Added = {2011-10-20 10:47:15 +0200}, Date-Modified = {2012-10-28 22:47:50 +0000}, Editor = {Boscarino, C. and Hofmann, K. and Jijkoun, V.B. and Meij, E. and de Rijke, M. and Weerkamp, W.}, Title = {DIR 2011 - Dutch-Belgian Information Retrieval Workshop}, Year = {2011}}
-
J. He, E. Meij, and M. de Rijke, “Result diversification based on query-specific cluster ranking,” J. am. soc. inf. sci., vol. 62, iss. 3, p. 550–571, 2011.
[Bibtex]@article{JASIST:2011:he, Abstract = {Result diversification is a retrieval strategy for dealing with ambiguous or multi-faceted queries by providing documents that cover as many facets of the query as possible. We propose a result diversification framework based on query-specific clustering and cluster ranking, in which diversification is restricted to documents belonging to clusters that potentially contain a high percentage of relevant documents. Empirical results show that the proposed framework improves the performance of several existing diversification methods. The framework also gives rise to a simple yet effective cluster-based approach to result diversification that selects documents from different clusters to be included in a ranked list in a round robin fashion. We describe a set of experiments aimed at thoroughly analyzing the behavior of the two main components of the proposed diversification framework, ranking and selecting clusters for diversification. Both components have a crucial impact on the overall performance of our framework, but ranking clusters plays a more important role than selecting clusters. We also examine properties that clusters should have in order for our diversification framework to be effective. Most relevant documents should be contained in a small number of high-quality clusters, while there should be no dominantly large clusters. Also, documents from these high-quality clusters should have a diverse content. These properties are strongly correlated with the overall performance of the proposed diversification framework.}, Address = {New York, NY, USA}, Author = {He, Jiyin and Meij, Edgar and de Rijke, Maarten}, Citeulike-Article-Id = {9425102}, Citeulike-Linkout-0 = {http://portal.acm.org/citation.cfm?id=1952338}, Citeulike-Linkout-1 = {http://dx.doi.org/10.1002/asi.21468}, Date-Added = {2011-10-20 10:40:50 +0200}, Date-Modified = {2012-10-28 21:59:28 +0000}, Doi = {10.1002/asi.21468}, Issn = {1532-2882}, Journal = {J. Am. Soc. Inf. Sci.}, Keywords = {todo}, Number = {3}, Pages = {550--571}, Posted-At = {2011-10-20 09:40:35}, Priority = {2}, Publisher = {Wiley Subscription Services, Inc., A Wiley Company}, Title = {Result diversification based on query-specific cluster ranking}, Url = {http://dx.doi.org/10.1002/asi.21468}, Volume = {62}, Year = {2011}, Bdsk-Url-1 = {http://dx.doi.org/10.1002/asi.21468}}
-
E. Meij, “Combining concepts and language models for information access,” PhD Thesis, 2010.
[Bibtex]@phdthesis{2010:meij, Author = {Meij, Edgar}, Date-Added = {2011-10-20 10:18:00 +0200}, Date-Modified = {2011-10-22 12:23:33 +0200}, School = {University of Amsterdam}, Title = {Combining Concepts and Language Models for Information Access}, Year = {2010}}
-
M. de Rijke, K. Balog, M. Bron, J. He, B. Huurnink, V. B. Jijkoun, F. Laan, E. Meij, E. Tsagkias, A. Vishneuski, and W. Weerkamp, “Archieven linken met semantische zoekmachines,” Dixit (tijdschrift over toegepaste taal- en spraaktechnologie), vol. 7, iss. 1, pp. 7-9, 2010.
[Bibtex]@article{DIXIT:2010:rijke, Author = {de Rijke, M. and Balog, K. and Bron, M. and He, J. and Huurnink, B. and Jijkoun, V.B. and Laan, F. and Meij, E. and Tsagkias, E. and Vishneuski, A. and Weerkamp, W.}, Date-Added = {2011-10-20 10:17:50 +0200}, Date-Modified = {2011-10-20 10:17:50 +0200}, Journal = {DIXIT (Tijdschrift over toegepaste taal- en spraaktechnologie)}, Number = {1}, Pages = {7-9}, Title = {Archieven Linken met Semantische Zoekmachines}, Volume = {7}, Year = {2010}}
-
K. Balog, E. Meij, and M. de Rijke, “Entity search: building bridges between two worlds,” in Proceedings of the 3rd international semantic search workshop, 2010.
[Bibtex]@inproceedings{semsearch:2010:balog, Author = {Balog, Krisztian and Meij, Edgar and de Rijke, Maarten}, Booktitle = {Proceedings of the 3rd International Semantic Search Workshop}, Date-Added = {2011-10-20 10:07:31 +0200}, Date-Modified = {2012-10-30 08:41:54 +0000}, Series = {SEMSEARCH 2010}, Title = {Entity search: building bridges between two worlds}, Year = {2010}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/1863879.1863888}}
-
S. Koulouzis, E. Meij, and A. Belloum, “Enabling large data transfers between web services,” in 5th egee user forum, 2010.
[Bibtex]@inproceedings{EGEE:2010:koulouzis, Author = {Koulouzis, S. and Meij, E. and Belloum, A.}, Booktitle = {5th EGEE User Forum}, Date-Added = {2011-10-20 10:00:08 +0200}, Date-Modified = {2011-10-20 10:00:08 +0200}, Title = {Enabling Large Data Transfers Between Web Services}, Year = {2010}}
-
J. He, K. Balog, K. Hofmann, E. Meij, M. de Rijke, E. Tsagkias, and W. Weerkamp, “Heuristic ranking and diversification of web documents,” in The eighteenth text retrieval conference, 2010.
[Bibtex]@inproceedings{TREC:2010:he, Abstract = {We describe the participation of the University of Amsterdam's Intelligent Systems Lab in the web track at TREC 2009. We participated in the adhoc and diversity task. We find that spam is an important issue in the ad hoc task and that Wikipedia-based heuristic optimization approaches help to boost the retrieval performance, which is assumed to potentially reduce spam in the top ranked results. As for the diversity task, we explored different methods. Clustering and a topic model-based approach have a similar performance and both are relatively better than a query log based approach.}, Author = {He, J. and Balog, K. and Hofmann, K. and Meij, E. and de Rijke, M. and Tsagkias, E. and Weerkamp, W.}, Booktitle = {The Eighteenth Text REtrieval Conference}, Date-Added = {2011-10-20 09:45:15 +0200}, Date-Modified = {2012-10-30 09:24:20 +0000}, Series = {TREC 2009}, Title = {Heuristic Ranking and Diversification of Web Documents}, Year = {2010}}
-
E. Meij, J. He, W. Weerkamp, and M. de Rijke, “Topical diversity and relevance feedback,” in The eighteenth text retrieval conference, 2010.
[Bibtex]@inproceedings{TREC:2010:meij, Author = {Meij, E. and He, J. and Weerkamp, W. and de Rijke, M.}, Booktitle = {The Eighteenth Text REtrieval Conference}, Date-Added = {2011-10-20 09:43:29 +0200}, Date-Modified = {2012-10-30 09:24:33 +0000}, Series = {TREC 2009}, Title = {Topical Diversity and Relevance Feedback}, Year = {2010}}
- E. Meij, M. Bron, L. Hollink, B. Huurnink, and M. de Rijke, “Learning semantic query suggestions (abstract),” in Dir ’10, 2010.
[Bibtex]@inproceedings{DIR:2010:meij, Author = {Meij, E. and Bron, M. and Hollink, L. and Huurnink, B. and de Rijke, M.}, Booktitle = {DIR '10}, Date-Added = {2011-10-20 09:42:57 +0200}, Date-Modified = {2011-10-20 09:42:57 +0200}, Title = {Learning Semantic Query Suggestions (Abstract)}, Year = {2010}}
-
M. Roos, M. S. Marshall, A. P. Gibson, M. Schuemie, E. Meij, S. Katrenko, W. R. van Hage, K. Krommydas, and P. W. Adriaans, “Structuring and extracting knowledge for the support of hypothesis generation in molecular biology,” Bmc bioinformatics, vol. 10, iss. 10, 2009.
[Bibtex]@article{BMC:2009:roos, Author = {Roos, M. and Marshall, M.S. and Gibson, A.P. and Schuemie, M. and Meij, E. and Katrenko, S. and van Hage, W.R. and Krommydas, K. and Adriaans, P.W.}, Date-Added = {2011-10-19 12:09:11 +0200}, Date-Modified = {2011-10-19 12:09:11 +0200}, Journal = {BMC Bioinformatics}, Number = {10}, Title = {Structuring and extracting knowledge for the support of hypothesis generation in molecular biology}, Volume = {10}, Year = {2009}}
-
K. Hofmann, M. de Rijke, B. Huurnink, and E. Meij, “A semantic perspective on query log analysis,” in Working notes for the clef 2009 workshop, 2009.
[Bibtex]@inproceedings{CLEF:2009:hofmann, Abstract = {We present our views on the CLEF log file analysis task. We argue for a task definition that focuses on the semantic enrichment of query logs. In addition, we discuss how additional information about the context in which queries are being made could further our understanding of users' information seeking and how to better facilitate this process. }, Author = {Hofmann, K. and de Rijke, M. and Huurnink, B. and Meij, E.}, Booktitle = {Working Notes for the CLEF 2009 Workshop}, Date-Added = {2011-10-17 09:46:16 +0200}, Date-Modified = {2011-10-17 09:46:16 +0200}, Title = {A Semantic Perspective on Query Log Analysis}, Year = {2009}}
-
E. Meij, W. Weerkamp, J. He, and M. de Rijke, “Incorporating non-relevance information in the estimation of query models,” in The seventeenth text retrieval conference, 2009.
[Bibtex]@inproceedings{TREC:2009:meij, Abstract = {We describe the participation of the University of Amsterdam's ILPS group in the relevance feedback track at TREC 2008. We introduce a new model which incorporates information from relevant and non-relevant documents to improve the estimation of query models. Our main findings are twofold: (i) in terms of statMAP, a larger number of judged non-relevant documents improves retrieval effectiveness and (ii) on the TREC Terabyte topics, we can effectively replace the estimates on the judged non-relevant documents with estimations on the document collection.}, Author = {Meij, E. and Weerkamp, W. and He, J. and de Rijke, M.}, Booktitle = {The Seventeenth Text REtrieval Conference}, Date-Added = {2011-10-16 16:03:56 +0200}, Date-Modified = {2012-10-30 09:23:32 +0000}, Series = {TREC 2008}, Title = {Incorporating Non-Relevance Information in the Estimation of Query Models}, Year = {2009}}
-
M. S. Marshall, M. Roos, E. Meij, S. Katrenko, W. R. van Hage, and P. W. Adriaans, “De AIDA toolbox: een gecombineerde aanpak voor het beheren van kennis,” Agro informatica, vol. 21, iss. 4, p. 5–7, 2009.
[Bibtex]@article{AGRO:2009:marshall, Author = {Marshall, M.S. and Roos, M. and Meij, Edgar and Katrenko, S. and van Hage, W.R. and Adriaans, P.W.}, Date-Added = {2011-10-16 15:55:36 +0200}, Date-Modified = {2012-10-28 23:04:41 +0000}, Edition = {1}, Journal = {Agro Informatica}, Number = {4}, Pages = {5--7}, Title = {De {AIDA} toolbox: Een gecombineerde aanpak voor het beheren van kennis}, Volume = {21}, Year = {2009}}
-
M. S. Marshall, M. Roos, E. Meij, S. Katrenko, W. R. van Hage, and P. W. Adriaans, “Semantic disclosure in an e-science environment,” in Semantic e-science (springer annals of information systems aois), 2009.
[Bibtex]@inproceedings{AIS:2009:marshall, Author = {Marshall, M.S. and Roos, M. and Meij, E. and Katrenko, S. and van Hage, W.R. and Adriaans, P.W.}, Booktitle = {Semantic e-Science (Springer Annals of Information Systems AoIS)}, Date-Added = {2011-10-16 15:03:17 +0200}, Date-Modified = {2012-10-28 17:21:26 +0000}, Publisher = {Springer}, Series = {Annals of Information Systems}, Title = {Semantic disclosure in an e-Science environment}, Volume = {11}, Year = {2009}}
- E. Meij and M. de Rijke, “Wij-woorden op websites: zoekmachines voor geesteswetenschappers,” in Onszelf voorbij. over de grenzen van verbondenheid, 2011.
[Bibtex]@inproceedings{chapter:2011:meij, Author = {Meij, E. and de Rijke, M.}, Booktitle = {Onszelf voorbij. Over de grenzen van verbondenheid}, Date-Added = {2011-10-16 12:56:05 +0200}, Date-Modified = {2012-10-28 17:22:31 +0000}, Editor = {Joris Verheijen en Jonneke Bekkenkamp (red.)}, Isbn = {9789079578 276}, Publisher = {Parthenon}, Title = {Wij-woorden op websites: Zoekmachines voor geesteswetenschappers}, Year = {2011}}
-
W. Weerkamp, K. Balog, and E. Meij, “A generative language modeling approach for ranking entities,” in Advances in focused retrieval, 2009.
[Bibtex]@inproceedings{INEX:2008:weerkamp, Abstract = {We describe our participation in the INEX 2008 Entity Ranking track. We develop a generative language modeling approach for the entity ranking and list completion tasks. Our framework comprises the following components: (i) entity and (ii) query language models, (iii) entity prior, (iv) the probability of an entity for a given category, and (v) the probability of an entity given another entity. We explore various ways of estimating these components, and report on our results. We find that improving the estimation of these components has very positive effects on performance, yet, there is room for further improvements.}, Author = {Weerkamp, W. and Balog, K. and Meij, E.}, Booktitle = {Advances in Focused Retrieval}, Date-Added = {2011-10-16 12:29:08 +0200}, Date-Modified = {2011-10-16 12:29:08 +0200}, Organization = {Springer}, Publisher = {Springer}, Title = {A Generative Language Modeling Approach for Ranking Entities}, Year = {2009}}
- M. Roos, S. M. Marshall, P. T. de Boer, K. van den Berg, S. Katrenko, E. Meij, W. R. van Hage, and P. W. Adriaans, “Biological applications of AIDA knowledge management components,” in Ismb ’08, 2008.
[Bibtex]@inproceedings{ISMB:2008:roos, Author = {Marco Roos and M. Scott Marshall and Piter T. de Boer and Kasper van den Berg and Sophia Katrenko and Edgar Meij and Willem R. van Hage and Pieter W. Adriaans}, Booktitle = {ISMB '08}, Date-Added = {2011-10-16 10:45:35 +0200}, Date-Modified = {2012-10-28 23:04:46 +0000}, Title = {Biological applications of {AIDA} knowledge management components}, Year = {2008}}
-
W. Weerkamp, J. He, K. Balog, and E. Meij, “The University of Amsterdam (ILPS) at INEX 2008,” in Inex 2008 workshop pre-proceedings, Dagstuhl, 2008.
[Bibtex]@inproceedings{INEX-WS:2008:weerkamp, Abstract = {We describe our participation in the INEX 2008 Entity Ranking and Link-the-Wiki tracks. We provide a detailed account of the ideas underlying our approaches to these tasks. For the Link-the-Wiki track, we also report on the results and findings so far.}, Address = {Dagstuhl}, Author = {Weerkamp, W. and He, J. and Balog, K. and Meij, E.}, Booktitle = {INEX 2008 Workshop Pre-Proceedings}, Date-Added = {2011-10-16 10:36:58 +0200}, Date-Modified = {2012-10-28 17:30:53 +0000}, Title = {{The University of Amsterdam (ILPS) at INEX 2008}}, Year = {2008}}
-
K. Balog, E. Meij, W. Weerkamp, J. He, and M. de Rijke, “The University of Amsterdam at TREC 2008: Blog, Enterprise, and Relevance Feedback,” in Trec 2008 working notes, 2008.
[Bibtex]@inproceedings{TREC-WN:2008:balog, Abstract = {We describe the participation of the University of Amsterdam's ILPS group in the blog, enterprise and relevance feedback track at TREC 2008. Our main preliminary conclusions are that estimating mixture weights for external expansion in blog post retrieval is non-trivial and we need more analysis to find out why it works better for blog distillation than for blog post retrieval. For the relevance feedback track we observe two things: (i) in terms of statMAP, a larger number of judged non-relevant documents improves retrieval effectiveness and (ii) on the TREC Terabyte topics, we can effectively replace the estimates on the judged non-relevant documents with estimations on the document collection. Finally, since the enterprise track did not have any results yet, we only described our participation and do not draw any conclusions.}, Author = {Balog, K. and Meij, E. and Weerkamp, W. and He, J. and de Rijke, M.}, Booktitle = {TREC 2008 Working Notes}, Date-Added = {2011-10-16 10:36:44 +0200}, Date-Modified = {2012-10-28 22:01:18 +0000}, Title = {{The University of Amsterdam at TREC 2008: Blog, Enterprise, and Relevance Feedback}}, Year = {2008}}
-
S. Koulouzis, E. Meij, M. S. Marshall, and A. Belloum, “Enabling data transport between web services through alternative protocols and streaming,” in 4th ieee international conference on e-science, 2008.
[Bibtex]@inproceedings{IEEE:2008:koulouzis, Author = {Koulouzis, S. and Meij, E. and Marshall, M.S. and Belloum, A.}, Booktitle = {4th IEEE International Conference on e-Science}, Date-Added = {2011-10-16 10:35:31 +0200}, Date-Modified = {2011-10-16 10:35:31 +0200}, Title = {Enabling Data Transport between Web Services through alternative protocols and Streaming}, Year = {2008}}
-
E. Meij and S. Katrenko, “Bootstrapping language associated with biomedical entities,” in The sixteenth text retrieval conference, 2008.
[Bibtex]@inproceedings{TREC:2008:meij, Author = {Meij, E. and Katrenko, S.}, Booktitle = {The Sixteenth Text REtrieval Conference}, Date-Added = {2011-10-16 10:24:41 +0200}, Date-Modified = {2012-10-30 09:23:12 +0000}, Series = {TREC 2007}, Title = {Bootstrapping Language Associated with Biomedical Entities}, Year = {2008}}
-
E. Meij and M. de Rijke, “Using prior information derived from citations in literature search,” in Riao 2007, 2007.
[Bibtex]@inproceedings{RIAO:2007:Meij, Author = {Meij, E. and de Rijke, M.}, Booktitle = {RIAO 2007}, Date-Added = {2011-10-13 09:05:34 +0200}, Date-Modified = {2012-10-30 08:49:59 +0000}, Title = {Using Prior Information Derived from Citations in Literature Search}, Year = {2007}}
-
M. Roos, S. Katrenko, W. R. van Hage, E. Meij, M. S. Marshall, and P. W. Adriaans, “My first bioaid: heuristic support for hypothesis construction,” in Ismb-eccb’07, 2007.
[Bibtex]@inproceedings{ISMB:2007:Roos, Author = {Roos, M. and Katrenko, S. and van Hage, W.R. and Meij, E. and Marshall, M.S. and Adriaans, P.W.}, Booktitle = {ISMB-ECCB'07}, Date-Added = {2011-10-13 08:56:20 +0200}, Date-Modified = {2011-10-13 08:56:20 +0200}, Title = {My first BioAID: heuristic support for hypothesis construction}, Year = {2007}}
-
K. Balog, E. Meij, and M. de Rijke, “The University of Amsterdam at the TREC 2006 Enterprise Track,” in The fifteenth text retrieval conference, 2007.
[Bibtex]@inproceedings{TREC:2006:balog, Author = {Balog, K. and Meij, E. and de Rijke, M.}, Booktitle = {The Fifteenth Text REtrieval Conference}, Date-Added = {2011-10-12 23:33:06 +0200}, Date-Modified = {2012-10-30 09:23:12 +0000}, Series = {TREC 2006}, Title = {{The University of Amsterdam at the TREC 2006 Enterprise Track}}, Year = {2007}}
-
E. Meij, M. Jansen, and M. de Rijke, “Expanding queries using multiple resources (the AID group at TREC 2006: genomics track),” in The fifteenth text retrieval conference, 2007.
[Bibtex]@inproceedings{TREC:2006:meij, Author = {Meij, E. and Jansen, M. and de Rijke, M.}, Booktitle = {The Fifteenth Text REtrieval Conference}, Date-Added = {2011-10-12 23:24:14 +0200}, Date-Modified = {2012-10-30 09:23:12 +0000}, Series = {TREC 2006}, Title = {Expanding Queries Using Multiple Resources (The {AID} Group at {TREC} 2006: Genomics Track)}, Year = {2007}}
-
E. Meij, L. H. L. IJzereef, L. A. Azzopardi, J. Kamps, M. de Rijke, M. Voorhees, and L. P. Buckland, “Combining thesauri-based methods for biomedical retrieval,” in The fourteenth text retrieval conference, 2006.
[Bibtex]@inproceedings{TREC:2005:meij, Author = {Meij, E. and IJzereef, L.H.L. and Azzopardi, L.A. and Kamps, J. and de Rijke, M. and Voorhees, M. and Buckland, L.P.}, Booktitle = {The Fourteenth Text REtrieval Conference}, Date-Added = {2011-10-12 23:16:44 +0200}, Date-Modified = {2012-10-30 09:23:12 +0000}, Series = {TREC 2005}, Title = {Combining Thesauri-based Methods for Biomedical Retrieval}, Year = {2006}}
-
E. Meij and M. de Rijke, “Deploying lucene on the grid,” in Proceedings sigir 2006 workshop on open source information retrieval (osir2006), 2006.
[Bibtex]@inproceedings{OSIR:2005:meij, Author = {Meij, E. and de Rijke, M.}, Booktitle = {Proceedings SIGIR 2006 workshop on Open Source Information Retrieval (OSIR2006)}, Date-Added = {2011-10-12 23:08:51 +0200}, Date-Modified = {2011-10-12 23:08:51 +0200}, Title = {Deploying Lucene on the Grid}, Year = {2006}}
-
E. Meij, “Van case-based reasoning tot information retrieval; case retrieval voor de helpdesk van een webhosting bedrijf.,” Master Thesis, 2005.
[Bibtex]@mastersthesis{2005:meij, Abstract = {The helpdesk department of Hostnet, a web hosting company, daily receives 35 up to 50 questions from its customers. Within the domain in which Hostnet operates, only few off-the-shelf manuals exist and this is particularly noticeable on the helpdesk. Currently, only a few possibilities for knowledge management and/or elicitation exist within the organization. Questions are answered and problems are solved mostly by relying on the expertise of the staff. They therefore need to have up-to-date knowledge of a variety of possible questions, problem situations and solutions. They also need to be creative and flexible when handling novel questions. Hostnet uses a ticketing system to handle questions from their customers. One of many advantages of using such a system is that all questions are stored, along with their corresponding answers. Hostnet uses the system for some time now and it has thus collected a large amount of domain and organization specific knowledge. This kind of information is exactly the type on which the research area of case-based reasoning focuses. Case-based reasoning uses previously solved problems (cases) as a knowledge source to aid solving similar cases in the future. One of the main components, in any case-based reasoning system, is the retrieval module. This module searches for alike cases, given a new case and a similarity measure. Techniques from the area of Information Retrieval may be used to assist in finding these alike questions, for example by implementing vector-space based, statistical methods. This research focuses on analyzing to what extent previously solved cases can serve as a basis for a statistical information retrieval module of a case-based reasoning system within Hostnet by measuring the effects of different information retrieval techniques on the results. The evaluated techniques are stemming, term weighting and combinations thereof. The above described organizational setting is not unique to Hostnet. Every service-providing company with direct customer contacts is probably familiar with the described situation and could benefit from the presented results. The suggested approach yields adequate results by which, at best, 60% of new questions can be answered, based on the first 10 retrieved stored questions. The mean reciprocal rank of the first matching question provided room for improvement however, with a value of 7 out of 10. The most important conclusion is that the best results are achieved when applying none of the before mentioned information retrieval techniques. The suggested approach needs to be improved for a successful integration within a case-based reasoning system, but it does seem viable.}, Author = {Edgar Meij}, Date-Added = {2011-10-12 21:53:59 +0200}, Date-Modified = {2011-10-12 21:55:28 +0200}, School = {University of Amsterdam}, Title = {Van Case-Based Reasoning tot Information Retrieval; Case retrieval voor de helpdesk van een webhosting bedrijf.}, Year = {2005}}
-
E. Meij and M. de Rijke, “Integrating Conceptual Knowledge into Relevance Models: A Model and Estimation Method,” in Proceedings of the 1st international conference on theory of information retrieval, 2007.
[Bibtex]@inproceedings{ICTIR:2007:meij, Author = {E. Meij and de Rijke, M.}, Booktitle = {Proceedings of the 1st International Conference on Theory of Information Retrieval}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2012-10-30 08:50:30 +0000}, Series = {ICTIR 2007}, Title = {{Integrating Conceptual Knowledge into Relevance Models: A Model and Estimation Method}}, Year = {2007}}
-
E. Meij and M. de Rijke, “Thesaurus-based feedback to support mixed search and browsing environments,” in Research and advanced technology for digital libraries, 11th european conference, ecdl 2007, 2007.
[Bibtex]@inproceedings{ECDL:2007:meij, Author = {Edgar Meij and Maarten de Rijke}, Booktitle = {Research and Advanced Technology for Digital Libraries, 11th European Conference, ECDL 2007}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2012-10-28 23:04:22 +0000}, Title = {Thesaurus-Based Feedback to Support Mixed Search and Browsing Environments}, Year = {2007}}
- D. R. Recupero, “A new unsupervised method for document clustering by using wordnet lexical and conceptual relations,” Information retrieval, vol. 10, iss. 6, p. 563–579, 2007.
[Bibtex]@article{IR:2007:Recupero, Author = {Recupero, Diego R.}, Citeulike-Article-Id = {2414184}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2011-10-12 18:31:55 +0200}, Journal = {Information Retrieval}, Keywords = {retrieval\_model, semantic\_similarity, wordnet}, Number = {6}, Owner = {emeij}, Pages = {563--579}, Priority = {2}, Timestamp = {2008.02.22}, Title = {A new unsupervised method for document clustering by using WordNet lexical and conceptual relations}, Volume = {10}, Year = {2007}}
-
D. Trieschnigg, E. Meij, M. de Rijke, and W. Kraaij, “Measuring concept relatedness using language models,” in Proceedings of the 31st annual international acm sigir conference on research and development in information retrieval, 2008.
[Bibtex]@inproceedings{SIGIR:2008:trieschnigg, Author = {Trieschnigg, Dolf and Meij, Edgar and de Rijke, Maarten and Kraaij, Wessel}, Booktitle = {Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2012-10-30 08:45:51 +0000}, Series = {SIGIR 2008}, Title = {Measuring concept relatedness using language models}, Year = {2008}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/1390334.1390523}}
-
E. Meij, “Towards a combined model for search and navigation of annotated documents,” in Proceedings of the 31st annual international acm sigir conference on research and development in information retrieval, 2008.
[Bibtex]@inproceedings{SIGIR:2008:meij-doctcons, Abstract = {Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.}, Author = {Meij, Edgar}, Booktitle = {Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2012-10-30 08:48:04 +0000}, Series = {SIGIR 2008}, Title = {Towards a combined model for search and navigation of annotated documents}, Year = {2008}, Bdsk-Url-1 = {http://dx.doi.org/10.1145/1390334.1390573}}
-
E. Meij and M. de Rijke, “The University of Amsterdam at the CLEF 2008 Domain Specific Track – parsimonious relevance and concept models,” in Working notes for the clef 2008 workshop, 2008.
[Bibtex]@inproceedings{CLEF-WN:2008:meij, Author = {Edgar Meij and Maarten de Rijke}, Booktitle = {Working Notes for the CLEF 2008 Workshop}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2012-10-30 09:28:58 +0000}, Title = {The {U}niversity of {A}msterdam at the {CLEF} 2008 {Domain Specific Track} - Parsimonious Relevance and Concept Models}, Year = {2008}}
-
E. Meij, W. Weerkamp, K. Balog, and M. de Rijke, “Parsimonious relevance models,” in Proceedings of the 31st annual international acm sigir conference on research and development in information retrieval, 2008.
[Bibtex]@inproceedings{SIGIR:2008:Meij-prm, Author = {Meij, Edgar and Weerkamp, Wouter and Balog, Krisztian and de Rijke, Maarten}, Booktitle = {Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2012-10-30 08:47:44 +0000}, Series = {SIGIR 2008}, Title = {Parsimonious relevance models}, Year = {2008}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/1390334.1390520}}
-
E. Meij, D. Trieschnigg, M. de Rijke, and W. Kraaij, “Parsimonious concept modeling,” in Proceedings of the 31st annual international acm sigir conference on research and development in information retrieval, 2008.
[Bibtex]@inproceedings{SIGIR:2008:Meij-cm, Author = {Meij, Edgar and Trieschnigg, Dolf and de Rijke, Maarten and Kraaij, Wessel}, Booktitle = {Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2012-10-30 08:46:38 +0000}, Series = {SIGIR 2008}, Title = {Parsimonious concept modeling}, Year = {2008}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/1390334.1390519}}
-
E. Meij, W. Weerkamp, and M. de Rijke, “A query model based on normalized log-likelihood,” in Proceedings of the 18th acm conference on information and knowledge management, 2009.
[Bibtex]@inproceedings{CIKM:2009:Meij, Author = {Meij, Edgar and Weerkamp, Wouter and de Rijke, Maarten}, Booktitle = {Proceedings of the 18th ACM conference on Information and knowledge management}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2012-10-30 08:42:51 +0000}, Series = {CIKM 2009}, Title = {A query model based on normalized log-likelihood}, Year = {2009}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/1645953.1646261}}
-
E. Meij, P. Mika, and H. Zaragoza, “Investigating the demand side of semantic search through query log analysis,” in Proceedings of the workshop on semantic search (semsearch 2009) at the 18th international world wide web conference (www 2009), 2009.
[Bibtex]@inproceedings{semsearch:2009:meij, Author = {Edgar Meij and P. Mika and H. Zaragoza}, Booktitle = {Proceedings of the Workshop on Semantic Search (SemSearch 2009) at the 18th International World Wide Web Conference (WWW 2009)}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2012-10-30 08:43:47 +0000}, Title = {Investigating the Demand Side of Semantic Search through Query Log Analysis}, Year = {2009}}
-
K. Hofmann, M. Tsagkias, E. Meij, and M. de Rijke, “The impact of document structure on keyphrase extraction,” in Proceedings of the 18th acm conference on information and knowledge management, 2009.
[Bibtex]@inproceedings{CIKM:2009:hofmann, Author = {Hofmann, Katja and Tsagkias, Manos and Meij, Edgar and de Rijke, Maarten}, Booktitle = {Proceedings of the 18th ACM conference on Information and knowledge management}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2012-10-30 08:42:45 +0000}, Series = {CIKM 2009}, Title = {The impact of document structure on keyphrase extraction}, Year = {2009}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/1645953.1646215}}
-
P. Mika, E. Meij, and H. Zaragoza, “Investigating the semantic gap through query log analysis.,” in Proceedings of the 8th international semantic web conference, 2009.
[Bibtex]@inproceedings{ISWC:2009:mika, Author = {Peter Mika and Edgar Meij and Hugo Zaragoza}, Booktitle = {Proceedings of the 8th International Semantic Web Conference}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2012-10-30 08:45:11 +0000}, Series = {ISWC 2009}, Title = {Investigating the Semantic Gap through Query Log Analysis.}, Year = {2009}, Bdsk-Url-1 = {http://dblp.uni-trier.de/db/conf/semweb/iswc2009.html#MikaMZ09}}
-
E. Meij, P. Mika, and H. Zaragoza, “An evaluation of entity and frequency based query completion methods,” in Proceedings of the 32nd international acm sigir conference on research and development in information retrieval, 2009.
[Bibtex]@inproceedings{SIGIR:2009:meij, Author = {Meij, Edgar and Mika, Peter and Zaragoza, Hugo}, Booktitle = {Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2012-10-30 08:43:25 +0000}, Series = {SIGIR 2009}, Title = {An evaluation of entity and frequency based query completion methods}, Year = {2009}, Bdsk-Url-1 = {http://doi.acm.org/10.1145/1571941.1572074}}
-
E. Meij and M. de Rijke, “Concept models for domain-specific search,” in Evaluating systems for multilingual and multimodal information access, 9th workshop of the cross-language evaluation forum, clef 2008, aarhus, denmark, september 17-19, 2008, revised selected papers, 2009.
[Bibtex]@inproceedings{CLEF:2008:meij, Author = {Meij, Edgar and de Rijke, Maarten}, Booktitle = {Evaluating Systems for Multilingual and Multimodal Information Access, 9th Workshop of the Cross-Language Evaluation Forum, CLEF 2008, Aarhus, Denmark, September 17-19, 2008, Revised Selected Papers}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2012-10-30 08:44:35 +0000}, Title = {Concept models for domain-specific search}, Year = {2009}}
-
E. Meij, M. Bron, B. Huurnink, L. Hollink, and M. de Rijke, “Learning semantic query suggestions,” in Proceedings of the 8th international conference on the semantic web, 2009.
[Bibtex]@inproceedings{ISWC:2009:Meij, Abstract = {Learning Semantic Query Suggestions by Edgar Meij, Marc Bron, Laura Hollink, Bouke Huurnink and Maarten de Rijke is available online now. An important application of semantic web technology is recognizing human-defined concepts in text. Query transformation is a strategy often used in search engines to derive queries that are able to return more useful search results than the original query and most popular search engines provide facilities that let users complete, specify, or reformulate their queries. We study the problem of semantic query suggestion, a special type of query transformation based on identifying semantic concepts contained in user queries. We use a feature-based approach in conjunction with supervised machine learning, augmenting term-based features with search history-based and concept-specific features. We apply our method to the task of linking queries from real-world query logs (the transaction logs of the Netherlands Institute for Sound and Vision) to the DBpedia knowledge base. We evaluate the utility of different machine learning algorithms, features, and feature types in identifying semantic concepts using a manually developed test bed and show significant improvements over an already high baseline. The resources developed for this paper, i.e., queries, human assessments, and extracted features, are available for download. }, Author = {E. Meij and M. Bron and B. Huurnink and Hollink, L. and de Rijke, M.}, Booktitle = {Proceedings of the 8th International Conference on The Semantic Web}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2012-10-30 08:45:04 +0000}, Series = {ISWC 2009}, Title = {Learning Semantic Query Suggestions}, Year = {2009}}
-
E. Meij, D. Trieschnigg, M. de Rijke, and W. Kraaij, “Conceptual language models for domain-specific retrieval,” Inf. process. manage., vol. 46, iss. 4, p. 448–469, 2010.
[Bibtex]@article{IPM:2010:Meij, Address = {Tarrytown, NY, USA}, Author = {Meij, Edgar and Trieschnigg, Dolf and de Rijke, Maarten and Kraaij, Wessel}, Date-Added = {2011-10-12 18:31:55 +0200}, Date-Modified = {2011-10-12 18:31:55 +0200}, Doi = {http://dx.doi.org/10.1016/j.ipm.2009.09.005}, Issn = {0306-4573}, Journal = {Inf. Process. Manage.}, Number = {4}, Pages = {448--469}, Publisher = {Pergamon Press, Inc.}, Title = {Conceptual language models for domain-specific retrieval}, Volume = {46}, Year = {2010}, Bdsk-Url-1 = {http://dx.doi.org/10.1016/j.ipm.2009.09.005}}
-
D. Spina, E. Meij, M. de Rijke, A. Oghina, B. M. Thuong, and M. Breuss, “Identifying entity aspects in microblog posts,” in The 35th international acm sigir conference on research and development in information retrieval, 2012.
[Bibtex]@inproceedings{SIGIR:2012:spina, Author = {Damiano Spina and Meij, Edgar and de Rijke, Maarten and Andrei Oghina and Bui Minh Thuong and Mathias Breuss}, Booktitle = {The 35th International ACM SIGIR conference on research and development in Information Retrieval}, Date-Added = {2012-05-03 22:17:17 +0200}, Date-Modified = {2012-10-30 08:40:47 +0000}, Series = {SIGIR 2012}, Title = {Identifying Entity Aspects in Microblog Posts}, Year = {2012}}
Leave a Reply