Recent years show an increas­ing inter­est in ver­ti­cal search: search­ing within a par­tic­u­lar type of infor­ma­tion. Under­stand­ing what peo­ple search for in these “ver­ti­cals” gives direc­tion to research and pro­vides point­ers for the search engines them­selves. In this paper we ana­lyze the search logs of one par­tic­u­lar ver­ti­cal: peo­ple search engines. Based on an exten­sive analy­sis of the logs of a search engine geared towards find­ing peo­ple, we pro­pose a clas­si­fi­ca­tion scheme for peo­ple search at three lev­els: (a) queries, (b) ses­sions, and © users. For queries, we iden­tify three types, (i) event-based high-profile queries (peo­ple that become “pop­u­lar” because of an event hap­pen­ing), (ii) reg­u­lar high-profile queries (celebri­ties), and (iii) low-profile queries (other, less-known peo­ple). We present exper­i­ments on auto­matic clas­si­fi­ca­tion of queries. On the ses­sion level, we observe five types: (i) fam­ily ses­sions (users look­ing for rel­a­tives), (ii) event ses­sions (query­ing the main play­ers of an event), (iii) spot­ting ses­sions (try­ing to “spot” dif­fer­ent celebri­ties online), (iv) poly­mer­ous ses­sions (ses­sions with­out a clear rela­tion between queries), and (v) repet­i­tive ses­sions (query refine­ment and copy­ing). Finally, for users we iden­tify four types: (i) mon­i­tors, (ii) spot­ters, (iii) fol­low­ers, and (iv) polymers.

Our find­ings not only offer insight into search behav­ior in peo­ple search engines, but they are also use­ful to iden­tify future research direc­tions and to pro­vide point­ers for search engine improvements.

  • [PDF] [DOI] W. Weerkamp, R. Berend­sen, B. Kovachev, E. Meij, K. Balog, and M. de Rijke, “Peo­ple search­ing for peo­ple: analy­sis of a peo­ple search engine log,” in Pro­ceed­ings of the 34th inter­na­tional ACM SIGIR con­fer­ence on Research and devel­op­ment in Infor­ma­tion, New York, NY, USA, 2011, pp. 45–54.
    [Bib­tex]
    @inproceedings{sigir:2011:weerkamp,
      Acmid = {2009927},
      Address = {New York, NY, USA},
      Author = {Weerkamp, Wouter and Berendsen, Richard and Kovachev, Bogomil and Meij, Edgar and Balog, Krisztian and de Rijke, Maarten},
      Booktitle = {Proceedings of the 34th international ACM SIGIR conference on Research and development in Information},
      Date-Added = {2011-10-20 10:50:25 +0200},
      Date-Modified = {2011-10-20 10:50:35 +0200},
      Doi = {http://doi.acm.org/10.1145/2009916.2009927},
      Isbn = {978-1-4503-0757-4},
      Keywords = {classification, people search, query log analysis},
      Location = {Beijing, China},
      Numpages = {10},
      Pages = {45--54},
      Publisher = {ACM},
      Series = {SIGIR '11},
      Title = {People searching for people: analysis of a people search engine log},
      Url = {http://doi.acm.org/10.1145/2009916.2009927},
      Year = {2011},
      Bdsk-Url-1 = {http://doi.acm.org/10.1145/2009916.2009927}}

ACM DL Author-ize servicePeo­ple search­ing for peo­ple: analy­sis of a peo­ple search engine log
Wouter Weerkamp, Richard Berend­sen, Bogomil Kovachev, Edgar Meij, Kriszt­ian Balog, Maarten de Rijke
SIGIR ’11 Pro­ceed­ings of the 34th inter­na­tional ACM SIGIR con­fer­ence on Research and devel­op­ment in Information, 2011