Skip to main content

Topical and Structural Linkage in Wikipedia

  • Conference paper
Advances in Information Retrieval (ECIR 2011)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6611))

Included in the following conference series:

Abstract

We explore statistical properties of links within Wikipedia. We demonstrate that a simple algorithm can predict many of the links that would normally be added to a new article, without considering the topic of the article itself. We then explore a variant of topic-oriented PageRank, which can effectively identify topical links within existing articles, when compared with manual judgments of their topical relevance. Based on these results, we suggest that linkages within Wikipedia arise from a combination of structural requirements and topical relationships.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Adafre, S.F., de Rijke, M.: Discovering missing links in Wikipedia. In: 3rd International Workshop on Link Discovery, Chicago, pp. 90–97 (2005)

    Google Scholar 

  2. Büttcher, S., Clarke, C.L.A., Cormack, G.V.: Information Retrieval: Implementing and Evaluating Search Engines. MIT Press, Cambridge (2010)

    MATH  Google Scholar 

  3. Gardner, J.J., Xiong, L.: Automatic link detection: A sequence labeling approach. In: 18th CIKM, Hong Kong, pp. 1701–1704 (2009)

    Google Scholar 

  4. Huang, D.W.C., Xu, Y., Trotman, A., Geva, S.: Overview of INEX 2007 link the wiki track. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 373–387. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  5. Huang, W.C., Geva, S., Trotman, A.: Overview of the INEX 2008 link the wiki track. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2008. LNCS, vol. 5631, pp. 314–325. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  6. Itakura, K.Y., Clarke, C.L.A.: University of waterloo at INEX2007: Adhoc and link-the-wiki tracks. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 417–425. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  7. Itakura, K.Y., Clarke, C.L.A.: University of waterloo at INEX 2009: Ad hoc, book, entity ranking, and link-the-wiki tracks. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2009. LNCS, vol. 6203, pp. 331–341. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  8. Mihalcea, R., Csomai, A.: Wikify!: Linking documents to encyclopedic knowledge. In: 16th CIKM, Lisbon, pp. 233–242 (2007)

    Google Scholar 

  9. Milne, D., Witten, I.H.: Learning to link with Wikipedia. In: 17th CIKM, pp. 509–518. Napa Valley, California (2008)

    Google Scholar 

  10. Voorhees, E.M.: Variations in relevance judgments and the measurement of retrieval effectiveness. In: 21st SIGIR, pp. 315–323 (1998)

    Google Scholar 

  11. Zhang, J., Kamps, J.: Link detection in XML documents: What about repeated links. In: SIGIR 2008 Workshop on Focused Retrieval, pp. 59–66 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Itakura, K.Y., Clarke, C.L.A., Geva, S., Trotman, A., Huang, W.C. (2011). Topical and Structural Linkage in Wikipedia. In: Clough, P., et al. Advances in Information Retrieval. ECIR 2011. Lecture Notes in Computer Science, vol 6611. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20161-5_45

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-20161-5_45

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-20160-8

  • Online ISBN: 978-3-642-20161-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics