ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
Computer Networks
Volume 51, Issue 8, 6 June 2007, Pages 1861-1881
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (552 K)

  E-mail Article   
  Add to my Quick Links   
Bookmark and share in 2collab (opens in new window)
Request permission to reuse this article
  Cited By in Scopus (0)
 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.comnet.2006.09.014    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2006 Elsevier B.V. All rights reserved.

Associative search in peer to peer networks: Harnessing latent semanticsstar, open

Edith Cohena, E-mail The Corresponding Author, Amos Fiatb, E-mail The Corresponding Author and Haim Kaplanb, Corresponding Author Contact Information, 1, E-mail The Corresponding Author

aAT&T research labs, 180 Park Avenue, Florham Park, NJ, USA bSchool of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel

Received 13 December 2005; 
revised 20 July 2006; 
accepted 20 September 2006. 
Responsible Editor: C. Westphal. 
Available online 3 November 2006.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

The success of a P2P file-sharing network highly depends on the scalability and versatility of its search mechanism. Two particularly desirable search features are scope (ability to find infrequent items) and support for partial-match queries (queries that contain typos or include a subset of keywords). While centralized-index architectures (such as Napster) can support both these features, existing decentralized architectures seem to support at most one: prevailing unstructured P2P protocols (such as Gnutella and FastTrack) deploy a “blind” search mechanism where the set of peers probed is unrelated to the query; thus they support partial-match queries but have limited scope. On the other extreme, the recently-proposed distributed hash tables (DHTs) such as CAN and CHORD, couple index location with the item’s hash value, and thus have good scope but can not effectively support partial-match queries. Another hurdle to DHTs deployment is their tight control of the overlay structure and the information (part of the index) each peer maintains, which makes them more sensitive to failures and frequent joins and disconnects.

We develop a new class of decentralized P2P architectures. Our design is based on unstructured architectures such as Gnutella and FastTrack, and retains many of their appealing properties including support for partial match queries, and relative resilience to peer failures. Yet, we obtain orders of magnitude improvement in the efficiency of locating rare items. Our approach exploits associations inherent in human selections to steer the search process to peers that are more likely to have an answer to the query. We demonstrate the potential of associative search using models, analysis, and simulations.

Keywords: Networks; Search; Information retrieval; Peer to peer; Latent semantic; Rare items; Algorithms

Article Outline

1. Introduction
2. Related work
3. Model and evaluation methodology
3.1. Blind search as random search
4. Associative search
4.1. Rapier strategy
4.2. Rapier versus Prand
4.3. Rapier is no worse than Prand
4.4. The itemsets model
5. Greedy: optimizing the search strategy
5.1. The Greedy search sequence
5.2. Theoretical background
5.3. Estimating M and q
6. Data and simulation results
7. Relation to classic search techniques
8. Conclusion and future work
References
Vitae









Computer Networks
Volume 51, Issue 8, 6 June 2007, Pages 1861-1881
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.