Not logged in.

Contribution Details

Type Conference or Workshop Paper
Scope Discipline-based scholarship
Published in Proceedings Yes
Title Fast similarity search in peer-to-peer networks
Organization Unit
Authors
  • Thomas Bocek
  • Ela Hunt
  • David Hausheer
  • Burkhard Stiller
Presentation Type paper
Item Subtype Original Work
Refereed Yes
Status Published in final form
Language
  • English
ISBN 978-1-4244-2065-0
ISSN 1542-1201
Page Range 240 - 247
Event Title NOMS 2008 - 2008 IEEE Network Operations and Management Symposium
Event Type conference
Event Location Salvador, Brazil
Event Start Date April 7 - 2008
Event End Date April 11 - 2008
Series Name IEEE/IFIP Network Operations and Management Symposium (NOMS)
Number 2008
Place of Publication Los Alamitos
Publisher Institute of Electrical and Electronics Engineers
Abstract Text Peer-to-peer (P2P) systems show numerous advantages over centralized systems, such as load balancing, scalability, and fault tolerance, and they require certain functionality, such as search, repair, and message and data transfer. In particular, structured P2P networks perform an exact search in logarithmic time proportional to the number of peers. However, keyword similarity search in a structured P2P network remains a challenge. Similarity search for service discovery can significantly improve service management in a distributed environment. As services are often described informally in text form, keyword similarity search can find the required services or data items more reliably. This paper presents a fast similarity search algorithm for structured P2P systems. The new algorithm, called P2P fast similarity search (P2PFastSS), finds similar keys in any distributed hash table (DHT) using the edit distance metric, and is independent of the underlying P2P routing algorithm. Performance analysis shows that P2PFastSS carries out a similarity search in time proportional to the logarithm of the number of peers. Simulations on PlanetLab confirm these results and show that a similarity search with 34,000 peers performs in less than three seconds on average. Thus, P2PFastSS is suitable for similarity search in large-scale network infrastructures, such as service description matching in service discovery or searching for similar terms in P2P storage networks.
Digital Object Identifier 10.1109/NOMS.2008.4575140
Other Identification Number merlin-id:294
PDF File Download from ZORA
Export BibTeX
EP3 XML (ZORA)
Additional Information This paper was presented at 11th IEEE/IFIP Network Operations and Management Symposium (NOMS 2008) in Salvador, Brazil, 7–11 april 2008. // © 2008 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.