Skip to main content
Erschienen in: Peer-to-Peer Networking and Applications 2/2009

01.06.2009

XCube: Processing XPath queries in a hypercube overlay network

verfasst von: Yingguang Li, M. Tamer Özsu, Kian-Lee Tan

Erschienen in: Peer-to-Peer Networking and Applications | Ausgabe 2/2009

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we present the design and performance of XCube, a tag-based system for managing XML data in a hypercube overlay network. In XCube, each node in a d-dimensional hypercube is identified by a d-bit vector. A peer manages a smaller hypercube with dimension d′ < d. An XML document is compactly represented as a structure summary and a content summary. The structure summary comprises a d-bit vector derived from the distinct tag names in the document and a synopsis capturing the structure of the document. The content summary consists of a bit map that summarizes the document content. The metadata of a document, i.e., owner IP, document identifier, structure summary and content summary, is indexed at its anchor peer (the peer that manages the node with matching bit vector). In addition, the structure summary is further indexed at all peers that manages nodes whose bit vectors are covered by the document’s bit vector. An XPath query is processed in four phases. In phase 1, the query is routed to its anchor peer according to the bit vector of the query. In phase 2, the query is evaluated against all the synopses stored in its anchor peer and forwarded to the anchor peers of the matching synopses. In phase 3, the anchor peer of each related synopsis examines the query on the related bit maps and forwards the query to the related owner peers. Finally in phase 4, the owner peers evaluate the query on the XML documents and return answers to the querying peer. We also present a scheme that dynamically partitions the hypercube to balance the load across peers. We further exploit the partition history to remove redundant messages. We conduct a comprehensive experimental study and the results show the efficiency of XCube.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
A structure/path query can be mapped into a tag-based query by ignoring the structure.
 
2
We will use the terms document structure, synopsis, and XML tree interchangeably in the paper.
 
6
If a document can be summarized with multiple bit maps, the bit maps can be built with more bits, and thus they are more accurate.
 
7
The number of edges in a tree is (T − 1), where T is the number of nodes in the tree. Therefore, the number of 1-bits in the bit vector is bounded by 2N.
 
Literatur
1.
Zurück zum Zitat Aberer K (2001) P-Grid: a self-organizing access structure for P2P information systems. In: Proceedings of the 6th CoopIS conference, pp 179–194 Aberer K (2001) P-Grid: a self-organizing access structure for P2P information systems. In: Proceedings of the 6th CoopIS conference, pp 179–194
2.
Zurück zum Zitat Abiteboul S, Manolescu I, Preda N (2004) Constructing and querying a peer-to-peer warehouse of XML resources. In: Semantic web and databases workshop, pp 219–225 Abiteboul S, Manolescu I, Preda N (2004) Constructing and querying a peer-to-peer warehouse of XML resources. In: Semantic web and databases workshop, pp 219–225
3.
Zurück zum Zitat Bonifati A, Matrangolo U, Cuzzocrea A, Jain M (2004) XPath lookup queries in P2P networks. In: Proceedings of WIDM’04, pp 48–55 Bonifati A, Matrangolo U, Cuzzocrea A, Jain M (2004) XPath lookup queries in P2P networks. In: Proceedings of WIDM’04, pp 48–55
4.
Zurück zum Zitat Crespo A, Garcia-Molina H (2002) Routing indices for peer-to-peer systems. In: Proceedings of ICDCS’02, p 23, July Crespo A, Garcia-Molina H (2002) Routing indices for peer-to-peer systems. In: Proceedings of ICDCS’02, p 23, July
5.
Zurück zum Zitat Galanis L, Wang Y, Jeffery S, DeWitt D (2003) Locating data sources in large distributed systems. In: Proceedings of VLDB’03. Berlin, Germany, pp 874–885 Galanis L, Wang Y, Jeffery S, DeWitt D (2003) Locating data sources in large distributed systems. In: Proceedings of VLDB’03. Berlin, Germany, pp 874–885
6.
Zurück zum Zitat Galanis L, Wang Y, Jeffery SR, Dewitt DJ (2003) Processing queries in a large peer-to-peer system. In: Proceedings of the 16th CAiSE conference, pp 273–288 Galanis L, Wang Y, Jeffery SR, Dewitt DJ (2003) Processing queries in a large peer-to-peer system. In: Proceedings of the 16th CAiSE conference, pp 273–288
7.
Zurück zum Zitat Ganesan P, Bawa M, Garcia-Molina H (2004) Online balancing of range-partitioned data with applications to peer-to-peer systems. In: Proceedings of VLDB’04, pp 444–455 Ganesan P, Bawa M, Garcia-Molina H (2004) Online balancing of range-partitioned data with applications to peer-to-peer systems. In: Proceedings of VLDB’04, pp 444–455
8.
Zurück zum Zitat Goldman R, Widom J (1997) Dataguides: enabling query formulation and optimization in semistructured databases. In: Proceedings of VLDB’97, pp 436–445 Goldman R, Widom J (1997) Dataguides: enabling query formulation and optimization in semistructured databases. In: Proceedings of VLDB’97, pp 436–445
9.
Zurück zum Zitat Joung YJ, Fang CT, Yang LW (2005) Keyword search in DHT-based peer-to-peer networks. In: Proceedings of ICDCS’05, pp 339–348 Joung YJ, Fang CT, Yang LW (2005) Keyword search in DHT-based peer-to-peer networks. In: Proceedings of ICDCS’05, pp 339–348
10.
Zurück zum Zitat Kaushik R, Bohannon P, Naughton JF, Korth HF (2002) Covering indexes for branching path queries. In: Proceedings of ACM SIGMOD’02, pp 133–144 Kaushik R, Bohannon P, Naughton JF, Korth HF (2002) Covering indexes for branching path queries. In: Proceedings of ACM SIGMOD’02, pp 133–144
11.
Zurück zum Zitat Koloniari G, Pitoura E (2004) Content-based routing of path queries in peer-to-peer systems. In: Proceedings of the EDBT conference, pp 29–47 Koloniari G, Pitoura E (2004) Content-based routing of path queries in peer-to-peer systems. In: Proceedings of the EDBT conference, pp 29–47
12.
Zurück zum Zitat Polyzotis N, Garofalakis M (2006) XSKETCH synopses for XML data graphs. ACM Trans Database Syst 31(3):1014–1063CrossRef Polyzotis N, Garofalakis M (2006) XSKETCH synopses for XML data graphs. ACM Trans Database Syst 31(3):1014–1063CrossRef
13.
Zurück zum Zitat Ratnasamy S, Francis P, Handley M, Karp R, Shenker S (2001) A scalable content-addressable network. In: Proceedings of SIGCOMM’01, pp 161–172 Ratnasamy S, Francis P, Handley M, Karp R, Shenker S (2001) A scalable content-addressable network. In: Proceedings of SIGCOMM’01, pp 161–172
14.
Zurück zum Zitat Saroiu S, Gummadi PK, Gribble SD (2002) A measurement study of peer-to-peer file sharing systems. In: Proc. of multimedia computing and networking Saroiu S, Gummadi PK, Gribble SD (2002) A measurement study of peer-to-peer file sharing systems. In: Proc. of multimedia computing and networking
15.
Zurück zum Zitat Sartiani C, Manghi P, Ghelli G, Conforti G (2004) XPeer: a self-organizing XML P2P database system. In: Proceedings of the first EDBT workshop on P2P and databases Sartiani C, Manghi P, Ghelli G, Conforti G (2004) XPeer: a self-organizing XML P2P database system. In: Proceedings of the first EDBT workshop on P2P and databases
16.
Zurück zum Zitat Schlosser M, Sintek M, Decker S, Nejdl W (2002) A scalable and ontology-based p2p infrastructure for semantic web services. In: Proceedings of the second international conference on peer-to-peer computing. IEEE Computer Society, Washington, DC, USA, pp 104–111CrossRef Schlosser M, Sintek M, Decker S, Nejdl W (2002) A scalable and ontology-based p2p infrastructure for semantic web services. In: Proceedings of the second international conference on peer-to-peer computing. IEEE Computer Society, Washington, DC, USA, pp 104–111CrossRef
17.
Zurück zum Zitat Skobeltsyn G, Hauswirth M, Aberer K (2005) Efficient processing of XPath queries with structured overlay networks. In: OTM conferences, pp 1243–1260 Skobeltsyn G, Hauswirth M, Aberer K (2005) Efficient processing of XPath queries with structured overlay networks. In: OTM conferences, pp 1243–1260
18.
Zurück zum Zitat Stoica I, Morris R, Karger D, Kaashoek F, Balakrishnan H (2001) Chord: a scalable peer-to-peer lookup service for internet applications. In: Proceedings of SIGCOMM’01, pp 17–32 Stoica I, Morris R, Karger D, Kaashoek F, Balakrishnan H (2001) Chord: a scalable peer-to-peer lookup service for internet applications. In: Proceedings of SIGCOMM’01, pp 17–32
19.
Zurück zum Zitat Wang Q, Özsu MT (2004) A data locating mechanism for distributed XML data over P2P networks. In: Technical report CS-2004-45, University of Waterloo Wang Q, Özsu MT (2004) A data locating mechanism for distributed XML data over P2P networks. In: Technical report CS-2004-45, University of Waterloo
20.
Zurück zum Zitat Yao BB, Özsu MT, Khandelwal N (2004) XBench benchmark and performance testing of XML DBMSs. In: Proceedings of ICDE’04, p 621 Yao BB, Özsu MT, Khandelwal N (2004) XBench benchmark and performance testing of XML DBMSs. In: Proceedings of ICDE’04, p 621
21.
Zurück zum Zitat Zhang N, Özsu MT, Aboulnaga A, Ilyas IF (2006) XSEED: accurate and fast cardinality estimation for XPath queries. In: Proceedings of ICDE’06, p 61 Zhang N, Özsu MT, Aboulnaga A, Ilyas IF (2006) XSEED: accurate and fast cardinality estimation for XPath queries. In: Proceedings of ICDE’06, p 61
Metadaten
Titel
XCube: Processing XPath queries in a hypercube overlay network
verfasst von
Yingguang Li
M. Tamer Özsu
Kian-Lee Tan
Publikationsdatum
01.06.2009
Verlag
Springer US
Erschienen in
Peer-to-Peer Networking and Applications / Ausgabe 2/2009
Print ISSN: 1936-6442
Elektronische ISSN: 1936-6450
DOI
https://doi.org/10.1007/s12083-008-0025-3

Weitere Artikel der Ausgabe 2/2009

Peer-to-Peer Networking and Applications 2/2009 Zur Ausgabe

Premium Partner