Skip to main content
Erschienen in: The Journal of Supercomputing 1/2014

01.04.2014

Distributed replica placement algorithms for correlated data

verfasst von: Manghui Tu, I-Ling Yen

Erschienen in: The Journal of Supercomputing | Ausgabe 1/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In distributed systems, data may be correlated due to accesses from clients and the correlation has some impact on date placement, and existing research works focus on independent data objects. In this paper, we address both the scalability and the stability of the data placement solutions in internet environment. We first show that replica allocation decisions can be made locally for each replica site in a tree network, with data access knowledge of its neighbors. We then develop a new replication cost model for correlated data objects in Internet environment. Based on the cost model and the algorithms in previous research, we develop a distributed optimal replica allocation algorithm (DOPR) for correlated data in internet environment. A distributed heuristic algorithm (DHPR) is then developed to efficiently make replica placement decisions. The algorithm obtains sub-optimal solutions for the correlated data model and yields significant performance gains. Experimental studies show that the distributed heuristic allocation algorithm significantly outperforms the general frequency-based replication schemes (in which the replication decision of each data object is made based on the number of accesses on that data object).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Adya et al (2002) FARSITE: federated, available, and reliable storage for an incompletely trusted environment. In: Proceedings of the 5th OSDI Adya et al (2002) FARSITE: federated, available, and reliable storage for an incompletely trusted environment. In: Proceedings of the 5th OSDI
3.
Zurück zum Zitat Baev I, Rajaraman R, Swamy C (2010) Approximation algorithms for data placement problems. SIAM J Comput Baev I, Rajaraman R, Swamy C (2010) Approximation algorithms for data placement problems. SIAM J Comput
4.
Zurück zum Zitat Borst S, Gupta V, Walid A (2010) Distributed caching algorithms for content distribution networks. In: Proceedings of IEEE INFOCOM Borst S, Gupta V, Walid A (2010) Distributed caching algorithms for content distribution networks. In: Proceedings of IEEE INFOCOM
5.
Zurück zum Zitat Breitbart Y, Komondoor R, Rastogi R, Seshadri S, Silberschatz A (1999) Update propagation protocols for replicated databases. In: SIGMOD conference, pp 97–108 Breitbart Y, Komondoor R, Rastogi R, Seshadri S, Silberschatz A (1999) Update propagation protocols for replicated databases. In: SIGMOD conference, pp 97–108
6.
Zurück zum Zitat Chen Y et al (2002) Dynamic replica placement for scalable content delivery. In: Proceedings of the first international workshop on peer-to-peer systems (IPTPS 2002) Chen Y et al (2002) Dynamic replica placement for scalable content delivery. In: Proceedings of the first international workshop on peer-to-peer systems (IPTPS 2002)
8.
Zurück zum Zitat Kadambi S, Chen J, Cooper B, Lomax D, Ramakrishnan R, Silberstein A, Tam E, Molina H (2011) Where in the world is my data? In: Proceedings of the VLDB endowment, vol 4, no 11 Kadambi S, Chen J, Cooper B, Lomax D, Ramakrishnan R, Silberstein A, Tam E, Molina H (2011) Where in the world is my data? In: Proceedings of the VLDB endowment, vol 4, no 11
9.
Zurück zum Zitat Kalpakis K et al (2001) Optimal placement of replicas in trees with read, write and storage costs. IEEE Trans Parallel Distrib Syst 12(6) Kalpakis K et al (2001) Optimal placement of replicas in trees with read, write and storage costs. IEEE Trans Parallel Distrib Syst 12(6)
10.
Zurück zum Zitat Koruplou M, Dahlin M (2002) Coordinated placement and replacement for large scale distributed caches. IEEE Trans Knowl Data Eng 14(6) Koruplou M, Dahlin M (2002) Coordinated placement and replacement for large scale distributed caches. IEEE Trans Knowl Data Eng 14(6)
11.
Zurück zum Zitat Kubitowicz J, et al (2000) OceanStore: an architecture for global-scale persistent storage. In: Proceedings of ASPLOS’00 Kubitowicz J, et al (2000) OceanStore: an architecture for global-scale persistent storage. In: Proceedings of ASPLOS’00
12.
Zurück zum Zitat Li K, Shen Y, Lin K, Qu W (2010) Coordinated multimedia object replacement in transcoding proxies. J Supercomput 52(3) Li K, Shen Y, Lin K, Qu W (2010) Coordinated multimedia object replacement in transcoding proxies. J Supercomput 52(3)
13.
Zurück zum Zitat Lu Z, McKinley KS (2000) Partial collection replication versus cache for information retrieval systems. In: Proceedings of the ACM international conference on research and development in information retrieval, Athens, Greece, July 2000 Lu Z, McKinley KS (2000) Partial collection replication versus cache for information retrieval systems. In: Proceedings of the ACM international conference on research and development in information retrieval, Athens, Greece, July 2000
14.
Zurück zum Zitat Myint J, Naing T (2011) Management of data replication for PC cluster-based cloud storage system. Int J Cloud Comput, Serv Arch 1(3) Myint J, Naing T (2011) Management of data replication for PC cluster-based cloud storage system. Int J Cloud Comput, Serv Arch 1(3)
15.
Zurück zum Zitat Passarella A (2011) A survey on content-centric technologies for the current Internet: CDN and P2P solutions. Comput Commun 1 Passarella A (2011) A survey on content-centric technologies for the current Internet: CDN and P2P solutions. Comput Commun 1
16.
Zurück zum Zitat Paxson V (1997) End-to-end routing behavior in the Internet. IEEE/ACM Trans Netw 5(5):601–615 CrossRef Paxson V (1997) End-to-end routing behavior in the Internet. IEEE/ACM Trans Netw 5(5):601–615 CrossRef
17.
Zurück zum Zitat Qiu L, Padmanabhan V, Voelker G (2001) On the placement of web server replicas. In: IEEE 20th INFOCOM Qiu L, Padmanabhan V, Voelker G (2001) On the placement of web server replicas. In: IEEE 20th INFOCOM
18.
Zurück zum Zitat Ranganathan K, Foster I (2001) Identifying dynamic replication strategies for a high-performance data grid. In: Proceedings of the 2nd of international workshop on grid computing, Denver, CO, USA, November 2001. Lecture notes in computer science, vol 2242, pp 75–86 Ranganathan K, Foster I (2001) Identifying dynamic replication strategies for a high-performance data grid. In: Proceedings of the 2nd of international workshop on grid computing, Denver, CO, USA, November 2001. Lecture notes in computer science, vol 2242, pp 75–86
19.
Zurück zum Zitat Saito Y et al (2002) Taming aggressive replication in the Pangaea wide-area file system. In: Proceedings of the 5th symposium on operating system design and implementation Saito Y et al (2002) Taming aggressive replication in the Pangaea wide-area file system. In: Proceedings of the 5th symposium on operating system design and implementation
20.
Zurück zum Zitat Troll G, Graben P. Zipf’s law is not a consequence of the central limit theorem. Phys Rev E 57(2):1347–1355 Troll G, Graben P. Zipf’s law is not a consequence of the central limit theorem. Phys Rev E 57(2):1347–1355
21.
Zurück zum Zitat Tu M, Li P, Xiao L, Yen I, Bastani F (2006) Replica placement algorithms for mobile transaction systems. IEEE Trans Knowl Data Eng 18(7) Tu M, Li P, Xiao L, Yen I, Bastani F (2006) Replica placement algorithms for mobile transaction systems. IEEE Trans Knowl Data Eng 18(7)
22.
Zurück zum Zitat Tu M, Li P, Ma Q, Yen I, Bastani F (2010) Secure data object placement in the P2P Data grid. IEEE Trans Depend Secure Comput 7(1) Tu M, Li P, Ma Q, Yen I, Bastani F (2010) Secure data object placement in the P2P Data grid. IEEE Trans Depend Secure Comput 7(1)
23.
Zurück zum Zitat Tu M, Ma H, Yen I, Bastani F, Xu D (2013) Availability, security, access performance an load balance in P2P data grid. J Grid Comput 11(1) Tu M, Ma H, Yen I, Bastani F, Xu D (2013) Availability, security, access performance an load balance in P2P data grid. J Grid Comput 11(1)
24.
Zurück zum Zitat Wolfson O, Jajodia S, Huang Y (1997) An adaptive data replication algorithm. ACM Trans Database Syst 22(2):255–314 CrossRef Wolfson O, Jajodia S, Huang Y (1997) An adaptive data replication algorithm. ACM Trans Database Syst 22(2):255–314 CrossRef
25.
Zurück zum Zitat Ye Y, Xiao L, Yen I, Bastani F (2010) Cloud storage design based on hybrid of replication and data partitioning. In: Proceedings of the IEEE 16th international conference on parallel and distributed systems (ICPADS) Ye Y, Xiao L, Yen I, Bastani F (2010) Cloud storage design based on hybrid of replication and data partitioning. In: Proceedings of the IEEE 16th international conference on parallel and distributed systems (ICPADS)
26.
Zurück zum Zitat Zaman S, Grosu D (2011) A distributed algorithm for the replica placement problem. IEEE Trans Parallel Distrib Syst 22(9) Zaman S, Grosu D (2011) A distributed algorithm for the replica placement problem. IEEE Trans Parallel Distrib Syst 22(9)
27.
Zurück zum Zitat Zin N, Noraziah A, Fauzi A, Herawan T (2012) Replication techniques in data grid environments. In: Intelligent information and database systems. Lecture notes in computer science, vol 7197 Zin N, Noraziah A, Fauzi A, Herawan T (2012) Replication techniques in data grid environments. In: Intelligent information and database systems. Lecture notes in computer science, vol 7197
Metadaten
Titel
Distributed replica placement algorithms for correlated data
verfasst von
Manghui Tu
I-Ling Yen
Publikationsdatum
01.04.2014
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 1/2014
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-013-1036-2

Weitere Artikel der Ausgabe 1/2014

The Journal of Supercomputing 1/2014 Zur Ausgabe

Premium Partner