Skip to main content

2015 | OriginalPaper | Buchkapitel

On Replica Placement in High-Availability Storage Under Correlated Failure

verfasst von : K. Alex Mills, R. Chandrasekaran, Neeraj Mittal

Erschienen in: Combinatorial Optimization and Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

A new model describing dependencies among system components as a directed graph is presented and used to solve a novel replica placement problem in data centers. A criterion for optimizing replica placements is formalized and explained. In this work, the optimization goal is to choose placements in which correlated failure events disable as few replicas as possible. A fast optimization algorithm is given for dependency models represented by trees. The main contribution of the paper is an \(O(n + \rho \log \rho )\) dynamic programming algorithm for placing \(\rho \) replicas on a tree with n vertices.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bakkaloglu, M., Wylie, J.J., Wang, C., et. al: On correlated failures in survivable storage systems. Technical report CMU-CS-02-129, Carnegie Mellon University (2002) Bakkaloglu, M., Wylie, J.J., Wang, C., et. al: On correlated failures in survivable storage systems. Technical report CMU-CS-02-129, Carnegie Mellon University (2002)
2.
Zurück zum Zitat Blume, L., Easley, D., Kleinberg, J., Kleinberg, R., Tardos, E.: Which networks are least susceptible to cascading failures? In: Proceedings of the 52nd Annual Symposium on Foundations of Computer Science (FOCS) (2011) Blume, L., Easley, D., Kleinberg, J., Kleinberg, R., Tardos, E.: Which networks are least susceptible to cascading failures? In: Proceedings of the 52nd Annual Symposium on Foundations of Computer Science (FOCS) (2011)
3.
Zurück zum Zitat Chen, M., Chen, W., Liu, L., Zheng, Z.: An analytical framework and its applications for studying brick storage reliability. In: Proceedings of the 26th International Symposium on Reliable Distributed Systems (SRDS) (2007) Chen, M., Chen, W., Liu, L., Zheng, Z.: An analytical framework and its applications for studying brick storage reliability. In: Proceedings of the 26th International Symposium on Reliable Distributed Systems (SRDS) (2007)
4.
Zurück zum Zitat Ford, D., Labelle, F., Popovici, F., et al.: Availability in globally distributed storage systems. In: Proceedings of the 9th USENIX Symposium on Operating Systems Design and Implementation (OSDI) (2010) Ford, D., Labelle, F., Popovici, F., et al.: Availability in globally distributed storage systems. In: Proceedings of the 9th USENIX Symposium on Operating Systems Design and Implementation (OSDI) (2010)
5.
Zurück zum Zitat Hu, X.D., Jia, X.H., Du, D.Z., et al.: Placement of data replicas for optimal data availability in ring networks. J. Parallel Distrib. Comput. (JPDC) 61(10), 1412–1424 (2001)CrossRefMATH Hu, X.D., Jia, X.H., Du, D.Z., et al.: Placement of data replicas for optimal data availability in ring networks. J. Parallel Distrib. Comput. (JPDC) 61(10), 1412–1424 (2001)CrossRefMATH
6.
Zurück zum Zitat Pezoa, J.E., Hayat, M.M.: Reliability of heterogeneous distributed computing systems in the presence of correlated failures. IEEE Trans. Parallel Distrib. Comput. 25(4), 1034–1043 (2014)CrossRef Pezoa, J.E., Hayat, M.M.: Reliability of heterogeneous distributed computing systems in the presence of correlated failures. IEEE Trans. Parallel Distrib. Comput. 25(4), 1034–1043 (2014)CrossRef
7.
Zurück zum Zitat Kim, J., Dobson, I.: Approximating a loading-dependent cascading failure model with a branching process. IEEE Trans. Reliab. 59(4), 691–699 (2010)CrossRef Kim, J., Dobson, I.: Approximating a loading-dependent cascading failure model with a branching process. IEEE Trans. Reliab. 59(4), 691–699 (2010)CrossRef
8.
Zurück zum Zitat Lian, Q., Chen, W., Zhang, Z.: On the impact of replica placement to the reliability of distributed brick storage systems. In: Proceedings of the International Conference on Distributed Computing Systems (ICDCS) (2005) Lian, Q., Chen, W., Zhang, Z.: On the impact of replica placement to the reliability of distributed brick storage systems. In: Proceedings of the International Conference on Distributed Computing Systems (ICDCS) (2005)
9.
Zurück zum Zitat Mills, K.A., Chandrasekaran, R., Mittal, N.: Algorithms for replica placement in high-availability storage (2015). arxiv:1503.02654 Mills, K.A., Chandrasekaran, R., Mittal, N.: Algorithms for replica placement in high-availability storage (2015). arxiv:​1503.​02654
10.
Zurück zum Zitat Nath, S., Yu, H., Gibbons, P.B., Seshan, S.: Subtleties in tolerating correlated failures in wide-area storage systems. In: Proceedings of the 3rd USENIX Symposium on Networked Systems Design and Implementation (NSDI) (2006) Nath, S., Yu, H., Gibbons, P.B., Seshan, S.: Subtleties in tolerating correlated failures in wide-area storage systems. In: Proceedings of the 3rd USENIX Symposium on Networked Systems Design and Implementation (NSDI) (2006)
11.
Zurück zum Zitat Shekhar, S., Wu, W.: Optimal placement of data replicas in distributed database with majority voting protocol. Theoret. Comput. Sci. 258(1), 555–571 (2001)MathSciNetCrossRefMATH Shekhar, S., Wu, W.: Optimal placement of data replicas in distributed database with majority voting protocol. Theoret. Comput. Sci. 258(1), 555–571 (2001)MathSciNetCrossRefMATH
12.
Zurück zum Zitat Weatherspoon, H., Moscovitz, T., Kubiatowicz, J.: Introspective failure analysis: avoiding correlated failures in peer-to-peer systems. In: Proceedings of the 21st Symposium on Reliable Distributed Systems (SRDS) (2002) Weatherspoon, H., Moscovitz, T., Kubiatowicz, J.: Introspective failure analysis: avoiding correlated failures in peer-to-peer systems. In: Proceedings of the 21st Symposium on Reliable Distributed Systems (SRDS) (2002)
13.
Zurück zum Zitat Zhang, Z., Wu, W., Shekhar, S.: Optimal placements of replicas in a ring network with majority voting protocol. J. Parallel Distrib. Comput. (JPDC) 69(5), 461–469 (2009)CrossRef Zhang, Z., Wu, W., Shekhar, S.: Optimal placements of replicas in a ring network with majority voting protocol. J. Parallel Distrib. Comput. (JPDC) 69(5), 461–469 (2009)CrossRef
14.
Zurück zum Zitat Zhu, Y., Yan, J., Sun, Y., et al.: Revealing cascading failure vulnerability in power grids using risk-graph. IEEE Trans. Parallel Distrib. Syst. (TPDS) 25(12), 3274–3284 (2014)CrossRef Zhu, Y., Yan, J., Sun, Y., et al.: Revealing cascading failure vulnerability in power grids using risk-graph. IEEE Trans. Parallel Distrib. Syst. (TPDS) 25(12), 3274–3284 (2014)CrossRef
Metadaten
Titel
On Replica Placement in High-Availability Storage Under Correlated Failure
verfasst von
K. Alex Mills
R. Chandrasekaran
Neeraj Mittal
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-26626-8_26