Skip to main content
Erschienen in: Cluster Computing 3/2017

10.11.2016

Distributed decentralized collaborative monitoring architecture for cloud infrastructures

verfasst von: Xiaolong Xu, Yun Chen, Jose M. Alcaraz Calero

Erschienen in: Cluster Computing | Ausgabe 3/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Cloud computing infrastructures are demanding an efficient monitoring mechanism to warranty the operational state of large-scale virtualized data centers and to provide mechanisms to improve the efficiency and stability of such infrastructures. Traditionally, centralized monitoring models (CMM) provide high performance and availability for the group of nodes in charge of monitoring tasks. However, the centralized nature of this architecture, easily leads to a single point of failure, bottlenecks in terms of performance and an unbalanced distributions of the monitoring workloads. These facts are not being suitable for large-scale cloud infrastructures. To tackle this concern, the main contribution of this paper is a distributed collaborative monitoring model (DCMM) for cloud computing infrastructures. DCMM provides self-organized capabilities based on mutual perception and balanced monitoring of each node. DCMM also provides rapid notification and recovery mechanisms under degraded conditions. In addition, an adaptive threshold control algorithm (ATCA) is proposed to dynamically adapt the sets of thresholds used for notification purposes in order to identify unnecessary duplicate information sent back to the monitoring tool. ATCA is based on historical monitoring records. Both DCMM and ATCA are described in detail in this contribution. Several empirical experiments have been done using OpenStack cloud infrastructure in order to validate our claims. Experimental results show that DCMM with ATCA can efficiently balance monitoring workload, reduce the workload of monitoring nodes, avoid a single point of failure, and reduce bottleneck problems whereas it is contributing to the achievement of real-time monitoring and data consistency within the monitoring architecture.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Hazarika, B., Sing, T.J.: Survey paper on cloud computing & cloud monitoring: basics. SSRG Int. J. Comput. Sci. Eng. 2(1), 10–15 (2015) Hazarika, B., Sing, T.J.: Survey paper on cloud computing & cloud monitoring: basics. SSRG Int. J. Comput. Sci. Eng. 2(1), 10–15 (2015)
2.
Zurück zum Zitat Manvi, S.S., Shyamb, G.K.: Resource management for infrastructure as a service (IaaS) in cloud computing: a survey. J. Netw. Comput. Appl. 41(1), 424–440 (2014)CrossRef Manvi, S.S., Shyamb, G.K.: Resource management for infrastructure as a service (IaaS) in cloud computing: a survey. J. Netw. Comput. Appl. 41(1), 424–440 (2014)CrossRef
6.
Zurück zum Zitat Ahuja, S.P., Moore, B.: A survey of cloud computing and social networks. Netw. Commun. Technol. 2(8), 11–16 (2013) Ahuja, S.P., Moore, B.: A survey of cloud computing and social networks. Netw. Commun. Technol. 2(8), 11–16 (2013)
7.
Zurück zum Zitat Povedano-Molina, J., Lopez-Vega, J.M., Lopez-Soler, J.M., Corradi, A., Foschini, L.: DARGOS: A highly adaptable and scalable monitoring architecture for multi-tenant clouds. Future Gener. Comput. Syst. 29(8), 2041–2056 (2013)CrossRef Povedano-Molina, J., Lopez-Vega, J.M., Lopez-Soler, J.M., Corradi, A., Foschini, L.: DARGOS: A highly adaptable and scalable monitoring architecture for multi-tenant clouds. Future Gener. Comput. Syst. 29(8), 2041–2056 (2013)CrossRef
8.
Zurück zum Zitat Weinhardt, C., Anandasivam, A., Benjamin, B.: Business models in the service world. IT Prof. 11(2), 28–33 (2009)CrossRef Weinhardt, C., Anandasivam, A., Benjamin, B.: Business models in the service world. IT Prof. 11(2), 28–33 (2009)CrossRef
9.
Zurück zum Zitat Alhamazani, K., Ranjan, R., Mitra, K., Rabhi, F.: Prakash Jayaraman, P., Ullah Khan, S., Guabtni, A., Bhatnagar, V.: An overview of the commercial cloud monitoring tools: research dimensions, design issues and state-of-the-art. Computing 97(4), 357–377 (2014) Alhamazani, K., Ranjan, R., Mitra, K., Rabhi, F.: Prakash Jayaraman, P., Ullah Khan, S., Guabtni, A., Bhatnagar, V.: An overview of the commercial cloud monitoring tools: research dimensions, design issues and state-of-the-art. Computing 97(4), 357–377 (2014)
10.
Zurück zum Zitat Zhang, R., Jin, Y.Z., Yang, T.: Intelligent selection algorithm of measurement nodes in distributed network measurement. Comput. Sci. 42(9), 70–77 (2015) Zhang, R., Jin, Y.Z., Yang, T.: Intelligent selection algorithm of measurement nodes in distributed network measurement. Comput. Sci. 42(9), 70–77 (2015)
11.
Zurück zum Zitat Vyas, R.A., Prajapati, H.B., Dabhi, V.K.: Embedding custom metric in ganglia monitoring system. In: Proceedings of the IEEE International Advance Computing Conference, pp. 793–797 (2014) Vyas, R.A., Prajapati, H.B., Dabhi, V.K.: Embedding custom metric in ganglia monitoring system. In: Proceedings of the IEEE International Advance Computing Conference, pp. 793–797 (2014)
14.
Zurück zum Zitat Alcaraz Calero, J.M., Aguado, J.G.: Comparative analysis of architectures for monitoring cloud computing infrastructures. Future Gener. Comput. Syst. 47(1), 16–30 (2015) Alcaraz Calero, J.M., Aguado, J.G.: Comparative analysis of architectures for monitoring cloud computing infrastructures. Future Gener. Comput. Syst. 47(1), 16–30 (2015)
15.
Zurück zum Zitat Aceto, G., Botta, A., Donato, W.D., Pescap, A.: Cloud monitoring: a survey. Comput. Netw. 57(9), 2093–2115 (2013)CrossRef Aceto, G., Botta, A., Donato, W.D., Pescap, A.: Cloud monitoring: a survey. Comput. Netw. 57(9), 2093–2115 (2013)CrossRef
16.
Zurück zum Zitat Ward, J.S., Barker, A.: Semantic based data collection for large scale cloud systems. In: Proceedings of the 5th International Workshop on Data-Intensive Distributed Computing, pp. 13–22(2012) Ward, J.S., Barker, A.: Semantic based data collection for large scale cloud systems. In: Proceedings of the 5th International Workshop on Data-Intensive Distributed Computing, pp. 13–22(2012)
21.
Zurück zum Zitat Calero, J.M.A., Aguado, J.G.: MonPaaS: An adaptive monitoring platform as a service for cloud computing infrastructures and services. IEEE Trans. Serv. Comput. 8(1), 65–78 (2015)CrossRef Calero, J.M.A., Aguado, J.G.: MonPaaS: An adaptive monitoring platform as a service for cloud computing infrastructures and services. IEEE Trans. Serv. Comput. 8(1), 65–78 (2015)CrossRef
22.
Zurück zum Zitat Jiang, Y.M., Lan, J.L., Zhou, H.Q.: Resource monitoring policy for network virtualization environment. J. Electron. Inf. Technol. 36(3), 708–714 (2014) Jiang, Y.M., Lan, J.L., Zhou, H.Q.: Resource monitoring policy for network virtualization environment. J. Electron. Inf. Technol. 36(3), 708–714 (2014)
23.
Zurück zum Zitat Liu, X.H., Jing, N., Yin, J.P.: The hierarchical network monitoring model with bounded bandwidth and delay constraints. J. Electron. Inf. Technol. 30(3), 712–716 (2008)CrossRef Liu, X.H., Jing, N., Yin, J.P.: The hierarchical network monitoring model with bounded bandwidth and delay constraints. J. Electron. Inf. Technol. 30(3), 712–716 (2008)CrossRef
24.
Zurück zum Zitat Tan, Y.M., Venkatesh, V., Gu, X.H.: Resilient self-compressive monitoring for large-scale hosting infrastructures. IEEE Trans. Parallel Distrib. Syst. 24(3), 576–586 (2013)CrossRef Tan, Y.M., Venkatesh, V., Gu, X.H.: Resilient self-compressive monitoring for large-scale hosting infrastructures. IEEE Trans. Parallel Distrib. Syst. 24(3), 576–586 (2013)CrossRef
25.
Zurück zum Zitat Canali, C., Lancellotti, R.: Automated clustering of VMs for scalable cloud monitoring and management. In: Proceeedings of the 20th International Conference on Software, Telecommunications and Computer Networks, pp. 1–5 (2012) Canali, C., Lancellotti, R.: Automated clustering of VMs for scalable cloud monitoring and management. In: Proceeedings of the 20th International Conference on Software, Telecommunications and Computer Networks, pp. 1–5 (2012)
26.
Zurück zum Zitat Andreolini, M., Colajanni, M., Pietri, M., Tosi, S.: Adaptive, scalable and reliable monitoring of big data on clouds. J. Parallel Distrib. Comput. 79(1), 67–79 (2015)CrossRef Andreolini, M., Colajanni, M., Pietri, M., Tosi, S.: Adaptive, scalable and reliable monitoring of big data on clouds. J. Parallel Distrib. Comput. 79(1), 67–79 (2015)CrossRef
27.
Zurück zum Zitat Han, F.F., Peng, J.J., Zhang, W., Li, Q., Li, J.D., Jiang, Q.L., Yuan, Q.: Virtual resource monitoring in cloud computing. J. Shanghai Univ. (English Edition). 15(5), 381–385 (2011)CrossRef Han, F.F., Peng, J.J., Zhang, W., Li, Q., Li, J.D., Jiang, Q.L., Yuan, Q.: Virtual resource monitoring in cloud computing. J. Shanghai Univ. (English Edition). 15(5), 381–385 (2011)CrossRef
28.
Zurück zum Zitat Zheng, Z.Y., Song, C.H., Li, D., Zhang, X.J., Li, L.P.: Research for the data transmission model in cloud resource monitoring. Res. Notes Inf. Sci. 14(49), 283–289 (2013) Zheng, Z.Y., Song, C.H., Li, D., Zhang, X.J., Li, L.P.: Research for the data transmission model in cloud resource monitoring. Res. Notes Inf. Sci. 14(49), 283–289 (2013)
29.
Zurück zum Zitat Jiang, Y., Sun, H., Ding, J.M., Liu, Y.L.: A data transmission method for resource monitoring under cloud computing environment. Int. J. Grid Distrib. Comput. 8(2), 15–24 (2015)CrossRef Jiang, Y., Sun, H., Ding, J.M., Liu, Y.L.: A data transmission method for resource monitoring under cloud computing environment. Int. J. Grid Distrib. Comput. 8(2), 15–24 (2015)CrossRef
30.
Zurück zum Zitat Lin, K., Tong, W.Q., Liu, X.D., Zhang, L.P.: A self-adaptive mechanism for resource monitoring in cloud computing. In: Proceedings of the IET International Conference on Smart and Sustainable City, pp. 243–247(2013) Lin, K., Tong, W.Q., Liu, X.D., Zhang, L.P.: A self-adaptive mechanism for resource monitoring in cloud computing. In: Proceedings of the IET International Conference on Smart and Sustainable City, pp. 243–247(2013)
31.
Zurück zum Zitat Videla, A., Williams, J.J.W.: RabbitMQ in Action. Manning Publications, New York (2012) Videla, A., Williams, J.J.W.: RabbitMQ in Action. Manning Publications, New York (2012)
32.
Zurück zum Zitat Ghamri-Doudane, S., Agoulmine, N.: Enhanced DHT-based P2P architecture for effective resource discovery and management. J. Netw. Syst. Manag. 15(3), 335–354 (2007)CrossRef Ghamri-Doudane, S., Agoulmine, N.: Enhanced DHT-based P2P architecture for effective resource discovery and management. J. Netw. Syst. Manag. 15(3), 335–354 (2007)CrossRef
33.
Zurück zum Zitat Fersi, G., Louati, W., Jemaa, M.B.: Distributed Hash table-based routing and data management in wireless sensor networks: a survey. Wirel. Netw. 19(2), 219–236 (2013)CrossRef Fersi, G., Louati, W., Jemaa, M.B.: Distributed Hash table-based routing and data management in wireless sensor networks: a survey. Wirel. Netw. 19(2), 219–236 (2013)CrossRef
34.
Zurück zum Zitat Chung, W.C., Chang, R.S.: A new mechanism for resource monitoring in grid computing. Future Gener. Comput. Syst. 25(1), 1–7 (2009)CrossRef Chung, W.C., Chang, R.S.: A new mechanism for resource monitoring in grid computing. Future Gener. Comput. Syst. 25(1), 1–7 (2009)CrossRef
35.
Zurück zum Zitat Huang, H., Wang, L.Q.: P&P: A combined push-pull model for resource monitoring in cloud computing environment. In: Proceedings of the 2010 3rd IEEE International Conference on Cloud Computing, pp. 260–267(2010) Huang, H., Wang, L.Q.: P&P: A combined push-pull model for resource monitoring in cloud computing environment. In: Proceedings of the 2010 3rd IEEE International Conference on Cloud Computing, pp. 260–267(2010)
36.
Zurück zum Zitat Kumar, R., Charu, S., Jain, K.: Open source solution for cloud computing platform using OpenStack. Int. J. Comput. Sci. Mob. Comput. 3(5), 89–98 (2014) Kumar, R., Charu, S., Jain, K.: Open source solution for cloud computing platform using OpenStack. Int. J. Comput. Sci. Mob. Comput. 3(5), 89–98 (2014)
37.
Zurück zum Zitat Massie, M.L., Chun, B.N., Culler, D.E.: The ganglia distributed monitoring system: design, implementation, and experience. Parallel Comput. 30(7), 817–840 (2004)CrossRef Massie, M.L., Chun, B.N., Culler, D.E.: The ganglia distributed monitoring system: design, implementation, and experience. Parallel Comput. 30(7), 817–840 (2004)CrossRef
38.
Zurück zum Zitat Yang, G., Sui, Y.L.: Adaptive approach to monitor resource for cloud computing platform. Comput. Eng. Appl. 45(29), 14–17 (2009) Yang, G., Sui, Y.L.: Adaptive approach to monitor resource for cloud computing platform. Comput. Eng. Appl. 45(29), 14–17 (2009)
Metadaten
Titel
Distributed decentralized collaborative monitoring architecture for cloud infrastructures
verfasst von
Xiaolong Xu
Yun Chen
Jose M. Alcaraz Calero
Publikationsdatum
10.11.2016
Verlag
Springer US
Erschienen in
Cluster Computing / Ausgabe 3/2017
Print ISSN: 1386-7857
Elektronische ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-016-0675-5

Weitere Artikel der Ausgabe 3/2017

Cluster Computing 3/2017 Zur Ausgabe