Skip to main content
Erschienen in: Cluster Computing 2/2019

23.07.2018

Optimizing communication performance in scale-out storage system

verfasst von: Uiseok Song, Bodon Jeong, Sungyong Park, Kwongyong Lee

Erschienen in: Cluster Computing | Ausgabe 2/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Ceph is an object-based scale-out storage system that is widely used in the cloud computing environment due to its scalable and reliable characteristics. Although there are many factors to affect the performance of scale-out storage systems, the design of a communication subsystem plays an important role in determining the overall performance of these systems. In this paper, we first conduct an extensive analysis of communication subsystem in Ceph, which uses asynchronous messenger framework, called async messenger, for inter-component communication in the storage cluster. Then, we propose three optimization techniques to improve the performance of Ceph messenger. These include (i) deploying load balancing algorithm among worker threads based on the amount of workloads, (ii) assigning multiple worker threads (we call dual worker) per single connection to maximize the overlapping activity among threads, and (iii) using multiple connections between storage servers to maximize bandwidth usage. The experimental results show that the optimized Ceph messenger outperforms the original messenger implementation by up to 40% in random writes with 4 K messages. Moreover, Ceph with optimized communication subsystem shows up to 13% performance improvement as compared to original Ceph.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Weil, S.A., Brandt, S.A., Miller, E.L., Long, D.D., Maltzahn, C.: Ceph: a scalable, high-performance distributed file system. In: 7th Symposium on Operating Systems Design and Implementation, USENIX Association, pp. 307–320 (2006) Weil, S.A., Brandt, S.A., Miller, E.L., Long, D.D., Maltzahn, C.: Ceph: a scalable, high-performance distributed file system. In: 7th Symposium on Operating Systems Design and Implementation, USENIX Association, pp. 307–320 (2006)
7.
Zurück zum Zitat Ling, Y., Mullen, T., Lin, X.: Analysis of optimal thread pool size. ACM SIGOPS Oper. Syst. Rev. 34(2), 42–55 (2000)CrossRef Ling, Y., Mullen, T., Lin, X.: Analysis of optimal thread pool size. ACM SIGOPS Oper. Syst. Rev. 34(2), 42–55 (2000)CrossRef
8.
Zurück zum Zitat Han, Y., Lee, K., Park, S.: A dynamic message-aware communication scheduler for Ceph storage system. In: IEEE International Workshops on Foundations and Applications of Self* Systems, pp. 60–65 (2016) Han, Y., Lee, K., Park, S.: A dynamic message-aware communication scheduler for Ceph storage system. In: IEEE International Workshops on Foundations and Applications of Self* Systems, pp. 60–65 (2016)
9.
Zurück zum Zitat Lee, D.Y., Jeong, K., Han, S.H., Kim, J.S., Hwang, J.Y., Cho, S.: Understanding write behavior of storage backends in Ceph object store. In: Proceedings of the 2017 IEEE International Conference on Massive Storage Systems and Technology (2017) Lee, D.Y., Jeong, K., Han, S.H., Kim, J.S., Hwang, J.Y., Cho, S.: Understanding write behavior of storage backends in Ceph object store. In: Proceedings of the 2017 IEEE International Conference on Massive Storage Systems and Technology (2017)
10.
Zurück zum Zitat Oh, M., Eom, J., Yoon, J., Yun, J., Kim, S., Yeom, H.: Performance optimization for all flash scale-out storage. In: IEEE International Conference on Cluster Computing, pp. 1561–1563 (2016) Oh, M., Eom, J., Yoon, J., Yun, J., Kim, S., Yeom, H.: Performance optimization for all flash scale-out storage. In: IEEE International Conference on Cluster Computing, pp. 1561–1563 (2016)
11.
Zurück zum Zitat Gudu, D., Hardt, M., Streit, A.: Evaluating the performance and scalability of the Ceph distributed storage system. In: IEEE International Conference on Big Data (Big Data), pp. 177–182 (2014) Gudu, D., Hardt, M., Streit, A.: Evaluating the performance and scalability of the Ceph distributed storage system. In: IEEE International Conference on Big Data (Big Data), pp. 177–182 (2014)
12.
Zurück zum Zitat Wang, F., Nelson, M., Oral, S., Atchley, S., Atchley, S., Weil, S., Settlemyer, B., Caldwell, B., Hill, J.: Performance and scalability evaluation of the Ceph parallel file system. In: Proceeding of the 8th Parallel Data Storage Workshop, pp. 14–19 (2013) Wang, F., Nelson, M., Oral, S., Atchley, S., Atchley, S., Weil, S., Settlemyer, B., Caldwell, B., Hill, J.: Performance and scalability evaluation of the Ceph parallel file system. In: Proceeding of the 8th Parallel Data Storage Workshop, pp. 14–19 (2013)
13.
Zurück zum Zitat Chowdhury, M., Srikanth, K., Stoica, I.: Leveraging endpoint flexibility in data-intensive clusters. In: ACM SIGCOMM Computer Communication Review, vol. 43, no. 4, pp. 231–242 (2013) Chowdhury, M., Srikanth, K., Stoica, I.: Leveraging endpoint flexibility in data-intensive clusters. In: ACM SIGCOMM Computer Communication Review, vol. 43, no. 4, pp. 231–242 (2013)
14.
Zurück zum Zitat Kettimuthu, R., Vardoyan, G., Agrawal, G., Sadayappan, P., Foster, I.: An elegant sufficiency: load-aware differentiated scheduling of data transfers. In: SC-International Conference on High Performance Computing, Networking, Storage and Analysis, pp. 1–12 (2015) Kettimuthu, R., Vardoyan, G., Agrawal, G., Sadayappan, P., Foster, I.: An elegant sufficiency: load-aware differentiated scheduling of data transfers. In: SC-International Conference on High Performance Computing, Networking, Storage and Analysis, pp. 1–12 (2015)
15.
Zurück zum Zitat Kim, Y., Atchley, S., Vallee, G.R., Lee, S., Shipman, G.M.: Optimizing end-to-end big data transfers over terabits network infrastructure. In: IEEE Transactions on Parallel and Distributed Systems, pp. 188–201 (2017) Kim, Y., Atchley, S., Vallee, G.R., Lee, S., Shipman, G.M.: Optimizing end-to-end big data transfers over terabits network infrastructure. In: IEEE Transactions on Parallel and Distributed Systems, pp. 188–201 (2017)
17.
Zurück zum Zitat Ito, T., Ohsaki, H., Imase, M.: GridFTP-APT: automatic parallelism tuning mechanism for data transfer protocol GridFTP. In: 6th IEEE Interntional Symposium on Cluster Computing and the Grid (CCGrid) (2006) Ito, T., Ohsaki, H., Imase, M.: GridFTP-APT: automatic parallelism tuning mechanism for data transfer protocol GridFTP. In: 6th IEEE Interntional Symposium on Cluster Computing and the Grid (CCGrid) (2006)
18.
Zurück zum Zitat Subramoni, H., Lai, P., Kettimuthu, R., Panda, D.K.: High performance data transfer in grid environment using GridFTP over InfiniBand, Technical Report, OSU-CISRC-11/09-TR53 Subramoni, H., Lai, P., Kettimuthu, R., Panda, D.K.: High performance data transfer in grid environment using GridFTP over InfiniBand, Technical Report, OSU-CISRC-11/09-TR53
Metadaten
Titel
Optimizing communication performance in scale-out storage system
verfasst von
Uiseok Song
Bodon Jeong
Sungyong Park
Kwongyong Lee
Publikationsdatum
23.07.2018
Verlag
Springer US
Erschienen in
Cluster Computing / Ausgabe 2/2019
Print ISSN: 1386-7857
Elektronische ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-018-2831-6

Weitere Artikel der Ausgabe 2/2019

Cluster Computing 2/2019 Zur Ausgabe

Premium Partner