Skip to main content

2018 | OriginalPaper | Buchkapitel

IOscope: A Flexible I/O Tracer for Workloads’ I/O Pattern Characterization

verfasst von : Abdulqawi Saif, Lucas Nussbaum, Ye-Qiong Song

Erschienen in: High Performance Computing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Storage systems are getting complex to handle HPC and Big Data requirements. This complexity triggers performing in-depth evaluations to ensure the absence of issues in all systems’ layers. However, the current performance evaluation activity is performed around high-level metrics for simplicity reasons. It is therefore impossible to catch potential I/O issues in lower layers along the Linux I/O stack. In this paper, we introduce IOscope tracer for uncovering I/O patterns of storage systems’ workloads. It performs filtering-based profiling over fine-grained criteria inside Linux kernel. IOscope has near-zero overhead and verified behaviours inside the kernel thanks to relying on the extended Berkeley Packet Filter (eBPF) technology. We demonstrate the capabilities of IOscope to discover patterns-related issues through a performance study on MongoDB and Cassandra. Results show that clustered MongoDB suffers from a noisy I/O pattern regardless of the used storage support (HDDs or SSDs). Hence, IOscope helps to have better troubleshooting process and contributes to have in-depth understanding of I/O performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
A major version of MongoDB (v3.6) has been released during writing this paper. It suffers from the same performance issue discussed in Sect. 3.2, regardless of the optimized throughput.
 
Literatur
1.
Zurück zum Zitat Abramova, V., Bernardino, J.: NoSQL databases: MongoDB vs cassandra. In: Proceedings of the International C* Conference on Computer Science and Software Engineering, pp. 14–22. ACM (2013) Abramova, V., Bernardino, J.: NoSQL databases: MongoDB vs cassandra. In: Proceedings of the International C* Conference on Computer Science and Software Engineering, pp. 14–22. ACM (2013)
4.
Zurück zum Zitat Chahal, D., Virk, R., Nambiar, M.: Performance extrapolation of IO intensive workloads: work in progress. In: Proceedings of the 7th ACM/SPEC on International Conference on Performance Engineering, pp. 105–108. ACM (2016) Chahal, D., Virk, R., Nambiar, M.: Performance extrapolation of IO intensive workloads: work in progress. In: Proceedings of the 7th ACM/SPEC on International Conference on Performance Engineering, pp. 105–108. ACM (2016)
5.
Zurück zum Zitat Cooper, B.F., Silberstein, A., Tam, E., Ramakrishnan, R., Sears, R.: Benchmarking cloud serving systems with YCSB. In: Proceedings of the 1st ACM Symposium on Cloud Computing, pp. 143–154. ACM (2010) Cooper, B.F., Silberstein, A., Tam, E., Ramakrishnan, R., Sears, R.: Benchmarking cloud serving systems with YCSB. In: Proceedings of the 1st ACM Symposium on Cloud Computing, pp. 143–154. ACM (2010)
6.
Zurück zum Zitat Daoud, H., Dagenais, M.R.: Recovering disk storage metrics from low-level trace events. Softw.: Pract. Exp. 48(5), 1019–1041 (2018) Daoud, H., Dagenais, M.R.: Recovering disk storage metrics from low-level trace events. Softw.: Pract. Exp. 48(5), 1019–1041 (2018)
7.
Zurück zum Zitat Desnoyers, M., Dagenais, M.R.: The LTTng tracer: a low impact performance and behavior monitor for GNU/Linux. In: OLS (Ottawa Linux Symposium), vol. 2006, pp. 209–224. Citeseer, Linux Symposium (2006) Desnoyers, M., Dagenais, M.R.: The LTTng tracer: a low impact performance and behavior monitor for GNU/Linux. In: OLS (Ottawa Linux Symposium), vol. 2006, pp. 209–224. Citeseer, Linux Symposium (2006)
9.
Zurück zum Zitat Jacob, B., Larson, P., Leitao, B., Da Silva, S.: SystemTap: instrumenting the Linux kernel for analyzing performance and functional problems. IBM Redbook (2008) Jacob, B., Larson, P., Leitao, B., Da Silva, S.: SystemTap: instrumenting the Linux kernel for analyzing performance and functional problems. IBM Redbook (2008)
10.
Zurück zum Zitat Jeong, S., Lee, K., Hwang, J., Lee, S., Won, Y.: Androstep: Android storage performance analysis tool. Software Engineering (Workshops), vol. 13, pp. 327–340 (2013) Jeong, S., Lee, K., Hwang, J., Lee, S., Won, Y.: Androstep: Android storage performance analysis tool. Software Engineering (Workshops), vol. 13, pp. 327–340 (2013)
11.
Zurück zum Zitat Jung, M.G., Youn, S.A., Bae, J., Choi, Y.L.: A study on data input and output performance comparison of MongoDB and PostgreSQL in the big data environment. In: 2015 8th International Conference on Database Theory and Application (DTA), pp. 14–17. IEEE (2015) Jung, M.G., Youn, S.A., Bae, J., Choi, Y.L.: A study on data input and output performance comparison of MongoDB and PostgreSQL in the big data environment. In: 2015 8th International Conference on Database Theory and Application (DTA), pp. 14–17. IEEE (2015)
12.
Zurück zum Zitat Klein, J., Gorton, I., Ernst, N., Donohoe, P., Pham, K., Matser, C.: Performance evaluation of NoSQL databases: a case study. In: Proceedings of the 1st Workshop on Performance Analysis of Big Data Systems (2015) Klein, J., Gorton, I., Ernst, N., Donohoe, P., Pham, K., Matser, C.: Performance evaluation of NoSQL databases: a case study. In: Proceedings of the 1st Workshop on Performance Analysis of Big Data Systems (2015)
14.
Zurück zum Zitat Luo, X., et al.: HPC I/O trace extrapolation. In: Proceedings of the 4th Workshop on Extreme Scale Programming Tools. p. 2. ACM (2015) Luo, X., et al.: HPC I/O trace extrapolation. In: Proceedings of the 4th Workshop on Extreme Scale Programming Tools. p. 2. ACM (2015)
15.
Zurück zum Zitat Luo, X., et al.: ScalaiOExtrap: elastic I/O tracing and extrapolation. In: 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 585–594. IEEE (2017) Luo, X., et al.: ScalaiOExtrap: elastic I/O tracing and extrapolation. In: 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 585–594. IEEE (2017)
16.
Zurück zum Zitat Mantri, S.G.: Efficient in-depth IO tracing and its application for optimizing systems. Ph.D. thesis, Virginia Tech (2014) Mantri, S.G.: Efficient in-depth IO tracing and its application for optimizing systems. Ph.D. thesis, Virginia Tech (2014)
17.
Zurück zum Zitat McDougall, R., Mauro, J., Gregg, B.: Solaris performance and tools: DTrace and MDB techniques for Solaris 10 and OpenSolaris. Prentice Hall (2006) McDougall, R., Mauro, J., Gregg, B.: Solaris performance and tools: DTrace and MDB techniques for Solaris 10 and OpenSolaris. Prentice Hall (2006)
19.
Zurück zum Zitat Schulist, J., Borkmann, D., Starovoitov, A.: Linux socket filtering aka Berkeley Packet Filter (BPF) (2016) Schulist, J., Borkmann, D., Starovoitov, A.: Linux socket filtering aka Berkeley Packet Filter (BPF) (2016)
20.
Zurück zum Zitat Sharma, S.D., Dagenais, M.: Enhanced userspace and in-kernel trace filtering for production systems. J. Comput. Sci. Technol. 6, 1161–1178 (2016)CrossRef Sharma, S.D., Dagenais, M.: Enhanced userspace and in-kernel trace filtering for production systems. J. Comput. Sci. Technol. 6, 1161–1178 (2016)CrossRef
22.
Zurück zum Zitat Tak, B.C., Tang, C., Huang, H., Wang, L.: PseudoApp: performance prediction for application migration to cloud. In: 2013 IFIP/IEEE International Symposium on Integrated Network Management (IM 2013), pp. 303–310. IEEE (2013) Tak, B.C., Tang, C., Huang, H., Wang, L.: PseudoApp: performance prediction for application migration to cloud. In: 2013 IFIP/IEEE International Symposium on Integrated Network Management (IM 2013), pp. 303–310. IEEE (2013)
24.
Zurück zum Zitat Virk, R., Chahal, D.: Trace replay based I/O performance studies for enterprise workload migration. In: 2nd Annual Conference of CMG India, Page Online (2015) Virk, R., Chahal, D.: Trace replay based I/O performance studies for enterprise workload migration. In: 2nd Annual Conference of CMG India, Page Online (2015)
Metadaten
Titel
IOscope: A Flexible I/O Tracer for Workloads’ I/O Pattern Characterization
verfasst von
Abdulqawi Saif
Lucas Nussbaum
Ye-Qiong Song
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-030-02465-9_7