Skip to main content
Erschienen in: Cluster Computing 5/2023

02.07.2021

Design and implementation of an efficient and parallel bad block checker for parallelism of storage devices

verfasst von: Jaehyun Han, Guangyu Zhu, Eunseo Lee, Yongseok Son

Erschienen in: Cluster Computing | Ausgabe 5/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Cloud computing is being widely adopted in the industry due to providing more computation power and improve resource utilization. In cloud computing systems, many users execute various types of applications that produce a large amount of data. To handle a large amount of data, cloud computing systems provide various, high-performance, and large-scale clustered storage devices. With the large capacity, improving the performance of storage maintenance is an important issue since the large capacity can increase the suspend time during the maintenance significantly. As a storage maintenance technique, checking a bad block in which the data cannot be accessed anymore prevents I/O failure of the application. However, an existing bad block checker (e.g., badblocks in Linux) takes a long time, even when storage devices provide parallelism (e.g., multiple disks, multi-channel SSD, etc). It is because the existing bad block checker performs I/O and check operations in a serialized manner. To reduce the checking time, we propose an efficient and parallel bad block checker for exploiting the parallelism of storage devices. In our scheme, we enable parallel I/O and check operations for the bad block instead of the serialized operations. To do this, we first divide a series of check operations into parallel tasks (i.e., independent tasks). Second, we create a thread pool in which multiple workers fetch their tasks concurrently. Finally, we enable each checker to perform its own check and I/O operations in parallel. We implement and evaluate our checker on a 32-core machine with a disk array and an NVMe SSD. The experimental results show that our proposed bad block checker improves the performance by up to 3.7\(\times\) and 7.8\(\times\) in the disk array and NVMe SSD, respectively, compared with the existing bad block checker.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
If the read operation for a block is failed, bad block checker considers the block as a bad block.
 
Literatur
1.
Zurück zum Zitat Venkatesh, M., Sumalatha, MR., SelvaKumar, C.: Improving public auditability, data possession in data storage security for cloud computing. In 2012 International Conference on Recent Trends in Information Technology, pages 463–467. IEEE, (2012) Venkatesh, M., Sumalatha, MR., SelvaKumar, C.: Improving public auditability, data possession in data storage security for cloud computing. In 2012 International Conference on Recent Trends in Information Technology, pages 463–467. IEEE, (2012)
2.
Zurück zum Zitat Aceto, G., Botta, A., De Donato, W., Pescapè, A.: Cloud monitoring: A survey. Comput. Netw. 57(9), 2093–2115 (2013)CrossRef Aceto, G., Botta, A., De Donato, W., Pescapè, A.: Cloud monitoring: A survey. Comput. Netw. 57(9), 2093–2115 (2013)CrossRef
3.
Zurück zum Zitat Yang, T., Shia, B.-C., Wei, J., Fang, K.: Mass data analysis and forecasting based on cloud computing. JSW 7(10), 2189–2195 (2012)CrossRef Yang, T., Shia, B.-C., Wei, J., Fang, K.: Mass data analysis and forecasting based on cloud computing. JSW 7(10), 2189–2195 (2012)CrossRef
4.
Zurück zum Zitat Lee, Y.C., Zomaya, A.Y.: Energy efficient utilization of resources in cloud computing systems. J. Supercomput. 60(2), 268–280 (2012)CrossRef Lee, Y.C., Zomaya, A.Y.: Energy efficient utilization of resources in cloud computing systems. J. Supercomput. 60(2), 268–280 (2012)CrossRef
5.
Zurück zum Zitat Jian-Hua, Z., Nan, Z.: Cloud computing-based data storage and disaster recovery. In 2011 International Conference on Future Computer Science and Education, pp. 629–632. IEEE, (2011) Jian-Hua, Z., Nan, Z.: Cloud computing-based data storage and disaster recovery. In 2011 International Conference on Future Computer Science and Education, pp. 629–632. IEEE, (2011)
6.
Zurück zum Zitat Buyya, R., Yeo, C.S., Venugopal, S., Broberg, J., Brandic, I.: Cloud computing and emerging it platforms: Vision, hype, and reality for delivering computing as the 5th utility. Fut. Gener. Comput. Syst. 25(6), 599–616 (2009)CrossRef Buyya, R., Yeo, C.S., Venugopal, S., Broberg, J., Brandic, I.: Cloud computing and emerging it platforms: Vision, hype, and reality for delivering computing as the 5th utility. Fut. Gener. Comput. Syst. 25(6), 599–616 (2009)CrossRef
7.
Zurück zum Zitat Xun, X.: From cloud computing to cloud manufacturing. Robotics and computer-integrated manufacturing 28(1), 75–86 (2012)CrossRef Xun, X.: From cloud computing to cloud manufacturing. Robotics and computer-integrated manufacturing 28(1), 75–86 (2012)CrossRef
8.
Zurück zum Zitat Rimal, B. P., Jukan, A., Katsaros, D., Goeleven, Y.: Architectural requirements for cloud computing systems: an enterprise cloud approach. J. Grid Comput., 9(1):3–26, 2011 Rimal, B. P., Jukan, A., Katsaros, D., Goeleven, Y.: Architectural requirements for cloud computing systems: an enterprise cloud approach. J. Grid Comput., 9(1):3–26, 2011
9.
Zurück zum Zitat George, A., Angela Demke, B., Ashvin, G.: Opportunistic storage maintenance. In Proceedings of the 25th Symposium on Operating Systems Principles, pp. 457–473. ACM (2015) George, A., Angela Demke, B., Ashvin, G.: Opportunistic storage maintenance. In Proceedings of the 25th Symposium on Operating Systems Principles, pp. 457–473. ACM (2015)
10.
Zurück zum Zitat Kumar, R.S., Saxena, A.: Data integrity proofs in cloud storage. In 2011 Third International Conference on Communication Systems and Networks (COMSNETS 2011), pp. 1–4. IEEE, (2011) Kumar, R.S., Saxena, A.: Data integrity proofs in cloud storage. In 2011 Third International Conference on Communication Systems and Networks (COMSNETS 2011), pp. 1–4. IEEE, (2011)
15.
Zurück zum Zitat George, A., Alina, O., Bianca, S.: Practical scrubbing: Getting to the bad sector at the right time. In IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012), pp. 1–12. IEEE, (2012) George, A., Alina, O., Bianca, S.: Practical scrubbing: Getting to the bad sector at the right time. In IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2012), pp. 1–12. IEEE, (2012)
16.
Zurück zum Zitat Farzaneh, M., Ioan, S., Bianca, S.: Proactive error prediction to improve storage system reliability. In 2017\(\{\)USENIX\(\}\)Annual Technical Conference (\(\{\)USENIX\(\}\)\(\{\)ATC\(\}\) 17), pp. 391–402 (2017) Farzaneh, M., Ioan, S., Bianca, S.: Proactive error prediction to improve storage system reliability. In 2017\(\{\)USENIX\(\}\)Annual Technical Conference (\(\{\)USENIX\(\}\)\(\{\)ATC\(\}\) 17), pp. 391–402 (2017)
24.
Zurück zum Zitat Lakshmi, N., Bairavasundaram, G. R., Goodson, S., Pasupathy, J. S.: An analysis of latent sector errors in disk drives. In ACM SIGMETRICS Performance Evaluation Review, vol. 35, pp. 289–300. ACM, (2007) Lakshmi, N., Bairavasundaram, G. R., Goodson, S., Pasupathy, J. S.: An analysis of latent sector errors in disk drives. In ACM SIGMETRICS Performance Evaluation Review, vol. 35, pp. 289–300. ACM, (2007)
25.
Zurück zum Zitat Han, J., Zhu, G., Lee, E., Son, Y.: An efficient and parallel bad block checker for parallelism of storage devices. In 2020 IEEE International Conference on Autonomic Computing and Self-Organizing Systems Companion (ACSOS-C), pp. 13–18. IEEE, (2020) Han, J., Zhu, G., Lee, E., Son, Y.: An efficient and parallel bad block checker for parallelism of storage devices. In 2020 IEEE International Conference on Autonomic Computing and Self-Organizing Systems Companion (ACSOS-C), pp. 13–18. IEEE, (2020)
27.
Zurück zum Zitat McKusick, M. K., Joy, W. N., Leffler, S. J., Fabry, R. S: Fsck- the unixfile system check program. Unix System Manager’s Manual-4.3 BSD Virtual VAX-11 Version, (1986) McKusick, M. K., Joy, W. N., Leffler, S. J., Fabry, R. S: Fsck- the unixfile system check program. Unix System Manager’s Manual-4.3 BSD Virtual VAX-11 Version, (1986)
29.
Zurück zum Zitat Patterson, D. A., Gibson, G., Katz, R. H : A case for redundant arrays of inexpensive disks (RAID), vol. 17. ACM (1988) Patterson, D. A., Gibson, G., Katz, R. H : A case for redundant arrays of inexpensive disks (RAID), vol. 17. ACM (1988)
30.
Zurück zum Zitat Ma, A., Dragga, C., Arpaci-Dusseau, A.C., Arpaci-Dusseau, R.H., Mckusick, M.K.: Ffsck: The fast file-system checker. ACM Trans. Storage (TOS) 10(1), 2 (2014) Ma, A., Dragga, C., Arpaci-Dusseau, A.C., Arpaci-Dusseau, R.H., Mckusick, M.K.: Ffsck: The fast file-system checker. ACM Trans. Storage (TOS) 10(1), 2 (2014)
31.
Zurück zum Zitat Gunawi, H. S., Rajimwale, A., Arpaci-Dusseau, A. C., Arpaci-Dusseau, R. H: Sqck: A declarative file system checker. In OSDI, pp. 131–146 (2008) Gunawi, H. S., Rajimwale, A., Arpaci-Dusseau, A. C., Arpaci-Dusseau, R. H: Sqck: A declarative file system checker. In OSDI, pp. 131–146 (2008)
33.
Zurück zum Zitat Schwarz, Thomas JE., Xin, Qin ., Miller, Ethan L., Long, Darrell DE., Hospodor, Andy ., Ng. Spencer.: Disk scrubbing in large archival storage systems. In The IEEE Computer Society’s 12th Annual International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems, 2004.(MASCOTS 2004). Proceedings., pages 409–418. IEEE, (2004) Schwarz, Thomas JE., Xin, Qin ., Miller, Ethan L., Long, Darrell DE., Hospodor, Andy ., Ng. Spencer.: Disk scrubbing in large archival storage systems. In The IEEE Computer Society’s 12th Annual International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems, 2004.(MASCOTS 2004). Proceedings., pages 409–418. IEEE, (2004)
34.
Zurück zum Zitat Bjørling, M., Axboe, J., Nellans, D., Bonnet, P.: Linux block io: introducing multi-queue ssd access on multi-core systems. In Proceedings of the 6th international systems and storage conference, pp. 22. ACM, (2013) Bjørling, M., Axboe, J., Nellans, D., Bonnet, P.: Linux block io: introducing multi-queue ssd access on multi-core systems. In Proceedings of the 6th international systems and storage conference, pp. 22. ACM, (2013)
35.
Zurück zum Zitat Son, Yongseok., Kim, Sunggon., Yeom, Heon Y ., Han, Hyuck.: High-performance transaction processing in journaling file systems. In 16th \(\{\)USENIX\(\}\)Conference on File and Storage Technologies (\(\{\)FAST\(\}\) 18), pages 227–240, (2018) Son, Yongseok., Kim, Sunggon., Yeom, Heon Y ., Han, Hyuck.: High-performance transaction processing in journaling file systems. In 16th \(\{\)USENIX\(\}\)Conference on File and Storage Technologies (\(\{\)FAST\(\}\) 18), pages 227–240, (2018)
36.
Zurück zum Zitat Bhat, S.S., Rasha Eqbal, A.T., Clements, M.F.K., Nickolai, Z.: Scaling a file system to many cores using an operation log. In Proceedings of the 26th Symposium on Operating Systems Principles, pp. 69–86. ACM, (2017) Bhat, S.S., Rasha Eqbal, A.T., Clements, M.F.K., Nickolai, Z.: Scaling a file system to many cores using an operation log. In Proceedings of the 26th Symposium on Operating Systems Principles, pp. 69–86. ACM, (2017)
Metadaten
Titel
Design and implementation of an efficient and parallel bad block checker for parallelism of storage devices
verfasst von
Jaehyun Han
Guangyu Zhu
Eunseo Lee
Yongseok Son
Publikationsdatum
02.07.2021
Verlag
Springer US
Erschienen in
Cluster Computing / Ausgabe 5/2023
Print ISSN: 1386-7857
Elektronische ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-021-03353-w

Weitere Artikel der Ausgabe 5/2023

Cluster Computing 5/2023 Zur Ausgabe

Premium Partner