Skip to main content

2019 | OriginalPaper | Buchkapitel

A Data Preparation Approach for Cloud Storage Based on Containerized Parallel Patterns

verfasst von : Diana Carrizales, Dante D. Sánchez-Gallegos, Hugo Reyes, J. L. Gonzalez-Compean, Miguel Morales-Sandoval, Jesus Carretero, Alejandro Galaviz-Mosqueda

Erschienen in: Internet and Distributed Computing Systems

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we present the design, implementation, and evaluation of an efficient data preparation and retrieval approach for cloud storage. The approach includes a deduplication subsystem that indexes the hash of each content to identify duplicated data. As a consequence, avoiding duplicated content reduces reprocessing time during uploads and other costs related to outsource data management tasks. Our proposed data preparation scheme enables organizations to add properties such as security, reliability, and cost-efficiency to their contents before sending them to the cloud. It also creates recovery schemes for organizations to share preprocessed contents with partners and end-users. The approach also includes an engine that encapsulates preprocessing applications into virtual containers (VCs) to create parallel patterns that improve the efficiency of data preparation retrieval process. In a study case, real repositories of satellite images, and organizational files were prepared to be migrated to the cloud by using processes such as compression, encryption, encoding for fault tolerance, and access control. The experimental evaluation revealed the feasibility of using a data preparation approach for organizations to mitigate risks that still could arise in the cloud. It also revealed the efficiency of the deduplication process to reduce data preparation tasks and the efficacy of parallel patterns to improve the end-user service experience.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Chow, R., et al.: Controlling data in the cloud: outsourcing computation without outsourcing control. In: CCSW 2009, pp. 85–90. ACM (2009) Chow, R., et al.: Controlling data in the cloud: outsourcing computation without outsourcing control. In: CCSW 2009, pp. 85–90. ACM (2009)
2.
Zurück zum Zitat Dworkin, M.J.: SHA-3 standard: permutation-based hash and extendable-output functions (2015) Dworkin, M.J.: SHA-3 standard: permutation-based hash and extendable-output functions (2015)
3.
Zurück zum Zitat Gantz, J., Reinsel, D.: The digital universe in 2020: big data, bigger digital shadows, and biggest growth in the far east. IDC iView 2007(2012), 1–16 (2012) Gantz, J., Reinsel, D.: The digital universe in 2020: big data, bigger digital shadows, and biggest growth in the far east. IDC iView 2007(2012), 1–16 (2012)
4.
Zurück zum Zitat Gonzalez, J.L., Perez, J.C., Sosa-Sosa, V.J., Sanchez, L.M., Bergua, B.: SkyCDS: a resilient content delivery service based on diversified cloud storage. Simul. Model. Pract. Theory 54, 64–85 (2015)CrossRef Gonzalez, J.L., Perez, J.C., Sosa-Sosa, V.J., Sanchez, L.M., Bergua, B.: SkyCDS: a resilient content delivery service based on diversified cloud storage. Simul. Model. Pract. Theory 54, 64–85 (2015)CrossRef
5.
Zurück zum Zitat Gonzalez, J.L., Sosa, V., Diaz, A., Carretero, J., Yanez, J.: Sacbe: a building block approach for constructing efficient and flexible end-to-end cloud storage. J. Syst. Softw. 135, 143–156 (2018)CrossRef Gonzalez, J.L., Sosa, V., Diaz, A., Carretero, J., Yanez, J.: Sacbe: a building block approach for constructing efficient and flexible end-to-end cloud storage. J. Syst. Softw. 135, 143–156 (2018)CrossRef
6.
Zurück zum Zitat Mao, B., Wu, S., Jiang, H.: Improving storage availability in cloud-of-clouds with hybrid redundant data distribution. In: IPDPS 2015m, pp. 633–642. IEEE (2015) Mao, B., Wu, S., Jiang, H.: Improving storage availability in cloud-of-clouds with hybrid redundant data distribution. In: IPDPS 2015m, pp. 633–642. IEEE (2015)
7.
Zurück zum Zitat Meister, D., Brinkmann, A.: Multi-level comparison of data deduplication in a backup scenario. In: Proceedings of SYSTOR 2009, p. 8. ACM (2009) Meister, D., Brinkmann, A.: Multi-level comparison of data deduplication in a backup scenario. In: Proceedings of SYSTOR 2009, p. 8. ACM (2009)
8.
Zurück zum Zitat Meister, D., Brinkmann, A.: dedupv1: improving deduplication throughput using solid state drives (SSD). In: MSST 2010, pp. 1–6. IEEE (2010) Meister, D., Brinkmann, A.: dedupv1: improving deduplication throughput using solid state drives (SSD). In: MSST 2010, pp. 1–6. IEEE (2010)
10.
Zurück zum Zitat Mitzenmacher, M.: The power of two choices in randomized load balancing. IEEE TPDS 12(10), 1094–1104 (2001) Mitzenmacher, M.: The power of two choices in randomized load balancing. IEEE TPDS 12(10), 1094–1104 (2001)
11.
Zurück zum Zitat Morales, M., Gonzalez, J.L., Diaz, A., Sosa, V.J.: A pairing-based cryptographic approach for data security in the cloud. IJISP 17(4), 441–461 (2018)CrossRef Morales, M., Gonzalez, J.L., Diaz, A., Sosa, V.J.: A pairing-based cryptographic approach for data security in the cloud. IJISP 17(4), 441–461 (2018)CrossRef
12.
Zurück zum Zitat Ng, W., Wen, Y., Zhu, H.: Private data deduplication protocols in cloud storage. In: Proceedings of the SAC 2012, pp. 441–446. ACM (2012) Ng, W., Wen, Y., Zhu, H.: Private data deduplication protocols in cloud storage. In: Proceedings of the SAC 2012, pp. 441–446. ACM (2012)
13.
Zurück zum Zitat Plummer, D.C., Bittman, T.J., Austin, T., Cearley, D.W., Smith, D.M.: Cloud computing: defining and describing an emerging phenomenon. Gartner, 17 June 2008 Plummer, D.C., Bittman, T.J., Austin, T., Cearley, D.W., Smith, D.M.: Cloud computing: defining and describing an emerging phenomenon. Gartner, 17 June 2008
14.
Zurück zum Zitat Rabin, M.O.: Efficient dispersal of information for security, load balancing, and fault tolerance. JACM 36(2), 335–348 (1989)MathSciNetCrossRef Rabin, M.O.: Efficient dispersal of information for security, load balancing, and fault tolerance. JACM 36(2), 335–348 (1989)MathSciNetCrossRef
15.
Zurück zum Zitat Reinsel, D., Gantz, J., Rydning, J.: The digitization of the world: from edge to core. International Data Corporation, Framingham (2018) Reinsel, D., Gantz, J., Rydning, J.: The digitization of the world: from edge to core. International Data Corporation, Framingham (2018)
16.
Zurück zum Zitat Reyes, H., Gonzalez, J., Morales, M., Carretero, J.: A data integrity verification service for cloud storage based on building blocks. In: 2018 8th CSIT, pp. 201–206. IEEE (2018) Reyes, H., Gonzalez, J., Morales, M., Carretero, J.: A data integrity verification service for cloud storage based on building blocks. In: 2018 8th CSIT, pp. 201–206. IEEE (2018)
17.
Zurück zum Zitat Sánchez, D., Gonzalez, J., Alvarado, S., Sosa, V., Tuxpan, J., Carretero, J.: A containerized service for clustering and categorization of weather records in the cloud. In: CSIT, pp. 26–31. IEEE (2018) Sánchez, D., Gonzalez, J., Alvarado, S., Sosa, V., Tuxpan, J., Carretero, J.: A containerized service for clustering and categorization of weather records in the cloud. In: CSIT, pp. 26–31. IEEE (2018)
18.
Zurück zum Zitat Singh, A., Chatterjee, K.: Cloud security issues and challenges: a survey. J. Netw. Comput. Appl. 79, 88–115 (2017)CrossRef Singh, A., Chatterjee, K.: Cloud security issues and challenges: a survey. J. Netw. Comput. Appl. 79, 88–115 (2017)CrossRef
20.
Zurück zum Zitat Zhang, J., Zhang, Z.: Secure and efficient data-sharing in clouds. CCPE 27(8), 2125–2143 (2015) Zhang, J., Zhang, Z.: Secure and efficient data-sharing in clouds. CCPE 27(8), 2125–2143 (2015)
Metadaten
Titel
A Data Preparation Approach for Cloud Storage Based on Containerized Parallel Patterns
verfasst von
Diana Carrizales
Dante D. Sánchez-Gallegos
Hugo Reyes
J. L. Gonzalez-Compean
Miguel Morales-Sandoval
Jesus Carretero
Alejandro Galaviz-Mosqueda
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-34914-1_45

Premium Partner