Skip to main content

2021 | OriginalPaper | Buchkapitel

Data Processing and Development of Big Data System: A Survey

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

At present, data are generated all the time in the world, and these data contain great value. However, these data have problems such as huge scale, complex type, fast flow rate and low value density. Therefore, the research of big data analysis means and tools has increasingly become a research hotspot. On the basis of research at home and abroad, this paper attempts to analyze the definition, framework and typical big data processing systems of big data, especially focuses on the analysis and comparison of batch data processing system, stream data processing system, hybrid processing system and graph processing system, and obtains the characteristics, processing mechanism and applicable occasions of each system. The paper hopes to provide some references for understanding big data systems, solving problems in the process of big data processing and developing big data applications, and provide reference for improving the effectiveness and efficiency of data processing.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Beniamino, D.M., Rocco, A., Giuseppina, C., Antonio, E., Joanna, K.: Big data (lost) in the cloud. Int. J. Big Data Intell. 1(1), 3–17 (2014)CrossRef Beniamino, D.M., Rocco, A., Giuseppina, C., Antonio, E., Joanna, K.: Big data (lost) in the cloud. Int. J. Big Data Intell. 1(1), 3–17 (2014)CrossRef
4.
Zurück zum Zitat Bello-Orgaz, G., Jung, J., Camacho, D.: Social big data: recent achievements and new challenges. Inf. Fusion 28, 45–59 (2016)CrossRef Bello-Orgaz, G., Jung, J., Camacho, D.: Social big data: recent achievements and new challenges. Inf. Fusion 28, 45–59 (2016)CrossRef
8.
Zurück zum Zitat Liu, C.: Market research and forecast. Mech. Ind. press 8(1), 84–85 (2017) Liu, C.: Market research and forecast. Mech. Ind. press 8(1), 84–85 (2017)
9.
Zurück zum Zitat Zikopoulos, P., Eaton, C.: Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data, 1st edn. McGraw-Hill Osborne Media, New York (2011) Zikopoulos, P., Eaton, C.: Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data, 1st edn. McGraw-Hill Osborne Media, New York (2011)
11.
Zurück zum Zitat Kumar, A., Maskara, S., Chiang, I.: Identifying semantic in high-dimensional web data using latent semantic manifold. J. Data Anal. Inf. Process. 3(4), 136–152 (2015) Kumar, A., Maskara, S., Chiang, I.: Identifying semantic in high-dimensional web data using latent semantic manifold. J. Data Anal. Inf. Process. 3(4), 136–152 (2015)
15.
Zurück zum Zitat Zaharia, M., Chowdhury, M., Franklin, M., Shenker, S. and Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX conference on Hot topics in cloud computing, pp. 10–17. USENIX Association, USA (2010) Zaharia, M., Chowdhury, M., Franklin, M., Shenker, S. and Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX conference on Hot topics in cloud computing, pp. 10–17. USENIX Association, USA (2010)
18.
Zurück zum Zitat Yong, T.: Comparative study on calculation models of spark and flink. Comput. Products Circ. 2(4), 152–153 (2019) Yong, T.: Comparative study on calculation models of spark and flink. Comput. Products Circ. 2(4), 152–153 (2019)
19.
Zurück zum Zitat Dai, M., Gao, S.: Performance evaluation of large-scale data analysis based on hadoop, spark and flink. J. Chin. Acad. Electron. Sci. 13(2), 149–155 (2018) Dai, M., Gao, S.: Performance evaluation of large-scale data analysis based on hadoop, spark and flink. J. Chin. Acad. Electron. Sci. 13(2), 149–155 (2018)
24.
Zurück zum Zitat Sun, L., Yu, Q., Peng, D., Subramani, S., Wang, X.: a fog-based framework for disease prognosis based medical sensor data streams. Comput. Mater. Continua 66(1), 603–619 (2021)CrossRef Sun, L., Yu, Q., Peng, D., Subramani, S., Wang, X.: a fog-based framework for disease prognosis based medical sensor data streams. Comput. Mater. Continua 66(1), 603–619 (2021)CrossRef
25.
Zurück zum Zitat Alhroob, A., Alzyadat, W., Imam, A.T., Jaradat, G.M.: The genetic algorithm and binary search technique in the program path coverage for improving software testing using big data. Intell. Autom. Soft Comput. 26(4), 725–733 (2020)CrossRef Alhroob, A., Alzyadat, W., Imam, A.T., Jaradat, G.M.: The genetic algorithm and binary search technique in the program path coverage for improving software testing using big data. Intell. Autom. Soft Comput. 26(4), 725–733 (2020)CrossRef
26.
Zurück zum Zitat Mirarab, A., Mirtaheri, S.L., Asghari, S.A.: A model to create organizational value with big data analytics. Comput. Syst. Sci. Eng. 35(2), 69–79 (2020)CrossRef Mirarab, A., Mirtaheri, S.L., Asghari, S.A.: A model to create organizational value with big data analytics. Comput. Syst. Sci. Eng. 35(2), 69–79 (2020)CrossRef
Metadaten
Titel
Data Processing and Development of Big Data System: A Survey
verfasst von
Shuyan Yu
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-78618-2_34

Premium Partner