Skip to main content

2022 | OriginalPaper | Buchkapitel

Data Lake Versus Data Warehouse Architecture: A Comparative Study

verfasst von : Mohamed El Mehdi El Aissi, Sarah Benjelloun, Yassine Loukili, Younes Lakhrissi, Abdessamad El Boushaki, Hiba Chougrad, Safae Elhaj Ben Ali

Erschienen in: WITS 2020

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Each day huge quantities of data are generated from digital technologies and information systems. Therefore, processing these massive data requires a specific architecture and a good knowledge on how to handle data. Traditional databases management system can no longer be used for this type of data since they were originally designed for limited and structured data. Moreover, dedicated architecture known as Data Lake has been developed in order to extract valuable information hidden in data. The main objective of this paper is to explore the two architectures, namely, data warehouse and data lake. Furthermore, it describes the main differences and exposes key factors of each one.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Haigh T (2016) How Charles Bachman invented the DBMS, a foundation of our digital world. Commun ACM 59(7):25–30CrossRef Haigh T (2016) How Charles Bachman invented the DBMS, a foundation of our digital world. Commun ACM 59(7):25–30CrossRef
3.
Zurück zum Zitat Ariyachandra T, Watson HJ (2006) Which data warehouse architecture is most successful? Bus Intel J 11(1):4 Ariyachandra T, Watson HJ (2006) Which data warehouse architecture is most successful? Bus Intel J 11(1):4
4.
Zurück zum Zitat Verma K Study of HDFS architecture and services Verma K Study of HDFS architecture and services
5.
Zurück zum Zitat Nance C et al (2013) Nosql vs rdbms-why there is room for both Nance C et al (2013) Nosql vs rdbms-why there is room for both
6.
Zurück zum Zitat Zafar R et al (2016) Big data: the NoSQL and RDBMS review. In: 2016 international conference on information and communication technology (ICICTM). IEEE Zafar R et al (2016) Big data: the NoSQL and RDBMS review. In: 2016 international conference on information and communication technology (ICICTM). IEEE
7.
Zurück zum Zitat Panwar A, Bhatnagar V (2020) Data Lake architecture: a new repository for data engineer. Int J Org Collective Intell (IJOCI) 10(1):63–75CrossRef Panwar A, Bhatnagar V (2020) Data Lake architecture: a new repository for data engineer. Int J Org Collective Intell (IJOCI) 10(1):63–75CrossRef
8.
9.
Zurück zum Zitat Rabl T et al (2012) Solving big data challenges for enterprise application performance management. arXiv preprint arXiv:1208.4167 Rabl T et al (2012) Solving big data challenges for enterprise application performance management. arXiv preprint arXiv:​1208.​4167
10.
Zurück zum Zitat Megdiche I, Ravat F, Zhao Y (2020) A use case of data lake metadata management. Data Lakes 2:97–122CrossRef Megdiche I, Ravat F, Zhao Y (2020) A use case of data lake metadata management. Data Lakes 2:97–122CrossRef
11.
Zurück zum Zitat Llave MR (2018) Data lakes in business intelligence: reporting from the trenches. Procedia Comput Sci 138:516–524CrossRef Llave MR (2018) Data lakes in business intelligence: reporting from the trenches. Procedia Comput Sci 138:516–524CrossRef
12.
Zurück zum Zitat Farnum, Michael A, et al (2019) A dimensional warehouse for integrating operational data from clinical trials. Database 2019 Farnum, Michael A, et al (2019) A dimensional warehouse for integrating operational data from clinical trials. Database 2019
13.
Zurück zum Zitat Vyas S, Vaishnav P (2017) A comparative study of various ETL process and their testing techniques in data warehouse. J Stat Manage Syst 20(4):753–763 Vyas S, Vaishnav P (2017) A comparative study of various ETL process and their testing techniques in data warehouse. J Stat Manage Syst 20(4):753–763
15.
Zurück zum Zitat Kakhani, Manish K, Sweeti K, Biradar SR (2015) Research issues in big data analytics. Int J Appl Innov Eng Manage 2(8):228–232 Kakhani, Manish K, Sweeti K, Biradar SR (2015) Research issues in big data analytics. Int J Appl Innov Eng Manage 2(8):228–232
16.
Zurück zum Zitat Sandhu M, Purnima AS (2018) A review on Big Data analytics in business Sandhu M, Purnima AS (2018) A review on Big Data analytics in business
17.
Zurück zum Zitat Lakshmi C, Nagendra Kumar VV (2016) Survey paper on Big Data. Int J Adv Res Comput Sci Softw Eng 6(8) Lakshmi C, Nagendra Kumar VV (2016) Survey paper on Big Data. Int J Adv Res Comput Sci Softw Eng 6(8)
18.
Zurück zum Zitat Ravat F, Zhao Y (2019) Data Lakes: trends and perspectives. Int Conf Database Expert Syst Appl, Springer, Cham Ravat F, Zhao Y (2019) Data Lakes: trends and perspectives. Int Conf Database Expert Syst Appl, Springer, Cham
19.
Zurück zum Zitat Chen M, Mao S, Liu Y (2014) Big data: A survey. Mobile Networks and Applications 19(2):171–209CrossRef Chen M, Mao S, Liu Y (2014) Big data: A survey. Mobile Networks and Applications 19(2):171–209CrossRef
21.
Zurück zum Zitat Anthony M, et al (2020) An evaluation of how Big-Data and data warehouses improve business intelligence decision making. In: World conference on information systems and technologies. Springer, Cham Anthony M, et al (2020) An evaluation of how Big-Data and data warehouses improve business intelligence decision making. In: World conference on information systems and technologies. Springer, Cham
22.
Zurück zum Zitat Gore Sumit S, Ambulgekar HP (2014) MapReduce-based warehouse systems: a survey. In: 2014 International Conference on Advances in Engineering & Technology Research (ICAETR-2014). IEEE Gore Sumit S, Ambulgekar HP (2014) MapReduce-based warehouse systems: a survey. In: 2014 International Conference on Advances in Engineering & Technology Research (ICAETR-2014). IEEE
25.
Zurück zum Zitat Dean J, Ghemawat S (2008) MapReduce: simplified data processing on large clusters. Commun ACM 51(1):107–113CrossRef Dean J, Ghemawat S (2008) MapReduce: simplified data processing on large clusters. Commun ACM 51(1):107–113CrossRef
27.
Zurück zum Zitat Miloslavskaya N, Tolstoy A (2016) Big data, fast data and data lake concepts. Procedia Computer Science 88(300–305):63 Miloslavskaya N, Tolstoy A (2016) Big data, fast data and data lake concepts. Procedia Computer Science 88(300–305):63
28.
Zurück zum Zitat Umar A, Siddiqui GF (2018) Big data augmentation with Data Warehouse: a survey. In: 2018 IEEE international conference on Big Data (Big Data). IEEE Umar A, Siddiqui GF (2018) Big data augmentation with Data Warehouse: a survey. In: 2018 IEEE international conference on Big Data (Big Data). IEEE
29.
Zurück zum Zitat Paulraj P (2004) Data warehousing fundamentals: a comprehensive guide for IT professionals. John Wiley & Sons Paulraj P (2004) Data warehousing fundamentals: a comprehensive guide for IT professionals. John Wiley & Sons
31.
Zurück zum Zitat Yaqoob I (2016) Big data: from beginning to future. Int J Inf Manage Yaqoob I (2016) Big data: from beginning to future. Int J Inf Manage
32.
Zurück zum Zitat Abuqabita F (2019) A comparative study on big data analytics frameworks. Data resources and challenges, Canadian Center of Science and Education Abuqabita F (2019) A comparative study on big data analytics frameworks. Data resources and challenges, Canadian Center of Science and Education
33.
Zurück zum Zitat Labrinidis A, Jagadish HV (2012) Challenges and opportunities with big data. Proceedings of the VLDB Endowment 5(12):2032–2033CrossRef Labrinidis A, Jagadish HV (2012) Challenges and opportunities with big data. Proceedings of the VLDB Endowment 5(12):2032–2033CrossRef
34.
Zurück zum Zitat Cedrine M, Laurent A (2016) The next information architecture evolution: the data lake wave. In: Proceedings of the 8th international conference on management of digital ecosystems Cedrine M, Laurent A (2016) The next information architecture evolution: the data lake wave. In: Proceedings of the 8th international conference on management of digital ecosystems
Metadaten
Titel
Data Lake Versus Data Warehouse Architecture: A Comparative Study
verfasst von
Mohamed El Mehdi El Aissi
Sarah Benjelloun
Yassine Loukili
Younes Lakhrissi
Abdessamad El Boushaki
Hiba Chougrad
Safae Elhaj Ben Ali
Copyright-Jahr
2022
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-33-6893-4_19

Neuer Inhalt