Skip to main content
Top

2021 | OriginalPaper | Chapter

Hadoop/Hive Data Query Performance Comparison Between Data Warehouses Designed by Data Vault and Snowflake Methodologies

Authors : Yuri Grigoriev, Evgeny Ermakov, Oleg Ermakov

Published in: Modern Information Technology and IT Education

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The article discusses the difference between Data Vault and Snowflake methodologies in Hadoop infrastructure. The history of Data Vault methodology development from the original version to the modern is showed. The main components of Data Vault are described: hubs, communications, satellites. A comparison of the Data Vault approach with the classic star and snowflake approaches is performed. TPC-H test schema is designed using the Data Vault methodology: business entities are identified and converted to hubs, their rela-tionships are allocated to satellites. Result storage size and query execution time comparison between Data Vault and Snowflake is performed.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Salley, C.T., Codd, E.F.: Providing OLAP to user-analysts: an IT mandate. Computerworld 27(30) (1998) Salley, C.T., Codd, E.F.: Providing OLAP to user-analysts: an IT mandate. Computerworld 27(30) (1998)
2.
go back to reference Inmon, W.: Building the Data Warehouse. Willey, New York (1992) Inmon, W.: Building the Data Warehouse. Willey, New York (1992)
5.
go back to reference Almeida, M.S., Ishikawa, M., Reinschmidt, J., Roeber, T.: Getting Started with Data Warehouse and Business Intelligence. IBM Corporation (1999) Almeida, M.S., Ishikawa, M., Reinschmidt, J., Roeber, T.: Getting Started with Data Warehouse and Business Intelligence. IBM Corporation (1999)
6.
go back to reference Pendse, N.: OLAP Architectures: The OLAP Report, 18 January 1998 Pendse, N.: OLAP Architectures: The OLAP Report, 18 January 1998
7.
go back to reference Sperley, E.: The Enterprise Data Warehouse: Planning, Building, and Implementation, 1st edn. Prentice Hall, Upper Saddle River (1999) Sperley, E.: The Enterprise Data Warehouse: Planning, Building, and Implementation, 1st edn. Prentice Hall, Upper Saddle River (1999)
Metadata
Title
Hadoop/Hive Data Query Performance Comparison Between Data Warehouses Designed by Data Vault and Snowflake Methodologies
Authors
Yuri Grigoriev
Evgeny Ermakov
Oleg Ermakov
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-78273-3_15

Premium Partner