Skip to main content

2014 | OriginalPaper | Buchkapitel

Big Data Benchmark - Big DS

verfasst von : Jun-Ming Zhao, Wen-Shuan Wang, Xian Liu, You-Fu Chen

Erschienen in: Advancing Big Data Benchmarks

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Performance and scalability in clusters of heterogeneous and complex Big Data Analytic environments are always unpredictable. In this paper, we are trying to address this problem by using a benchmark named “Big DS”. The benchmark adopts many great ideas from some famous industry benchmarks like TPC-H [1], TPC-DS [1], SPECvirt_sc2010 [2] and SPECjbb2005 [2], we also adopt some ideas from non-standard benchmarks liked TeraSort [3], SWIM [4], etc. By defining a configurable workload for different big data analytics environment, Big DS can be used for measuring the performance and scalability of a big data analytics platform or environment for different business.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat TPC. TPC is a trademark of the Transaction Processing Performance Council. TPC-H and TPC-DS are the decision support benchmarks of TPC organization. http://www.tpc.org TPC. TPC is a trademark of the Transaction Processing Performance Council. TPC-H and TPC-DS are the decision support benchmarks of TPC organization. http://​www.​tpc.​org
2.
Zurück zum Zitat SPEC. SPEC is a trademark of the Standard Performance Evaluation Corporation 1995–2014. SPECjbb2005 is the server side Java Benchmark of SPEC.org. SPECjbb2013 is the evaluation version of SPECjbb2005. SPECvirt_2010sc is the server consolidation virtualization benchmark of SPEC.org. http://www.spec.org SPEC. SPEC is a trademark of the Standard Performance Evaluation Corporation 1995–2014. SPECjbb2005 is the server side Java Benchmark of SPEC.org. SPECjbb2013 is the evaluation version of SPECjbb2005. SPECvirt_2010sc is the server consolidation virtualization benchmark of SPEC.org. http://​www.​spec.​org
3.
Zurück zum Zitat TeraSort. Refer to the Apache Terasort benchmark, which is a MapReduce version of Sort benchmark TeraSort. Refer to the Apache Terasort benchmark, which is a MapReduce version of Sort benchmark
4.
Zurück zum Zitat SWIM. SWIM stands for Statistical Workload Injector for MapReduce. The synthesis methodology is adopted in BigDS and it’s supporting toolset SWIM. SWIM stands for Statistical Workload Injector for MapReduce. The synthesis methodology is adopted in BigDS and it’s supporting toolset
5.
Zurück zum Zitat Apache Hadoop and it’s related projects. Apache Hadoop is an open-source software framework for storage and large-scale processing of data-sets on clusters of commodity hardware. Hadoop is an Apache top-level project being built and used by a global community of contributors and users. It is licensed under the Apache License 2.0 Apache Hadoop and it’s related projects. Apache Hadoop is an open-source software framework for storage and large-scale processing of data-sets on clusters of commodity hardware. Hadoop is an Apache top-level project being built and used by a global community of contributors and users. It is licensed under the Apache License 2.0
6.
Zurück zum Zitat Apache Hive. Apache Hive is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis. [1] While initially developed by Facebook Apache Hive. Apache Hive is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis. [1] While initially developed by Facebook
11.
Zurück zum Zitat Big Bench. Extend TPC-DS specification to include unstructured and semi-structured data; modify the TPC-DS. In: A data model for BigBench was proposed in the First WBDB Workshop by Ghazal (2012) Big Bench. Extend TPC-DS specification to include unstructured and semi-structured data; modify the TPC-DS. In: A data model for BigBench was proposed in the First WBDB Workshop by Ghazal (2012)
13.
Zurück zum Zitat Apache Drill Project. Apache Drill is an open-source software framework that supports data-intensive distributed applications for interactive analysis of large-scale datasets. Drill is the open source version of Google’s Dremel system which is available as an IaaS service called Google BigQuery. http://incubator.apache.org/drill/ Apache Drill Project. Apache Drill is an open-source software framework that supports data-intensive distributed applications for interactive analysis of large-scale datasets. Drill is the open source version of Google’s Dremel system which is available as an IaaS service called Google BigQuery. http://​incubator.​apache.​org/​drill/​
Metadaten
Titel
Big Data Benchmark - Big DS
verfasst von
Jun-Ming Zhao
Wen-Shuan Wang
Xian Liu
You-Fu Chen
Copyright-Jahr
2014
DOI
https://doi.org/10.1007/978-3-319-10596-3_5