Skip to main content

2014 | OriginalPaper | Buchkapitel

31. Loading Data into HBase

verfasst von : Juan Yang, Xiaopu Feng

Erschienen in: Computer Engineering and Networking

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

HBase is a top Apache open-source project that separated from Hadoop. As it has most of the features of Google’s BigTable system and is implemented in Java, it is very popular in days of massive data. HBase’s advantages are reflected in the massive data read and query. Loading huge amounts of data into HBase is the first step to use HBase. HBase itself has several methods to load data, and different methods have different application scenarios. This article made an exhaustive study and a performance testing of them. Also, this article achieved the custom loading data, and experiments show that it has good efficiency.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Dean, J., & Ghemawat, S. (2008). MapReduce: Simplified data processing on large clusters. Communications of the ACM, 51(1), 107–113.CrossRef Dean, J., & Ghemawat, S. (2008). MapReduce: Simplified data processing on large clusters. Communications of the ACM, 51(1), 107–113.CrossRef
2.
Zurück zum Zitat Chang, F., Dean, J., Ghemawat, S., Hsieh, W. C., Wallach, D. A., Burrows, M., et al. (2008). Bigtable: A distributed storage system for structured data. ACM Transactions on Computer Systems (TOCS), 26(2), 4.CrossRef Chang, F., Dean, J., Ghemawat, S., Hsieh, W. C., Wallach, D. A., Burrows, M., et al. (2008). Bigtable: A distributed storage system for structured data. ACM Transactions on Computer Systems (TOCS), 26(2), 4.CrossRef
3.
Zurück zum Zitat White, T. Hadoop: The definitive guide [M]. O’Reilly Media, Inc.,1005 Gravenstein Highway North, Sebastopol, CA95472, 2012. White, T. Hadoop: The definitive guide [M]. O’Reilly Media, Inc.,1005 Gravenstein Highway North, Sebastopol, CA95472, 2012.
4.
Zurück zum Zitat George, L. HBase: The definitive guide [M]. O’Reilly Media, Inc.,1005 Gravenstein Highway North, Sebastopol, CA95472, 2011. George, L. HBase: The definitive guide [M]. O’Reilly Media, Inc.,1005 Gravenstein Highway North, Sebastopol, CA95472, 2011.
5.
Zurück zum Zitat Huang, J., Ouyang, X., Jose, J., Wasi-ur-Rahman Md., Wang, H., Luo, M., et al. (2012). High-performance design of HBase with RDMA over infiniBand. In Proceedings of the 2012 I.E. 26th International Parallel and Distributed Processing Symposium, IPDPS 2012 (pp. 774–778). Washington, DC: IEEE Computer Society. Huang, J., Ouyang, X., Jose, J., Wasi-ur-Rahman Md., Wang, H., Luo, M., et al. (2012). High-performance design of HBase with RDMA over infiniBand. In Proceedings of the 2012 I.E. 26th International Parallel and Distributed Processing Symposium, IPDPS 2012 (pp. 774–778). Washington, DC: IEEE Computer Society.
6.
Zurück zum Zitat Li, C. (2010). Transforming relational database into HBase: A case study. In Proceedings 2010 I.E. International Conference on Software Engineering and Service Sciences, ICSESS 2010 (pp. 683–687). Piscataway, NJ: IEEE Computer Society. Li, C. (2010). Transforming relational database into HBase: A case study. In Proceedings 2010 I.E. International Conference on Software Engineering and Service Sciences, ICSESS 2010 (pp. 683–687). Piscataway, NJ: IEEE Computer Society.
7.
Zurück zum Zitat Vora, M. N. (2011). Hadoop-HBase for large-scale data. In Proceedings of 2011 International Conference on Computer Science and Network Technology, ICCSNT 2011 (pp. 601–605). Piscataway, NJ: IEEE Computer Society. Vora, M. N. (2011). Hadoop-HBase for large-scale data. In Proceedings of 2011 International Conference on Computer Science and Network Technology, ICCSNT 2011 (pp. 601–605). Piscataway, NJ: IEEE Computer Society.
8.
Zurück zum Zitat Carstoiu, D., Cernian, A., & Olteanu, A. (2010). Hadoop hbase-0.20. 2 performance evaluation. In NISS2010 – 4th International Conference on New Trends in Information Science and Service Science (pp. 84–87). Piscataway, NJ: IEEE Computer Society. Carstoiu, D., Cernian, A., & Olteanu, A. (2010). Hadoop hbase-0.20. 2 performance evaluation. In NISS2010 – 4th International Conference on New Trends in Information Science and Service Science (pp. 84–87). Piscataway, NJ: IEEE Computer Society.
Metadaten
Titel
Loading Data into HBase
verfasst von
Juan Yang
Xiaopu Feng
Copyright-Jahr
2014
Verlag
Springer International Publishing
DOI
https://doi.org/10.1007/978-3-319-01766-2_31

Neuer Inhalt