Skip to main content
Top

2014 | OriginalPaper | Chapter

31. Loading Data into HBase

Authors : Juan Yang, Xiaopu Feng

Published in: Computer Engineering and Networking

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

HBase is a top Apache open-source project that separated from Hadoop. As it has most of the features of Google’s BigTable system and is implemented in Java, it is very popular in days of massive data. HBase’s advantages are reflected in the massive data read and query. Loading huge amounts of data into HBase is the first step to use HBase. HBase itself has several methods to load data, and different methods have different application scenarios. This article made an exhaustive study and a performance testing of them. Also, this article achieved the custom loading data, and experiments show that it has good efficiency.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Dean, J., & Ghemawat, S. (2008). MapReduce: Simplified data processing on large clusters. Communications of the ACM, 51(1), 107–113.CrossRef Dean, J., & Ghemawat, S. (2008). MapReduce: Simplified data processing on large clusters. Communications of the ACM, 51(1), 107–113.CrossRef
2.
go back to reference Chang, F., Dean, J., Ghemawat, S., Hsieh, W. C., Wallach, D. A., Burrows, M., et al. (2008). Bigtable: A distributed storage system for structured data. ACM Transactions on Computer Systems (TOCS), 26(2), 4.CrossRef Chang, F., Dean, J., Ghemawat, S., Hsieh, W. C., Wallach, D. A., Burrows, M., et al. (2008). Bigtable: A distributed storage system for structured data. ACM Transactions on Computer Systems (TOCS), 26(2), 4.CrossRef
3.
go back to reference White, T. Hadoop: The definitive guide [M]. O’Reilly Media, Inc.,1005 Gravenstein Highway North, Sebastopol, CA95472, 2012. White, T. Hadoop: The definitive guide [M]. O’Reilly Media, Inc.,1005 Gravenstein Highway North, Sebastopol, CA95472, 2012.
4.
go back to reference George, L. HBase: The definitive guide [M]. O’Reilly Media, Inc.,1005 Gravenstein Highway North, Sebastopol, CA95472, 2011. George, L. HBase: The definitive guide [M]. O’Reilly Media, Inc.,1005 Gravenstein Highway North, Sebastopol, CA95472, 2011.
5.
go back to reference Huang, J., Ouyang, X., Jose, J., Wasi-ur-Rahman Md., Wang, H., Luo, M., et al. (2012). High-performance design of HBase with RDMA over infiniBand. In Proceedings of the 2012 I.E. 26th International Parallel and Distributed Processing Symposium, IPDPS 2012 (pp. 774–778). Washington, DC: IEEE Computer Society. Huang, J., Ouyang, X., Jose, J., Wasi-ur-Rahman Md., Wang, H., Luo, M., et al. (2012). High-performance design of HBase with RDMA over infiniBand. In Proceedings of the 2012 I.E. 26th International Parallel and Distributed Processing Symposium, IPDPS 2012 (pp. 774–778). Washington, DC: IEEE Computer Society.
6.
go back to reference Li, C. (2010). Transforming relational database into HBase: A case study. In Proceedings 2010 I.E. International Conference on Software Engineering and Service Sciences, ICSESS 2010 (pp. 683–687). Piscataway, NJ: IEEE Computer Society. Li, C. (2010). Transforming relational database into HBase: A case study. In Proceedings 2010 I.E. International Conference on Software Engineering and Service Sciences, ICSESS 2010 (pp. 683–687). Piscataway, NJ: IEEE Computer Society.
7.
go back to reference Vora, M. N. (2011). Hadoop-HBase for large-scale data. In Proceedings of 2011 International Conference on Computer Science and Network Technology, ICCSNT 2011 (pp. 601–605). Piscataway, NJ: IEEE Computer Society. Vora, M. N. (2011). Hadoop-HBase for large-scale data. In Proceedings of 2011 International Conference on Computer Science and Network Technology, ICCSNT 2011 (pp. 601–605). Piscataway, NJ: IEEE Computer Society.
8.
go back to reference Carstoiu, D., Cernian, A., & Olteanu, A. (2010). Hadoop hbase-0.20. 2 performance evaluation. In NISS2010 – 4th International Conference on New Trends in Information Science and Service Science (pp. 84–87). Piscataway, NJ: IEEE Computer Society. Carstoiu, D., Cernian, A., & Olteanu, A. (2010). Hadoop hbase-0.20. 2 performance evaluation. In NISS2010 – 4th International Conference on New Trends in Information Science and Service Science (pp. 84–87). Piscataway, NJ: IEEE Computer Society.
Metadata
Title
Loading Data into HBase
Authors
Juan Yang
Xiaopu Feng
Copyright Year
2014
Publisher
Springer International Publishing
DOI
https://doi.org/10.1007/978-3-319-01766-2_31