Skip to main content
Top
Published in: Arabian Journal for Science and Engineering 2/2022

07-08-2021 | Research Article-Computer Engineering and Computer Science

The Role of Vertical Elastic Namenode in Handling Big Data in Small Files

Authors: Mohammad Al-Masadeh, Fahad Al-Zahrani

Published in: Arabian Journal for Science and Engineering | Issue 2/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Hadoop is a distributed system used exclusively for BigData analysis and processing that is Hadoop distinguished itself through its high performance and a solid availability. Hadoop cluster is available for use at any time and this is one of Hadoop’s solid attributes that make it popularly known in data analysis and sciences. However, there are a several factors impacting Hadoop cluster, causing it to be inaccessible. One of these factors is BigData in small files whereby Hadoop’s availability shortage accrued when a massive amount of small files dataset is pushed toward a Hadoop cluster. This will harm the cluster’s performance, making it unavailable for access and use. This negative factor affects the Namenode itself as the Namenode is a single point of failure. Hence, once it crashes, the whole cluster will be out of service and need to jump again manually. This paper will introduce the elastic Namenode in lieu of the current traditional one. The elastic Namenode has an ability to adapt to the frequent negative factors that are affecting the whole cluster, causing it to become unavailable. The elastic Namenode will adopt the vertical elasticity manner, this type of elasticity will add more memory resources to the Namenode based on a direction from a script that traces the Namenode memory itself. The result will be a cloud elastic Namenode that can be expanded and shrunk upon request, which allows Hadoop cluster to treat BigData in small files without any negative factor or issue.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Prabhu, Y.: Transformation of Hadoop: a survey. Int. J. Sci. Technol. Eng. 4(8), 97–101 (2018) Prabhu, Y.: Transformation of Hadoop: a survey. Int. J. Sci. Technol. Eng. 4(8), 97–101 (2018)
2.
go back to reference Koh, S.; et al.: Exploring fault-tolerant erasure codes for scalable all-flash array clusters. IEEE Trans. Parallel Distrib. Syst. 30(6), 1312–1330 (2019)CrossRef Koh, S.; et al.: Exploring fault-tolerant erasure codes for scalable all-flash array clusters. IEEE Trans. Parallel Distrib. Syst. 30(6), 1312–1330 (2019)CrossRef
3.
go back to reference Mohan, L.J.; Caneleo, P.I.S.; Parampalli, U.; Harwood, A.: Geo-aware erasure coding for high-performance erasure-coded storage clusters. Ann. Telecommun. 73(1–2), 139–152 (2018)CrossRef Mohan, L.J.; Caneleo, P.I.S.; Parampalli, U.; Harwood, A.: Geo-aware erasure coding for high-performance erasure-coded storage clusters. Ann. Telecommun. 73(1–2), 139–152 (2018)CrossRef
4.
go back to reference Singh, A.; Choudhary, S.; Kumari, M.: HADOOP ecosystem analytics and big data for advanced computing platforms. Int. J. Adv. Sci. Technol. 29(5), 6633–6642 (2020) Singh, A.; Choudhary, S.; Kumari, M.: HADOOP ecosystem analytics and big data for advanced computing platforms. Int. J. Adv. Sci. Technol. 29(5), 6633–6642 (2020)
5.
go back to reference Chandrasekar, S.; Dakshinamurthy, R.; Seshakumar, P.G.; Prabavathy, B.; Babu, C.: A novel indexing scheme for efficient handling of small files in hadoop distributed file system. In: 2013 International Conference on Computer Communication and Informatics. IEEE (2013) Chandrasekar, S.; Dakshinamurthy, R.; Seshakumar, P.G.; Prabavathy, B.; Babu, C.: A novel indexing scheme for efficient handling of small files in hadoop distributed file system. In: 2013 International Conference on Computer Communication and Informatics. IEEE (2013)
6.
go back to reference Ularu, E.G.; Puican, F.C.; Apostu, A.; Velicanu, M.; Student, P.: Perspectives on big data and big data analytics. Database Syst. J. 3(4), 3–14 (2012) Ularu, E.G.; Puican, F.C.; Apostu, A.; Velicanu, M.; Student, P.: Perspectives on big data and big data analytics. Database Syst. J. 3(4), 3–14 (2012)
7.
go back to reference Korat, V.G.; Pamu, K.S.: Reduction of data at namenode in HDFS using harballing technique. Int. J. Adv. Res. Comput. Eng. Technol. 1(4), 635–642 (2012) Korat, V.G.; Pamu, K.S.: Reduction of data at namenode in HDFS using harballing technique. Int. J. Adv. Res. Comput. Eng. Technol. 1(4), 635–642 (2012)
8.
go back to reference Deshpande, P.P.: Hadoop distributed filesystem: metadata management. Int. Res. J. Eng. Technol. 10, 2395–2456 (2017) Deshpande, P.P.: Hadoop distributed filesystem: metadata management. Int. Res. J. Eng. Technol. 10, 2395–2456 (2017)
9.
go back to reference Demir, I.; Sayar, A.: Hadoop optimization for massive image processing: case study face detection. Int. J. Comput. Commun. Control 9(6) 664–671 (2014)CrossRef Demir, I.; Sayar, A.: Hadoop optimization for massive image processing: case study face detection. Int. J. Comput. Commun. Control 9(6) 664–671 (2014)CrossRef
10.
go back to reference Mrozek, D.; Daniłowicz, P.; Małysiak-Mrozek, B.: HDInsight4PSi: boosting performance of 3D protein structure similarity searching with HDInsight clusters in Microsoft Azure cloud. Inf. Sci. (Ny) 349–350, 77–101 (2016)CrossRef Mrozek, D.; Daniłowicz, P.; Małysiak-Mrozek, B.: HDInsight4PSi: boosting performance of 3D protein structure similarity searching with HDInsight clusters in Microsoft Azure cloud. Inf. Sci. (Ny) 349–350, 77–101 (2016)CrossRef
11.
go back to reference Guan, Y.; Ma, Z.; Li, L.: HDFS optimization strategy based on hierarchical storage of hot and cold data. Procedia CIRP 83, 415–418 (2019)CrossRef Guan, Y.; Ma, Z.; Li, L.: HDFS optimization strategy based on hierarchical storage of hot and cold data. Procedia CIRP 83, 415–418 (2019)CrossRef
12.
go back to reference Approach, A.N.; Undestand, T.; Files, S.; In, P.: A review on small files in Hadoop. (5), 6585–6588 (2017) Approach, A.N.; Undestand, T.; Files, S.; In, P.: A review on small files in Hadoop. (5), 6585–6588 (2017)
13.
go back to reference Chethan, R.; Chandan, K.; Jayanth, K.: A selective approach for storing small files in respective blocks of Hadoop. Int. J. Adv. Netw. Appl. (IJANA) 461–465 (2010) Chethan, R.; Chandan, K.; Jayanth, K.: A selective approach for storing small files in respective blocks of Hadoop. Int. J. Adv. Netw. Appl. (IJANA) 461–465 (2010)
14.
go back to reference Ahad, M.A.; Biswas, R.: Dynamic merging based small file storage (DM-SFS) architecture for efficiently storing small size files in Hadoop. Procedia Comput. Sci. 132, 1626–1635 (2018)CrossRef Ahad, M.A.; Biswas, R.: Dynamic merging based small file storage (DM-SFS) architecture for efficiently storing small size files in Hadoop. Procedia Comput. Sci. 132, 1626–1635 (2018)CrossRef
16.
go back to reference Mohanty, A.; Ranjana, P.; Subramanian, D.V.: Small files consolidation technique in Hadoop cluster. Int. J. Simul. Syst. Sci. Technol. 19(6), 311–315 (2018) Mohanty, A.; Ranjana, P.; Subramanian, D.V.: Small files consolidation technique in Hadoop cluster. Int. J. Simul. Syst. Sci. Technol. 19(6), 311–315 (2018)
18.
go back to reference Moltó, G.; Caballer, M.; Romero, E.; De Alfonso, C.: Elastic memory management of virtualized infrastructures for applications with dynamic memory requirements. Procedia Comput. Sci. 18, 159–168 (2013)CrossRef Moltó, G.; Caballer, M.; Romero, E.; De Alfonso, C.: Elastic memory management of virtualized infrastructures for applications with dynamic memory requirements. Procedia Comput. Sci. 18, 159–168 (2013)CrossRef
19.
go back to reference Aishwarya, K.; Arvind Ram, A.; Sreevatson, M. C.; Babu, C.; Prabavathy, B.: Efficient prefetching technique for storage of heterogeneous small files in Hadoop distributed file system federation. In: 2013 Fifth International Conference on Advanced Computing (ICoAC). IEEE (2014) Aishwarya, K.; Arvind Ram, A.; Sreevatson, M. C.; Babu, C.; Prabavathy, B.: Efficient prefetching technique for storage of heterogeneous small files in Hadoop distributed file system federation. In: 2013 Fifth International Conference on Advanced Computing (ICoAC). IEEE (2014)
23.
go back to reference He, S.; Guo, L.; Guo, Y.: Real time elastic cloud management for limited resources. In: Proc. - 2011 IEEE 4th Int. Conf. Cloud Comput. CLOUD 2011, 622–629 (2011) He, S.; Guo, L.; Guo, Y.: Real time elastic cloud management for limited resources. In: Proc. - 2011 IEEE 4th Int. Conf. Cloud Comput. CLOUD 2011, 622–629 (2011)
Metadata
Title
The Role of Vertical Elastic Namenode in Handling Big Data in Small Files
Authors
Mohammad Al-Masadeh
Fahad Al-Zahrani
Publication date
07-08-2021
Publisher
Springer Berlin Heidelberg
Published in
Arabian Journal for Science and Engineering / Issue 2/2022
Print ISSN: 2193-567X
Electronic ISSN: 2191-4281
DOI
https://doi.org/10.1007/s13369-021-05929-5

Other articles of this Issue 2/2022

Arabian Journal for Science and Engineering 2/2022 Go to the issue

Research Article-Computer Engineering and Computer Science

Hand Gesture Recognition from 2D Images by Using Convolutional Capsule Neural Networks

Premium Partners