Skip to main content
Top

2018 | OriginalPaper | Chapter

High Concurrent Elastic Resource Allocation in Hadoop YARN

Authors : Yang Peng, Danyan Luo, Jian Dong, Zhibo Wu

Published in: Communications and Networking

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Efficient resource management to improve the throughput in large-scale cluster has become a research focus with the rapid development of applications of Big Data. YARN (Yet Another Resource Negotiator), as the new generation of resource management system in Hadoop, is more efficient in resource utilization and capable of handling more kinds of workload than previous systems. Due to the fact that a task usually occupies more resources than it actually uses during some stage of its life cycle, a relevant amount of resource is idle and can not be allocated to satisfy the requirements of pending tasks. In order to address the deficiencies of resource allocation in YARN, this paper presents a high concurrent elastic resource allocation strategy named Ballon, which can dynamically adjust the configured resource of a node depending on the actual resource utilization of the node. Moreover, Ballon classifies resource requests of applications into different types. Consequently the elastic resources can be allocated to proper request. Our experiments demonstrate that Ballon cluster can reduce the average execution time of application by at least 10% in most MapReduce application and can increase the resource utilization of cluster.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Shvachko, K., Kuang, H., Radia, S., et al.: The Hadoop distributed file system, pp. 1–10 (2010) Shvachko, K., Kuang, H., Radia, S., et al.: The Hadoop distributed file system, pp. 1–10 (2010)
2.
go back to reference Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. In: Conference on Symposium on Operating Systems Design & Implementation. DBLP (2004) Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. In: Conference on Symposium on Operating Systems Design & Implementation. DBLP (2004)
4.
go back to reference Huang, W., Meng, L., Zhang, D., et al.: In-memory parallel processing of massive remotely sensed data using an Apache Spark on Hadoop YARN model. IEEE J. Sel. Topics Appl. Earth Obs. Remote Sens. 10, 1–17 (2017)CrossRef Huang, W., Meng, L., Zhang, D., et al.: In-memory parallel processing of massive remotely sensed data using an Apache Spark on Hadoop YARN model. IEEE J. Sel. Topics Appl. Earth Obs. Remote Sens. 10, 1–17 (2017)CrossRef
5.
go back to reference Vavilapalli, V.K., Murthy, A.C., Douglas, C., et al.: Apache Hadoop YARN: yet another resource negotiator. In: Symposium on Cloud Computing, p. 5 (2013) Vavilapalli, V.K., Murthy, A.C., Douglas, C., et al.: Apache Hadoop YARN: yet another resource negotiator. In: Symposium on Cloud Computing, p. 5 (2013)
6.
go back to reference Eadline, D.: Preface to Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2. Pearson Schweiz Ag, Zug (2014) Eadline, D.: Preface to Apache Hadoop YARN: Moving beyond MapReduce and Batch Processing with Apache Hadoop 2. Pearson Schweiz Ag, Zug (2014)
7.
go back to reference Liu, Y.: High availability of network service on docker container. In: International Conference on Measurement, Instrumentation and Automation (2016) Liu, Y.: High availability of network service on docker container. In: International Conference on Measurement, Instrumentation and Automation (2016)
9.
go back to reference Genkin, M., Dehne, F., Pospelova, M., Chen, Y., Navarro, P.: Automatic, on-line tuning of YARN container memory and CPU parameters. In: IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS), Sydney, NSW, pp. 317–324 (2016) Genkin, M., Dehne, F., Pospelova, M., Chen, Y., Navarro, P.: Automatic, on-line tuning of YARN container memory and CPU parameters. In: IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS), Sydney, NSW, pp. 317–324 (2016)
10.
go back to reference Ding, X., Liu, Y., Qian, D.: JellyFish: online performance tuning with adaptive configuration and elastic container in Hadoop YARN. In: 2015 IEEE 21st International Conference on Parallel and Distributed Systems (ICPADS), Melbourne, VIC, p. 831 (2015) Ding, X., Liu, Y., Qian, D.: JellyFish: online performance tuning with adaptive configuration and elastic container in Hadoop YARN. In: 2015 IEEE 21st International Conference on Parallel and Distributed Systems (ICPADS), Melbourne, VIC, p. 831 (2015)
11.
go back to reference Shao, Y., Li, C., Dong, W., Liu, Y.: Energy-aware dynamic resource allocation on Hadoop YARN cluster. In: IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS), Sydney, NSW, pp. 364–371 (2016) Shao, Y., Li, C., Dong, W., Liu, Y.: Energy-aware dynamic resource allocation on Hadoop YARN cluster. In: IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS), Sydney, NSW, pp. 364–371 (2016)
12.
go back to reference Yao, Y., Gao, H., Wang, J., Mi, N., Sheng, B.: OpERA: opportunistic and efficient resource allocation in Hadoop YARN by harnessing idle resources. In: 2016 25th International Conference on Computer Communication and Networks (ICCCN), Waikoloa, HI, pp. 1–9 (2016) Yao, Y., Gao, H., Wang, J., Mi, N., Sheng, B.: OpERA: opportunistic and efficient resource allocation in Hadoop YARN by harnessing idle resources. In: 2016 25th International Conference on Computer Communication and Networks (ICCCN), Waikoloa, HI, pp. 1–9 (2016)
13.
go back to reference Zhao, Y., Wu, G.: Yadoop: an elastic resource management solution of YARN. In: 2015 IEEE Symposium on Service-Oriented System Engineering, San Francisco Bay, CA (2015) Zhao, Y., Wu, G.: Yadoop: an elastic resource management solution of YARN. In: 2015 IEEE Symposium on Service-Oriented System Engineering, San Francisco Bay, CA (2015)
14.
go back to reference Lin, J., Lee, M.: Performance evaluation of job schedulers on Hadoop YARN. Concur. Comput. Pract. Exp. 28(9), 2711–2728 (2016)CrossRef Lin, J., Lee, M.: Performance evaluation of job schedulers on Hadoop YARN. Concur. Comput. Pract. Exp. 28(9), 2711–2728 (2016)CrossRef
15.
go back to reference Shah, P.: Adaptive application master for elastic web server farms for cloud based on Hadoop YARN. In: International Conference on Cloud Computing and Big Data, pp. 461–446 (2013) Shah, P.: Adaptive application master for elastic web server farms for cloud based on Hadoop YARN. In: International Conference on Cloud Computing and Big Data, pp. 461–446 (2013)
16.
go back to reference Kakantousis, T.: Scaling YARN: a distributed resource manager for Hadoop (2015) Kakantousis, T.: Scaling YARN: a distributed resource manager for Hadoop (2015)
Metadata
Title
High Concurrent Elastic Resource Allocation in Hadoop YARN
Authors
Yang Peng
Danyan Luo
Jian Dong
Zhibo Wu
Copyright Year
2018
Publisher
Springer International Publishing
DOI
https://doi.org/10.1007/978-3-319-78130-3_54

Premium Partner