Skip to main content

2018 | OriginalPaper | Buchkapitel

Performance Enhancement of Hadoop for Big Data Using Multilevel Queue Migration (MQM) Technique

verfasst von : C. Sreedhar, N. Kasiviswanath, P. Chenna Reddy

Erschienen in: Advanced Computational and Communication Paradigms

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The recent advancements in Hadoop MapReduce scheduling techniques have demonstrated significant outcomes. The continuous tradeoff between the data-job locality and synchronization results in the higher efficiency for the framework. Thus, large number of scientific and enterprise applications have adopted the parallel and synchronized mechanism through Hadoop framework. However, with this adaptation, a large number of datacenter-based nodes are been deployed, significantly causing the increase of energy consumptions. Henceforth, the demand of the recent research is to enhance the overall efficiency of Hadoop jobs and to decrease the energy consumption without degrading the performance. The recent advancements have demonstrated by many strategies by improving the Map and Reduce job allocation techniques; conversely, the same improvement can also be achieved through multilevel queues. Hence, this work constitutes the multilevel queue with custom load balancing to demonstrate the improvement in overall performance of Hadoop job scheduling. The work results in a significant improvement of Hadoop jobs in terms of execution times and energy consumption.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat De Camargo, R.Y., Goldchleger, A., Kon, F.: InteGrade: a tool for executing parallel applications. J. Grid Comput. (2005) De Camargo, R.Y., Goldchleger, A., Kon, F.: InteGrade: a tool for executing parallel applications. J. Grid Comput. (2005)
2.
Zurück zum Zitat Dittrich, J., Quiane-Ruiz, J.-A.: Efficient Big Data Processing in Hadoop MapReduce. In: VLDB Endowment, Vol. 5, No. 12 (2012)CrossRef Dittrich, J., Quiane-Ruiz, J.-A.: Efficient Big Data Processing in Hadoop MapReduce. In: VLDB Endowment, Vol. 5, No. 12 (2012)CrossRef
3.
Zurück zum Zitat Pinedo, M.: Scheduling: Theory, Algorithms, and Systems, 3rd edn. Springer Science, Heidelberg (2008)MATH Pinedo, M.: Scheduling: Theory, Algorithms, and Systems, 3rd edn. Springer Science, Heidelberg (2008)MATH
4.
Zurück zum Zitat Morton, K., Balazinska, M., Grossman, D.: ParaTimer—a progress indicator for MapReduce DAGs. In: International Conference on Management of Data, pp. 507–518 (2010) Morton, K., Balazinska, M., Grossman, D.: ParaTimer—a progress indicator for MapReduce DAGs. In: International Conference on Management of Data, pp. 507–518 (2010)
5.
Zurück zum Zitat Wei, L.: Efficient processing of k nearest neighbor joins using MapReduce. VLDB Endowment 5(10), 1016–1027 (2012)CrossRef Wei, L.: Efficient processing of k nearest neighbor joins using MapReduce. VLDB Endowment 5(10), 1016–1027 (2012)CrossRef
6.
Zurück zum Zitat Dean, J., Ghemawat, S.: MapReduce—simplified data processing on large clusters. In: Symposium on Operating System Design and Implementation (2004) Dean, J., Ghemawat, S.: MapReduce—simplified data processing on large clusters. In: Symposium on Operating System Design and Implementation (2004)
9.
Zurück zum Zitat Kavulya, S.: An analysis of traces from a production MapReduce cluster. In: 10th IEEE/ACM CCGrid, pp. 94–103 (2010) Kavulya, S.: An analysis of traces from a production MapReduce cluster. In: 10th IEEE/ACM CCGrid, pp. 94–103 (2010)
10.
Zurück zum Zitat Liu, S., Xu, J., Liu, Z., Liu, X.: Evaluating task scheduling in hadoop-based cloud systems. In: IEEE International Conference on Big Data (2013) Liu, S., Xu, J., Liu, Z., Liu, X.: Evaluating task scheduling in hadoop-based cloud systems. In: IEEE International Conference on Big Data (2013)
12.
Zurück zum Zitat Pinedo, M.: Scheduling—Theory, Algorithms, and Systems, 3rd edn. Springer Science, Heidelberg (2008)MATH Pinedo, M.: Scheduling—Theory, Algorithms, and Systems, 3rd edn. Springer Science, Heidelberg (2008)MATH
13.
Zurück zum Zitat Baptiste, P.: Scheduling equal-length jobs on identical parallel machines. Discrete Appl. Math. 103(1), 21–32 (2000)MathSciNetCrossRef Baptiste, P.: Scheduling equal-length jobs on identical parallel machines. Discrete Appl. Math. 103(1), 21–32 (2000)MathSciNetCrossRef
15.
Zurück zum Zitat Isard, M., Prabhakaran, V., Currey, J., Wider, U., Talwar, K., Goldberg, A.: Quincy—fair scheduling for distributed computing clusters. In: ACM SIGOPS, pp. 261–276 (2009) Isard, M., Prabhakaran, V., Currey, J., Wider, U., Talwar, K., Goldberg, A.: Quincy—fair scheduling for distributed computing clusters. In: ACM SIGOPS, pp. 261–276 (2009)
16.
Zurück zum Zitat Chen, J., Wang, D., Zhao, W.: A Task Scheduling algorithm for hadoop platform. J. Comput. 8(4), 929–936 (2013) Chen, J., Wang, D., Zhao, W.: A Task Scheduling algorithm for hadoop platform. J. Comput. 8(4), 929–936 (2013)
17.
Zurück zum Zitat Tiwari, N.: Scheduling and energy efficiency improvement techniques for Hadoop MapReduce: state of art and directions for future research. Doctoral dissertation, Indian Institute of Technology, Bombay Mumbai Tiwari, N.: Scheduling and energy efficiency improvement techniques for Hadoop MapReduce: state of art and directions for future research. Doctoral dissertation, Indian Institute of Technology, Bombay Mumbai
18.
Zurück zum Zitat Nguyen, P., Simon, T., Halem, M., Chapman, D., Le, Q.: A hybrid scheduling algorithm for data intensive workloads in a MapReduce environment. In: IEEE/ACM Fifth International Conference on Utility and Cloud Computing. UCC’12, pp. 161–168. Washington, DC, USA: IEEE Computer Society (2012) Nguyen, P., Simon, T., Halem, M., Chapman, D., Le, Q.: A hybrid scheduling algorithm for data intensive workloads in a MapReduce environment. In: IEEE/ACM Fifth International Conference on Utility and Cloud Computing. UCC’12, pp. 161–168. Washington, DC, USA: IEEE Computer Society (2012)
19.
Zurück zum Zitat Zaharia, M., Konwinski, A., Joseph, A.D., Katz, R., Stoica, I.: Improving MapReduce performance in heterogeneous environments. In: USENIX Symposium on Operating Systems Design and Implementation (2008) Zaharia, M., Konwinski, A., Joseph, A.D., Katz, R., Stoica, I.: Improving MapReduce performance in heterogeneous environments. In: USENIX Symposium on Operating Systems Design and Implementation (2008)
20.
Zurück zum Zitat Chen, Q., Zhang, D., Guo, M., Deng, Q., Guo, S.: A self-adaptive MapReduce scheduling algorithm in heterogeneous environment. In: 10th IEEE International Conference on Computer and Information Technology (2010) Chen, Q., Zhang, D., Guo, M., Deng, Q., Guo, S.: A self-adaptive MapReduce scheduling algorithm in heterogeneous environment. In: 10th IEEE International Conference on Computer and Information Technology (2010)
21.
Zurück zum Zitat Zaharia, M., Borthakur, D., Sarma, J.S., Elmeleegy, K., Shenker, S., Stoica, I.: Delay scheduling—a simple technique for achieving locality and fairness in cluster scheduling. In: 5th European Conference on Computer Systems (EuroSys). ACM Press (2010) Zaharia, M., Borthakur, D., Sarma, J.S., Elmeleegy, K., Shenker, S., Stoica, I.: Delay scheduling—a simple technique for achieving locality and fairness in cluster scheduling. In: 5th European Conference on Computer Systems (EuroSys). ACM Press (2010)
22.
Zurück zum Zitat Cassales, G.W., Charao, A.S., Pinheiro, M.K., Souveyet, C., Steffene, L.A.: Context-aware scheduling for Apache Hadoop over pervasive environments. Comput. Sci. 52, 202–209 (2015)CrossRef Cassales, G.W., Charao, A.S., Pinheiro, M.K., Souveyet, C., Steffene, L.A.: Context-aware scheduling for Apache Hadoop over pervasive environments. Comput. Sci. 52, 202–209 (2015)CrossRef
Metadaten
Titel
Performance Enhancement of Hadoop for Big Data Using Multilevel Queue Migration (MQM) Technique
verfasst von
C. Sreedhar
N. Kasiviswanath
P. Chenna Reddy
Copyright-Jahr
2018
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-8237-5_32

Neuer Inhalt