Skip to main content
Top

2021 | OriginalPaper | Chapter

Big Data Analytics—Analysis and Comparison of Various Tools

Authors : Amit Gupta, Bhanu Prakash Dubey, Himani Sivaraman, M. C. Lohani

Published in: Advances in Information Communication Technology and Computing

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Big data is the latest terminology in the computer world. The data collection is increasing day by day, and many technological changes can bring some new methods for decision-making process in many areas such as health and finance. As the complexities are increasing due to volume, veracity, variety and velocity, our focus is on the methods to calculate the value of data using various big data analytics technologies. The analytics process used with respect to big data can be improvised by using new algorithms, which enhance the analytical aspects and can be used to extract the hidden knowledge very efficiently and effectively.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Demchenko Y, Grosso P, de Laat C, Membrey P. Addressing big data issues in scientific data infrastructure. In: 2013 international conference on collaboration technologies and systems (CTS), San Diego, 2013. IEEE, pp 48–55 Demchenko Y, Grosso P, de Laat C, Membrey P. Addressing big data issues in scientific data infrastructure. In: 2013 international conference on collaboration technologies and systems (CTS), San Diego, 2013. IEEE, pp 48–55
2.
go back to reference Cox M, Ellsworth D. Managing big data for scientific visualization. In: ACM Siggraph ’97 course #4 exploring giga-byte datasets in real-time: algorithms, data management, and time-critical design, August, 1997 Cox M, Ellsworth D. Managing big data for scientific visualization. In: ACM Siggraph ’97 course #4 exploring giga-byte datasets in real-time: algorithms, data management, and time-critical design, August, 1997
3.
go back to reference Bekkerman R, Bilenko M, Langford J (2011) Scaling up machine learning: parallel and distributed approaches. Cambridge University Press, CambridgeCrossRef Bekkerman R, Bilenko M, Langford J (2011) Scaling up machine learning: parallel and distributed approaches. Cambridge University Press, CambridgeCrossRef
4.
go back to reference Ni Z Comparative evaluation of spark and stratosphere. Thesis, KTH Royal Institute of Technology; 2013 Ni Z Comparative evaluation of spark and stratosphere. Thesis, KTH Royal Institute of Technology; 2013
5.
go back to reference Bu Y, Howe B, Balazinska M, Ernst MD (2010) HaLoop: efficient Iterative data processing on large clusters. Proceedings VLDB Endowment 3(1):285–296CrossRef Bu Y, Howe B, Balazinska M, Ernst MD (2010) HaLoop: efficient Iterative data processing on large clusters. Proceedings VLDB Endowment 3(1):285–296CrossRef
6.
go back to reference Jakovits P, Srirama SN (2014) Evaluating MapReduce frameworks for iterative scientific computing applications. In: 2014 International conference on high performance computing & simulation; 2014. pp 226–33 Jakovits P, Srirama SN (2014) Evaluating MapReduce frameworks for iterative scientific computing applications. In: 2014 International conference on high performance computing & simulation; 2014. pp 226–33
7.
go back to reference Vavilapalli VK, Murthy AC, Douglas C, Agarwal S, Konar M, Evans R, Graves T, Lowe J, Shah H, Seth S, Saha B, Curino C, O’Malley O, Radia S, Reed B, Baldeschwieler E. Apache Hadoop YARN: yet another resource negotiator. In: Proceedings of the 4th annual symposium on cloud computing; 2013 Vavilapalli VK, Murthy AC, Douglas C, Agarwal S, Konar M, Evans R, Graves T, Lowe J, Shah H, Seth S, Saha B, Curino C, O’Malley O, Radia S, Reed B, Baldeschwieler E. Apache Hadoop YARN: yet another resource negotiator. In: Proceedings of the 4th annual symposium on cloud computing; 2013
8.
go back to reference Fernández A, del Río S, López V, Bawakid A, del Jesus MJ, Benítez JM, Herrera F (2014) Big data with cloud computing: an insight on the computing environment, MapReduce, and programming frameworks. Wiley Interdiscip Rev Data Min Knowl Discov. 4(5):380–409CrossRef Fernández A, del Río S, López V, Bawakid A, del Jesus MJ, Benítez JM, Herrera F (2014) Big data with cloud computing: an insight on the computing environment, MapReduce, and programming frameworks. Wiley Interdiscip Rev Data Min Knowl Discov. 4(5):380–409CrossRef
9.
go back to reference Lin J, Kolcz A. Large-scale machine learning at twitter. In: Proceedings of the 2012 ACM SIGMOD international conference on management of data; 2012. pp 793–804 Lin J, Kolcz A. Large-scale machine learning at twitter. In: Proceedings of the 2012 ACM SIGMOD international conference on management of data; 2012. pp 793–804
10.
go back to reference Dean J, Ghemawat S. MapReduce: Simplified Data Processing on Large Clusters. In: Proceedings of the 6th symposium on operating systems design and implementation; 2004 Dean J, Ghemawat S. MapReduce: Simplified Data Processing on Large Clusters. In: Proceedings of the 6th symposium on operating systems design and implementation; 2004
11.
go back to reference Malewicz G, Austern MH, Bik AJC, Dehnert JC, Horn I, Leiser N, and Czajkowski G (2010) Pregel: A system for large-scale graph processing. In: Proceedings of the 2010 ACM SIGMOD international conference on management of data; 2010. pp 135–45 Malewicz G, Austern MH, Bik AJC, Dehnert JC, Horn I, Leiser N, and Czajkowski G (2010) Pregel: A system for large-scale graph processing. In: Proceedings of the 2010 ACM SIGMOD international conference on management of data; 2010. pp 135–45
13.
go back to reference Zaharia M, Chowdhury M, Das T, Dave A (2012) Fast and interactive analytics over Hadoop data with Spark. USENIX Login 37(4):45–51 Zaharia M, Chowdhury M, Das T, Dave A (2012) Fast and interactive analytics over Hadoop data with Spark. USENIX Login 37(4):45–51
14.
go back to reference White T (2012) Hadoop: the definitive guide, 3rd edn. O’Reilly Media, Inc., Sebastopol, CA White T (2012) Hadoop: the definitive guide, 3rd edn. O’Reilly Media, Inc., Sebastopol, CA
15.
go back to reference Zaharia M, Chowdhury M, Franklin MJ, Shenker S, Stoica I. Spark (2010) Cluster Computing with Working Sets. In: Proceedings of the 2nd USENIX conference on hot topics in cloud computing Zaharia M, Chowdhury M, Franklin MJ, Shenker S, Stoica I. Spark (2010) Cluster Computing with Working Sets. In: Proceedings of the 2nd USENIX conference on hot topics in cloud computing
16.
go back to reference Cai Z, Gao J, Luo S, Perez LL, Vagena Z, Jermaine C. A comparison of platforms for implementing and running very large scale machine learning algorithms. In: Proceedings of the 2014 ACM SIGMOD international conference on management of data (SIGMOD’14) 2014, pp 1371–1382 Cai Z, Gao J, Luo S, Perez LL, Vagena Z, Jermaine C. A comparison of platforms for implementing and running very large scale machine learning algorithms. In: Proceedings of the 2014 ACM SIGMOD international conference on management of data (SIGMOD’14) 2014, pp 1371–1382
17.
go back to reference Zhang H, Tudor BM, Chen G, Ooi BC (2014) Efficient in-memory data management: an analysis. Proc VLDB Endowment 7(10):6–9 Zhang H, Tudor BM, Chen G, Ooi BC (2014) Efficient in-memory data management: an analysis. Proc VLDB Endowment 7(10):6–9
18.
go back to reference Singh J (2014) Big data analytic and mining with machine learning algorithm. Int J Inform Comput Technol 4(1):33–40 Singh J (2014) Big data analytic and mining with machine learning algorithm. Int J Inform Comput Technol 4(1):33–40
19.
go back to reference Ousterhout K, Rasti R, Ratnasamy S, Shenker S, Chun B (2015) Making sense of performance in data analytics frameworks. In: Proceedings of the 12th USENIX symposium. On networked systems design and implementation (NSDI 15) Ousterhout K, Rasti R, Ratnasamy S, Shenker S, Chun B (2015) Making sense of performance in data analytics frameworks. In: Proceedings of the 12th USENIX symposium. On networked systems design and implementation (NSDI 15)
20.
go back to reference Shahrivari S, Jalili S (2014) Beyond batch processing : towards real-time and streaming big data. Computers 3(4):117–129CrossRef Shahrivari S, Jalili S (2014) Beyond batch processing : towards real-time and streaming big data. Computers 3(4):117–129CrossRef
Metadata
Title
Big Data Analytics—Analysis and Comparison of Various Tools
Authors
Amit Gupta
Bhanu Prakash Dubey
Himani Sivaraman
M. C. Lohani
Copyright Year
2021
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-15-5421-6_48