Skip to main content

2019 | OriginalPaper | Buchkapitel

DataSpeak: Data Extraction, Aggregation, and Classification Using Big Data Novel Algorithm

verfasst von : Venkatesh Gauri Shankar, Bali Devi, Sumit Srivastava

Erschienen in: Computing, Communication and Signal Processing

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

A huge amount of data is coming due to large set of computing devices. As a birth of the variety of data, data processing and analysis is a big issue in big data analytics. On other hand, data consistency and scalability is also a major problem in the large set of data. Our research and proposed algorithm aims to data extraction, aggregation, and classification based on novel approach as “DataSpeak”. We have used k-Nearest Neighbors with Spark as reference and produced a novel approach with modified algorithm. We have analyzed our approach on the large dataset from travel and tourism, placement papers, movies and historical, smartphone, etc., domains. As for ability and accuracy of our algorithm, we have used cross validation, precision, recall, and comparative statistical analysis with the existing algorithm. Our approach returns with the fast accessing of data with efficient data extraction in a minimal time when compared to the existing algorithm in same domain. As concerned with the data aggregation and classification, our approach returns 98% of data aggregation and classification based on the data structure.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
7.
Zurück zum Zitat Tang, J., Liu, J., Zhang, M., Mei, Q.: Visualizing large-scale and high-dimensional data. In: Proceedings of the 25th International Conference on WWW, pp. 287–297 (2016) Tang, J., Liu, J., Zhang, M., Mei, Q.: Visualizing large-scale and high-dimensional data. In: Proceedings of the 25th International Conference on WWW, pp. 287–297 (2016)
11.
Zurück zum Zitat Yianilos, P.N.: Data structures and algorithms for nearest neighbor search in general metric spaces. In: Proceedings of the Fourth Annual ACM-SIAM Symposium on Discrete Algorithms, vol. 93, pp. 311–321 (1993) Yianilos, P.N.: Data structures and algorithms for nearest neighbor search in general metric spaces. In: Proceedings of the Fourth Annual ACM-SIAM Symposium on Discrete Algorithms, vol. 93, pp. 311–321 (1993)
12.
Zurück zum Zitat Vaidya, P.M.: An o(nlogn) algorithm for the all-nearest-neighbors problem. In: Discrete Computational Geometry, vol. 4(2), pp. 101–115 (1989)MathSciNetCrossRef Vaidya, P.M.: An o(nlogn) algorithm for the all-nearest-neighbors problem. In: Discrete Computational Geometry, vol. 4(2), pp. 101–115 (1989)MathSciNetCrossRef
14.
Zurück zum Zitat Nada, E., Ahmed, E.: Big data analytics: a literature review paper. In: Lecture Notes in Computer Science, pp. 214–227. Springer, Aug 2014 Nada, E., Ahmed, E.: Big data analytics: a literature review paper. In: Lecture Notes in Computer Science, pp. 214–227. Springer, Aug 2014
15.
Zurück zum Zitat Demetrios, Z.Y., Shonali, K.: Mobile big data analytics: research, practice, and opportunities. In: Proceeding MDM’ 2014, 15th International Conference on Mobile Data Management, vol. 01, pp. 1–2 (2014) Demetrios, Z.Y., Shonali, K.: Mobile big data analytics: research, practice, and opportunities. In: Proceeding MDM’ 2014, 15th International Conference on Mobile Data Management, vol. 01, pp. 1–2 (2014)
17.
Zurück zum Zitat EMC.: Dell EMC data science analytics. In: EMC Education Services, pp. 1–508 (2015) EMC.: Dell EMC data science analytics. In: EMC Education Services, pp. 1–508 (2015)
20.
Zurück zum Zitat Georgios, S., Mavromoustakis, C.X., Mastorakis, G., Batalla, J.M., Dobre, C., Panagiotakis, S., Pallis, E.: Big data and cloud computing: a survey of the state-of-the-art and research challenges. In: Advances in Mobile Cloud Computing and Big Data in the 5G Era Studies in Big Data 22 (2017) Georgios, S., Mavromoustakis, C.X., Mastorakis, G., Batalla, J.M., Dobre, C., Panagiotakis, S., Pallis, E.: Big data and cloud computing: a survey of the state-of-the-art and research challenges. In: Advances in Mobile Cloud Computing and Big Data in the 5G Era Studies in Big Data 22 (2017)
21.
Zurück zum Zitat Kune, R., Konugurthi, P.K., Agarwal, A., Chillarige, R.R., Buyya, R.: The anatomy of big data computing. In: Softw. Pract. Exper. 46, 79105 (2016) Kune, R., Konugurthi, P.K., Agarwal, A., Chillarige, R.R., Buyya, R.: The anatomy of big data computing. In: Softw. Pract. Exper. 46, 79105 (2016)
22.
Zurück zum Zitat Yang, C., Huang, Q., Li, Z., Liu, K., Hu, F.: Big Data and cloud computing: innovation opportunities and challenges. In: International Journal of Digital Earth. Published by Informa UK Limited, trading as Taylor Francis (2016) Yang, C., Huang, Q., Li, Z., Liu, K., Hu, F.: Big Data and cloud computing: innovation opportunities and challenges. In: International Journal of Digital Earth. Published by Informa UK Limited, trading as Taylor Francis (2016)
31.
Zurück zum Zitat Shankar, V.G., Jangid, M., Devi, B., Kabra, S.: Mobile big data: malware and its analysis. In: Proceedings of First International Conference on Smart System, Innovations and Computing. Smart Innovation, Systems and Technologies, vol. 79, pp. 831–842, Springer, Singapore (2018). https://doi.org/10.1007/978-981-10-5828-8_79 CrossRef Shankar, V.G., Jangid, M., Devi, B., Kabra, S.: Mobile big data: malware and its analysis. In: Proceedings of First International Conference on Smart System, Innovations and Computing. Smart Innovation, Systems and Technologies, vol. 79, pp. 831–842, Springer, Singapore (2018). https://​doi.​org/​10.​1007/​978-981-10-5828-8_​79 CrossRef
Metadaten
Titel
DataSpeak: Data Extraction, Aggregation, and Classification Using Big Data Novel Algorithm
verfasst von
Venkatesh Gauri Shankar
Bali Devi
Sumit Srivastava
Copyright-Jahr
2019
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-1513-8_16

Neuer Inhalt