Skip to main content
Top
Published in: Wireless Networks 3/2022

11-12-2018

A framework for social media data analytics using Elasticsearch and Kibana

Authors: Neel Shah, Darryl Willick, Vijay Mago

Published in: Wireless Networks | Issue 3/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Real-time online data processing is quickly becoming an essential tool in the analysis of social media for political trends, advertising, public health awareness programs and policy making. Traditionally, processes associated with offline analysis are productive and efficient only when the data collection is a one-time process. Currently, cutting edge research requires real-time data analysis that comes with a set of challenges, particularly the efficiency of continuous data fetching within the context of present NoSQL and relational databases. In this paper, we demonstrate a solution to effectively adsress the challenges of real-time analysis using a configurable Elasticsearch search engine. We are using a distributed database architecture, pre-build indexing and standardizing the Elasticsearch framework for large scale text mining. The results from the query engine are visulized in almost real-time.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Cervellini, P., Menezes, A. G., & Mago, V. K. (2016). Finding trendsetters on yelp dataset. In 2016 IEEE symposium series on computational intelligence (SSCI) (pp. 1–7). IEEE. Cervellini, P., Menezes, A. G., & Mago, V. K. (2016). Finding trendsetters on yelp dataset. In 2016 IEEE symposium series on computational intelligence (SSCI) (pp. 1–7). IEEE.
2.
go back to reference Belyi, E., Giabbanelli, P. J., Patel, I., Balabhadrapathruni, N. H., Abdallah, A. B., Hameed, W., et al. (2016). Combining association rule mining and network analysis for pharmacosurveillance. The Journal of Supercomputing, 72(5), 2014–2034.CrossRef Belyi, E., Giabbanelli, P. J., Patel, I., Balabhadrapathruni, N. H., Abdallah, A. B., Hameed, W., et al. (2016). Combining association rule mining and network analysis for pharmacosurveillance. The Journal of Supercomputing, 72(5), 2014–2034.CrossRef
3.
go back to reference Kononenko, O., Baysal, O., Holmes, R., & Godfrey, M. W. (2014). Mining modern repositories with Elasticsearch. In Proceedings of the 11th working conference on mining software repositories (pp. 328–331). ACM. Kononenko, O., Baysal, O., Holmes, R., & Godfrey, M. W. (2014). Mining modern repositories with Elasticsearch. In Proceedings of the 11th working conference on mining software repositories (pp. 328–331). ACM.
4.
go back to reference Liu, Q., Kumar, S., & Mago, V. (2017). Safernet: Safe transportation routing in the era of internet of vehicles and mobile crowd sensing. In 2017 14th IEEE annual consumer communications and networking conference (CCNC) (pp. 299–304). IEEE. Liu, Q., Kumar, S., & Mago, V. (2017). Safernet: Safe transportation routing in the era of internet of vehicles and mobile crowd sensing. In 2017 14th IEEE annual consumer communications and networking conference (CCNC) (pp. 299–304). IEEE.
5.
go back to reference Kim, M. G., & Koh, J. H. (2016). Recent research trends for geospatial information explored by twitter data. Spatial Information Research, 24(2), 65–73.CrossRef Kim, M. G., & Koh, J. H. (2016). Recent research trends for geospatial information explored by twitter data. Spatial Information Research, 24(2), 65–73.CrossRef
6.
go back to reference Assunção, M. D., Calheiros, R. N., Bianchi, S., Netto, M. A., & Buyya, R. (2015). Big data computing and clouds: Trends and future directions. Journal of Parallel and Distributed Computing, 79, 3–15.CrossRef Assunção, M. D., Calheiros, R. N., Bianchi, S., Netto, M. A., & Buyya, R. (2015). Big data computing and clouds: Trends and future directions. Journal of Parallel and Distributed Computing, 79, 3–15.CrossRef
8.
go back to reference Kumar, P., Kumar, P., Zaidi, N., & Rathore, V. S. (2018). Analysis and comparative exploration of elastic search, Mongodb and Hadoop big data processing. In Soft computing: Theories and applications, (pp. 605–615). New York: Springer. Kumar, P., Kumar, P., Zaidi, N., & Rathore, V. S. (2018). Analysis and comparative exploration of elastic search, Mongodb and Hadoop big data processing. In Soft computing: Theories and applications, (pp. 605–615). New York: Springer.
9.
go back to reference Cea, D., Nin, J., Tous, R., Torres, J., & Ayguadé, E (2014). Towards the cloudification of the social networks analytics. In Modeling decisions for artificial intelligence (pp. 192–203). New York: Springer. Cea, D., Nin, J., Tous, R., Torres, J., & Ayguadé, E (2014). Towards the cloudification of the social networks analytics. In Modeling decisions for artificial intelligence (pp. 192–203). New York: Springer.
10.
go back to reference Bai, J. (2013). Feasibility analysis of big log data real time search based on hbase and elasticsearch. In 2013 ninth international conference on natural computation (ICNC) (pp. 1166–1170). IEEE. Bai, J. (2013). Feasibility analysis of big log data real time search based on hbase and elasticsearch. In 2013 ninth international conference on natural computation (ICNC) (pp. 1166–1170). IEEE.
12.
go back to reference Gormley, C., & Tong, Z. (2015). Elasticsearch: The definitive guide: A distributed real-time search and analytics engine. Sebastopol: O’Reilly Media, Inc. Gormley, C., & Tong, Z. (2015). Elasticsearch: The definitive guide: A distributed real-time search and analytics engine. Sebastopol: O’Reilly Media, Inc.
17.
go back to reference Yang, F., Tschetter, E., Léauté, X., Ray, N., Merlino, G., & Ganguli, D. (2014). Druid: A real-time analytical data store. In Proceedings of the 2014 ACM SIGMOD international conference on Management of data (pp. 157–168). ACM. Yang, F., Tschetter, E., Léauté, X., Ray, N., Merlino, G., & Ganguli, D. (2014). Druid: A real-time analytical data store. In Proceedings of the 2014 ACM SIGMOD international conference on Management of data (pp. 157–168). ACM.
18.
go back to reference Burkitt, K. J., Dowling, E. G., & Branon, T. R. (2014). System and method for real-time processing, storage, indexing, and delivery of segmented video. US Patent 8,769,576. Burkitt, K. J., Dowling, E. G., & Branon, T. R. (2014). System and method for real-time processing, storage, indexing, and delivery of segmented video. US Patent 8,769,576.
19.
go back to reference Hashem, I. A. T., Yaqoob, I., Anuar, N. B., Mokhtar, S., Gani, A., & Khan, S. U. (2015). The rise of big data on cloud computing: Review and open research issues. Information Systems, 47, 98–115.CrossRef Hashem, I. A. T., Yaqoob, I., Anuar, N. B., Mokhtar, S., Gani, A., & Khan, S. U. (2015). The rise of big data on cloud computing: Review and open research issues. Information Systems, 47, 98–115.CrossRef
20.
go back to reference Yang, H., Park, M., Cho, M., Song, M., & Kim, S. (2014). A system architecture for manufacturing process analysis based on big data and process mining techniques. In 2014 IEEE international conference on big data (pp. 1024–1029). IEEE. Yang, H., Park, M., Cho, M., Song, M., & Kim, S. (2014). A system architecture for manufacturing process analysis based on big data and process mining techniques. In 2014 IEEE international conference on big data (pp. 1024–1029). IEEE.
21.
go back to reference Stelzer, G., Plaschkes, I., Oz-Levi, D., Alkelai, A., Olender, T., Zimmerman, S., et al. (2016). Varelect: The phenotype-based variation prioritizer of the genecards suite. BMC Genomics, 17(2), 444.CrossRef Stelzer, G., Plaschkes, I., Oz-Levi, D., Alkelai, A., Olender, T., Zimmerman, S., et al. (2016). Varelect: The phenotype-based variation prioritizer of the genecards suite. BMC Genomics, 17(2), 444.CrossRef
22.
go back to reference Bagnasco, S., Berzano, D., Guarise, A., Lusso, S., Masera, M., & Vallero, S. (2015). Monitoring of IAAS and scientific applications on the cloud using the elasticsearch ecosystem. In Journal of physics: Conference series (Vol. 608, p. 012016). Bristol: IOP Publishing. Bagnasco, S., Berzano, D., Guarise, A., Lusso, S., Masera, M., & Vallero, S. (2015). Monitoring of IAAS and scientific applications on the cloud using the elasticsearch ecosystem. In Journal of physics: Conference series (Vol. 608, p. 012016). Bristol: IOP Publishing.
23.
go back to reference Chen, D., Chen, Y., Brownlow, B. N., Kanjamala, P. P., Arredondo, C. A. G., Radspinner, B. L., et al. (2017). Real-time or near real-time persisting daily healthcare data into hdfs and elasticsearch index inside a big data platform. IEEE Transactions on Industrial Informatics, 13(2), 595–606.CrossRef Chen, D., Chen, Y., Brownlow, B. N., Kanjamala, P. P., Arredondo, C. A. G., Radspinner, B. L., et al. (2017). Real-time or near real-time persisting daily healthcare data into hdfs and elasticsearch index inside a big data platform. IEEE Transactions on Industrial Informatics, 13(2), 595–606.CrossRef
24.
go back to reference Coronel, J. B., & Mock, S. (2017). Designsafe: Using elasticsearch to share and search data on a science web portal. In Proceedings of the practice and experience in advanced research computing 2017 on sustainability, success and impact (p. 25). ACM. Coronel, J. B., & Mock, S. (2017). Designsafe: Using elasticsearch to share and search data on a science web portal. In Proceedings of the practice and experience in advanced research computing 2017 on sustainability, success and impact (p. 25). ACM.
Metadata
Title
A framework for social media data analytics using Elasticsearch and Kibana
Authors
Neel Shah
Darryl Willick
Vijay Mago
Publication date
11-12-2018
Publisher
Springer US
Published in
Wireless Networks / Issue 3/2022
Print ISSN: 1022-0038
Electronic ISSN: 1572-8196
DOI
https://doi.org/10.1007/s11276-018-01896-2

Other articles of this Issue 3/2022

Wireless Networks 3/2022 Go to the issue