Skip to main content
Erschienen in: The Journal of Supercomputing 5/2020

07.12.2018

A decade of big data literature: analysis of trends in light of bibliometrics

verfasst von: Iftikhar Ahmad, Gulzar Ahmed, Syed Adeel Ali Shah, Ejaz Ahmed

Erschienen in: The Journal of Supercomputing | Ausgabe 5/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Bibliometrics is a quantitative tool for the analysis of literature published in a scientific field. Using Scopus as the data source, we perform a thorough analysis of scholarly works published in the field of big data from 2008 to 2017. The objective of the work is to find the most cited articles in the given time frame, the citation trends, the authorship trends as well as the trends of research work in the related area. The analysis shows that over 50% of publications do not receive any citations, and the average number of citations per publication is 3.17. It is also observed that single authorship of research publications has declined over the time. The analysis reveals the pioneering role played by the USA in advancing the research in big data, which has lately been taken over by China, and the large-scale usage of big data analytics in various domains of science.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Fußnoten
1
“Big data” dynamic factor models for macroeconomic measurement and forecasting: A discussion of the papers by Lucrezia Reichlin and by Mark W. Watson (Book Chapter) Diebold, F.X. 2003 Advances in Economics and Econometrics: Theory and Applications, Eighth World Congress, Volume III pp. 115–122 19.
 
2
Including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics.
 
Literatur
1.
Zurück zum Zitat Adam D, Kramer I, Guillory JE, Hancock JT (2014) Experimental evidence of massive-scale emotional contagion through social networks. Proc Natl Acad Sci 111(24):8788–8790 Adam D, Kramer I, Guillory JE, Hancock JT (2014) Experimental evidence of massive-scale emotional contagion through social networks. Proc Natl Acad Sci 111(24):8788–8790
2.
Zurück zum Zitat Ahmed E, Yaqoob I, Hashem IAT, Khan I, Ahmed AIA, Imran M, Vasilakos AV (2017) The role of big data analytics in Internet of Things. Comput Netw 129:459–471 Ahmed E, Yaqoob I, Hashem IAT, Khan I, Ahmed AIA, Imran M, Vasilakos AV (2017) The role of big data analytics in Internet of Things. Comput Netw 129:459–471
3.
Zurück zum Zitat Aksnes DW (2003) Characteristics of highly cited papers. Res Evalu 12(3):159–170 Aksnes DW (2003) Characteristics of highly cited papers. Res Evalu 12(3):159–170
6.
Zurück zum Zitat Aronova E, Baker KS, Oreskes N (2010) Big science and big data in biology: from the international geophysical year through the international biological program to the long term ecological research (LTER) network, 1957 present. Hist Stud Nat Sci 40(2):183–224 Aronova E, Baker KS, Oreskes N (2010) Big science and big data in biology: from the international geophysical year through the international biological program to the long term ecological research (LTER) network, 1957 present. Hist Stud Nat Sci 40(2):183–224
7.
Zurück zum Zitat Bello-Orgaz G, Jung JJ, Camacho D (2016) Social big data: recent achievements and new challenges. Inf Fusion 28:45–59 Bello-Orgaz G, Jung JJ, Camacho D (2016) Social big data: recent achievements and new challenges. Inf Fusion 28:45–59
8.
Zurück zum Zitat Bourque P, Abran A, Garbajosa J, Keeni G, Shen B (2014) Guide to the software engineering body of knowledge (SWEBOK) version3.0. IEEE Press Bourque P, Abran A, Garbajosa J, Keeni G, Shen B (2014) Guide to the software engineering body of knowledge (SWEBOK) version3.0. IEEE Press
9.
Zurück zum Zitat Boyd D, Crawford K (2012) Critical questions for big data: provocations for a cultural, technological, and scholarly phenomenon. Inf Commun Soc 15(5):662–679 Boyd D, Crawford K (2012) Critical questions for big data: provocations for a cultural, technological, and scholarly phenomenon. Inf Commun Soc 15(5):662–679
10.
Zurück zum Zitat Brinkmann BH, Bower MR, Stengel KA, Worrell GA, Stead M (2009) Large-scale electrophysiology: acquisition, compression, encryption, and storage of big data. J Neurosci Methods 180(1):185–192 Brinkmann BH, Bower MR, Stengel KA, Worrell GA, Stead M (2009) Large-scale electrophysiology: acquisition, compression, encryption, and storage of big data. J Neurosci Methods 180(1):185–192
11.
Zurück zum Zitat Brzezinski M (2015) Power laws in citation distributions: evidence from scopus. Scientometrics 103(1):213228MathSciNet Brzezinski M (2015) Power laws in citation distributions: evidence from scopus. Scientometrics 103(1):213228MathSciNet
12.
Zurück zum Zitat Chadegani A, Arezoo, Salehi H, Yunus M, Farhadi H, Fooladi M, Farhadi M, Ebrahim NA (2013) A comparison between two main academic literature collections: Web of Science and Scopus databases. Asian Soc Sci 9(5):18–26 Chadegani A, Arezoo, Salehi H, Yunus M, Farhadi H, Fooladi M, Farhadi M, Ebrahim NA (2013) A comparison between two main academic literature collections: Web of Science and Scopus databases. Asian Soc Sci 9(5):18–26
13.
Zurück zum Zitat Chen H, Chiang RHL, Storey VC (2012) Business intelligence and analytics: from big data to big impact. MIS Q 36(4):1165–1188 Chen H, Chiang RHL, Storey VC (2012) Business intelligence and analytics: from big data to big impact. MIS Q 36(4):1165–1188
14.
Zurück zum Zitat Chen Y, Alspaugh S, Katz R (2012) Interactive analytical processing in big data systems: a cross-industry study of mapreduce workloads. Proc VLDB Endow 5(12):1802–1813 Chen Y, Alspaugh S, Katz R (2012) Interactive analytical processing in big data systems: a cross-industry study of mapreduce workloads. Proc VLDB Endow 5(12):1802–1813
15.
Zurück zum Zitat Chianese A, Marulli F, Piccialli F, Benedusi P, Jung JE (2017) An associative engines based approach supporting collaborative analytics in the internet of cultural things. Future Gener Comput Syst 66:187–198 Chianese A, Marulli F, Piccialli F, Benedusi P, Jung JE (2017) An associative engines based approach supporting collaborative analytics in the internet of cultural things. Future Gener Comput Syst 66:187–198
16.
Zurück zum Zitat Cohen J, Dolan B, Dunlap M, Hellerstein JM, Welton C (2009) MAD skills: new analysis practices for big data. Proc VLDB Endow 2(2):1481–1492 Cohen J, Dolan B, Dunlap M, Hellerstein JM, Welton C (2009) MAD skills: new analysis practices for big data. Proc VLDB Endow 2(2):1481–1492
17.
Zurück zum Zitat Crespo JA, Herranz N, Li Y, RuizCastillo J (2014) The effect on citation inequality of differences in citation practices at the web of science subject category level. J Assoc Inf Sci Technol 65(6):1244–1256 Crespo JA, Herranz N, Li Y, RuizCastillo J (2014) The effect on citation inequality of differences in citation practices at the web of science subject category level. J Assoc Inf Sci Technol 65(6):1244–1256
18.
Zurück zum Zitat Culnan MJ (1978) An analysis of the information usage patterns of academics and practitioners in the computer field: a citation analysis of a national conference proceedings. Inf Process Manag 14(6):395–404 Culnan MJ (1978) An analysis of the information usage patterns of academics and practitioners in the computer field: a citation analysis of a national conference proceedings. Inf Process Manag 14(6):395–404
19.
Zurück zum Zitat Davis PM (2009) Authorchoice openaccess publishing in the biological and medical literature: a citation analysis. J Assoc Inf Sci Technol 60(1):3–8 Davis PM (2009) Authorchoice openaccess publishing in the biological and medical literature: a citation analysis. J Assoc Inf Sci Technol 60(1):3–8
20.
Zurück zum Zitat Ding Y, Zhang G, Chambers T, Song M, Wang X, Zhai C (2014) Contentbased citation analysis: the next generation of citation analysis. J Assoc Inf Sci Technol 65(9):1820–1833 Ding Y, Zhang G, Chambers T, Song M, Wang X, Zhai C (2014) Contentbased citation analysis: the next generation of citation analysis. J Assoc Inf Sci Technol 65(9):1820–1833
22.
Zurück zum Zitat Effendy S, Yap RHC (2017) Analysing trends in computer science research: a preliminary study using the Microsoft Academic Graph. In: Proceedings of the 26th International Conference on World Wide Web companion. International World Wide Web, Conferences Steering Committee, pp 1245–1250 Effendy S, Yap RHC (2017) Analysing trends in computer science research: a preliminary study using the Microsoft Academic Graph. In: Proceedings of the 26th International Conference on World Wide Web companion. International World Wide Web, Conferences Steering Committee, pp 1245–1250
23.
Zurück zum Zitat Al-Fuqaha A, Guizani M, Mohammadi M, Aledhari M, Ayyash M (2015) Internet of things: a survey on enabling technologies, protocols, and applications. IEEE Commun Surv Tutor 17(4):2347–2376 Al-Fuqaha A, Guizani M, Mohammadi M, Aledhari M, Ayyash M (2015) Internet of things: a survey on enabling technologies, protocols, and applications. IEEE Commun Surv Tutor 17(4):2347–2376
24.
Zurück zum Zitat Garousi V, Mäntylä MV (2016) Citations, research topics and active countries in software engineering: a bibliometrics study. Comput Sci Rev 19:56–77MathSciNet Garousi V, Mäntylä MV (2016) Citations, research topics and active countries in software engineering: a bibliometrics study. Comput Sci Rev 19:56–77MathSciNet
25.
Zurück zum Zitat Garousi V (2015) A bibliometric analysis of the Turkish software engineering research community. Scientometrics 105(1):23–49 Garousi V (2015) A bibliometric analysis of the Turkish software engineering research community. Scientometrics 105(1):23–49
26.
Zurück zum Zitat Garousi V, Fernandes JM (2016) Highly-cited papers in software engineering: the top-100. Inf Softw Technol 71:108–128 Garousi V, Fernandes JM (2016) Highly-cited papers in software engineering: the top-100. Inf Softw Technol 71:108–128
27.
Zurück zum Zitat Gingras Y, Wallace ML (2010) Why it has become more difficult to predict Nobel Prize winners: a bibliometric analysis of nominees and winners of the chemistry and physics prizes (19012007). Scientometrics 82(2):401–412 Gingras Y, Wallace ML (2010) Why it has become more difficult to predict Nobel Prize winners: a bibliometric analysis of nominees and winners of the chemistry and physics prizes (19012007). Scientometrics 82(2):401–412
28.
Zurück zum Zitat Gohar M, Ahmed SH, Khan M, Guizani N, Ahmed A, Rahman AU (2018) A big data analytics architecture for the internet of small things. IEEE Commun Mag 56(2):128–133 Gohar M, Ahmed SH, Khan M, Guizani N, Ahmed A, Rahman AU (2018) A big data analytics architecture for the internet of small things. IEEE Commun Mag 56(2):128–133
29.
Zurück zum Zitat Goodrum, Abby A, McCain KW, Lawrence S, Giles CL (2001) Scholarly publishing in the Internet age: a citation analysis of computer science literature. Inf Process Manag 37(5):661–675MATH Goodrum, Abby A, McCain KW, Lawrence S, Giles CL (2001) Scholarly publishing in the Internet age: a citation analysis of computer science literature. Inf Process Manag 37(5):661–675MATH
30.
Zurück zum Zitat Hampton SE, Strasser CA, Tewksbury JJ, Gram WK, Budden A, Batcheller et al (2013) Big data and the future of ecology. Front Ecol Environ 11(3):156–162 Hampton SE, Strasser CA, Tewksbury JJ, Gram WK, Budden A, Batcheller et al (2013) Big data and the future of ecology. Front Ecol Environ 11(3):156–162
31.
Zurück zum Zitat Hashem, Targio IA, Yaqoob I, Anuar NB, Mokhtar S, Gani A, Khan SU (2015) The rise of big data on cloud computing: review and open research issues. Inf Syst 47:98–115 Hashem, Targio IA, Yaqoob I, Anuar NB, Mokhtar S, Gani A, Khan SU (2015) The rise of big data on cloud computing: review and open research issues. Inf Syst 47:98–115
32.
Zurück zum Zitat Herodotou, Herodotos, Lim H, Luo G, Borisov N, Dong L, Cetin FB, Babu S (2011) Starfish: a self-tuning system for big data analytics. In Cidr 11(2011):261–272 Herodotou, Herodotos, Lim H, Luo G, Borisov N, Dong L, Cetin FB, Babu S (2011) Starfish: a self-tuning system for big data analytics. In Cidr 11(2011):261–272
33.
Zurück zum Zitat Ho Y-S (2012) Top-cited articles in chemical engineering in Science Citation Index Expanded: a bibliometric analysis. Chin J Chem Eng 20(3):478–488 Ho Y-S (2012) Top-cited articles in chemical engineering in Science Citation Index Expanded: a bibliometric analysis. Chin J Chem Eng 20(3):478–488
34.
Zurück zum Zitat Ho Y-S (2014) Classic articles on social work field in Social Science Citation Index: a bibliometric analysis. Scientometrics 98(1):137–155 Ho Y-S (2014) Classic articles on social work field in Social Science Citation Index: a bibliometric analysis. Scientometrics 98(1):137–155
35.
Zurück zum Zitat Hoonlor A, Szymanski BK, Zaki MJ (2013) Trends in computer science research. Commun ACM 56(10):74–83 Hoonlor A, Szymanski BK, Zaki MJ (2013) Trends in computer science research. Commun ACM 56(10):74–83
36.
Zurück zum Zitat Howe, Doug, Costanzo M, Fey P, Gojobori T, Hannick L, Hide W, Hill DP et al (2008) Big data: the future of biocuration. Nature 455(7209):47 Howe, Doug, Costanzo M, Fey P, Gojobori T, Hannick L, Hide W, Hill DP et al (2008) Big data: the future of biocuration. Nature 455(7209):47
37.
Zurück zum Zitat Ioannidis J, Boyack KW, Small H, Sorensen AA, Klavans R (2014) Bibliometrics: is your most cited work your best? Nat News 514(7524):561–562 Ioannidis J, Boyack KW, Small H, Sorensen AA, Klavans R (2014) Bibliometrics: is your most cited work your best? Nat News 514(7524):561–562
38.
Zurück zum Zitat Jabbar S, Malik KR, Ahmad M, Aldabbas O, Asif M, Khalid S, Han K, Ahmed SH (2018) A methodology of real-time data fusion for localized big data analytics. IEEE Access 6:24510–24520 Jabbar S, Malik KR, Ahmad M, Aldabbas O, Asif M, Khalid S, Han K, Ahmed SH (2018) A methodology of real-time data fusion for localized big data analytics. IEEE Access 6:24510–24520
39.
Zurück zum Zitat Jacobs A (2009) The pathologies of big data. Commun ACM 52(8):36–44 Jacobs A (2009) The pathologies of big data. Commun ACM 52(8):36–44
40.
Zurück zum Zitat Kalantari A, Kamsin A, Kamaruddin HS, Ebrahim NA, Gani A, Ebrahimi A, Shamshirband S (2017) A bibliometric approach to tracking big data research trends. J Big Data 4(1) Kalantari A, Kamsin A, Kamaruddin HS, Ebrahim NA, Gani A, Ebrahimi A, Shamshirband S (2017) A bibliometric approach to tracking big data research trends. J Big Data 4(1)
41.
Zurück zum Zitat Kosinski M, Stillwell D, Graepel T (2013) Private traits and attributes are predictable from digital records of human behavior. Proc Natl Acad Sci 110(15):5802–5805 Kosinski M, Stillwell D, Graepel T (2013) Private traits and attributes are predictable from digital records of human behavior. Proc Natl Acad Sci 110(15):5802–5805
42.
Zurück zum Zitat Leonelli S (2014) What difference does quantity make? On the epistemology of Big Data in biology. Big Data Soc 1(1) Leonelli S (2014) What difference does quantity make? On the epistemology of Big Data in biology. Big Data Soc 1(1)
43.
Zurück zum Zitat Liao H, Tang M, Luo L, Li C, Chiclana F, Zeng X-J (2018) A bibliometric analysis and visualization of medical big data research. Sustainability 10(1) Liao H, Tang M, Luo L, Li C, Chiclana F, Zeng X-J (2018) A bibliometric analysis and visualization of medical big data research. Sustainability 10(1)
44.
Zurück zum Zitat Liu W, Wang Z, Liu X, Zeng N, Liu Y, Alsaadi FE (2017) A survey of deep neural network architectures and their applications. Neurocomputing 234:11–26 Liu W, Wang Z, Liu X, Zeng N, Liu Y, Alsaadi FE (2017) A survey of deep neural network architectures and their applications. Neurocomputing 234:11–26
46.
Zurück zum Zitat Murdoch TB, Detsky AS (2013) The inevitable application of big data to health care. JAMA 309(13):1351–1352 Murdoch TB, Detsky AS (2013) The inevitable application of big data to health care. JAMA 309(13):1351–1352
48.
Zurück zum Zitat Nobre GC, Tavares E (2017) Scientific literature analysis on big data and internet of things applications on circular economy: a bibliometric study. Scientometrics 111(1):463–492 Nobre GC, Tavares E (2017) Scientific literature analysis on big data and internet of things applications on circular economy: a bibliometric study. Scientometrics 111(1):463–492
49.
Zurück zum Zitat Raghupathi W, Raghupathi V (2014) Big data analytics in healthcare: promise and potential. Health Inf Sci Syst 2(1) Raghupathi W, Raghupathi V (2014) Big data analytics in healthcare: promise and potential. Health Inf Sci Syst 2(1)
50.
Zurück zum Zitat Rani S, Ahmed SH, Talwar R, Malhotra J (2017) Can sensors collect big data? An energy-efficient big data gathering algorithm for a WSN. IEEE Trans Ind Inform 13(4):1961–1968 Rani S, Ahmed SH, Talwar R, Malhotra J (2017) Can sensors collect big data? An energy-efficient big data gathering algorithm for a WSN. IEEE Trans Ind Inform 13(4):1961–1968
52.
Zurück zum Zitat Stephens ZD, Lee SY, Faghri F, Campbell RH, Zhai C, Efron MJ, et al (2015) Big data: astronomical or genomical? PLoS Biol 13(7) Stephens ZD, Lee SY, Faghri F, Campbell RH, Zhai C, Efron MJ, et al (2015) Big data: astronomical or genomical? PLoS Biol 13(7)
54.
Zurück zum Zitat Wohlin C (2005) An analysis of the most cited articles in software engineering journals—1999. Inf Softw Technol 47(15):957–964 Wohlin C (2005) An analysis of the most cited articles in software engineering journals—1999. Inf Softw Technol 47(15):957–964
55.
Zurück zum Zitat Wohlin C (2007) An analysis of the most cited articles in software engineering journals—2000. Inf Softw Technol 49(1):2–11 Wohlin C (2007) An analysis of the most cited articles in software engineering journals—2000. Inf Softw Technol 49(1):2–11
Metadaten
Titel
A decade of big data literature: analysis of trends in light of bibliometrics
verfasst von
Iftikhar Ahmad
Gulzar Ahmed
Syed Adeel Ali Shah
Ejaz Ahmed
Publikationsdatum
07.12.2018
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 5/2020
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-018-2714-x

Weitere Artikel der Ausgabe 5/2020

The Journal of Supercomputing 5/2020 Zur Ausgabe