Skip to main content
Top

2017 | Supplement | Chapter

An Exploration of Wikipedia Data as a Measure of Regional Knowledge Distribution

Authors : Fabian Stephany, Fabian Braesemann

Published in: Social Informatics

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In today’s economies, knowledge is the key ingredient for prosperity. However, it is hard to measure this intangible asset appropriately. Standard economic models mostly rely on common measures such as enrollment rates and international test scores. However, these proxies focus rather on the quality of education of pupils than on the distribution of knowledge among the whole population, which is increasingly defined by alternative sources of education such as online learning platforms. As a consequence, the economically relevant stock of knowledge in a region is only roughly approximated. Furthermore, they are abstract in content, and both capital-, and time-consuming in census. This paper proposes to explore Wikipedia data as an alternative source of capturing the knowledge distribution on a narrow geographical scale. Wikipedia is by far the largest digital encyclopedia worldwide and provides data on usage and editing publicly. We compare Wikipedia usage worldwide and edits in the U.S. to existing measures of the acquisition and stock of knowledge. The results indicate that there is a significant correlation between Wikipedia interactions and knowledge approximations on different geographical scales. Considering these results, it seems promising to further explore Wikipedia data to develop a reliable, inexpensive, and real-time proxy of knowledge distribution around the world.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Wikipedia provides over 40 million articles in 250 languages worldwide and is ranked among the top-ten most popular websites, see [3].
 
2
According the [3] the English language version alone has more than 30 million registered editors and additionally a large number of not-registered editors.
 
5
Visualiszations of these information geographies can be found here: [18, 19].
 
8
Thus we include all articles that link to the category “Computer Science” itself (level 1), to the subcategories that link to the “Computer Science” (level 2) and to their subcategories (level 3). This number of layers has been chosen to avoid the collection of edit data that are only very weakly related to computer science.
 
10
An interactive version of the map can be accessed via the online dashboard that provides supplementary information to this article: http://​bit.​ly/​Wiki_​Dashboard.
 
11
The data stems from the U.S. census: https://​www.​census.​gov/​.
 
12
Data on the number of students stems from http://​www.​stateuniversity.​com/​. City-level covariates are collected from http://​www.​city-data.​com/​/​ and a list of academic computer science departments is available on Wikipedia: http://​bit.​ly/​CS_​Departments.
 
Literature
2.
go back to reference Mayer-Schönberger, V., Cukier, K.: Learning with Big Data: The Future of Education. Houghton Mifflin Harcourt, New York (2014) Mayer-Schönberger, V., Cukier, K.: Learning with Big Data: The Future of Education. Houghton Mifflin Harcourt, New York (2014)
4.
go back to reference Moy, C.L., Locke, J.R., Coppola, B.P., McNeil, A.J.: Improving science education and understanding through editing Wikipedia. J. Chem. Educ. 87(11), 1159–1162 (2010). doi:10.1021/ed100367v CrossRef Moy, C.L., Locke, J.R., Coppola, B.P., McNeil, A.J.: Improving science education and understanding through editing Wikipedia. J. Chem. Educ. 87(11), 1159–1162 (2010). doi:10.​1021/​ed100367v CrossRef
7.
go back to reference Collier, B., Bear, J.: Conflict, criticism, or confidence: an empirical examination of the gender gap in Wikipedia contributions. In: Proceedings of the ACM 2012 Conference on Computer Supported Cooperative Work, pp. 383–392. ACM (2012) Collier, B., Bear, J.: Conflict, criticism, or confidence: an empirical examination of the gender gap in Wikipedia contributions. In: Proceedings of the ACM 2012 Conference on Computer Supported Cooperative Work, pp. 383–392. ACM (2012)
8.
go back to reference Gloor, P., De Boer, P., Lo, W., Wagner, S., Nemoto, K., Fuehres, H.: Cultural Anthropology Through the Lens of Wikipedia-A Comparison of Historical Leadership Networks in the English, Chinese, Japanese and German Wikipedia, arXiv preprint arXiv:1502.05256 Gloor, P., De Boer, P., Lo, W., Wagner, S., Nemoto, K., Fuehres, H.: Cultural Anthropology Through the Lens of Wikipedia-A Comparison of Historical Leadership Networks in the English, Chinese, Japanese and German Wikipedia, arXiv preprint arXiv:​1502.​05256
9.
go back to reference Eom, Y.-H., Aragón, P., Laniado, D., Kaltenbrunner, A., Vigna, S., Shepelyansky, D.L.: Interactions of cultures and top people of Wikipedia from ranking of 24 language editions. PloS one 10(3), e0114825 (2015)CrossRef Eom, Y.-H., Aragón, P., Laniado, D., Kaltenbrunner, A., Vigna, S., Shepelyansky, D.L.: Interactions of cultures and top people of Wikipedia from ranking of 24 language editions. PloS one 10(3), e0114825 (2015)CrossRef
10.
go back to reference Laufer, P., Wagner, C., Flöck, F., Strohmaier, M.: Mining cross-cultural relations from Wikipedia: a study of 31 European food cultures. In: Proceedings of the ACM Web Science Conference, p. 3. ACM (2015) Laufer, P., Wagner, C., Flöck, F., Strohmaier, M.: Mining cross-cultural relations from Wikipedia: a study of 31 European food cultures. In: Proceedings of the ACM Web Science Conference, p. 3. ACM (2015)
11.
go back to reference Ronen, S., Gonçalves, B., Hu, K.Z., Vespignani, A., Pinker, S., Hidalgo, C.A.: Links that speak: the global language network and its association with global fame. Proc. Nat. Acad. Sci. 111(52), E5616–E5622 (2014)CrossRef Ronen, S., Gonçalves, B., Hu, K.Z., Vespignani, A., Pinker, S., Hidalgo, C.A.: Links that speak: the global language network and its association with global fame. Proc. Nat. Acad. Sci. 111(52), E5616–E5622 (2014)CrossRef
12.
go back to reference Yasseri, T., Spoerri, A., Graham, M., Kertész, J.: The most controversial topics in Wikipedia: a multilingual and geographical analysis. arXiv:1305.5566 [physics] Yasseri, T., Spoerri, A., Graham, M., Kertész, J.: The most controversial topics in Wikipedia: a multilingual and geographical analysis. arXiv:​1305.​5566 [physics]
13.
go back to reference Borra, E., Weltevrede, E., Ciuccarelli, P., Kaltenbrunner, A., Laniado, D., Magni, G., Mauri, M., Rogers, R., Venturini, T.: Societal controversies in Wikipedia articles. In: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, pp. 193–196. ACM (2015) Borra, E., Weltevrede, E., Ciuccarelli, P., Kaltenbrunner, A., Laniado, D., Magni, G., Mauri, M., Rogers, R., Venturini, T.: Societal controversies in Wikipedia articles. In: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, pp. 193–196. ACM (2015)
15.
go back to reference Graham, M., Hogan, B., Straumann, R.K., Medhat, A.: Uneven geographies of user-generated information: patterns of increasing informational poverty. Ann. Assoc. Am. Geogr. 104(4), 746–764 (2014)CrossRef Graham, M., Hogan, B., Straumann, R.K., Medhat, A.: Uneven geographies of user-generated information: patterns of increasing informational poverty. Ann. Assoc. Am. Geogr. 104(4), 746–764 (2014)CrossRef
16.
go back to reference Graham, M., De Sabbata, S., Zook, M.A.: Towards a study of information geographies: (im) mutable augmentations and a mapping of the geographies of information. Geo: Geogr. Environ. 2(1), 88–105 (2015) Graham, M., De Sabbata, S., Zook, M.A.: Towards a study of information geographies: (im) mutable augmentations and a mapping of the geographies of information. Geo: Geogr. Environ. 2(1), 88–105 (2015)
17.
go back to reference Hardy, D., Frew, J., Goodchild, M.F.: Volunteered geographic information production as a spatial process. Int. J. Geogr. Inf. Sci. 26(7), 1191–1212 (2012)CrossRef Hardy, D., Frew, J., Goodchild, M.F.: Volunteered geographic information production as a spatial process. Int. J. Geogr. Inf. Sci. 26(7), 1191–1212 (2012)CrossRef
Metadata
Title
An Exploration of Wikipedia Data as a Measure of Regional Knowledge Distribution
Authors
Fabian Stephany
Fabian Braesemann
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-67256-4_4