Skip to main content
Erschienen in: International Journal on Digital Libraries 4/2019

08.02.2018

Time-focused analysis of connectivity and popularity of historical persons in Wikipedia

verfasst von: Adam Jatowt, Daisuke Kawai, Katsumi Tanaka

Erschienen in: International Journal on Digital Libraries | Ausgabe 4/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Wikipedia contains large amounts of content related to history. It is being used extensively for many knowledge intensive tasks within computer science, digital humanities and related fields. In this paper, we look into Wikipedia articles on historical people for studying link-related temporal features of articles on past people. Our study sheds new light on the characteristics of information about historical people recorded in the English Wikipedia and quantifies user interest in such data. We propose a novel style of analysis in which we use signals derived from the hyperlink structure of Wikipedia as well as from article view logs, and we overlay them over temporal dimension to understand relations between time periods, link structure and article popularity. In the latter part of the paper, we also demonstrate several ways for estimating person importance based on the temporal aspects of the link structure as well as a method for ranking cities using the computed importance scores of their related persons.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
4
Note that, naturally, the amount of possible links from the articles on “future people” decreases the closer to the latest decade due to the decreasing numbers of articles of people from the recent times from which such links could originate. Similar case applies to the links from the articles on “past people” when “moving away” from the present toward the distant past.
 
5
Due to their large size we do not show networks for the most recent centuries.
 
Literatur
1.
Zurück zum Zitat Assmann, A.: Introduction to Cultural Studies. Schmidt Erich Verlag, Wirtschaft (2008). (in German) Assmann, A.: Introduction to Cultural Studies. Schmidt Erich Verlag, Wirtschaft (2008). (in German)
2.
Zurück zum Zitat Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: A Nucleus for a Web of Open Data. In: ISWC’07/ASWC’07, pp. 722–735. Springer (2007) Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: A Nucleus for a Web of Open Data. In: ISWC’07/ASWC’07, pp. 722–735. Springer (2007)
3.
Zurück zum Zitat Yeung, C.-M. Au, Jatowt, A.: Studying how the past is remembered: towards computational history through large scale text mining. In: CIKM, pp. 1231–1240 (2011) Yeung, C.-M. Au, Jatowt, A.: Studying how the past is remembered: towards computational history through large scale text mining. In: CIKM, pp. 1231–1240 (2011)
5.
Zurück zum Zitat Carr, E.H.: What is History?. Penguin, London (1961) Carr, E.H.: What is History?. Penguin, London (1961)
6.
Zurück zum Zitat Cook, J., Das Sarma, A., Fabrikant, A., Tomkins, A.: Weeks, your two, of fame and your grandmother’s. In: WWW: ACM, New York, NY. USA, pp. 919–928 (2012) Cook, J., Das Sarma, A., Fabrikant, A., Tomkins, A.: Weeks, your two, of fame and your grandmother’s. In: WWW: ACM, New York, NY. USA, pp. 919–928 (2012)
8.
Zurück zum Zitat Düring, M.: Can Network Analysis Reveal Importance? Degree Centrality and Leaders in the EU Integration Process. Social Informatics, pp. 314–318. Springer, Berlin (2014) Düring, M.: Can Network Analysis Reveal Importance? Degree Centrality and Leaders in the EU Integration Process. Social Informatics, pp. 314–318. Springer, Berlin (2014)
9.
Zurück zum Zitat Ebbinghaus, H.: Memory: A Contribution to Experimental Psychology. Columbia University, New York (1913)CrossRef Ebbinghaus, H.: Memory: A Contribution to Experimental Psychology. Columbia University, New York (1913)CrossRef
10.
Zurück zum Zitat Eom, Y.-H., Aragón, P., Laniado, D., Kaltenbrunner, A., Vigna, S., Shepelyansky, D.L.: Interactions of cultures and top people of Wikipedia from ranking of 24 language editions. PLoS ONE 10(3), e0114825 (2014)CrossRef Eom, Y.-H., Aragón, P., Laniado, D., Kaltenbrunner, A., Vigna, S., Shepelyansky, D.L.: Interactions of cultures and top people of Wikipedia from ranking of 24 language editions. PLoS ONE 10(3), e0114825 (2014)CrossRef
11.
Zurück zum Zitat Ferron, M., Massa, P.: Collective memory building in Wikipedia: the case of North African uprisings. In: WikiSym ’11. ACM, New York, NY, USA, 114–123 (2011) Ferron, M., Massa, P.: Collective memory building in Wikipedia: the case of North African uprisings. In: WikiSym ’11. ACM, New York, NY, USA, 114–123 (2011)
12.
Zurück zum Zitat Friedman, R.: The Life Millennium: The 100 Most Important Events and People of the Past 1000 Years. Bulfinch, New York City (1998) Friedman, R.: The Life Millennium: The 100 Most Important Events and People of the Past 1000 Years. Bulfinch, New York City (1998)
13.
Zurück zum Zitat Gabrilovich, E., Markovitch, S.: Computing semantic relatedness using Wikipedia-based explicit semantic analysis. Proc. IJCAI 2007, 1606–1611 (2007) Gabrilovich, E., Markovitch, S.: Computing semantic relatedness using Wikipedia-based explicit semantic analysis. Proc. IJCAI 2007, 1606–1611 (2007)
14.
Zurück zum Zitat Gabrilovich, E., et al.: Overcoming the brittleness bottleneck using Wikipedia: Enhancing text categorization with encyclopedic knowledge. In: AAAI (2006) Gabrilovich, E., et al.: Overcoming the brittleness bottleneck using Wikipedia: Enhancing text categorization with encyclopedic knowledge. In: AAAI (2006)
15.
Zurück zum Zitat Gadamer, H.-G.: Truth and Method. Sheed and Ward, London (1975) Gadamer, H.-G.: Truth and Method. Sheed and Ward, London (1975)
16.
Zurück zum Zitat Garcia-Fernandez, A., Ligozat, A.-L., Dinarelli, M., Bernhard, D.: When was it written? Automatically determining publication dates. In: SPIRE (2011) Garcia-Fernandez, A., Ligozat, A.-L., Dinarelli, M., Bernhard, D.: When was it written? Automatically determining publication dates. In: SPIRE (2011)
17.
Zurück zum Zitat Geipel, M.: Self-organization applied to dynamic network layout. Int. J. Mod. Phys. C 18(10), 1537–1549 (2007)CrossRef Geipel, M.: Self-organization applied to dynamic network layout. Int. J. Mod. Phys. C 18(10), 1537–1549 (2007)CrossRef
18.
Zurück zum Zitat Giles, J.: Internet Encyclopaedias go head to head. Nature 438, 900–901 (2005)CrossRef Giles, J.: Internet Encyclopaedias go head to head. Nature 438, 900–901 (2005)CrossRef
19.
Zurück zum Zitat Gyöngyi, Z., Garcia-Molina, H., Pedersen, J.: Combating web spam with trustrank. In VLDB 576–587, 2004 (2004) Gyöngyi, Z., Garcia-Molina, H., Pedersen, J.: Combating web spam with trustrank. In VLDB 576–587, 2004 (2004)
20.
Zurück zum Zitat Halbwachs, M.: La Mémoire Collective. Les Presses Universitaires de France (1950) (in French) Halbwachs, M.: La Mémoire Collective. Les Presses Universitaires de France (1950) (in French)
21.
Zurück zum Zitat Hart, M.H.: The 100: A Ranking of the Most Influential Persons in History. Citadel; Revised edition (2000) Hart, M.H.: The 100: A Ranking of the Most Influential Persons in History. Citadel; Revised edition (2000)
22.
Zurück zum Zitat Hoerl, C., McCormack, T.: Time and Memory: Issues in Philosophy and Psychology. Oxford University Press, Oxford (2001) Hoerl, C., McCormack, T.: Time and Memory: Issues in Philosophy and Psychology. Oxford University Press, Oxford (2001)
23.
Zurück zum Zitat Hoffart, J., et al.: YAGO2: Exploring and querying world knowledge in time, space, context, and many languages. In: WWW pp. 229–232 (2011) Hoffart, J., et al.: YAGO2: Exploring and querying world knowledge in time, space, context, and many languages. In: WWW pp. 229–232 (2011)
24.
Zurück zum Zitat Hoffmann, L.: Looking back at big data. Commun. ACM 56(4), 21–23 (2013)CrossRef Hoffmann, L.: Looking back at big data. Commun. ACM 56(4), 21–23 (2013)CrossRef
25.
Zurück zum Zitat Huet, T., Biega, J., Suchanek, F.: Mining history with Le Monde. In: AKBC 2013 workshop at CIKM2013 (2013) Huet, T., Biega, J., Suchanek, F.: Mining history with Le Monde. In: AKBC 2013 workshop at CIKM2013 (2013)
26.
Zurück zum Zitat Jacoby, R.: Social Amnesia: A Critique of Contemporary Psychology. Transaction Publishers, Piscataway (1997) Jacoby, R.: Social Amnesia: A Critique of Contemporary Psychology. Transaction Publishers, Piscataway (1997)
27.
Zurück zum Zitat Jatowt, A., Antoine, E., Kawai, Y., Akiyama, T.: Mapping temporal horizons. Analysis of collective future and past related attention in microblogging. In: WWW, pp. 484–494 (2015) Jatowt, A., Antoine, E., Kawai, Y., Akiyama, T.: Mapping temporal horizons. Analysis of collective future and past related attention in microblogging. In: WWW, pp. 484–494 (2015)
28.
Zurück zum Zitat Jatowt, A., Kawai, D., Tanaka, K.: Digital history meets Wikipedia: analyzing historical persons in Wikipedia. In: Proceedings of the 16th ACM/IEEE-CS Joint Conference on Digital Libraries. (JCDL 2016). ACM Press, Newark, USA, pp. 17–26 (2016) Jatowt, A., Kawai, D., Tanaka, K.: Digital history meets Wikipedia: analyzing historical persons in Wikipedia. In: Proceedings of the 16th ACM/IEEE-CS Joint Conference on Digital Libraries. (JCDL 2016). ACM Press, Newark, USA, pp. 17–26 (2016)
29.
Zurück zum Zitat Jatowt, A., Kawai, D., Tanaka, K.: Predicting importance of historical persons using Wikipedia. In: Proceedings of the 25th ACM International Conference on Information and Knowledge Management (CIKM 2016), ACM Press, Indianapolis, IN, USA, pp. 1909–1912 (2016) Jatowt, A., Kawai, D., Tanaka, K.: Predicting importance of historical persons using Wikipedia. In: Proceedings of the 25th ACM International Conference on Information and Knowledge Management (CIKM 2016), ACM Press, Indianapolis, IN, USA, pp. 1909–1912 (2016)
30.
Zurück zum Zitat Jatowt, A., Kawai, D., Tanaka, K.: Timestamping entities using contextual information. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2017). ACM Press, Tokyo, Japan, pp. 1205–1208 (2017) Jatowt, A., Kawai, D., Tanaka, K.: Timestamping entities using contextual information. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2017). ACM Press, Tokyo, Japan, pp. 1205–1208 (2017)
31.
Zurück zum Zitat Joho, H., Jatowt, A., Blanco, R.: Temporal information searching behaviour and tactics. Inf. Process. Manag. J. 51(6), 834–850 (2015)CrossRef Joho, H., Jatowt, A., Blanco, R.: Temporal information searching behaviour and tactics. Inf. Process. Manag. J. 51(6), 834–850 (2015)CrossRef
32.
Zurück zum Zitat Kanhabua, N., Niederée, C., Siberski, W.: Towards concise preservation by managed forgetting: research issues and case study. In: iPres (2013) Kanhabua, N., Niederée, C., Siberski, W.: Towards concise preservation by managed forgetting: research issues and case study. In: iPres (2013)
33.
Zurück zum Zitat Kanhabua, N., Nguyen, T.N., Niederée, C.: What triggers human remembering of events? A large-scale analysis of catalysts for collective memory in Wikipedia. In: JCDL, pp. 341–350 (2014) Kanhabua, N., Nguyen, T.N., Niederée, C.: What triggers human remembering of events? A large-scale analysis of catalysts for collective memory in Wikipedia. In: JCDL, pp. 341–350 (2014)
34.
Zurück zum Zitat Kinzler, D.: WikiSense—Mining the Wiki. In: Proceedings of Wikimania 2005. In: The First International Wikimedia Conference. Wikimedia Foundation (2005) Kinzler, D.: WikiSense—Mining the Wiki. In: Proceedings of Wikimania 2005. In: The First International Wikimedia Conference. Wikimedia Foundation (2005)
35.
Zurück zum Zitat Kittur, N., Chi, E.H., Suh, B.: What’s in Wikipedia? Mapping topics and conflict using socially annotated category structure. In: CHI ’09, pp. 1509–1512 (2009) Kittur, N., Chi, E.H., Suh, B.: What’s in Wikipedia? Mapping topics and conflict using socially annotated category structure. In: CHI ’09, pp. 1509–1512 (2009)
36.
Zurück zum Zitat Kremer, M.: Population growth and technological change: one million B.C. to 1990. Quart. J. Econ. 108, 681–716 (1993)CrossRef Kremer, M.: Population growth and technological change: one million B.C. to 1990. Quart. J. Econ. 108, 681–716 (1993)CrossRef
37.
Zurück zum Zitat Lazer, D., et al.: Computational social science. Science 323, 721–723 (2009)CrossRef Lazer, D., et al.: Computational social science. Science 323, 721–723 (2009)CrossRef
38.
Zurück zum Zitat Lendvai, P., Zervanou, K.: In: Proceedings of the 7th workshop on language technology for cultural heritage, social sciences, and humanities (LaTeCH 2013) at ACL’13 (2013) Lendvai, P., Zervanou, K.: In: Proceedings of the 7th workshop on language technology for cultural heritage, social sciences, and humanities (LaTeCH 2013) at ACL’13 (2013)
40.
Zurück zum Zitat McPherson, M., Smith-Lovin, L., Cook, J.M.: Birds of a feather: homophily in social networks. Annu. Rev. Sociol. 27, 415–444 (2001)CrossRef McPherson, M., Smith-Lovin, L., Cook, J.M.: Birds of a feather: homophily in social networks. Annu. Rev. Sociol. 27, 415–444 (2001)CrossRef
41.
Zurück zum Zitat Medelyan, O., Milne, D., Legg, C., Witten, Ian H.: Mining Meaning from Wikipedia. Int. J. Hum.-Comput. Stud. 67(9), 716–754 (2009)CrossRef Medelyan, O., Milne, D., Legg, C., Witten, Ian H.: Mining Meaning from Wikipedia. Int. J. Hum.-Comput. Stud. 67(9), 716–754 (2009)CrossRef
42.
Zurück zum Zitat Michel, J.-B., et al.: Quantitative analysis of culture using millions of digitized books. Science 331(6014), 176–182 (2011)CrossRef Michel, J.-B., et al.: Quantitative analysis of culture using millions of digitized books. Science 331(6014), 176–182 (2011)CrossRef
43.
Zurück zum Zitat Milne, D., Medelyan, O., Witten, I.H.: Mining domain-specific thesauri from Wikipedia: a case study. In: WI’06, pp. 442–448 (2006) Milne, D., Medelyan, O., Witten, I.H.: Mining domain-specific thesauri from Wikipedia: a case study. In: WI’06, pp. 442–448 (2006)
45.
Zurück zum Zitat Nunes, S., Ribeiro, C., David, G.: Using neighbors to date web documents. In: Proceedings of the WIDM’07 workshop associated to CIKM’07, pp. 129–136 (2007) Nunes, S., Ribeiro, C., David, G.: Using neighbors to date web documents. In: Proceedings of the WIDM’07 workshop associated to CIKM’07, pp. 129–136 (2007)
46.
Zurück zum Zitat Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web. Technical Report, Stanford University (1998) Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the web. Technical Report, Stanford University (1998)
47.
Zurück zum Zitat Rosenzweig, R.: Can history be open source? Wikipedia and the future of the past. J. Am. Hist. 93(1), 117–46 (2006)CrossRef Rosenzweig, R.: Can history be open source? Wikipedia and the future of the past. J. Am. Hist. 93(1), 117–46 (2006)CrossRef
48.
Zurück zum Zitat Skiena, S., Ward, C.B.: Who’s Bigger. Where Historical Figures Really Rank. Cambridge University Press, Cambridge (2014) Skiena, S., Ward, C.B.: Who’s Bigger. Where Historical Figures Really Rank. Cambridge University Press, Cambridge (2014)
49.
Zurück zum Zitat Strube, M., Ponzetto, S.: WikiRelate! Computing semantic relatedness using Wikipedia. In: AAAI-06, pp. 1419–1424 (2006) Strube, M., Ponzetto, S.: WikiRelate! Computing semantic relatedness using Wikipedia. In: AAAI-06, pp. 1419–1424 (2006)
50.
Zurück zum Zitat Sturrock, J.: Structuralism and since: from Lévi Strauss to Derrida, Introduction (1979) Sturrock, J.: Structuralism and since: from Lévi Strauss to Derrida, Introduction (1979)
51.
Zurück zum Zitat Takahashi, Y., Ohshima, H., Yamamoto, M., Iwasaki, H., Oyama, S., Tanaka, K.: Evaluating significance of historical entities based on tempo-spatial impacts analysis using wikipedia link structure. In: Proceedings of HT ’11. ACM, New York, NY, USA, pp. 83–92 (2011) Takahashi, Y., Ohshima, H., Yamamoto, M., Iwasaki, H., Oyama, S., Tanaka, K.: Evaluating significance of historical entities based on tempo-spatial impacts analysis using wikipedia link structure. In: Proceedings of HT ’11. ACM, New York, NY, USA, pp. 83–92 (2011)
52.
Zurück zum Zitat Whiting, S., Jose, J.M., Alonso, O.: Wikipedia as a time machine. In: TempWeb’14 at WWW2014, pp. 857–861 (2014) Whiting, S., Jose, J.M., Alonso, O.: Wikipedia as a time machine. In: TempWeb’14 at WWW2014, pp. 857–861 (2014)
53.
Zurück zum Zitat Wood, T.: An introduction to civil registration. Federation of Family History Societies (Publications) (1994) Wood, T.: An introduction to civil registration. Federation of Family History Societies (Publications) (1994)
54.
Zurück zum Zitat Vrandečić, D., Krötzsch, M.: A free collaborative knowledge base. Commun. ACM 57(1), 78–85 (2014)CrossRef Vrandečić, D., Krötzsch, M.: A free collaborative knowledge base. Commun. ACM 57(1), 78–85 (2014)CrossRef
55.
Zurück zum Zitat Zaagsma, G.: On digital history. BMGN Low Ctries. Hist. Rev. 128(4), 3–29 (2013)CrossRef Zaagsma, G.: On digital history. BMGN Low Ctries. Hist. Rev. 128(4), 3–29 (2013)CrossRef
56.
Zurück zum Zitat Zhang, X., Asano, Y., Yoshikawa, M.: Mining knowledge on relationships between objects from the web. IEICE Trans. 97–D(1), 77–88 (2014)CrossRef Zhang, X., Asano, Y., Yoshikawa, M.: Mining knowledge on relationships between objects from the web. IEICE Trans. 97–D(1), 77–88 (2014)CrossRef
57.
Zurück zum Zitat Au Yeung, C.M., Tomoharu, T.: Extracting multi-dimensional relations: a generative model of groups of entities in a corpus. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management Au Yeung, C.M., Tomoharu, T.: Extracting multi-dimensional relations: a generative model of groups of entities in a corpus. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management
Metadaten
Titel
Time-focused analysis of connectivity and popularity of historical persons in Wikipedia
verfasst von
Adam Jatowt
Daisuke Kawai
Katsumi Tanaka
Publikationsdatum
08.02.2018
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal on Digital Libraries / Ausgabe 4/2019
Print ISSN: 1432-5012
Elektronische ISSN: 1432-1300
DOI
https://doi.org/10.1007/s00799-018-0231-4

Weitere Artikel der Ausgabe 4/2019

International Journal on Digital Libraries 4/2019 Zur Ausgabe