ABSTRACT
Along the history, many researchers provided remarkable contributions to science, not only advancing knowledge but also in terms of mentoring new scientists. Currently, identifying and studying the formation of researchers over the years is a challenging task as current repositories of theses and dissertations are cataloged in a decentralized way through many local digital libraries. In this paper, we give a first step towards building a large repository that records the academic genealogy of researchers across fields and countries. We crawled data from the Networked Digital Library of Theses and Dissertations (NDLTD) and develop a framework to extract academic genealogy trees from this data and provide a series of analyses that describe the main properties of the academic genealogy trees. Our effort identified interesting findings related to the structure of academic formation, which highlight the importance of cataloging academic genealogy trees. We hope our initial framework will be the basis of a much larger crowdsourcing system.
- B. L. Alves, F. Benevenuto, and A. H. F. Laender. The Role of Research Leaders on the Evolution of Scientific Communities. In Proc. of WWW (Companion Volume), pages 649--656, Rio de Janeiro, Brazil, 2013. Google ScholarDigital Library
- A.-L. Barabâsi, H. Jeong, Z. Néda, E. Ravasz, A. Schubert, and T. Vicsek. Evolution of the social network of scientific collaborations. Physica A: Statistical Mechanics and its Applications, 311(3):590--614, 2002.Google ScholarCross Ref
- F. Benevenuto, A. H. F. Laender, and B. L. Alves. The h-index paradox: your coauthors have a higher h-index than you do. Scientometrics, 106(1):469--474, 2016. Google ScholarDigital Library
- W. A. Burchard and R. M. Keller. Some approaches to best-match file searching. CACM, 16(4):230--236, 1973. Google ScholarDigital Library
- P. Calado, M. A. Gonçalves, E. A. Fox, B. A. Ribeiro-Neto, A. H. F. Laender, A. S. da Silva, D. de Castro Reis, P. A. Roberto, M. V. Vieira, and J. P. Lage. The Web-DL Environment for Building Digital Libraries from the Web. In Proc. of JCDL, pages 346--357, Houston, USA, 2003. Google ScholarDigital Library
- S. Chang. Academic genealogy of american physicists. AAPPS Bulletin, 13(6):6--41, 2003.Google Scholar
- S. V. David and B. Y. Hayden. Neurotree: A collaborative, graphical database of the academic genealogy of neuroscience. PLoS ONE, 7(10):e46608, 2012.Google ScholarCross Ref
- E. A. Fox, M. A. Gonçalves, G. McMillan, J. L. Eaton, A. Atkins, and N. A. Kipp. The networked digital library of theses and dissertations: Changes in the university community. J. Comp. in H. Educ., 13(2):102--124, 2002.Google ScholarCross Ref
- W. Gl\"anzel. National characteristics in international scientific co-authorship relations. Scientometrics, 51(1):69--115, 2001.Google ScholarCross Ref
- R. C. Griffiths. Counting genealogical trees. J. of Math. Biol., 25(4):423--431, 1987.Google ScholarCross Ref
- J. E. Hirsch. An index to quantify an individual's scientific research output. PNAS, 102(46):16569--16572, 2005.Google ScholarCross Ref
- A. Jackson. A labor of love: the mathematics genealogy project. Notices of the AMS, 54(8):1002--1003, 2007.Google Scholar
- X. Liu, J. Bollen, M. L. Nelson, and H. Van de Sompel. Coauthorship networks in the digital library research community. IPM, 41(6):1462--1480, 2005. Google ScholarDigital Library
- R. D. Malmgren, J. M. Ottino, and L. A. N. Amaral. The role of mentorship in protégé performance. Nature, 465(7298):622--626, 2010.Google ScholarCross Ref
- G. V. Menezes, N. Ziviani, A. H. F. Laender, and V. A. F. Almeida. A Geographical Analysis of Knowledge Production in Computer Science. In Proc. of WWW, pages 1041--1050, Madrid, Spain, 2009. Google ScholarDigital Library
- M. E. Newman. The structure of scientific collaboration networks. PNAS, 98(2):404--409, 2001.Google ScholarCross Ref
- M. E. Newman. Coauthorship networks and patterns of scientific collaboration. PNAS, 101(s. 1):5200--5205, 2004.Google Scholar
- L. Rossi and J. P. Mena-Chalco. Caracterizaçao de árvores de genealogia acadêmica por meio de métricas em grafos. Anais do XXXIV CSBC, pages 21--32, 2014 (in Portuguese).Google Scholar
- E. Tuesta, K. Delgado, R. Mugnaini, L. Digiampietri, J. Mena-Chalco, and J. Pérez-Alcázar. Analysis of an Advisor-Advisee Relationship: An Exploratory Study of the Area of Exact and Earth Sciences in Brazil. PloS One, 10(5):e0129065--e0129065, 2014.Google Scholar
Index Terms
- Extracting Academic Genealogy Trees from the Networked Digital Library of Theses and Dissertations
Recommendations
Knowledge discovery in digital libraries of electronic theses and dissertations: an NDLTD case study
Special Issue on Very Large Digital LibrariesMany scholarly writings today are available in electronic formats. With universities around the world choosing to make digital versions of their dissertations, theses, project reports, and related files and data sets available online, an overwhelming ...
The impact of electronic theses and dissertations: a study of the institutional repository of a university in South Africa
This paper investigates the academic and societal impacts of the engineering electronic theses and dissertations (ETDs) at the North-West University. The citation counts obtained through Google Scholar (GS) and the altmetrics data obtained from Dspace ...
Status of Electronic Theses and Dissertations (ETDs) in Academic Libraries in Zimbabwe
Globally, electronic thesis and dissertations (ETDs) are deposited within the institutional and national repository portals and related databases. The purpose was to increase visibility of research output, and preserve and manage knowledge sharing ...
Comments