Skip to main content
Top
Published in: Social Network Analysis and Mining 1/2021

01-12-2021 | Original Article

Effectively clustering researchers in scientific collaboration networks: case study on ResearchGate

Authors: Marcos Wander Rodrigues, Mark A. Junho Song, Luis Enrique Zárate

Published in: Social Network Analysis and Mining | Issue 1/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Social networks play a significant role in sharing knowledge. Scientific collaboration online networks allow scientific articles and research results to be shared, and the interaction and possible collaboration between researchers. These networks have many users and store varied data about each of them, and which of the data are used to characterize and grouping similar users. The number of attributes available about each instance (user) can reach several hundred, making this a problem with high dimensionality. Thus, dimensionality reduction is indispensable to remove redundant and irrelevant attributes to improve machine learning algorithms’ performance and make models more understandable. In order to produce an efficient recommendation system for collaborative research, one of the main challenges of dimensionality reduction techniques is guaranteeing that the information of the data is represented in the reduced dataset after the reduction. In our dimensionality reduction, we used Factor Analysis, as it preserves the relationships between the variables. In this study, we characterize the profiles of ResearchGate users after applying dimensionality reduction to two different datasets. A dataset of continuous attributes composed of profile metrics and a dataset of dichotomous attributes contained interest topics. We evaluated our methodology using two recommendation applications: (1) Identifying groups of researchers through a global profile extraction process; and (2) Identifying profiles similar to a reference profile. For both applications, we used hierarchical clustering techniques to identify the groups of user profiles. Our experiments show that the Factor Analysis transformation was able to preserve the relevant information in the data, resulting in an effective clustering process for the recommendation system for collaborative networks of researchers.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
go back to reference Brown TA (2015) Confirmatory factor analysis for applied research, 2nd edn. Methodology in the Social Sciences. Guilford Publications Brown TA (2015) Confirmatory factor analysis for applied research, 2nd edn. Methodology in the Social Sciences. Guilford Publications
go back to reference Cunningham JP, Ghahramani Z (2015) Linear dimensionality reduction: survey, insights, and generalizations. J Mach Learn Res 16:2859–2900MathSciNetMATH Cunningham JP, Ghahramani Z (2015) Linear dimensionality reduction: survey, insights, and generalizations. J Mach Learn Res 16:2859–2900MathSciNetMATH
go back to reference Galbraith JI, Bartholomew DJ, Steele F, Moustaki I (2002) The analysis and interpretation of multivariate data for social scientists. CRC Press, CambridgeCrossRef Galbraith JI, Bartholomew DJ, Steele F, Moustaki I (2002) The analysis and interpretation of multivariate data for social scientists. CRC Press, CambridgeCrossRef
go back to reference Ghodsi A (2006) Dimensionality reduction a short tutorial. Department of Statistics and Actuarial Science, University of Waterloo, Ontario, Canada 37: 38 Ghodsi A (2006) Dimensionality reduction a short tutorial. Department of Statistics and Actuarial Science, University of Waterloo, Ontario, Canada 37: 38
go back to reference Hoang DT, Nguyen NT, Hwang D (2018) A group recommender system for selecting experts to review a specific problem. In: Nguyen NT, Pimenidis E, Khan Z, Trawiński B (eds) Computational collective intelligence. Springer International Publishing, Cham, pp 270–280CrossRef Hoang DT, Nguyen NT, Hwang D (2018) A group recommender system for selecting experts to review a specific problem. In: Nguyen NT, Pimenidis E, Khan Z, Trawiński B (eds) Computational collective intelligence. Springer International Publishing, Cham, pp 270–280CrossRef
go back to reference Jammalamadaka S, Sengupta A (2001) Topics in Circular Statistics. World Scientific, Series on multivariate analysis Jammalamadaka S, Sengupta A (2001) Topics in Circular Statistics. World Scientific, Series on multivariate analysis
go back to reference Li L, He D, Zhang C (2016) Evaluating academic answer quality: a pilot study on research gate q&a. In: Nah FFH, Tan CH (eds) HCI in business, government, and organizations: ecommerce and innovation. Springer International Publishing, Cham, pp 61–71CrossRef Li L, He D, Zhang C (2016) Evaluating academic answer quality: a pilot study on research gate q&a. In: Nah FFH, Tan CH (eds) HCI in business, government, and organizations: ecommerce and innovation. Springer International Publishing, Cham, pp 61–71CrossRef
go back to reference Maruyama GM (1997) Basics of structural equation modeling. SAGE Publications Maruyama GM (1997) Basics of structural equation modeling. SAGE Publications
go back to reference Mukaka M (2012) A guide to appropriate use of correlation coefficient in medical research. Malawi Med J 24:69–71 Mukaka M (2012) A guide to appropriate use of correlation coefficient in medical research. Malawi Med J 24:69–71
go back to reference dos Tiago RL, Santos LEZ (2015) Categorical data clustering: What similarity measure to recommend? Expert Syst Appl 42(3):1247–1260CrossRef dos Tiago RL, Santos LEZ (2015) Categorical data clustering: What similarity measure to recommend? Expert Syst Appl 42(3):1247–1260CrossRef
go back to reference Stewart DW (1981) The application and misapplication of factor analysis in marketing research. J Mark Res 18(1):51–62MathSciNetCrossRef Stewart DW (1981) The application and misapplication of factor analysis in marketing research. J Mark Res 18(1):51–62MathSciNetCrossRef
go back to reference Takahashi T, Tango K, Chikazawa Y, Katsurai M (2020) A novel researcher search system based on research content similarity and geographic information. In: In: Ishita E, Pang NLS, Zhou L (eds) Digital libraries at times of massive societal transition. ICADL 2020. Lecture Notes in Computer Science, Lecture Notes in Computer Science, pp 390–398. Springer. https://doi.org/10.1007/978-3-030-64452-9_36 Takahashi T, Tango K, Chikazawa Y, Katsurai M (2020) A novel researcher search system based on research content similarity and geographic information. In: In: Ishita E, Pang NLS, Zhou L (eds) Digital libraries at times of massive societal transition. ICADL 2020. Lecture Notes in Computer Science, Lecture Notes in Computer Science, pp 390–398. Springer. https://​doi.​org/​10.​1007/​978-3-030-64452-9_​36
go back to reference Tang J, Wu S, Sun J, Su H (2012) Cross-domain collaboration recommendation. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’12, pp 1285–1293. Association for Computing Machinery, New York. https://doi.org/10.1145/2339530.2339730 Tang J, Wu S, Sun J, Su H (2012) Cross-domain collaboration recommendation. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’12, pp 1285–1293. Association for Computing Machinery, New York. https://​doi.​org/​10.​1145/​2339530.​2339730
go back to reference Tucker LR, MacCallum RC (1997) Exploratory Factor Analysis. Unpublished manuscript, Ohio State University, Columbus Tucker LR, MacCallum RC (1997) Exploratory Factor Analysis. Unpublished manuscript, Ohio State University, Columbus
Metadata
Title
Effectively clustering researchers in scientific collaboration networks: case study on ResearchGate
Authors
Marcos Wander Rodrigues
Mark A. Junho Song
Luis Enrique Zárate
Publication date
01-12-2021
Publisher
Springer Vienna
Published in
Social Network Analysis and Mining / Issue 1/2021
Print ISSN: 1869-5450
Electronic ISSN: 1869-5469
DOI
https://doi.org/10.1007/s13278-021-00781-9

Other articles of this Issue 1/2021

Social Network Analysis and Mining 1/2021 Go to the issue

Premium Partner