Skip to main content
Erschienen in: Social Network Analysis and Mining 1/2021

01.12.2021 | Original Article

Effectively clustering researchers in scientific collaboration networks: case study on ResearchGate

verfasst von: Marcos Wander Rodrigues, Mark A. Junho Song, Luis Enrique Zárate

Erschienen in: Social Network Analysis and Mining | Ausgabe 1/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Social networks play a significant role in sharing knowledge. Scientific collaboration online networks allow scientific articles and research results to be shared, and the interaction and possible collaboration between researchers. These networks have many users and store varied data about each of them, and which of the data are used to characterize and grouping similar users. The number of attributes available about each instance (user) can reach several hundred, making this a problem with high dimensionality. Thus, dimensionality reduction is indispensable to remove redundant and irrelevant attributes to improve machine learning algorithms’ performance and make models more understandable. In order to produce an efficient recommendation system for collaborative research, one of the main challenges of dimensionality reduction techniques is guaranteeing that the information of the data is represented in the reduced dataset after the reduction. In our dimensionality reduction, we used Factor Analysis, as it preserves the relationships between the variables. In this study, we characterize the profiles of ResearchGate users after applying dimensionality reduction to two different datasets. A dataset of continuous attributes composed of profile metrics and a dataset of dichotomous attributes contained interest topics. We evaluated our methodology using two recommendation applications: (1) Identifying groups of researchers through a global profile extraction process; and (2) Identifying profiles similar to a reference profile. For both applications, we used hierarchical clustering techniques to identify the groups of user profiles. Our experiments show that the Factor Analysis transformation was able to preserve the relevant information in the data, resulting in an effective clustering process for the recommendation system for collaborative networks of researchers.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Brown TA (2015) Confirmatory factor analysis for applied research, 2nd edn. Methodology in the Social Sciences. Guilford Publications Brown TA (2015) Confirmatory factor analysis for applied research, 2nd edn. Methodology in the Social Sciences. Guilford Publications
Zurück zum Zitat Cunningham JP, Ghahramani Z (2015) Linear dimensionality reduction: survey, insights, and generalizations. J Mach Learn Res 16:2859–2900MathSciNetMATH Cunningham JP, Ghahramani Z (2015) Linear dimensionality reduction: survey, insights, and generalizations. J Mach Learn Res 16:2859–2900MathSciNetMATH
Zurück zum Zitat Galbraith JI, Bartholomew DJ, Steele F, Moustaki I (2002) The analysis and interpretation of multivariate data for social scientists. CRC Press, CambridgeCrossRef Galbraith JI, Bartholomew DJ, Steele F, Moustaki I (2002) The analysis and interpretation of multivariate data for social scientists. CRC Press, CambridgeCrossRef
Zurück zum Zitat Ghodsi A (2006) Dimensionality reduction a short tutorial. Department of Statistics and Actuarial Science, University of Waterloo, Ontario, Canada 37: 38 Ghodsi A (2006) Dimensionality reduction a short tutorial. Department of Statistics and Actuarial Science, University of Waterloo, Ontario, Canada 37: 38
Zurück zum Zitat Hoang DT, Nguyen NT, Hwang D (2018) A group recommender system for selecting experts to review a specific problem. In: Nguyen NT, Pimenidis E, Khan Z, Trawiński B (eds) Computational collective intelligence. Springer International Publishing, Cham, pp 270–280CrossRef Hoang DT, Nguyen NT, Hwang D (2018) A group recommender system for selecting experts to review a specific problem. In: Nguyen NT, Pimenidis E, Khan Z, Trawiński B (eds) Computational collective intelligence. Springer International Publishing, Cham, pp 270–280CrossRef
Zurück zum Zitat Jammalamadaka S, Sengupta A (2001) Topics in Circular Statistics. World Scientific, Series on multivariate analysis Jammalamadaka S, Sengupta A (2001) Topics in Circular Statistics. World Scientific, Series on multivariate analysis
Zurück zum Zitat Li L, He D, Zhang C (2016) Evaluating academic answer quality: a pilot study on research gate q&a. In: Nah FFH, Tan CH (eds) HCI in business, government, and organizations: ecommerce and innovation. Springer International Publishing, Cham, pp 61–71CrossRef Li L, He D, Zhang C (2016) Evaluating academic answer quality: a pilot study on research gate q&a. In: Nah FFH, Tan CH (eds) HCI in business, government, and organizations: ecommerce and innovation. Springer International Publishing, Cham, pp 61–71CrossRef
Zurück zum Zitat Maruyama GM (1997) Basics of structural equation modeling. SAGE Publications Maruyama GM (1997) Basics of structural equation modeling. SAGE Publications
Zurück zum Zitat Mukaka M (2012) A guide to appropriate use of correlation coefficient in medical research. Malawi Med J 24:69–71 Mukaka M (2012) A guide to appropriate use of correlation coefficient in medical research. Malawi Med J 24:69–71
Zurück zum Zitat dos Tiago RL, Santos LEZ (2015) Categorical data clustering: What similarity measure to recommend? Expert Syst Appl 42(3):1247–1260CrossRef dos Tiago RL, Santos LEZ (2015) Categorical data clustering: What similarity measure to recommend? Expert Syst Appl 42(3):1247–1260CrossRef
Zurück zum Zitat Stewart DW (1981) The application and misapplication of factor analysis in marketing research. J Mark Res 18(1):51–62MathSciNetCrossRef Stewart DW (1981) The application and misapplication of factor analysis in marketing research. J Mark Res 18(1):51–62MathSciNetCrossRef
Zurück zum Zitat Takahashi T, Tango K, Chikazawa Y, Katsurai M (2020) A novel researcher search system based on research content similarity and geographic information. In: In: Ishita E, Pang NLS, Zhou L (eds) Digital libraries at times of massive societal transition. ICADL 2020. Lecture Notes in Computer Science, Lecture Notes in Computer Science, pp 390–398. Springer. https://doi.org/10.1007/978-3-030-64452-9_36 Takahashi T, Tango K, Chikazawa Y, Katsurai M (2020) A novel researcher search system based on research content similarity and geographic information. In: In: Ishita E, Pang NLS, Zhou L (eds) Digital libraries at times of massive societal transition. ICADL 2020. Lecture Notes in Computer Science, Lecture Notes in Computer Science, pp 390–398. Springer. https://​doi.​org/​10.​1007/​978-3-030-64452-9_​36
Zurück zum Zitat Tang J, Wu S, Sun J, Su H (2012) Cross-domain collaboration recommendation. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’12, pp 1285–1293. Association for Computing Machinery, New York. https://doi.org/10.1145/2339530.2339730 Tang J, Wu S, Sun J, Su H (2012) Cross-domain collaboration recommendation. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’12, pp 1285–1293. Association for Computing Machinery, New York. https://​doi.​org/​10.​1145/​2339530.​2339730
Zurück zum Zitat Tucker LR, MacCallum RC (1997) Exploratory Factor Analysis. Unpublished manuscript, Ohio State University, Columbus Tucker LR, MacCallum RC (1997) Exploratory Factor Analysis. Unpublished manuscript, Ohio State University, Columbus
Metadaten
Titel
Effectively clustering researchers in scientific collaboration networks: case study on ResearchGate
verfasst von
Marcos Wander Rodrigues
Mark A. Junho Song
Luis Enrique Zárate
Publikationsdatum
01.12.2021
Verlag
Springer Vienna
Erschienen in
Social Network Analysis and Mining / Ausgabe 1/2021
Print ISSN: 1869-5450
Elektronische ISSN: 1869-5469
DOI
https://doi.org/10.1007/s13278-021-00781-9

Weitere Artikel der Ausgabe 1/2021

Social Network Analysis and Mining 1/2021 Zur Ausgabe

Premium Partner