Skip to main content

2021 | OriginalPaper | Buchkapitel

Joint Autoregressive and Graph Models for Software and Developer Social Networks

verfasst von : Rima Hazra, Hardik Aggarwal, Pawan Goyal, Animesh Mukherjee, Soumen Chakrabarti

Erschienen in: Advances in Information Retrieval

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Social network research has focused on hyperlink graphs, bibliographic citations, friend/follow patterns, influence spread, etc. Large software repositories also form a highly valuable networked artifact, usually in the form of a collection of packages, their developers, dependencies among them, and bug reports. This “social network of code” is rarely studied by social network researchers. We introduce two new problems in this setting. These problems are well-motivated in the software engineering community but not closely studied by social network scientists. The first is to identify packages that are most likely to be troubled by bugs in the immediate future, thereby demanding the greatest attention. The second is to recommend developers to packages for the next development cycle. Simple autoregression can be applied to historical data for both problems, but we propose a novel method to integrate network-derived features and demonstrate that our method brings additional benefits. Apart from formalizing these problems and proposing new baseline approaches, we prepare and contribute a substantial dataset connecting multiple attributes built from the long-term history of 20 releases of Ubuntu, growing to over 25,000 packages with their dependency links, maintained by over 3,800 developers, with over 280k bug reports.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
4
There are other types of relations also in the dataset like recommends, suggests and conflicts which are very infrequent.
 
9
One may argue that more complex models like point processes could be a possible choice. However note that we only have 20 time points and therefore such complex models cannot be trained sufficiently.
 
10
The full rank list has 4K packages on average.
 
11
We also tried other values of \(\alpha \) and \(\beta \) but they did not affect the results.
 
12
Changes in the value of K does not affect the final results.
 
Literatur
1.
Zurück zum Zitat Anvik, J.: Automating bug report assignment. In: Proceedings of the 28th International Conference on Software Engineering, pp. 937–940 (2006) Anvik, J.: Automating bug report assignment. In: Proceedings of the 28th International Conference on Software Engineering, pp. 937–940 (2006)
2.
Zurück zum Zitat Benson, A.R., Kumar, R., Tomkins, A.: Sequences of sets. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, KDD 2018, pp. 1148–1157. Association for Computing Machinery, New York (2018). https://doi.org/10.1145/3219819.3220100 Benson, A.R., Kumar, R., Tomkins, A.: Sequences of sets. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, KDD 2018, pp. 1148–1157. Association for Computing Machinery, New York (2018). https://​doi.​org/​10.​1145/​3219819.​3220100
3.
Zurück zum Zitat Decan, A., Mens, T., Claes, M.: On the topology of package dependency networks: a comparison of three programming language ecosystems. In: Proceedings of the 10th European Conference on Software Architecture Workshops, pp. 1–4 (2016) Decan, A., Mens, T., Claes, M.: On the topology of package dependency networks: a comparison of three programming language ecosystems. In: Proceedings of the 10th European Conference on Software Architecture Workshops, pp. 1–4 (2016)
5.
Zurück zum Zitat Kikas, R., Gousios, G., Dumas, M., Pfahl, D.: Structure and evolution of package dependency networks. In: 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR), pp. 102–112. IEEE (2017) Kikas, R., Gousios, G., Dumas, M., Pfahl, D.: Structure and evolution of package dependency networks. In: 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR), pp. 102–112. IEEE (2017)
6.
Zurück zum Zitat Mann, H.B., Whitney, D.R.: On a test of whether one of two random variables is stochastically larger than the other. Ann. Math. Stat. 18(1), 50–60 (1947)MathSciNetCrossRef Mann, H.B., Whitney, D.R.: On a test of whether one of two random variables is stochastically larger than the other. Ann. Math. Stat. 18(1), 50–60 (1947)MathSciNetCrossRef
7.
Zurück zum Zitat Mao, K., Yang, Y., Wang, Q., Jia, Y., Harman, M.: Developer recommendation for crowdsourced software development tasks. In: 2015 IEEE Symposium on Service-Oriented System Engineering, pp. 347–356. IEEE (2015) Mao, K., Yang, Y., Wang, Q., Jia, Y., Harman, M.: Developer recommendation for crowdsourced software development tasks. In: 2015 IEEE Symposium on Service-Oriented System Engineering, pp. 347–356. IEEE (2015)
8.
Zurück zum Zitat Naguib, H., Narayan, N., Brügge, B., Helal, D.: Bug report assignee recommendation using activity profiles. In: 2013 10th Working Conference on Mining Software Repositories (MSR), pp. 22–30. IEEE (2013) Naguib, H., Narayan, N., Brügge, B., Helal, D.: Bug report assignee recommendation using activity profiles. In: 2013 10th Working Conference on Mining Software Repositories (MSR), pp. 22–30. IEEE (2013)
9.
Zurück zum Zitat de Sousa, O.F., de Menezes, M., Penna, T.J.: Analysis of the package dependency on Debian GNU/Linux. J. Comput. Interdiscip. Sci. 1(2), 127–133 (2009) de Sousa, O.F., de Menezes, M., Penna, T.J.: Analysis of the package dependency on Debian GNU/Linux. J. Comput. Interdiscip. Sci. 1(2), 127–133 (2009)
10.
Zurück zum Zitat Tunio, M.Z., et al.: Impact of personality on task selection in crowdsourcing software development: a sorting approach. IEEE Access 5, 18287–18294 (2017)CrossRef Tunio, M.Z., et al.: Impact of personality on task selection in crowdsourcing software development: a sorting approach. IEEE Access 5, 18287–18294 (2017)CrossRef
12.
Zurück zum Zitat Yang, Y., Karim, M.R., Saremi, R., Ruhe, G.: Who should take this task? Dynamic decision support for crowd workers. In: Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, ESEM 2016. Association for Computing Machinery, New York (2016). https://doi.org/10.1145/2961111.2962594 Yang, Y., Karim, M.R., Saremi, R., Ruhe, G.: Who should take this task? Dynamic decision support for crowd workers. In: Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, ESEM 2016. Association for Computing Machinery, New York (2016). https://​doi.​org/​10.​1145/​2961111.​2962594
13.
Zurück zum Zitat Ye, B., Wang, Y.: CrowdRec: trust-aware worker recommendation in crowdsourcing environments. In: 2016 IEEE International Conference on Web Services (ICWS), pp. 1–8 (2016) Ye, B., Wang, Y.: CrowdRec: trust-aware worker recommendation in crowdsourcing environments. In: 2016 IEEE International Conference on Web Services (ICWS), pp. 1–8 (2016)
Metadaten
Titel
Joint Autoregressive and Graph Models for Software and Developer Social Networks
verfasst von
Rima Hazra
Hardik Aggarwal
Pawan Goyal
Animesh Mukherjee
Soumen Chakrabarti
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-72113-8_15