Skip to main content
Erschienen in: Education and Information Technologies 5/2021

10.05.2021

Using process mining for Git log analysis of projects in a software development course

verfasst von: Martin Macak, Daniela Kruzelova, Stanislav Chren, Barbora Buhnova

Erschienen in: Education and Information Technologies | Ausgabe 5/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Understanding the processes in education, such as the student learning behavior within a specific course, is a key to continuous course improvement. In online learning systems, students’ learning can be tracked and examined based on data collected by the systems themselves. However, it is non-trivial to decide how to extract the desired students’ behavior from the limited data in traditional classroom courses. Software development courses are a domain where student behavior analysis would be especially useful, as continuous teaching improvement in this fast progressing domain is necessary. In this paper, we propose to use process mining for improvement-motivated process analysis of a software development course (web development in particular). To this end, we analyze Git logs of students’ projects to understand their development processes. Process mining has been chosen as it can help us to find a descriptive model of this process. The main contribution of this paper is the detailed methodology of process mining usage for students’ project development analysis, considering various commit characteristics, which are crucial in understanding student coding-behavior patterns. The process mining analysis proved to be very useful, indicating multiple directions for the course improvement, which we also include in this work as a secondary contribution. The third contribution of this work is the summary and discussion of the process mining advantages and current gaps in process mining research for this task. The data we used are made publicly available to other researchers.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
5
By position, we mean the order of the commit in the list of project commits ordered by their timestamp from the oldest to the newest ones.
 
6
We usually consider the median time between commits as a more suitable measure than the mean time due to unevenly distributed work during the semester
 
Literatur
Zurück zum Zitat Anuwatvisit, S., Tungkasthan, A., & Premchaiswadi, W. (2012). Bottleneck mining and petri net simulation in education situations. In 2012 Tenth international conference on ICT and knowledge engineering (pp. 244–251). Anuwatvisit, S., Tungkasthan, A., & Premchaiswadi, W. (2012). Bottleneck mining and petri net simulation in education situations. In 2012 Tenth international conference on ICT and knowledge engineering (pp. 244–251).
Zurück zum Zitat Bannert, M., Reimann, P., & Sonnenberg, C. (2014). Process mining techniques for analysing patterns and strategies in students’ self-regulated learning. Metacognition and learning, 161–185. Bannert, M., Reimann, P., & Sonnenberg, C. (2014). Process mining techniques for analysing patterns and strategies in students’ self-regulated learning. Metacognition and learning, 161–185.
Zurück zum Zitat Bogarín, A, Romero, C., Cerezo, R., & Sánchez-Santillán, M. (2014). Clustering for improving educational process mining. In Proceedings of the fourth international conference on learning analytics and knowledge (pp. 11–15). Bogarín, A, Romero, C., Cerezo, R., & Sánchez-Santillán, M. (2014). Clustering for improving educational process mining. In Proceedings of the fourth international conference on learning analytics and knowledge (pp. 11–15).
Zurück zum Zitat Bogarín, A., Cerezo, R., & Romero, C. (2018). A survey on educational process mining. Wiley interdisciplinary reviews: Data mining and knowledge discovery, pp 1–17. Bogarín, A., Cerezo, R., & Romero, C. (2018). A survey on educational process mining. Wiley interdisciplinary reviews: Data mining and knowledge discovery, pp 1–17.
Zurück zum Zitat Cairns, A.H., Gueni, B., Fhima, M., Cairns, A., David, S., & Khelifa, N. (2014). Towards custom-designed professional training contents and curriculums through educational process mining. In The fourth international conference on advances in information mining and management (pp. 53–58). Cairns, A.H., Gueni, B., Fhima, M., Cairns, A., David, S., & Khelifa, N. (2014). Towards custom-designed professional training contents and curriculums through educational process mining. In The fourth international conference on advances in information mining and management (pp. 53–58).
Zurück zum Zitat Danubianu, M. (2015). Step by step data preprocessing for data mining. a case study. In Proceddings of the international conference on information technologies (pp. 117–124). Danubianu, M. (2015). Step by step data preprocessing for data mining. a case study. In Proceddings of the international conference on information technologies (pp. 117–124).
Zurück zum Zitat Dolak, R. (2019). Using process mining techniques to discover student’s activities, navigation paths, and behavior in lms moodle. In Innovative technologies and learning (pp. 129–138). Dolak, R. (2019). Using process mining techniques to discover student’s activities, navigation paths, and behavior in lms moodle. In Innovative technologies and learning (pp. 129–138).
Zurück zum Zitat Glassy, L. (2006). Using version control to observe student software development processes. J Comput Sci Coll, 99–106. Glassy, L. (2006). Using version control to observe student software development processes. J Comput Sci Coll, 99–106.
Zurück zum Zitat Günther, C.W., & Rozinat, A. (2012). Disco: Discover your processes. BPM (Demos), 40–44. Günther, C.W., & Rozinat, A. (2012). Disco: Discover your processes. BPM (Demos), 40–44.
Zurück zum Zitat Jones, C. (2010). Using subversion as an aid in evaluating individuals working on a group coding project. Journal of Computing Sciences in Colleges 18–23. Jones, C. (2010). Using subversion as an aid in evaluating individuals working on a group coding project. Journal of Computing Sciences in Colleges 18–23.
Zurück zum Zitat Kay, J., Maisonneuve, N., Yacef, K., & Zaïane, O. (2006). Mining patterns of events in students’ teamwork data. In Proceedings of the workshop on educational data mining at the 8th international conference on intelligent tutoring systems (pp. 45–52). Kay, J., Maisonneuve, N., Yacef, K., & Zaïane, O. (2006). Mining patterns of events in students’ teamwork data. In Proceedings of the workshop on educational data mining at the 8th international conference on intelligent tutoring systems (pp. 45–52).
Zurück zum Zitat Liu, Y., Stroulia, E., Wong, K., & German, D. (2004). Using cvs historical information to understand how students develop software. In 26th international conference on software engineering - W17S workshop “International Workshop on Mining Software Repositories” (pp. 32–36). Liu, Y., Stroulia, E., Wong, K., & German, D. (2004). Using cvs historical information to understand how students develop software. In 26th international conference on software engineering - W17S workshop “International Workshop on Mining Software Repositories” (pp. 32–36).
Zurück zum Zitat Mierle, K., Laven, K., Roweis, S., & Wilson, G. (2005). Mining student cvs repositories for performance indicators. SIGSOFT Softw Eng Notes, 1–5. Mierle, K., Laven, K., Roweis, S., & Wilson, G. (2005). Mining student cvs repositories for performance indicators. SIGSOFT Softw Eng Notes, 1–5.
Zurück zum Zitat Mittal, M., & Sureka, A. (2014). Process mining software repositories from student projects in an undergraduate software engineering course. In Companion proceedings of the 36th international conference on software engineering (pp. 344–353). Mittal, M., & Sureka, A. (2014). Process mining software repositories from student projects in an undergraduate software engineering course. In Companion proceedings of the 36th international conference on software engineering (pp. 344–353).
Zurück zum Zitat Mukala, P., Buijs, J., Leemans, M., & van der Aalst, W. (2015). Learning analytics on coursera event data: A process mining approach. In 5th international symposium on data-driven process discovery and analysis (pp. 18–32). Mukala, P., Buijs, J., Leemans, M., & van der Aalst, W. (2015). Learning analytics on coursera event data: A process mining approach. In 5th international symposium on data-driven process discovery and analysis (pp. 18–32).
Zurück zum Zitat Poncin, W., Serebrenik, A., & van den Brand, M. (2011a). Mining student capstone projects with frasr and prom. In Proceedings of the ACM international conference companion on object oriented programming systems languages and applications companion (pp. 87–96). Poncin, W., Serebrenik, A., & van den Brand, M. (2011a). Mining student capstone projects with frasr and prom. In Proceedings of the ACM international conference companion on object oriented programming systems languages and applications companion (pp. 87–96).
Zurück zum Zitat Poncin, W., Serebrenik, A., & van den Brand, M. (2011b). Process mining software repositories. In 2011 15th European conference on software maintenance and reengineering (pp. 5–14). Poncin, W., Serebrenik, A., & van den Brand, M. (2011b). Process mining software repositories. In 2011 15th European conference on software maintenance and reengineering (pp. 5–14).
Zurück zum Zitat Rafique, Y., & Mišić, VB. (2013). The effects of test-driven development on external quality and productivity: A meta-analysis. IEEE Transactions on Software Engineering, 835–856. Rafique, Y., & Mišić, VB. (2013). The effects of test-driven development on external quality and productivity: A meta-analysis. IEEE Transactions on Software Engineering, 835–856.
Zurück zum Zitat Romero, C., Cerezo, R., Bogarín, A., & Sánchez-Santillán, M. (2016). Educational process mining: A tutorial and case study using moodle data sets. Data mining and learning analytics: Applications in educational research, 1–28. Romero, C., Cerezo, R., Bogarín, A., & Sánchez-Santillán, M. (2016). Educational process mining: A tutorial and case study using moodle data sets. Data mining and learning analytics: Applications in educational research, 1–28.
Zurück zum Zitat Schoor, C., & Bannert, M. (2012). Exploring regulatory processes during a computer-supported collaborative learning task using process mining. Computers in Human Behavior, 1321–1331. Schoor, C., & Bannert, M. (2012). Exploring regulatory processes during a computer-supported collaborative learning task using process mining. Computers in Human Behavior, 1321–1331.
Zurück zum Zitat Spinellis, D. (2012). Git. IEEE Software, 100–101. Spinellis, D. (2012). Git. IEEE Software, 100–101.
Zurück zum Zitat Trcka, N., & Pechenizkiy, M. (2009). From local patterns to global models: Towards domain driven educational process mining. In 2009 Ninth international conference on intelligent systems design and applications (pp. 1114–1119). Trcka, N., & Pechenizkiy, M. (2009). From local patterns to global models: Towards domain driven educational process mining. In 2009 Ninth international conference on intelligent systems design and applications (pp. 1114–1119).
Zurück zum Zitat Van Dongen, B.F., de Medeiros, A.K.A., Verbeek, H., Weijters, A., & van Der Aalst, W.M. (2005). The prom framework: A new era in process mining tool support. In International conference on application and theory of petri nets (pp. 444–454). Van Dongen, B.F., de Medeiros, A.K.A., Verbeek, H., Weijters, A., & van Der Aalst, W.M. (2005). The prom framework: A new era in process mining tool support. In International conference on application and theory of petri nets (pp. 444–454).
Zurück zum Zitat Yu, S., & Zhou, S. (2010). A survey on metric of software complexity. In 2010 2nd IEEE International conference on information management and engineering (pp. 352–356). Yu, S., & Zhou, S. (2010). A survey on metric of software complexity. In 2010 2nd IEEE International conference on information management and engineering (pp. 352–356).
Metadaten
Titel
Using process mining for Git log analysis of projects in a software development course
verfasst von
Martin Macak
Daniela Kruzelova
Stanislav Chren
Barbora Buhnova
Publikationsdatum
10.05.2021
Verlag
Springer US
Erschienen in
Education and Information Technologies / Ausgabe 5/2021
Print ISSN: 1360-2357
Elektronische ISSN: 1573-7608
DOI
https://doi.org/10.1007/s10639-021-10564-6

Weitere Artikel der Ausgabe 5/2021

Education and Information Technologies 5/2021 Zur Ausgabe

Premium Partner