research-article

Modeling how students learn to program

Authors:
Chris Piech

Stanford University, Stanford, CA, USA

Stanford University, Stanford, CA, USA
View Profile

,
Mehran Sahami

Stanford University, Stanford, CA, USA

Stanford University, Stanford, CA, USA
View Profile

,
Daphne Koller

Stanford University, Stanford, CA, USA

Stanford University, Stanford, CA, USA
View Profile

,
Steve Cooper

Stanford University, Stanford, CA, USA

Stanford University, Stanford, CA, USA
View Profile

,
Paulo Blikstein

Stanford University, Stanford, CA, USA

Stanford University, Stanford, CA, USA
View Profile

SIGCSE '12: Proceedings of the 43rd ACM technical symposium on Computer Science EducationFebruary 2012Pages 153–160https://doi.org/10.1145/2157136.2157182

Published:29 February 2012Publication History

SIGCSE '12: Proceedings of the 43rd ACM technical symposium on Computer Science Education

Pages 153–160

ABSTRACT

Despite the potential wealth of educational indicators expressed in a student's approach to homework assignments, how students arrive at their final solution is largely overlooked in university courses. In this paper we present a methodology which uses machine learning techniques to autonomously create a graphical model of how students in an introductory programming course progress through a homework assignment. We subsequently show that this model is predictive of which students will struggle with material presented later in the class.

References

Booth, S. 1992. Learning to program: A phenomenographic perspective. Gothenburg, Sweden: Acta Universitatis Gothoburgensis.Google Scholar
Bowman, M., Debray, S. K., & Peterson, L. L. 1993. Reasoning about naming systems. ACM Trans. Program. Lang. Syst. 15, 5 (Nov. 1993), 795--825. Google ScholarDigital Library
Bruce, C., Buckingham, L., Hynd, J., McMahon, C., Roggenkamp, M., & Stoodley, I. 2004. Ways of experiencing the act of learning to program: A phenomenographic study of introductory programming students at university. Journal of Information Technology Education, 3, 143--160.Google ScholarCross Ref
Brusilovsky, Peter. 2000. Adaptive hypermedia: From intelligent tutoring systems to web-based education. LNCS. Google ScholarDigital Library
Conati, C., Gertner, A. S., VanLehn, K., & Druzdzel, M. J. 1997. On-line student modeling for coached problem solving using Bayesian networks. Proceedings of the 6th Int'l Conference on User Modeling (UM-96), 231--242.Google ScholarCross Ref
Cristianini, N. & Shawe-Taylor, J. 2000. An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press. Google ScholarDigital Library
Cutting, D., Karger, D., Pedersen, J., & Tukey, J. 1992. Scatter/gather: A cluster-based approach to browsing large document collections. Proc. 15th SIGIR, 1992. Google ScholarDigital Library
Dempster, A.P., Laird, N.M., & Rubin, D.B. 1977. Maximum likelihood from incomplete data via the em algorithm. J. of the Royal Statistical Society B, 39 (1): 1--38.Google Scholar
Gall, Harald et al. 2009. Change analysis with evolizer and changedistiller, Software, IEEE. Google ScholarDigital Library
Hoc, J-M. 1984. Do we really have conditional statements in our brains? Proceedings of the 2nd European Conference on Readings on Cognitive Ergonomics - Mind and Computers, G. van der Veer, M. Tauber, T. Green, and P. Gorny (Eds.), Springer-Verlag, London, UK, 92--101. Google ScholarDigital Library
Kaufman, L. and Rousseeuw, P.J. 1990. Finding groups in data: An introduction to cluster analysis, Wiley.Google Scholar
Kessler, C. & Anderson, J. 1988. Learning flow of control: Recursive and iterative procedures. In {26}, 229--260.Google Scholar
Kurland, D. & Pea, R. 1983. Children's mental models of recursive logo programs. Proceedings of the 5th Annual Conference of the Cognitive Science Society, NY, 1--5.Google Scholar
Needleman S. & Wunsch C. 1970. A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of Molecular Biology.Google ScholarCross Ref
Paine, Carina. 2001. How students learn to program: Observations of practical tasks completed. Proceedings of the IEEE International Conference on Advanced Learning Technologies (ICALT '01). Google ScholarDigital Library
Parlante, N.,Wolfman, S., McCann, L., Roberts, E., Nevison, C., Motil, J., Cain, J., & Reges, S. 2006. Nifty assignments. SIGCSE Bull. 38, 1 (March 2006), 562--563. Google ScholarDigital Library
Rabiner, L.R., & Juang, B,H. 1986. Introduction to hidden markov models. IEEE ASSP Magazine, 3(1):4--16, 1986.Google ScholarCross Ref
Rabiner, L.R., & Juang, B,H. 1993. Fundamentals of speech recognition. Prentice-Hall, Inc. Google ScholarDigital Library
Reiser , B., Anderson , J., Farrell, R. 1985. Dynamic student modelling in an intelligent tutor for LISP programming, Proceedings of the 9th int'l joint conference on AI, 8--14. Google ScholarDigital Library
Roberts, Eric. Karel Learns Java. Available from: http://www.stanford.edu/class/cs106a/cs106a_spring11/book/karel-the-robot-learns-java.pdf Accessed 9/1/2011.Google Scholar
Sagar, Tobias et al. 2006. Detecting similar java classes using tree algorithms. Proceedings of the 2006 international workshop on mining software repositories (MSR '06). Google ScholarDigital Library
Salton, G., Wong, A., & Yang, C. S. 1975. A vector space model for automatic indexing. CACM, 18: 613--620. Google ScholarDigital Library
Samurcay, R. 1988.The concept of variable in programming: Its meaning and use in problem-solving by novice programmers. In {26}, 161--178.Google Scholar
Smyth, P. 1997. Clustering sequences with hidden markov models. Advances in Neural Information Processing Systems, volume 9, 648--654. The MIT Press.Google Scholar
Soloway,E., Bonar,J., Ehrlich,K. 1983. Cognitive strategies and looping constructs: an empirical study. CACM, 26, 11, 853--860. Google ScholarDigital Library
Soloway E. & Spohrer, J. 1988. Studying the novice programmer. L. Erlbaum Assoc. Inc., Hillsdale, NJ, USA. Google ScholarDigital Library
Spohrer, J. & Soloway, E. 1986. Analyzing the high frequency bugs in novice programs. Papers presented at the first workshop on empirical studies of programmers, E. Soloway and S. Iyengar (Eds.). Ablex Publishing, Norwood, NJ, USA, 230--251. Google ScholarDigital Library
ACM Graphics library. Available from: http://www-cs-faculty.stanford.edu/~eroberts/jtf/ Accessed 9/1/2011.Google Scholar
Berland, M. & Martin, T. 2011. Clusters and patterns of novice programmers. AERA, New Orleans, LA.Google Scholar
Blikstein, P. & Worsley, M. (2011). Learning analytics: Assessing constructionist learning using machine learning. AERA, New Orleans, LA.Google Scholar
Blikstein, P. (2011). Using learning analytics to assess students' behavior in open-ended programming tasks. Proc. of the Learning Analytics Knowledge Conference, Banff. Google ScholarDigital Library
Blikstein, P. 2008. An Atom is known by the company it keeps. Unpublished PhD. dissertation, Northwestern University, Evanston, IL. Google ScholarDigital Library

Index Terms

Modeling how students learn to program
1. Social and professional topics
  1. Professional topics
    1. Computing education
      1. Computing education programs
        Computer science education

Recommendations

Using POGIL to help students learn to program
Special Issue on Alternatives to Lecture in the Computer Science Classroom

POGIL has been successfully implemented in a scientific computing course to teach science students how to program in Python. Following POGIL guidelines, the authors have developed guided inquiry activities that lead student teams to discover and ...
Read More
Modeling learning patterns of students with a tutoring system using Hidden Markov Models
Proceedings of the 2007 conference on Artificial Intelligence in Education: Building Technology Rich Learning Contexts That Work

The current paper focuses on modeling actions of high school students with a mathematics tutoring system with Hidden Markov Models. The results indicated that including a hidden state estimate of learner engagement increased the accuracy and predictive ...
Read More
Factored Latent-Dynamic Conditional Random Fields for single and multi-label sequence modeling
Highlights
- We propose a single and multi-label generalization of LDCRF (Morency et al., 2007), called the Factored LDCRF.
Graphical abstract

Display Omitted

Abstract
Conditional Random Fields (CRF) are frequently applied for labeling and segmenting sequence data. Morency et al. (2007) introduced hidden state variables in a labeled CRF structure in order to model the latent dynamics within class ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGCSE '12: Proceedings of the 43rd ACM technical symposium on Computer Science Education
February 2012
734 pages
ISBN:9781450310987
DOI:10.1145/2157136
Conference Chairs:
Laurie Smith King
College of the Holy Cross
,
David R. Musicant,
Program Chairs:
Tracy Camp
Colorado School of Mines
,
Paul Tymann
Rochester Institute of Technology
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 29 February 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
hidden Markov model
intelligent tutor
probabilistic graphical models
program dissimilarity metric
student progress model
Qualifiers
- research-article
Conference

Acceptance Rates
SIGCSE '12 Paper Acceptance Rate100of289submissions,35%Overall Acceptance Rate1,595of4,542submissions,35%
More
Upcoming Conference
SIGCSE Virtual 2024

Sponsor:

sigcse

SIGCSE Virtual 2024: ACM Virtual Global Computing Education Conference

December 5 - 7, 2024

Virtual Event , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 135
  Total Citations
  View Citations
- 2,402
  Total Downloads
- Downloads (Last 12 months)120
- Downloads (Last 6 weeks)10
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Modeling how students learn to program

SIGCSE '12: Proceedings of the 43rd ACM technical symposium on Computer Science Education

ABSTRACT

References

Cited By

Index Terms

Recommendations

Using POGIL to help students learn to program

Modeling learning patterns of students with a tutoring system using Hidden Markov Models

Factored Latent-Dynamic Conditional Random Fields for single and multi-label sequence modeling