ABSTRACT
Despite the potential wealth of educational indicators expressed in a student's approach to homework assignments, how students arrive at their final solution is largely overlooked in university courses. In this paper we present a methodology which uses machine learning techniques to autonomously create a graphical model of how students in an introductory programming course progress through a homework assignment. We subsequently show that this model is predictive of which students will struggle with material presented later in the class.
- Booth, S. 1992. Learning to program: A phenomenographic perspective. Gothenburg, Sweden: Acta Universitatis Gothoburgensis.Google Scholar
- Bowman, M., Debray, S. K., & Peterson, L. L. 1993. Reasoning about naming systems. ACM Trans. Program. Lang. Syst. 15, 5 (Nov. 1993), 795--825. Google ScholarDigital Library
- Bruce, C., Buckingham, L., Hynd, J., McMahon, C., Roggenkamp, M., & Stoodley, I. 2004. Ways of experiencing the act of learning to program: A phenomenographic study of introductory programming students at university. Journal of Information Technology Education, 3, 143--160.Google ScholarCross Ref
- Brusilovsky, Peter. 2000. Adaptive hypermedia: From intelligent tutoring systems to web-based education. LNCS. Google ScholarDigital Library
- Conati, C., Gertner, A. S., VanLehn, K., & Druzdzel, M. J. 1997. On-line student modeling for coached problem solving using Bayesian networks. Proceedings of the 6th Int'l Conference on User Modeling (UM-96), 231--242.Google ScholarCross Ref
- Cristianini, N. & Shawe-Taylor, J. 2000. An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press. Google ScholarDigital Library
- Cutting, D., Karger, D., Pedersen, J., & Tukey, J. 1992. Scatter/gather: A cluster-based approach to browsing large document collections. Proc. 15th SIGIR, 1992. Google ScholarDigital Library
- Dempster, A.P., Laird, N.M., & Rubin, D.B. 1977. Maximum likelihood from incomplete data via the em algorithm. J. of the Royal Statistical Society B, 39 (1): 1--38.Google Scholar
- Gall, Harald et al. 2009. Change analysis with evolizer and changedistiller, Software, IEEE. Google ScholarDigital Library
- Hoc, J-M. 1984. Do we really have conditional statements in our brains? Proceedings of the 2nd European Conference on Readings on Cognitive Ergonomics - Mind and Computers, G. van der Veer, M. Tauber, T. Green, and P. Gorny (Eds.), Springer-Verlag, London, UK, 92--101. Google ScholarDigital Library
- Kaufman, L. and Rousseeuw, P.J. 1990. Finding groups in data: An introduction to cluster analysis, Wiley.Google Scholar
- Kessler, C. & Anderson, J. 1988. Learning flow of control: Recursive and iterative procedures. In {26}, 229--260.Google Scholar
- Kurland, D. & Pea, R. 1983. Children's mental models of recursive logo programs. Proceedings of the 5th Annual Conference of the Cognitive Science Society, NY, 1--5.Google Scholar
- Needleman S. & Wunsch C. 1970. A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of Molecular Biology.Google ScholarCross Ref
- Paine, Carina. 2001. How students learn to program: Observations of practical tasks completed. Proceedings of the IEEE International Conference on Advanced Learning Technologies (ICALT '01). Google ScholarDigital Library
- Parlante, N.,Wolfman, S., McCann, L., Roberts, E., Nevison, C., Motil, J., Cain, J., & Reges, S. 2006. Nifty assignments. SIGCSE Bull. 38, 1 (March 2006), 562--563. Google ScholarDigital Library
- Rabiner, L.R., & Juang, B,H. 1986. Introduction to hidden markov models. IEEE ASSP Magazine, 3(1):4--16, 1986.Google ScholarCross Ref
- Rabiner, L.R., & Juang, B,H. 1993. Fundamentals of speech recognition. Prentice-Hall, Inc. Google ScholarDigital Library
- Reiser , B., Anderson , J., Farrell, R. 1985. Dynamic student modelling in an intelligent tutor for LISP programming, Proceedings of the 9th int'l joint conference on AI, 8--14. Google ScholarDigital Library
- Roberts, Eric. Karel Learns Java. Available from: http://www.stanford.edu/class/cs106a/cs106a_spring11/book/karel-the-robot-learns-java.pdf Accessed 9/1/2011.Google Scholar
- Sagar, Tobias et al. 2006. Detecting similar java classes using tree algorithms. Proceedings of the 2006 international workshop on mining software repositories (MSR '06). Google ScholarDigital Library
- Salton, G., Wong, A., & Yang, C. S. 1975. A vector space model for automatic indexing. CACM, 18: 613--620. Google ScholarDigital Library
- Samurcay, R. 1988.The concept of variable in programming: Its meaning and use in problem-solving by novice programmers. In {26}, 161--178.Google Scholar
- Smyth, P. 1997. Clustering sequences with hidden markov models. Advances in Neural Information Processing Systems, volume 9, 648--654. The MIT Press.Google Scholar
- Soloway,E., Bonar,J., Ehrlich,K. 1983. Cognitive strategies and looping constructs: an empirical study. CACM, 26, 11, 853--860. Google ScholarDigital Library
- Soloway E. & Spohrer, J. 1988. Studying the novice programmer. L. Erlbaum Assoc. Inc., Hillsdale, NJ, USA. Google ScholarDigital Library
- Spohrer, J. & Soloway, E. 1986. Analyzing the high frequency bugs in novice programs. Papers presented at the first workshop on empirical studies of programmers, E. Soloway and S. Iyengar (Eds.). Ablex Publishing, Norwood, NJ, USA, 230--251. Google ScholarDigital Library
- ACM Graphics library. Available from: http://www-cs-faculty.stanford.edu/~eroberts/jtf/ Accessed 9/1/2011.Google Scholar
- Berland, M. & Martin, T. 2011. Clusters and patterns of novice programmers. AERA, New Orleans, LA.Google Scholar
- Blikstein, P. & Worsley, M. (2011). Learning analytics: Assessing constructionist learning using machine learning. AERA, New Orleans, LA.Google Scholar
- Blikstein, P. (2011). Using learning analytics to assess students' behavior in open-ended programming tasks. Proc. of the Learning Analytics Knowledge Conference, Banff. Google ScholarDigital Library
- Blikstein, P. 2008. An Atom is known by the company it keeps. Unpublished PhD. dissertation, Northwestern University, Evanston, IL. Google ScholarDigital Library
Index Terms
- Modeling how students learn to program
Recommendations
Using POGIL to help students learn to program
Special Issue on Alternatives to Lecture in the Computer Science ClassroomPOGIL has been successfully implemented in a scientific computing course to teach science students how to program in Python. Following POGIL guidelines, the authors have developed guided inquiry activities that lead student teams to discover and ...
Modeling learning patterns of students with a tutoring system using Hidden Markov Models
Proceedings of the 2007 conference on Artificial Intelligence in Education: Building Technology Rich Learning Contexts That WorkThe current paper focuses on modeling actions of high school students with a mathematics tutoring system with Hidden Markov Models. The results indicated that including a hidden state estimate of learner engagement increased the accuracy and predictive ...
Factored Latent-Dynamic Conditional Random Fields for single and multi-label sequence modeling
Highlights- We propose a single and multi-label generalization of LDCRF (Morency et al., 2007), called the Factored LDCRF.
Graphical abstractDisplay Omitted
AbstractConditional Random Fields (CRF) are frequently applied for labeling and segmenting sequence data. Morency et al. (2007) introduced hidden state variables in a labeled CRF structure in order to model the latent dynamics within class ...
Comments