Skip to main content

2015 | OriginalPaper | Buchkapitel

Predicting Student Performance in Distance Higher Education Using Semi-supervised Techniques

verfasst von : Georgios Kostopoulos, Sotiris Kotsiantis, Panagiotis Pintelas

Erschienen in: Model and Data Engineering

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Students’ performance prediction in distance higher education has been widely researched over the past decades. Machine learning techniques and especially supervised learning have been used in numerous studies to identify in time students that are possible to fail in final exams. The identification of in case failure as soon as possible, could lead the academic staff to develop learning strategies aiming to improve students’ overall performance. In this paper, we investigate the effectiveness of semi-supervised techniques in predicting students’ performance in distance higher education. Several experiments take place in our research comparing to the accuracy measures of familiar semi-supervised algorithms. As far as, we are aware various researches deal with students’ performance prediction in distance learning by using machine learning techniques and especially supervised methods, but none of them investigate the effectiveness of semi-supervised algorithms. Our results confirm the advantage of semi-supervised methods and especially the satisfactory performance of Tri-Training algorithm.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Adhatrao, K., Gaykar, A., Dhawan, A., Jha, R., Honrao, V.: Predicting students’ performance using ID3 and C4.5 classification algorithms. Int. J. Data Min. Knowl. Manage. Process 3(5), 39–52 (2013)CrossRef Adhatrao, K., Gaykar, A., Dhawan, A., Jha, R., Honrao, V.: Predicting students’ performance using ID3 and C4.5 classification algorithms. Int. J. Data Min. Knowl. Manage. Process 3(5), 39–52 (2013)CrossRef
2.
Zurück zum Zitat Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: 11th Annual Conference on Computational Learning Theory, pp. 92–100. ACM (1998) Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: 11th Annual Conference on Computational Learning Theory, pp. 92–100. ACM (1998)
3.
Zurück zum Zitat Cardie, C., Ng, V.: Weakly supervised natural language learning without redundant views. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1, pp. 94–101. Association for Computational Linguistics (2003) Cardie, C., Ng, V.: Weakly supervised natural language learning without redundant views. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, vol. 1, pp. 94–101. Association for Computational Linguistics (2003)
4.
Zurück zum Zitat Deng, C., Guo, M.-Z.: Tri-training and data editing based semi-supervised clustering algorithm. In: Gelbukh, A., Reyes-Garcia, C.A. (eds.) MICAI 2006. LNCS (LNAI), vol. 4293, pp. 641–651. Springer, Heidelberg (2006)CrossRef Deng, C., Guo, M.-Z.: Tri-training and data editing based semi-supervised clustering algorithm. In: Gelbukh, A., Reyes-Garcia, C.A. (eds.) MICAI 2006. LNCS (LNAI), vol. 4293, pp. 641–651. Springer, Heidelberg (2006)CrossRef
5.
Zurück zum Zitat García, S., Fernández, A., Luengo, J., Herrera, F.: Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: experimental analysis of power. Inf. Sci. 180(10), 2044–2064 (2010)CrossRef García, S., Fernández, A., Luengo, J., Herrera, F.: Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: experimental analysis of power. Inf. Sci. 180(10), 2044–2064 (2010)CrossRef
6.
Zurück zum Zitat Huang, S., Fang, N.: Predicting student academic performance in an engineering dynamics course: a comparison of four types of predictive mathematical models. Comput. Educ. 61, 133–145 (2013)CrossRef Huang, S., Fang, N.: Predicting student academic performance in an engineering dynamics course: a comparison of four types of predictive mathematical models. Comput. Educ. 61, 133–145 (2013)CrossRef
7.
Zurück zum Zitat Kabakchieva, D.: Predicting student performance by using data mining methods for classification. Cybern. Inf. Technol. 13(1), 61–72 (2013) Kabakchieva, D.: Predicting student performance by using data mining methods for classification. Cybern. Inf. Technol. 13(1), 61–72 (2013)
8.
Zurück zum Zitat Kotsiantis, S., Pierrakeas, C., Pintelas, P.: Predicting students’ performance in distance learning using machine learning. Appl. Artif. Intell. 18(5), 411–426 (2004)CrossRef Kotsiantis, S., Pierrakeas, C., Pintelas, P.: Predicting students’ performance in distance learning using machine learning. Appl. Artif. Intell. 18(5), 411–426 (2004)CrossRef
9.
Zurück zum Zitat Kovacic, Z.: Early prediction of student success: mining students’ enrolment data. In: Proceedings of Informing Science and IT Education Conference (InSITE), pp. 647–665 (2010) Kovacic, Z.: Early prediction of student success: mining students’ enrolment data. In: Proceedings of Informing Science and IT Education Conference (InSITE), pp. 647–665 (2010)
10.
Zurück zum Zitat Mashiloane, L., Mchunu, M.: Mining for marks: a comparison of classification algorithms when predicting academic performance to identify “students at risk”. In: Prasath, R., Kathirvalavakumar, T. (eds.) MIKE 2013. LNCS, vol. 8284, pp. 541–552. Springer, Heidelberg (2013)CrossRef Mashiloane, L., Mchunu, M.: Mining for marks: a comparison of classification algorithms when predicting academic performance to identify “students at risk”. In: Prasath, R., Kathirvalavakumar, T. (eds.) MIKE 2013. LNCS, vol. 8284, pp. 541–552. Springer, Heidelberg (2013)CrossRef
11.
Zurück zum Zitat Mihalcea, R.: Co-training and self-training for word sense disambiguation. In: Proceedings of the Conference on Computational Natural Language Learning (2004) Mihalcea, R.: Co-training and self-training for word sense disambiguation. In: Proceedings of the Conference on Computational Natural Language Learning (2004)
12.
Zurück zum Zitat Murphy, K.P.: Machine Learning: A Probabilistic Perspective. MIT Press, Cambridge (2012) Murphy, K.P.: Machine Learning: A Probabilistic Perspective. MIT Press, Cambridge (2012)
13.
Zurück zum Zitat Navarro, P., Shoemaker, J.: Performance and perceptions of distance learners in cyberspace. Am. J. Distance Educ. 14(2), 15–35 (2000)CrossRef Navarro, P., Shoemaker, J.: Performance and perceptions of distance learners in cyberspace. Am. J. Distance Educ. 14(2), 15–35 (2000)CrossRef
14.
Zurück zum Zitat Quinlan, J.R.: C4.5: Programs for Machine Learning. Elsevier, Amsterdam (1993) Quinlan, J.R.: C4.5: Programs for Machine Learning. Elsevier, Amsterdam (1993)
15.
Zurück zum Zitat Rokach, L.: Data Mining with Decision Trees: Theory and Applications. World scientific, Singapore (2007) Rokach, L.: Data Mining with Decision Trees: Theory and Applications. World scientific, Singapore (2007)
16.
Zurück zum Zitat Rokach, L., Maimon, O.: Data Mining with Decision Trees: Theory and Applications. World scientific, Singapore (2015) Rokach, L., Maimon, O.: Data Mining with Decision Trees: Theory and Applications. World scientific, Singapore (2015)
17.
Zurück zum Zitat Romero, C., López, M.I., Luna, J.M., Ventura, S.: Predicting students’ final performance from participation in on-line discussion forums. Comput. Educ. 68, 458–472 (2013)CrossRef Romero, C., López, M.I., Luna, J.M., Ventura, S.: Predicting students’ final performance from participation in on-line discussion forums. Comput. Educ. 68, 458–472 (2013)CrossRef
18.
Zurück zum Zitat Ruggieri, S.: Efficient C4.5 classification algorithm. IEEE Trans. Knowl. Data Eng. 14(2), 438–444 (2002)CrossRef Ruggieri, S.: Efficient C4.5 classification algorithm. IEEE Trans. Knowl. Data Eng. 14(2), 438–444 (2002)CrossRef
19.
Zurück zum Zitat Simpson, O.: Predicting student success in open and distance learning. Open Learn. 21(2), 125–138 (2006)CrossRef Simpson, O.: Predicting student success in open and distance learning. Open Learn. 21(2), 125–138 (2006)CrossRef
20.
Zurück zum Zitat Wang, J., Luo, S.W., Zeng, X.H.: A random subspace method for co-training. In: IEEE International Joint Conference on Neural Networks, pp. 195–200. IEEE (2008) Wang, J., Luo, S.W., Zeng, X.H.: A random subspace method for co-training. In: IEEE International Joint Conference on Neural Networks, pp. 195–200. IEEE (2008)
21.
Zurück zum Zitat Yarowsky, D.: Unsupervised word sense disambiguation rivaling supervised methods. In: Proceedings of the 33rd Annual Meeting on Association for Computational Linguistics, pp. 189–196. Association for Computational Linguistics (1995) Yarowsky, D.: Unsupervised word sense disambiguation rivaling supervised methods. In: Proceedings of the 33rd Annual Meeting on Association for Computational Linguistics, pp. 189–196. Association for Computational Linguistics (1995)
22.
Zurück zum Zitat Yaslan, Y., Cataltepe, Z.: Co-training with relevant random subspaces. Neurocomputing 73(10), 1652–1661 (2010)CrossRef Yaslan, Y., Cataltepe, Z.: Co-training with relevant random subspaces. Neurocomputing 73(10), 1652–1661 (2010)CrossRef
23.
Zurück zum Zitat Zhu, X., Goldberg, A.B.: Introduction to semi-supervised learning. Synth. Lect. Artif. Intell. Mach. Learn. 3(1), 1–130 (2009)CrossRef Zhu, X., Goldberg, A.B.: Introduction to semi-supervised learning. Synth. Lect. Artif. Intell. Mach. Learn. 3(1), 1–130 (2009)CrossRef
24.
Zurück zum Zitat Zhou, Y., Goldman, S.: Democratic co-learning. In: ICTAI 2004, pp. 594–602. IEEE (2004) Zhou, Y., Goldman, S.: Democratic co-learning. In: ICTAI 2004, pp. 594–602. IEEE (2004)
25.
Zurück zum Zitat Zhou, Z.H., Li, M.: Tri-training: exploiting unlabeled data using three classifiers. IEEE Trans. Knowl. Data Eng. 17(11), 1529–1541 (2005)CrossRef Zhou, Z.H., Li, M.: Tri-training: exploiting unlabeled data using three classifiers. IEEE Trans. Knowl. Data Eng. 17(11), 1529–1541 (2005)CrossRef
Metadaten
Titel
Predicting Student Performance in Distance Higher Education Using Semi-supervised Techniques
verfasst von
Georgios Kostopoulos
Sotiris Kotsiantis
Panagiotis Pintelas
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-23781-7_21

Premium Partner