Skip to main content
Top

2017 | OriginalPaper | Chapter

Comparing the Performance of Latent Semantic Analysis and Probability Latent Semantic Analysis Models on Autoscoring Essay Tasks

Authors : Xiaohua Ke, Haijiao Luo

Published in: Emerging Technologies for Education

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper evaluates the performance variances of Latent Semantic Analysis (LSA) and Probability Latent Semantic Analysis (PLSA) by judging essay text qualities as automated essay (AES) scoring tools. A correlation research design was used to examine the correlation between LSA performance and PLSA performance. We introduced 3 weight methods and performed 6 experiments to produce the scoring performances of both LSA and PLSA from a total of 2444 Chinese essays. The results show that there were strong correlations between the LSA scores and PLSA scores. While the overall performance of PLSA is better than that of LSA, the findings from the current study do not corroborate the previous findings for PLSA methods that claim a significant improvement. The implications of our research for AES reveal that both LSA and PLSA have a limited capability at this point and those more reliable measures for automated essay analyzing and scoring, such as text formats and forms, still need to be a component of text quality analysis.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
According to Burstein and Chodorow (1999), adjacent is defined as between 1 mark with a full-score of 6.
 
Literature
go back to reference Antiqueira, L., Costa, L.F.: Using complex networks for language processing: the case of summary evaluation. In: Communications, Circuits and Systems Proceedings 2006, pp. 2678–2682 (2006) Antiqueira, L., Costa, L.F.: Using complex networks for language processing: the case of summary evaluation. In: Communications, Circuits and Systems Proceedings 2006, pp. 2678–2682 (2006)
go back to reference Burstein, J., Chodorow, M.: Automated essay scoring for nonnative English speakers. In: Proceedings of the ACL99 Workshop on Computer-Mediated Language Assessment and Evaluation of Natural Language Processing (1999) Burstein, J., Chodorow, M.: Automated essay scoring for nonnative English speakers. In: Proceedings of the ACL99 Workshop on Computer-Mediated Language Assessment and Evaluation of Natural Language Processing (1999)
go back to reference Chen, Y., et al.: A topic detection method based on semantic dependency distance and PLSA. In: The Proceedings of Computer Supported Cooperative Work in Design (CSCWD), vol. 5, pp. 703–708 (2012) Chen, Y., et al.: A topic detection method based on semantic dependency distance and PLSA. In: The Proceedings of Computer Supported Cooperative Work in Design (CSCWD), vol. 5, pp. 703–708 (2012)
go back to reference Gui, S.: The theory of latent semantic analysis and its application. Liguist. Appl. Linguist. 1, 76–85 (2003) Gui, S.: The theory of latent semantic analysis and its application. Liguist. Appl. Linguist. 1, 76–85 (2003)
go back to reference Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of SIGIR 1999 (1999) Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of SIGIR 1999 (1999)
go back to reference Landauer, T.K., Dumais, S.T.: A solution to Plato’s problem: the latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychol. Rev. 104(2), 211–240 (1997)CrossRef Landauer, T.K., Dumais, S.T.: A solution to Plato’s problem: the latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychol. Rev. 104(2), 211–240 (1997)CrossRef
go back to reference Landauer, T., Laham, D., Foltz, P.: Automatic essay assessment. Assess. Educ. Princ. Policy Pract. 3, 295–309 (2003) Landauer, T., Laham, D., Foltz, P.: Automatic essay assessment. Assess. Educ. Princ. Policy Pract. 3, 295–309 (2003)
go back to reference Shermis, M.D., Burstein, J.: Handbook of Automated Essay Evaluation: Current Applications and New Directions. Routledge, New York (2013) Shermis, M.D., Burstein, J.: Handbook of Automated Essay Evaluation: Current Applications and New Directions. Routledge, New York (2013)
go back to reference Thomas, H.: Unsupervised learning by probabilistic latent semantic analysis. Mach. Learn. 2, 177–196 (2001)MATH Thomas, H.: Unsupervised learning by probabilistic latent semantic analysis. Mach. Learn. 2, 177–196 (2001)MATH
Metadata
Title
Comparing the Performance of Latent Semantic Analysis and Probability Latent Semantic Analysis Models on Autoscoring Essay Tasks
Authors
Xiaohua Ke
Haijiao Luo
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-52836-6_42

Premium Partner