Skip to main content
Erschienen in: Progress in Artificial Intelligence 1/2022

04.08.2021 | Regular Paper

Automatic scoring of arabic essays over three linguistic levels

verfasst von: Waleed Alsanie, Mohamed I. Alkanhal, Mohammed Alhamadi, Abdulaziz O. Alqabbany

Erschienen in: Progress in Artificial Intelligence | Ausgabe 1/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The importance of open questions requiring argumentative answers to assess student’s competence, along with the increasing number of people applying to colleges, have increased the demand to have systems which automatically score written essays. Developing such a system faces two main challenges. The first is, in many cases, scoring a free answer is largely subjective and does not have well-defined criteria. The second is scoring free answers requires deep language understanding. In this paper, we present an automatic scoring system for Arabic with these two challenges being considered. We only consider the essays of learners of Arabic as a second language in the beginning and intermediate levels. We omit essays of students at advanced levels as these essays might pose different challenges that require deep language understanding. The essays are scored by extracting specific features from the three linguistic levels, lexical, syntax and semantics. Syntactic level scoring is based on the sentence structure. Each level is scored independently and then the final score of the essay is a combination of these scores. We present different experiments with linear and non-linear combination methods on a real dataset. The results obtained from our experiments show that the trained models with respect to a human rater achieve accuracies and quadratic weighted kappa values similar to the agreement between two human raters. It is evident from our results that, with some realistic assumptions, a decision support Arabic scoring system can be achieved.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
The smallest units of meaning.
 
2
SentiWordNet automatically annotates all WORDNET synsets according to their degrees of positivity, negativity and neutrality.
 
3
‘*’ here is the Kleene star.
 
6
The English translation is a correct sentence because English does not have dual number.
 
Literatur
3.
Zurück zum Zitat Alghamdi, M., Alkanhal, M., Badrashiny, M.A., Qabbany, A.A., Areshey, A., Alharbi, A.: A hybrid automatic scoring system for Arabic essays. AI Commun. 27(2), 103–111 (2014)CrossRef Alghamdi, M., Alkanhal, M., Badrashiny, M.A., Qabbany, A.A., Areshey, A., Alharbi, A.: A hybrid automatic scoring system for Arabic essays. AI Commun. 27(2), 103–111 (2014)CrossRef
4.
Zurück zum Zitat Alkanhal, M.I., Al-Badrashiny, M.A., Alghamdi, M.M., Al-Qabbany, A.O.: Automatic stochastic Arabic spelling correction with emphasis on space insertions and deletions. IEEE Trans. Audio Speech Lang. Process. 20(7), 2111–2122 (2012)CrossRef Alkanhal, M.I., Al-Badrashiny, M.A., Alghamdi, M.M., Al-Qabbany, A.O.: Automatic stochastic Arabic spelling correction with emphasis on space insertions and deletions. IEEE Trans. Audio Speech Lang. Process. 20(7), 2111–2122 (2012)CrossRef
5.
Zurück zum Zitat Attia, M.: Handling Arabic morphological and syntactic ambiguities within the LFG framework with a view to machine translation. Ph.D. thesis, School of Languages, Linguistics and Cultures, University of Manchester (2008) Attia, M.: Handling Arabic morphological and syntactic ambiguities within the LFG framework with a view to machine translation. Ph.D. thesis, School of Languages, Linguistics and Cultures, University of Manchester (2008)
6.
Zurück zum Zitat Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Proceedings of the 7th Conference on International Language Resources and Evaluation (LREC’10). European Language Resources Association (ELRA) (2010) Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Proceedings of the 7th Conference on International Language Resources and Evaluation (LREC’10). European Language Resources Association (ELRA) (2010)
7.
Zurück zum Zitat Bridgeman, B.: Human Ratings and Automated Essay Evaluation. Routledge, London (2013) Bridgeman, B.: Human Ratings and Automated Essay Evaluation. Routledge, London (2013)
10.
Zurück zum Zitat Chen, H., He, B.: Automated essay scoring by maximizing human-machine agreement. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 1741–1752. Association for Computational Linguistics, Seattle, Washington, USA (2013) Chen, H., He, B.: Automated essay scoring by maximizing human-machine agreement. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 1741–1752. Association for Computational Linguistics, Seattle, Washington, USA (2013)
12.
Zurück zum Zitat Collins, M., Duffy, N.: Convolution kernels for natural language. In: Advances in Neural Information Processing Systems 14, pp. 625–632. MIT Press (2001) Collins, M., Duffy, N.: Convolution kernels for natural language. In: Advances in Neural Information Processing Systems 14, pp. 625–632. MIT Press (2001)
13.
Zurück zum Zitat Collins, M.J.: A new statistical parser based on bigram lexical dependencies. In: Proceedings of the 34th Annual Meeting on Association for Computational Linguistics, ACL’96, pp. 184–191. Association for Computational Linguistics, Stroudsburg, PA, USA (1996). https://doi.org/10.3115/981863.981888 Collins, M.J.: A new statistical parser based on bigram lexical dependencies. In: Proceedings of the 34th Annual Meeting on Association for Computational Linguistics, ACL’96, pp. 184–191. Association for Computational Linguistics, Stroudsburg, PA, USA (1996). https://​doi.​org/​10.​3115/​981863.​981888
14.
Zurück zum Zitat Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41(6), 391–407 (1990)CrossRef Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41(6), 391–407 (1990)CrossRef
16.
Zurück zum Zitat Green, S., Manning, C.D.: Better Arabic parsing: baselines, evaluations, and analysis. In: Proceedings of the 23rd International Conference on Computational Linguistics, COLING’10, pp. 394–402. Association for Computational Linguistics, Stroudsburg, PA, USA (2010) Green, S., Manning, C.D.: Better Arabic parsing: baselines, evaluations, and analysis. In: Proceedings of the 23rd International Conference on Computational Linguistics, COLING’10, pp. 394–402. Association for Computational Linguistics, Stroudsburg, PA, USA (2010)
17.
Zurück zum Zitat Haussler, D.: Convolution kernels on discrete structures. Tech. Rep. UCS-CRL-99-10, University of California at Santa Cruz (1999) Haussler, D.: Convolution kernels on discrete structures. Tech. Rep. UCS-CRL-99-10, University of California at Santa Cruz (1999)
18.
Zurück zum Zitat Hindle, D., Rooth, M.: Structural ambiguity and lexical relations. Comput. Linguist. 19(1), 103–120 (1993) Hindle, D., Rooth, M.: Structural ambiguity and lexical relations. Comput. Linguist. 19(1), 103–120 (1993)
20.
Zurück zum Zitat Joshi, A.K., Schabes, Y.: Tree-adjoining Grammars and lexicalized grammars. Tech. Rep. MS-CIS-91-22, Department of Computer and Information Science (1991) Joshi, A.K., Schabes, Y.: Tree-adjoining Grammars and lexicalized grammars. Tech. Rep. MS-CIS-91-22, Department of Computer and Information Science (1991)
21.
Zurück zum Zitat Klein, D., Manning, C.D.: Accurate unlexicalized parsing. In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, Vol. 1, ACL’03, pp. 423–430. Association for Computational Linguistics, Stroudsburg, PA, USA (2003). https://doi.org/10.3115/1075096.1075150 Klein, D., Manning, C.D.: Accurate unlexicalized parsing. In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, Vol. 1, ACL’03, pp. 423–430. Association for Computational Linguistics, Stroudsburg, PA, USA (2003). https://​doi.​org/​10.​3115/​1075096.​1075150
23.
Zurück zum Zitat Landauer, T.K., Dumais, S.T.: A solution to Platos problem: the latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychol. Rev. 104(2), 211–240 (1997)CrossRef Landauer, T.K., Dumais, S.T.: A solution to Platos problem: the latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychol. Rev. 104(2), 211–240 (1997)CrossRef
25.
Zurück zum Zitat Moschitti, A.: Efficient convolution kernels for dependency and constituent syntactic trees. In: Proceedings of the 17th European Conference on Machine Learning, Lecture Notes in Computer Science, pp. 318–329. Springer (2006) Moschitti, A.: Efficient convolution kernels for dependency and constituent syntactic trees. In: Proceedings of the 17th European Conference on Machine Learning, Lecture Notes in Computer Science, pp. 318–329. Springer (2006)
26.
Zurück zum Zitat Östling, R., Smolentzov, A., Hinnerich, B.T., Höglin, E.: Automated essay scoring for Swedish. In: The 8th Workshop on Innovative Use of NLP for Building Educational Applications. Association for Computational Linguistics (2013) Östling, R., Smolentzov, A., Hinnerich, B.T., Höglin, E.: Automated essay scoring for Swedish. In: The 8th Workshop on Innovative Use of NLP for Building Educational Applications. Association for Computational Linguistics (2013)
27.
Zurück zum Zitat Page, E., Paulus, D.: The analysis of essays by computer. Tech. rep. (1968) Page, E., Paulus, D.: The analysis of essays by computer. Tech. rep. (1968)
28.
Zurück zum Zitat Page, E.B.: The imminence of... grading essays by computer. Phi Delta Kappan 47(5), 238–243 (1966) Page, E.B.: The imminence of... grading essays by computer. Phi Delta Kappan 47(5), 238–243 (1966)
29.
Zurück zum Zitat Rudner, L.M., Liang, T.: Automated essay scoring using Bayes’ theorem. J. Technol. Learn. Assess. 1(2) (2002) Rudner, L.M., Liang, T.: Automated essay scoring using Bayes’ theorem. J. Technol. Learn. Assess. 1(2) (2002)
30.
Zurück zum Zitat Schabes, Y., Abeillé, A., Joshi, A.K.: Parsing strategies with ’lexicalized’ grammars: application to tree adjoining grammars. Tech. Rep. MS-CIS-88-65, Department of Computer and Information Science (1988) Schabes, Y., Abeillé, A., Joshi, A.K.: Parsing strategies with ’lexicalized’ grammars: application to tree adjoining grammars. Tech. Rep. MS-CIS-88-65, Department of Computer and Information Science (1988)
32.
Zurück zum Zitat Wild, F., Stahl, C., Stermsek, G., Neumann, G.: Parameters driving effectiveness of automated essay scoring with LSA. In: Proceedings of the 9th International Computer Assisted Assessment Conference (CAA), pp. 485–495 (2005) Wild, F., Stahl, C., Stermsek, G., Neumann, G.: Parameters driving effectiveness of automated essay scoring with LSA. In: Proceedings of the 9th International Computer Assisted Assessment Conference (CAA), pp. 485–495 (2005)
Metadaten
Titel
Automatic scoring of arabic essays over three linguistic levels
verfasst von
Waleed Alsanie
Mohamed I. Alkanhal
Mohammed Alhamadi
Abdulaziz O. Alqabbany
Publikationsdatum
04.08.2021
Verlag
Springer Berlin Heidelberg
Erschienen in
Progress in Artificial Intelligence / Ausgabe 1/2022
Print ISSN: 2192-6352
Elektronische ISSN: 2192-6360
DOI
https://doi.org/10.1007/s13748-021-00257-z

Weitere Artikel der Ausgabe 1/2022

Progress in Artificial Intelligence 1/2022 Zur Ausgabe