nach oben

Progress in Artificial Intelligence

Erschienen in:

04.08.2021 | Regular Paper

Automatic scoring of arabic essays over three linguistic levels

verfasst von: Waleed Alsanie, Mohamed I. Alkanhal, Mohammed Alhamadi, Abdulaziz O. Alqabbany

Erschienen in: Progress in Artificial Intelligence | Ausgabe 1/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

The importance of open questions requiring argumentative answers to assess student’s competence, along with the increasing number of people applying to colleges, have increased the demand to have systems which automatically score written essays. Developing such a system faces two main challenges. The first is, in many cases, scoring a free answer is largely subjective and does not have well-defined criteria. The second is scoring free answers requires deep language understanding. In this paper, we present an automatic scoring system for Arabic with these two challenges being considered. We only consider the essays of learners of Arabic as a second language in the beginning and intermediate levels. We omit essays of students at advanced levels as these essays might pose different challenges that require deep language understanding. The essays are scored by extracting specific features from the three linguistic levels, lexical, syntax and semantics. Syntactic level scoring is based on the sentence structure. Each level is scored independently and then the final score of the essay is a combination of these scores. We present different experiments with linear and non-linear combination methods on a real dataset. The results obtained from our experiments show that the trained models with respect to a human rater achieve accuracies and quadratic weighted kappa values similar to the agreement between two human raters. It is evident from our results that, with some realistic assumptions, a decision support Arabic scoring system can be achieved.

Nächster Artikel Predicting human behavior in size-variant repeated games through deep convolutional neural networks

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

The smallest units of meaning.

SentiWordNet automatically annotates all WORDNET synsets according to their degrees of positivity, negativity and neutrality.

‘*’ here is the Kleene star.

http://disi.unitn.it/moschitti/Tree-Kernel.htm.

https://www.gnu.org/software/ispell/ispell.html.

The English translation is a correct sentence because English does not have dual number.

Ethnologue: Languages of the World. Nineteenth edition edn. SIL International, Dallas, Texas (2016). http://www.ethnologue.com/world

Abdelali, A., Darwish, K., Durrani, N., Mubarak, H.: Farasa: a fast and furious segmenter for Arabic. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations, pp. 11–16. Association for Computational Linguistics, San Diego, California (2016). https://doi.org/10.18653/v1/N16-3003. https://www.aclweb.org/anthology/N16-3003

Alghamdi, M., Alkanhal, M., Badrashiny, M.A., Qabbany, A.A., Areshey, A., Alharbi, A.: A hybrid automatic scoring system for Arabic essays. AI Commun. 27(2), 103–111 (2014)CrossRef

Alkanhal, M.I., Al-Badrashiny, M.A., Alghamdi, M.M., Al-Qabbany, A.O.: Automatic stochastic Arabic spelling correction with emphasis on space insertions and deletions. IEEE Trans. Audio Speech Lang. Process. 20(7), 2111–2122 (2012)CrossRef

Attia, M.: Handling Arabic morphological and syntactic ambiguities within the LFG framework with a view to machine translation. Ph.D. thesis, School of Languages, Linguistics and Cultures, University of Manchester (2008)

Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Proceedings of the 7th Conference on International Language Resources and Evaluation (LREC’10). European Language Resources Association (ELRA) (2010)

Bridgeman, B.: Human Ratings and Automated Essay Evaluation. Routledge, London (2013)

Chang, T.H., Lee, C.H.: Automatic Chinese essay scoring using connections between concepts in paragraphs. In: International Conference on Asian Language Processing (IALP’09), pp. 265–268. IEEE (2009). https://doi.org/10.1109/ialp.2009.63

Chang, T.H., Tsai, P.Y., Lee, C.H., Tam, H.P.: Automated essay scoring using set of literary sememes. In: International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE ’08), pp. 1–5. IEEE (2008). https://doi.org/10.1109/nlpke.2008.4906764

10.

Chen, H., He, B.: Automated essay scoring by maximizing human-machine agreement. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 1741–1752. Association for Computational Linguistics, Seattle, Washington, USA (2013)

11.

Chomsky, N.: On certain formal properties of grammars. Inf. Control 2(2), 137–167 (1959). https://doi.org/10.1016/s0019-9958(59)90362-6MathSciNetCrossRefMATH

12.

Collins, M., Duffy, N.: Convolution kernels for natural language. In: Advances in Neural Information Processing Systems 14, pp. 625–632. MIT Press (2001)

13.

Collins, M.J.: A new statistical parser based on bigram lexical dependencies. In: Proceedings of the 34th Annual Meeting on Association for Computational Linguistics, ACL’96, pp. 184–191. Association for Computational Linguistics, Stroudsburg, PA, USA (1996). https://doi.org/10.3115/981863.981888

14.

Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. J. Am. Soc. Inf. Sci. 41(6), 391–407 (1990)CrossRef

15.

Ene, E., Kosobucki, V.: Rubrics and corrective feedback in ESL writing: a longitudinal case study of an l2 writer. Assess. Writing 30, 3–20 (2016). https://doi.org/10.1016/j.asw.2016.06.003CrossRef

16.

Green, S., Manning, C.D.: Better Arabic parsing: baselines, evaluations, and analysis. In: Proceedings of the 23rd International Conference on Computational Linguistics, COLING’10, pp. 394–402. Association for Computational Linguistics, Stroudsburg, PA, USA (2010)

17.

Haussler, D.: Convolution kernels on discrete structures. Tech. Rep. UCS-CRL-99-10, University of California at Santa Cruz (1999)

18.

Hindle, D., Rooth, M.: Structural ambiguity and lexical relations. Comput. Linguist. 19(1), 103–120 (1993)

19.

Ishioka, T., Kameda, M.: Automated Japanese Essay scoring system:Jess. In: 15th International Workshop on Database and Expert Systems Applications, pp. 4–8. IEEE (2004). https://doi.org/10.1109/dexa.2004.1333440

20.

Joshi, A.K., Schabes, Y.: Tree-adjoining Grammars and lexicalized grammars. Tech. Rep. MS-CIS-91-22, Department of Computer and Information Science (1991)

21.

Klein, D., Manning, C.D.: Accurate unlexicalized parsing. In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, Vol. 1, ACL’03, pp. 423–430. Association for Computational Linguistics, Stroudsburg, PA, USA (2003). https://doi.org/10.3115/1075096.1075150

22.

Kumar, N., Dey, L.: Automatic quality assessment of documents with application to essay grading. In: 12th Mexican International Conference on Artificial Intelligence (MICAI), pp. 216–222. IEEE (2013). https://doi.org/10.1109/micai.2013.34

23.

Landauer, T.K., Dumais, S.T.: A solution to Platos problem: the latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychol. Rev. 104(2), 211–240 (1997)CrossRef

24.

Landauer, T.K., Psotka, J.: Simulating text understanding for educational applications with latent semantic analysis: introduction to LSA. Interact. Learn. Environ. 8(2), 73–86 (2000). https://doi.org/10.1076/1049-4820(200008)8:2;1-b;ft073dCrossRef

25.

Moschitti, A.: Efficient convolution kernels for dependency and constituent syntactic trees. In: Proceedings of the 17th European Conference on Machine Learning, Lecture Notes in Computer Science, pp. 318–329. Springer (2006)

26.

Östling, R., Smolentzov, A., Hinnerich, B.T., Höglin, E.: Automated essay scoring for Swedish. In: The 8th Workshop on Innovative Use of NLP for Building Educational Applications. Association for Computational Linguistics (2013)

27.

Page, E., Paulus, D.: The analysis of essays by computer. Tech. rep. (1968)

28.

Page, E.B.: The imminence of... grading essays by computer. Phi Delta Kappan 47(5), 238–243 (1966)

29.

Rudner, L.M., Liang, T.: Automated essay scoring using Bayes’ theorem. J. Technol. Learn. Assess. 1(2) (2002)

30.

Schabes, Y., Abeillé, A., Joshi, A.K.: Parsing strategies with ’lexicalized’ grammars: application to tree adjoining grammars. Tech. Rep. MS-CIS-88-65, Department of Computer and Information Science (1988)

31.

Shieber, S.M.: Evidence against the context-freeness of natural language. Linguist. Philos. 8(3), 333–343 (1985). https://doi.org/10.1007/bf00630917CrossRef

32.

Wild, F., Stahl, C., Stermsek, G., Neumann, G.: Parameters driving effectiveness of automated essay scoring with LSA. In: Proceedings of the 9th International Computer Assisted Assessment Conference (CAA), pp. 485–495 (2005)

Titel: Automatic scoring of arabic essays over three linguistic levels
verfasst von: Waleed Alsanie
Mohamed I. Alkanhal
Mohammed Alhamadi
Abdulaziz O. Alqabbany
Publikationsdatum: 04.08.2021
Verlag: Springer Berlin Heidelberg
Erschienen in: Progress in Artificial Intelligence / Ausgabe 1/2022
Print ISSN: 2192-6352
Elektronische ISSN: 2192-6360
DOI: https://doi.org/10.1007/s13748-021-00257-z

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Weitere Artikel der Ausgabe 1/2022

Knowledge-based sentence semantic similarity: algebraical properties

Semi-causal decision trees

Predicting human behavior in size-variant repeated games through deep convolutional neural networks

Improving link prediction in social networks using local and global features: a clustering-based approach

SVGPM: evolving SVM decision function by using genetic programming to solve imbalanced classification problem

Interpretable entity meta-alignment in knowledge graphs using penalized regression: a case study in the biomedical domain