Skip to main content

2019 | OriginalPaper | Buchkapitel

Reading Comprehension in University Texts: The Metrics of Lexical Complexity in Corpus Analysis in Spanish

verfasst von : Jenny Ortiz Zambrano, Eleanor Varela Tapia

Erschienen in: Computer and Communication Engineering

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The article focuses on the practical field of the development and implementation of a software application developed for the automatic processing of eight metrics to calculate the lexical complexity in a corpus that contains the transcriptions of university educational videos in Spanish called VYTEDU, prepared by teachers from the University of Guayaquil, Ecuador. The obtained result allowed to demonstrate the different indexes of lexical complexity that the texts have in terms of the comprehensibility of their content. One of the main characteristics of the texts lies in the difference in size and content. It should be noted that although some texts had greater content, the index of lexical complexity was lower than other texts whose content was smaller in size. The diffusion of the software supposes the use of it as a tool to continue researching in the field of Natural Language Processing. The application developed using free software tools facilitated the use of libraries in the field of Natural Language Processing contributing to the analysis of the complexity of text comprehension, making this research a second step to build an automatic simplification tool for text in Spanish in the higher academic field that is proposed as future work, since the first step was the construction of the VYTEDU corpus together with its publication.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Lexicon – According to the dictionary of the Royal Academy of the Spanish Language, its meaning is the “set of words of a language, or those that belong to the use of a region”. Official website available at http://​dle.​rae.​es/​?​id=​ND3Rym3.
 
2
CEATIC: Center for Advanced Studies of the University of Jaén (Jaén-España). (by its initials in Spanish).
 
3
VYTEDU: Videos and Transcripts in the Educational field. (by its initials in Spanish).
 
4
UTF-8: (8-bit Unicode Transformation Format). According to Yergeau (2003) “it is a transformation format of ISO 10646”.
 
5
POS-tagger – Part-Of-Speech tagger, also known as POS Tagging, Mesa (2016).
 
Literatur
1.
Zurück zum Zitat Neira Martínez, A.C., Reyes Reyes, F.T., Riffo Ocares, B.E.: Academic experience and reading comprehension strategies in first-year university students. Lit. Linguist. 31, 221–244 (2015) Neira Martínez, A.C., Reyes Reyes, F.T., Riffo Ocares, B.E.: Academic experience and reading comprehension strategies in first-year university students. Lit. Linguist. 31, 221–244 (2015)
3.
Zurück zum Zitat Blanco, A., Gutiérrez, C.: Readability of health web pages for patients and readers of the general population. Span. Mag. Publ. Health 76(4), 321–331 (2002) Blanco, A., Gutiérrez, C.: Readability of health web pages for patients and readers of the general population. Span. Mag. Publ. Health 76(4), 321–331 (2002)
5.
Zurück zum Zitat Anula, A.: Readings adapted to the teaching of Spanish as L2: linguistic variables for determining the level of readability. The evaluation in learning and teaching Spanish as L2, pp. 162–170 (2008) Anula, A.: Readings adapted to the teaching of Spanish as L2: linguistic variables for determining the level of readability. The evaluation in learning and teaching Spanish as L2, pp. 162–170 (2008)
6.
Zurück zum Zitat Saggion, H., Štajner, S., Bott, S., Mille, S., Rello, L., Drndarevic, B.: Making it simplext: implementation and evaluation of a text simplification system for spanish. ACM Trans. Accessible Comput. (TACCESS) 6(4), 14 (2015)CrossRef Saggion, H., Štajner, S., Bott, S., Mille, S., Rello, L., Drndarevic, B.: Making it simplext: implementation and evaluation of a text simplification system for spanish. ACM Trans. Accessible Comput. (TACCESS) 6(4), 14 (2015)CrossRef
7.
Zurück zum Zitat Spaulding, S.: A Spanish readability formula. Mod. Lang. J. 40(8), 433–441 (1956)CrossRef Spaulding, S.: A Spanish readability formula. Mod. Lang. J. 40(8), 433–441 (1956)CrossRef
8.
Zurück zum Zitat Štajner, S., Saggion, H.: Readability indices for automatic evaluation of text simplification systems: a feasibility study for spanish. In: Proceedings of the Sixth International Joint Conference on Natural Language Processing, pp. 374–382 (2013) Štajner, S., Saggion, H.: Readability indices for automatic evaluation of text simplification systems: a feasibility study for spanish. In: Proceedings of the Sixth International Joint Conference on Natural Language Processing, pp. 374–382 (2013)
9.
Zurück zum Zitat López-Anguita, R., Montejo-Ráez, A., Martínez-Santiago, F.J., Díaz-Galiano, M.C.: Legibility of the text, complexity metrics and the importance of words. Nat. Lang. Process. 61, 101–108 (2018) López-Anguita, R., Montejo-Ráez, A., Martínez-Santiago, F.J., Díaz-Galiano, M.C.: Legibility of the text, complexity metrics and the importance of words. Nat. Lang. Process. 61, 101–108 (2018)
10.
Zurück zum Zitat Senter, R.J., Smith, E.A.: Automated readability index. Cincinnati Univ. OH (1967) Senter, R.J., Smith, E.A.: Automated readability index. Cincinnati Univ. OH (1967)
11.
Zurück zum Zitat Rodríguez, S.: Extraction of information from emails using natural language processing techniques (2017) Rodríguez, S.: Extraction of information from emails using natural language processing techniques (2017)
12.
Zurück zum Zitat Mesa, J.: Processing of natural language and its application in hotel services (2016) Mesa, J.: Processing of natural language and its application in hotel services (2016)
13.
Zurück zum Zitat Orquín, A., Rodríguez, K., Amable, A., Martín, R., Echarte, Á., Morera, D.C.: System for the pre-processing of texts for Natural Language Processing (2009) Orquín, A., Rodríguez, K., Amable, A., Martín, R., Echarte, Á., Morera, D.C.: System for the pre-processing of texts for Natural Language Processing (2009)
14.
Zurück zum Zitat De Jesús Torres, J.A.: Design of an educational software for the learning of language and literature in the punctuation marks of the first-year students of general unified baccalaureate morning section, parallel F of the technical institute superior technical center DM Quito, period 2016 (Bachelor’s thesis, Quito: UCE). http://www.dspace.uce.edu.ec/handle/25000/11181 De Jesús Torres, J.A.: Design of an educational software for the learning of language and literature in the punctuation marks of the first-year students of general unified baccalaureate morning section, parallel F of the technical institute superior technical center DM Quito, period 2016 (Bachelor’s thesis, Quito: UCE). http://​www.​dspace.​uce.​edu.​ec/​handle/​25000/​11181
Metadaten
Titel
Reading Comprehension in University Texts: The Metrics of Lexical Complexity in Corpus Analysis in Spanish
verfasst von
Jenny Ortiz Zambrano
Eleanor Varela Tapia
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-12018-4_9