Skip to main content
Erschienen in: Universal Access in the Information Society 1/2017

06.08.2015 | Long paper

An approach to treat numerical information in the text simplification process

verfasst von: Susana Bautista, Raquel Hervás, Pablo Gervás, Javier Rojo

Erschienen in: Universal Access in the Information Society | Ausgabe 1/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Public information services and documents should be accessible to the widest possible readership. In particular, information from these sources often takes the form of numerical expressions, which pose comprehension problems for many people, including people with disabilities, who are often also exposed to poverty, illiteracy, or lack of access to advanced technology. This paper presents an approach to treat numerical information in the text simplification process to make it more accessible. A generic model for automatic text simplification systems is presented, aimed at making documents more accessible to readers with cognitive disabilities. The proposed approach is validated with a real system to simplify numerical expressions in Spanish. This system is then evaluated and the results show that it is appropriate for the task at hand.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Herrera, A., Macizo, P.: ¿Cómo leemos los números? (How we read numbers?). Ciencia Cognitiva 6(2), 44–47 (2012) Herrera, A., Macizo, P.: ¿Cómo leemos los números? (How we read numbers?). Ciencia Cognitiva 6(2), 44–47 (2012)
2.
Zurück zum Zitat Salguero, M., Alameda, J.: El procesamiento de los números y sus implicaciones educativas (Number processing and its educational implications). XXI Revista de Educación (Education Journal) 5, 181–189 (2003) Salguero, M., Alameda, J.: El procesamiento de los números y sus implicaciones educativas (Number processing and its educational implications). XXI Revista de Educación (Education Journal) 5, 181–189 (2003)
3.
Zurück zum Zitat Piaget, J., Inhelder, B.: Psicología del niño. Editorial Morata (1969) Piaget, J., Inhelder, B.: Psicología del niño. Editorial Morata (1969)
4.
Zurück zum Zitat Butterworth, B.: Foundational numerical capacities and the origins of dyscalculia. Trends in Cognitive Sciences 14(12), 534–541 (2010)CrossRef Butterworth, B.: Foundational numerical capacities and the origins of dyscalculia. Trends in Cognitive Sciences 14(12), 534–541 (2010)CrossRef
5.
Zurück zum Zitat Landerl, K., Bevan, A., Butterworth, B., et al.: Developmental dyscalculia and basic numerical capacities: a study of 8{9-year-old students. Cognition 93(2), 99–125 (2004)CrossRef Landerl, K., Bevan, A., Butterworth, B., et al.: Developmental dyscalculia and basic numerical capacities: a study of 8{9-year-old students. Cognition 93(2), 99–125 (2004)CrossRef
6.
Zurück zum Zitat Saggion, H., Gómez-Martínez, E., Etayo, E., Anula, A., Bourg, L.: Text simplification in simplext: making text more accessible. Procesamiento del Lenguaje Natural 47, 341–342 (2011) Saggion, H., Gómez-Martínez, E., Etayo, E., Anula, A., Bourg, L.: Text simplification in simplext: making text more accessible. Procesamiento del Lenguaje Natural 47, 341–342 (2011)
7.
Zurück zum Zitat Medero, J., Ostendorf, M.: Identifying targets for syntactic simplification. In: Proceedings of Speech and Language Technology in Education (2011) Medero, J., Ostendorf, M.: Identifying targets for syntactic simplification. In: Proceedings of Speech and Language Technology in Education (2011)
8.
Zurück zum Zitat Carroll, J., Minnen, G., Canning, Y., Devlin, S., Tait, J.: Practical simplification of English newspaper text to assist aphasic readers. In: AAAI-98 (1998) Carroll, J., Minnen, G., Canning, Y., Devlin, S., Tait, J.: Practical simplification of English newspaper text to assist aphasic readers. In: AAAI-98 (1998)
9.
Zurück zum Zitat Inui, K., Fujita, A., Takahashi, T., Iida, R., Iwakura, T.: Text simplification for reading assistance: A project note. In: Workshop on Paraphrasing (2003) Inui, K., Fujita, A., Takahashi, T., Iida, R., Iwakura, T.: Text simplification for reading assistance: A project note. In: Workshop on Paraphrasing (2003)
10.
Zurück zum Zitat Specia, L.: Translating from complex to simplified sentences. In: 9th International Conference on Computational Processing of the Portuguese Language (2010) Specia, L.: Translating from complex to simplified sentences. In: 9th International Conference on Computational Processing of the Portuguese Language (2010)
11.
Zurück zum Zitat Burstein, J., Shore, J., Sabatini, J., Lee, Y.W., Ventura, M.: The automated text adaptation tool. In: HLTNAACL (Demonstrations), pp. 3–4 (2007) Burstein, J., Shore, J., Sabatini, J., Lee, Y.W., Ventura, M.: The automated text adaptation tool. In: HLTNAACL (Demonstrations), pp. 3–4 (2007)
12.
Zurück zum Zitat Devlin, S., Unthank, G.: Helping aphasic people process online information. In: Proceedings of the 8th International ACM SIGACCESS Conference on Computers and Accessibility (2006) Devlin, S., Unthank, G.: Helping aphasic people process online information. In: Proceedings of the 8th International ACM SIGACCESS Conference on Computers and Accessibility (2006)
13.
Zurück zum Zitat Chandrasekar, R., Doran, C., Srinivas, B.: Motivations and methods for text simplification. In: Proceedings of the Sixteenth International Conference on Computational Linguistics (COLING ‘96), pp. 1041–1044 Chandrasekar, R., Doran, C., Srinivas, B.: Motivations and methods for text simplification. In: Proceedings of the Sixteenth International Conference on Computational Linguistics (COLING ‘96), pp. 1041–1044
14.
Zurück zum Zitat Siddharthan, A.: An architecture for a text simplification system. In: Proceedings of the Language Engineering Conference (LEC 2002), pp. 64–71 (2002) Siddharthan, A.: An architecture for a text simplification system. In: Proceedings of the Language Engineering Conference (LEC 2002), pp. 64–71 (2002)
15.
Zurück zum Zitat Junior, A., Maziero, E., Gasperinm, C., Pardo, T., Specia, L., Aluisio, S.: Supporting the adaptation of texts for poor literacy readers: a text simplification editor for Brazilian Portuguese. In: Proceedings of the NAACL/HLT Workshop on Innovative Use of NLP for Building Educational Applications, Boulder, Colorado, pp. 34–42 (2009) Junior, A., Maziero, E., Gasperinm, C., Pardo, T., Specia, L., Aluisio, S.: Supporting the adaptation of texts for poor literacy readers: a text simplification editor for Brazilian Portuguese. In: Proceedings of the NAACL/HLT Workshop on Innovative Use of NLP for Building Educational Applications, Boulder, Colorado, pp. 34–42 (2009)
16.
Zurück zum Zitat Daelemans, W., Hothker, A., Sang, E.T.K.: Automatic sentence simplification for subtitling in Dutch and English. In: Proceedings of the 4th Conference on Language Resources and Evaluation, Lisbon, Portugal, pp. 1045–1048 (2004) Daelemans, W., Hothker, A., Sang, E.T.K.: Automatic sentence simplification for subtitling in Dutch and English. In: Proceedings of the 4th Conference on Language Resources and Evaluation, Lisbon, Portugal, pp. 1045–1048 (2004)
17.
Zurück zum Zitat Petersen, S.E., Ostendorf, M.: Text simplification for language learners: a corpus analysis. In: Proceedings of Workshop on Speech and Language Technology for Education (SLaTE) (2007) Petersen, S.E., Ostendorf, M.: Text simplification for language learners: a corpus analysis. In: Proceedings of Workshop on Speech and Language Technology for Education (SLaTE) (2007)
18.
Zurück zum Zitat Gasperin, C., Specia, L., Pereira, T.F., Aluisio, S.M.: Learning when to simplify sentences for natural text simplification. In: Proceedings of the Encontro Nacional de Inteligencia Artificial (ENIA), Bento Gonalves, Brazil, pp. 809–818 (2009) Gasperin, C., Specia, L., Pereira, T.F., Aluisio, S.M.: Learning when to simplify sentences for natural text simplification. In: Proceedings of the Encontro Nacional de Inteligencia Artificial (ENIA), Bento Gonalves, Brazil, pp. 809–818 (2009)
19.
Zurück zum Zitat Zhu, Z., Bernhard, D., Gurevych, I.: A monolingual tree-based translation model for sentence simplification. In: Proceedings of the 23rd International Conference on Computational Linguistics, COLING’10 (2010) Zhu, Z., Bernhard, D., Gurevych, I.: A monolingual tree-based translation model for sentence simplification. In: Proceedings of the 23rd International Conference on Computational Linguistics, COLING’10 (2010)
20.
Zurück zum Zitat Woddsend, K., Lapata, M.: Learning to simplify sentences with quasi-synchronous grammar and integer programming. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) (2011) Woddsend, K., Lapata, M.: Learning to simplify sentences with quasi-synchronous grammar and integer programming. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) (2011)
21.
Zurück zum Zitat Klerke, S., Sogaard, A.: Simple, readable sub-sentences. In: ACL (Student Research Workshop) (2013) Klerke, S., Sogaard, A.: Simple, readable sub-sentences. In: ACL (Student Research Workshop) (2013)
22.
Zurück zum Zitat Devlin, S., Tait, J.: The use of a psycholinguistic database in the simplification of text for aphasic readers. In: Linguist Databases. CSLI, pp. 161–173 (1998) Devlin, S., Tait, J.: The use of a psycholinguistic database in the simplification of text for aphasic readers. In: Linguist Databases. CSLI, pp. 161–173 (1998)
23.
Zurück zum Zitat Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J.: Introduction to WordNet: an On-line Lexical Database. Int J Lexicography 3(4), 235–244 (1990)CrossRef Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J.: Introduction to WordNet: an On-line Lexical Database. Int J Lexicography 3(4), 235–244 (1990)CrossRef
24.
Zurück zum Zitat Quinlan, P.: The Oxford Psycholinguistic Database. Oxford University Press, Oxford (1992) Quinlan, P.: The Oxford Psycholinguistic Database. Oxford University Press, Oxford (1992)
25.
Zurück zum Zitat Bautista, S., Gervás, P., Madrid, R.: Feasibility analysis for semiautomatic conversion of text to improve readability. In: Proceedings of the Second International Conference on Information and Communication Technologies and Accessibility (2009) Bautista, S., Gervás, P., Madrid, R.: Feasibility analysis for semiautomatic conversion of text to improve readability. In: Proceedings of the Second International Conference on Information and Communication Technologies and Accessibility (2009)
26.
Zurück zum Zitat De Belder, J., Deschacht, K., Moens, M.F.: Lexical simplification. In: Proceedings of the 1st International Conference on Interdisciplinary Research on Technology, Education and Communication (2010) De Belder, J., Deschacht, K., Moens, M.F.: Lexical simplification. In: Proceedings of the 1st International Conference on Interdisciplinary Research on Technology, Education and Communication (2010)
27.
Zurück zum Zitat Peters, E., Hibbard, J., Slovic, P., Dieckmann, N.: Numeracy skill and the communication, comprehension, and use of risk-benefit information. Health Aff. 26(3), 741–748 (2007)CrossRef Peters, E., Hibbard, J., Slovic, P., Dieckmann, N.: Numeracy skill and the communication, comprehension, and use of risk-benefit information. Health Aff. 26(3), 741–748 (2007)CrossRef
28.
Zurück zum Zitat Power, R., Williams, S.: Generating numerical approximations. Comput Linguist 38(1), 113–134 (2012)CrossRef Power, R., Williams, S.: Generating numerical approximations. Comput Linguist 38(1), 113–134 (2012)CrossRef
29.
Zurück zum Zitat Bautista, S., Hervás, R., Gervás, P., Power, R., Williams, S.: How to make numerical information accessible: experimental identification of simplification strategies. In: 13th IFIP TC13 Conference on Human-Computer Interaction (INTERACT), Lisbon, Portugal (2011) Bautista, S., Hervás, R., Gervás, P., Power, R., Williams, S.: How to make numerical information accessible: experimental identification of simplification strategies. In: 13th IFIP TC13 Conference on Human-Computer Interaction (INTERACT), Lisbon, Portugal (2011)
30.
Zurück zum Zitat Krifka, M.: Be brief and vague! And how bidirectional optimality theory allows for Verbosity and Precision. In: Sounds and Systems: Studies in Structure and Change: A Festschrift for Theo Vennemann (Trends in Linguistics 141), Mouton de Gruyter, Berlin, pp. 439–458 (2002) Krifka, M.: Be brief and vague! And how bidirectional optimality theory allows for Verbosity and Precision. In: Sounds and Systems: Studies in Structure and Change: A Festschrift for Theo Vennemann (Trends in Linguistics 141), Mouton de Gruyter, Berlin, pp. 439–458 (2002)
31.
Zurück zum Zitat Williams, S., Power, R.: Precision and mathematical form in first and subsequent mentions of numerical facts and their relation to document structure. In: Proceedings of the 12th European Workshop on Natural Language Generation, Athens (2009) Williams, S., Power, R.: Precision and mathematical form in first and subsequent mentions of numerical facts and their relation to document structure. In: Proceedings of the 12th European Workshop on Natural Language Generation, Athens (2009)
32.
Zurück zum Zitat Grice, H.P.: Logic and Conversation. In: Cole, P., Morgan, J.L. (eds.) Syntax and Semantics: Vol. 3: Speech Acts, pp. 41–58. Academic Press, San Diego (1975) Grice, H.P.: Logic and Conversation. In: Cole, P., Morgan, J.L. (eds.) Syntax and Semantics: Vol. 3: Speech Acts, pp. 41–58. Academic Press, San Diego (1975)
33.
Zurück zum Zitat MacKay, D.J.: Sustainable energy—without the hot air (2009) MacKay, D.J.: Sustainable energy—without the hot air (2009)
34.
Zurück zum Zitat Qualifications, Authority, C.: Annual report and accounts. Technical report, Financial statements (2010) Qualifications, Authority, C.: Annual report and accounts. Technical report, Financial statements (2010)
35.
Zurück zum Zitat Anula, A.: Tipos de textos, complejidad lingüística y facilicitación lectora. In: Actas del Sexto Congreso de Hispanistas de Asia, pp. 45–61 (2007) Anula, A.: Tipos de textos, complejidad lingüística y facilicitación lectora. In: Actas del Sexto Congreso de Hispanistas de Asia, pp. 45–61 (2007)
36.
Zurück zum Zitat Anula, A.: Lecturas adaptadas a la enseñanza del español como L2: variables lingüísticas para la determinación del nivel de legibilidad. In: Pastor y Roca (eds.) La evaluación en el aprendizaje y la enseñanza del español como LE/L2, Alicante, pp. 162–170 (2008) Anula, A.: Lecturas adaptadas a la enseñanza del español como L2: variables lingüísticas para la determinación del nivel de legibilidad. In: Pastor y Roca (eds.) La evaluación en el aprendizaje y la enseñanza del español como LE/L2, Alicante, pp. 162–170 (2008)
37.
Zurück zum Zitat Bautista, S., Drndarevic, B., Hervás, R., Saggion, H., Gervás, P.: Análisis de la Simplificación de Expresiones Numéricas en Español mediante un estudio Empírico. Linguamática 4(2), 27–41 (2012) Bautista, S., Drndarevic, B., Hervás, R., Saggion, H., Gervás, P.: Análisis de la Simplificación de Expresiones Numéricas en Español mediante un estudio Empírico. Linguamática 4(2), 27–41 (2012)
38.
Zurück zum Zitat Drndarevic, B., Stajner, S., Bott, S., Bautista, S., Saggion, H.: Automatic text simplification in spanish: a comparative evaluation of complementing modules. In: 14th International Conference on Intelligent Text Processing and Computational Linguistics (Cicling) (2013)CrossRef Drndarevic, B., Stajner, S., Bott, S., Bautista, S., Saggion, H.: Automatic text simplification in spanish: a comparative evaluation of complementing modules. In: 14th International Conference on Intelligent Text Processing and Computational Linguistics (Cicling) (2013)CrossRef
39.
Zurück zum Zitat Padró, L., Stanilovsky, E.: FreeLing 3.0: towards wider multilinguality. In: Proceedings of the Language Resources and Evaluation Conference (LREC 2012), Istanbul, Turkey, ELRA (May 2012) Padró, L., Stanilovsky, E.: FreeLing 3.0: towards wider multilinguality. In: Proceedings of the Language Resources and Evaluation Conference (LREC 2012), Istanbul, Turkey, ELRA (May 2012)
40.
Zurück zum Zitat Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: a framework and graphical development environment for robust NLP tools and applications. In: Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (2002) Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: a framework and graphical development environment for robust NLP tools and applications. In: Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (2002)
41.
Zurück zum Zitat Bautista, S., Saggion, H.: Making Numerical Information More Accessible: Implementation of a Numerical Expressions Simplification Component for Spanish. ITL-International Journal of Applied Linguistics. Special Issue on Readability and Text Simplification. Peeters Publishers, Belgium (2014) Bautista, S., Saggion, H.: Making Numerical Information More Accessible: Implementation of a Numerical Expressions Simplification Component for Spanish. ITL-International Journal of Applied Linguistics. Special Issue on Readability and Text Simplification. Peeters Publishers, Belgium (2014)
42.
Zurück zum Zitat Bautista, S., Hervás, R., Gervás, P., Power, R., Williams, S.: A system for the simplification of numerical expressions at different levels of understandability. In: NLP4ITA (2013) Bautista, S., Hervás, R., Gervás, P., Power, R., Williams, S.: A system for the simplification of numerical expressions at different levels of understandability. In: NLP4ITA (2013)
43.
Zurück zum Zitat Siddharthan, A.: An architecture for a text simplification system. In: Language Engineering Conference, IEEE Computer Society, vol. 64 Siddharthan, A.: An architecture for a text simplification system. In: Language Engineering Conference, IEEE Computer Society, vol. 64
44.
Zurück zum Zitat De Belder, J., Deschacht, K., Moens, M.F.: Lexical simplification. In: Proceedings of Itec2010: 1st International Conference on Interdisciplinary Research on Technology, Education and Communication (2010) De Belder, J., Deschacht, K., Moens, M.F.: Lexical simplification. In: Proceedings of Itec2010: 1st International Conference on Interdisciplinary Research on Technology, Education and Communication (2010)
45.
Zurück zum Zitat Brouwers, L., Bernhard, D., Ligozat, A., Francois, T.: Syntactic Sentence Simplification for French. In: Proceedings of the 3rd Workshop on Predicting and Improving Text Readability for Target Reader Populations (PITR) at EACL 2014, Gothenburg, Sweden (2014) Brouwers, L., Bernhard, D., Ligozat, A., Francois, T.: Syntactic Sentence Simplification for French. In: Proceedings of the 3rd Workshop on Predicting and Improving Text Readability for Target Reader Populations (PITR) at EACL 2014, Gothenburg, Sweden (2014)
46.
Zurück zum Zitat Siddharthan, A., Angrosh, M.: Hybrid text simplification using synchronous dependency grammars with handwritten and automatically harvested rules. In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014), Gothenburg, Sweden (2014) Siddharthan, A., Angrosh, M.: Hybrid text simplification using synchronous dependency grammars with handwritten and automatically harvested rules. In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014), Gothenburg, Sweden (2014)
Metadaten
Titel
An approach to treat numerical information in the text simplification process
verfasst von
Susana Bautista
Raquel Hervás
Pablo Gervás
Javier Rojo
Publikationsdatum
06.08.2015
Verlag
Springer Berlin Heidelberg
Erschienen in
Universal Access in the Information Society / Ausgabe 1/2017
Print ISSN: 1615-5289
Elektronische ISSN: 1615-5297
DOI
https://doi.org/10.1007/s10209-015-0426-z

Weitere Artikel der Ausgabe 1/2017

Universal Access in the Information Society 1/2017 Zur Ausgabe

Premium Partner