Skip to main content
Erschienen in: Artificial Intelligence Review 2/2016

01.02.2016

Automatic Arabic text summarization: a survey

verfasst von: Asma Bader Al-Saleh, Mohamed El Bachir Menai

Erschienen in: Artificial Intelligence Review | Ausgabe 2/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This survey investigates several research studies that have been conducted in the field of Arabic text summarization. Specifically, it addresses summarization and evaluation methods, as well as the corpora used in those studies. The literature in this field is fairly limited and relatively new compared to the available literature on other languages, such as English. Therefore, there exists a great opportunity for further research in Arabic text summarization. In addition, one of the largest problems in Arabic summarization was the absence of Arabic gold standard summaries, although this situation is beginning to change, especially with the inclusion of Arabic language as a part of the corpora and tasks in the TAC 2011 MultiLing Pilot and ACL 2013 MultiLing Workshop. Finally, providing the required corpora and adopting them in Arabic summarization studies is an essential demand.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat AL-Khawaldeh F, Samawi V (2015) Lexical cohesion and entailment based segmentation for arabic text summarization (lceas). World Comput Sci Inf Technol J 5(3):51–60 AL-Khawaldeh F, Samawi V (2015) Lexical cohesion and entailment based segmentation for arabic text summarization (lceas). World Comput Sci Inf Technol J 5(3):51–60
Zurück zum Zitat Al-Radaideh Q, Afif M (2009) Arabic text summarization using aggregate similarity. In: International Arab conference on information technology (ACIT2009), Yemen Al-Radaideh Q, Afif M (2009) Arabic text summarization using aggregate similarity. In: International Arab conference on information technology (ACIT2009), Yemen
Zurück zum Zitat Al-Saeedan W, Menai M (2015) Swarm intelligence for natural language processing. Int J Artif Intell Soft Comput 5(2):117–150CrossRef Al-Saeedan W, Menai M (2015) Swarm intelligence for natural language processing. Int J Artif Intell Soft Comput 5(2):117–150CrossRef
Zurück zum Zitat Al-Sanie W (2005) Towards an infrastructure for arabic text summarization using rhetorical structure theory. Master’s thesis, King Saud University, Riyadh Al-Sanie W (2005) Towards an infrastructure for arabic text summarization using rhetorical structure theory. Master’s thesis, King Saud University, Riyadh
Zurück zum Zitat Al-Sulaiti L, Atwell ES (2006) The design of a corpus of contemporary arabic. Int J Corpus Linguist 11(2):135–171CrossRef Al-Sulaiti L, Atwell ES (2006) The design of a corpus of contemporary arabic. Int J Corpus Linguist 11(2):135–171CrossRef
Zurück zum Zitat Alguliev RM, Aliguliyev RM, Isazade NR (2013a) Formulation of document summarization as a 0–1 nonlinear programming problem. Comput Ind Eng 64(1):94–102CrossRef Alguliev RM, Aliguliyev RM, Isazade NR (2013a) Formulation of document summarization as a 0–1 nonlinear programming problem. Comput Ind Eng 64(1):94–102CrossRef
Zurück zum Zitat Alguliev RM, Aliguliyev RM, Isazade NR (2013b) Multiple documents summarization based on evolutionary optimization algorithm. Expert Syst Appl 40(5):1675–1689CrossRef Alguliev RM, Aliguliyev RM, Isazade NR (2013b) Multiple documents summarization based on evolutionary optimization algorithm. Expert Syst Appl 40(5):1675–1689CrossRef
Zurück zum Zitat Alotaiby F, Foda S, Alkharashi I (2012) New approaches to automatic headline generation for arabic documents. J Eng Comput Innov 3(1):11–25 Alotaiby F, Foda S, Alkharashi I (2012) New approaches to automatic headline generation for arabic documents. J Eng Comput Innov 3(1):11–25
Zurück zum Zitat Azmi AM, Al-Thanyyan S (2012) A text summarizer for arabic. Comput Speech Lang 26(4):260–273CrossRef Azmi AM, Al-Thanyyan S (2012) A text summarizer for arabic. Comput Speech Lang 26(4):260–273CrossRef
Zurück zum Zitat Bassiouney R, Katz EG (2012) Arabic language and linguistics. Georgetown University Press, Washington, DC Bassiouney R, Katz EG (2012) Arabic language and linguistics. Georgetown University Press, Washington, DC
Zurück zum Zitat Belguith L, Ellouze M, Maaloul M, Jaoua M, Jaoua F, Blache P (2014) Automatic summarization. In: Zitouni I (ed) Natural language processing of semitic languages, theory and applications of natural language processing. Springer, Berlin, pp 371–408CrossRef Belguith L, Ellouze M, Maaloul M, Jaoua M, Jaoua F, Blache P (2014) Automatic summarization. In: Zitouni I (ed) Natural language processing of semitic languages, theory and applications of natural language processing. Springer, Berlin, pp 371–408CrossRef
Zurück zum Zitat Belkebir R, Guessoum A (2015) A supervised approach to arabic text summarization using adaboost. In: Rocha A, Correia AM, Costanzo S, Reis LP (eds) New contributions in information systems and technologies, advances in intelligent systems and computing, vol 353, pp 227–236 Belkebir R, Guessoum A (2015) A supervised approach to arabic text summarization using adaboost. In: Rocha A, Correia AM, Costanzo S, Reis LP (eds) New contributions in information systems and technologies, advances in intelligent systems and computing, vol 353, pp 227–236
Zurück zum Zitat Binwahlan MS, Salim N, Suanmali L (2010) Fuzzy swarm diversity hybrid model for text summarization. Inf Process Manage 46(5):571–588CrossRef Binwahlan MS, Salim N, Suanmali L (2010) Fuzzy swarm diversity hybrid model for text summarization. Inf Process Manage 46(5):571–588CrossRef
Zurück zum Zitat Boudabous M, Maaloul M, Belguith L (2010) Digital learning for summarizing arabic documents. In: Loftsson H, Rgnvaldsson E, Helgadttir S (eds) Advances in natural language processing, lecture notes in computer science, vol 6233. Springer, Berlin, pp 79–84 Boudabous M, Maaloul M, Belguith L (2010) Digital learning for summarizing arabic documents. In: Loftsson H, Rgnvaldsson E, Helgadttir S (eds) Advances in natural language processing, lecture notes in computer science, vol 6233. Springer, Berlin, pp 79–84
Zurück zum Zitat Boujelben I, Jamoussi S, Hamadou AB (2014) A hybrid method for extracting relations between arabic named entities. J King Saud Univ Comput Inf Sci 26(4):425–440 special Issue on Arabic NLP Boujelben I, Jamoussi S, Hamadou AB (2014) A hybrid method for extracting relations between arabic named entities. J King Saud Univ Comput Inf Sci 26(4):425–440 special Issue on Arabic NLP
Zurück zum Zitat Buckwalter T, Parkinson D (2011) A frequency dictionary of Arabic: core vocabulary for learners. Routledge Buckwalter T, Parkinson D (2011) A frequency dictionary of Arabic: core vocabulary for learners. Routledge
Zurück zum Zitat Cambria E, White B (2014) Jumping nlp curves: a review of natural language processing research [review article]. Comput Intell Mag IEEE 9(2):48–57CrossRef Cambria E, White B (2014) Jumping nlp curves: a review of natural language processing research [review article]. Comput Intell Mag IEEE 9(2):48–57CrossRef
Zurück zum Zitat Chaibi AH, Naili M, Sammoud S (2014) Topic segmentation for textual document written in arabic language. Procedia Comput Sci 35:437–446CrossRef Chaibi AH, Naili M, Sammoud S (2014) Topic segmentation for textual document written in arabic language. Procedia Comput Sci 35:437–446CrossRef
Zurück zum Zitat Davies M (2010) The corpus of contemporary american english as the first reliable monitor corpus of english. Literary and Linguistic Computing, Oxford Davies M (2010) The corpus of contemporary american english as the first reliable monitor corpus of english. Literary and Linguistic Computing, Oxford
Zurück zum Zitat Deerwester S, Dumais ST, Furnas GW, Landauer TK, Harshman R (1990) Indexing by latent semantic analysis. J Am Soc Inf Sci 41(6):391–407CrossRef Deerwester S, Dumais ST, Furnas GW, Landauer TK, Harshman R (1990) Indexing by latent semantic analysis. J Am Soc Inf Sci 41(6):391–407CrossRef
Zurück zum Zitat Douzidia FS, Lapalme G (2004) Lakhas, an arabic summarization system. DUC 2004:128–135 Douzidia FS, Lapalme G (2004) Lakhas, an arabic summarization system. DUC 2004:128–135
Zurück zum Zitat El-Fishawy N, Hamouda A, Attiya GM, Atef M (2013) Arabic summarization in twitter social network. Ain Shams Eng J 5(2):411–420CrossRef El-Fishawy N, Hamouda A, Attiya GM, Atef M (2013) Arabic summarization in twitter social network. Ain Shams Eng J 5(2):411–420CrossRef
Zurück zum Zitat El-Ghannam F, El-Shishtawy T (2013) Multi-topic multi-document summarizer. Int J Comput Sci Inf Technol 5(6):77–90 El-Ghannam F, El-Shishtawy T (2013) Multi-topic multi-document summarizer. Int J Comput Sci Inf Technol 5(6):77–90
Zurück zum Zitat El-Haj M, Hammo B (2008) Evaluation of query-based arabic text summarization system. In: International conference on natural language processing and knowledge engineering, 2008. NLP-KE ’08, pp 1–7 El-Haj M, Hammo B (2008) Evaluation of query-based arabic text summarization system. In: International conference on natural language processing and knowledge engineering, 2008. NLP-KE ’08, pp 1–7
Zurück zum Zitat El-Haj M, Koulali R (2013) Kalimat a multipurpose arabic corpus. In: The 2nd workshop on Arabic corpus linguistics (WACL-2) El-Haj M, Koulali R (2013) Kalimat a multipurpose arabic corpus. In: The 2nd workshop on Arabic corpus linguistics (WACL-2)
Zurück zum Zitat El-Haj M, Rayson P (2013) Using a keyness metric for single and multi document summarisation. In: Proceedings of the MultiLing 2013 workshop on multilingual multi-document summarization. Association for Computational Linguistics, Sofia, Bulgaria, pp 64–71 El-Haj M, Rayson P (2013) Using a keyness metric for single and multi document summarisation. In: Proceedings of the MultiLing 2013 workshop on multilingual multi-document summarization. Association for Computational Linguistics, Sofia, Bulgaria, pp 64–71
Zurück zum Zitat El-Haj M, Kruschwitz U, Fox C (2010) Using mechanical turk to create a corpus of arabic summaries. In: Proceedings of the international conference on language resources and evaluation (LREC), Valletta, Malta, pp 36–39, in the language resources (LRs) and Human Language Technologies (HLT) for Semitic Languages workshop held in conjunction with the 7th international language resources and evaluation conference (LREC 2010) El-Haj M, Kruschwitz U, Fox C (2010) Using mechanical turk to create a corpus of arabic summaries. In: Proceedings of the international conference on language resources and evaluation (LREC), Valletta, Malta, pp 36–39, in the language resources (LRs) and Human Language Technologies (HLT) for Semitic Languages workshop held in conjunction with the 7th international language resources and evaluation conference (LREC 2010)
Zurück zum Zitat El-Haj M, Kruschwitz U, Fox C (2011a) Exploring clustering for multi-document arabic summarisation. In: Salem M, Shaalan K, Oroumchian F, Shakery A, Khelalfa H (eds) Information retrieval technology, lecture notes in computer science, vol 7097. Springer, Berlin, pp 550–561 El-Haj M, Kruschwitz U, Fox C (2011a) Exploring clustering for multi-document arabic summarisation. In: Salem M, Shaalan K, Oroumchian F, Shakery A, Khelalfa H (eds) Information retrieval technology, lecture notes in computer science, vol 7097. Springer, Berlin, pp 550–561
Zurück zum Zitat El-Haj M, Kruschwitz U, Fox C (2011b) Multi-document arabic text summarisation. In: Computer science and electronic engineering conference (CEEC), 2011 3rd, IEEE Xplore, pp 40–44 El-Haj M, Kruschwitz U, Fox C (2011b) Multi-document arabic text summarisation. In: Computer science and electronic engineering conference (CEEC), 2011 3rd, IEEE Xplore, pp 40–44
Zurück zum Zitat El-Haj M, Kruschwitz U, Fox C (2011c) University of essex at the tac 2011 multilingual summarisation pilot. In: Proceedings of the text analysis conference (TAC) 2011, MultiLing Summarisation Pilot, Maryland, USA El-Haj M, Kruschwitz U, Fox C (2011c) University of essex at the tac 2011 multilingual summarisation pilot. In: Proceedings of the text analysis conference (TAC) 2011, MultiLing Summarisation Pilot, Maryland, USA
Zurück zum Zitat El-Haj M, Kruschwitz U, Fox C (2014) Creating language resources for under-resourced languages: methodologies, and experiments with arabic. Language Resources and Evaluation, pp 1–32 El-Haj M, Kruschwitz U, Fox C (2014) Creating language resources for under-resourced languages: methodologies, and experiments with arabic. Language Resources and Evaluation, pp 1–32
Zurück zum Zitat Evans D, McKeown K (2005) Identifying similarities and differences across english and arabic news. In: International conference on intelligence analysis, McLean, VA, May 2005 Evans D, McKeown K (2005) Identifying similarities and differences across english and arabic news. In: International conference on intelligence analysis, McLean, VA, May 2005
Zurück zum Zitat Fabri R, Gasser M, Habash N, Kiraz G, Wintner S (2014) Linguistic introduction: the orthography, morphology and syntax of semitic languages. In: Zitouni I (ed) Natural language processing of semitic languages, theory and applications of natural language processing. Springer, Berlin, pp 3–41CrossRef Fabri R, Gasser M, Habash N, Kiraz G, Wintner S (2014) Linguistic introduction: the orthography, morphology and syntax of semitic languages. In: Zitouni I (ed) Natural language processing of semitic languages, theory and applications of natural language processing. Springer, Berlin, pp 3–41CrossRef
Zurück zum Zitat Farghaly A, Shaalan K (2009) Arabic natural language processing: challenges and solutions. ACM Trans Asian Lang Inf Process 8(4):14:1–14:22CrossRef Farghaly A, Shaalan K (2009) Arabic natural language processing: challenges and solutions. ACM Trans Asian Lang Inf Process 8(4):14:1–14:22CrossRef
Zurück zum Zitat Fattah MA, Ren F (2009) Ga, mr, ffnn, pnn and gmm based models for automatic text summarization. Comput Speech Lang 23(1):126–144CrossRef Fattah MA, Ren F (2009) Ga, mr, ffnn, pnn and gmm based models for automatic text summarization. Comput Speech Lang 23(1):126–144CrossRef
Zurück zum Zitat Fejer H, Omar N (2014) Automatic arabic text summarization using clustering and keyphrase extraction. In: 2014 International Conference on information technology and multimedia (ICIMU), pp 293–298 Fejer H, Omar N (2014) Automatic arabic text summarization using clustering and keyphrase extraction. In: 2014 International Conference on information technology and multimedia (ICIMU), pp 293–298
Zurück zum Zitat Frank E, Wang Y, Inglis S, Holmes G, Witten I (1998) Using model trees for classification. Mach Learn 32(1):63–76MATHCrossRef Frank E, Wang Y, Inglis S, Holmes G, Witten I (1998) Using model trees for classification. Mach Learn 32(1):63–76MATHCrossRef
Zurück zum Zitat Froud H, Lachkar A, Ouatik SA (2013) Arabic text summarization based on latent semantic analysis to enhance arabic documents clustering. Int J Data Mining Knowl Manag Process (IJDKP) 3(1). doi:10.5121/ijdkp.2013.3107 Froud H, Lachkar A, Ouatik SA (2013) Arabic text summarization based on latent semantic analysis to enhance arabic documents clustering. Int J Data Mining Knowl Manag Process (IJDKP) 3(1). doi:10.​5121/​ijdkp.​2013.​3107
Zurück zum Zitat Giannakopoulos G (2013) Multi-document multilingual summarization and evaluation tracks in acl 2013 multiling workshop. In: Proceedings of the MultiLing 2013 workshop on multilingual multi-document summarization. Association for Computational Linguistics, Sofia, Bulgaria, pp 20–28 Giannakopoulos G (2013) Multi-document multilingual summarization and evaluation tracks in acl 2013 multiling workshop. In: Proceedings of the MultiLing 2013 workshop on multilingual multi-document summarization. Association for Computational Linguistics, Sofia, Bulgaria, pp 20–28
Zurück zum Zitat Giannakopoulos G, Karkaletsis V (2011) Autosummeng and memog in evaluating guided summaries. In: Proceedings of the text analysis conference (TAC) 2011, MultiLing Summarisation Pilot, Maryland, USA Giannakopoulos G, Karkaletsis V (2011) Autosummeng and memog in evaluating guided summaries. In: Proceedings of the text analysis conference (TAC) 2011, MultiLing Summarisation Pilot, Maryland, USA
Zurück zum Zitat Giannakopoulos G, Karkaletsis V (2013) Summary evaluation: together we stand npower-ed. In: Gelbukh A (ed) Computational linguistics and intelligent text processing, lecture notes in computer science, vol 7817. Springer, Berlin, pp 436–450 Giannakopoulos G, Karkaletsis V (2013) Summary evaluation: together we stand npower-ed. In: Gelbukh A (ed) Computational linguistics and intelligent text processing, lecture notes in computer science, vol 7817. Springer, Berlin, pp 436–450
Zurück zum Zitat Giannakopoulos G, El-Haj M, Favre B, Litvak M, Steinberger J, Varma V (2012) Tac 2011 multiling pilot overview. In: Text analysis conference (TAC) Giannakopoulos G, El-Haj M, Favre B, Litvak M, Steinberger J, Varma V (2012) Tac 2011 multiling pilot overview. In: Text analysis conference (TAC)
Zurück zum Zitat Gong Y, Liu X (2001) Generic text summarization using relevance measure and latent semantic analysis. In: Proceedings of the 24th annual international ACM SIGIR conference on research and development in information retrieval. ACM, New York, NY, USA, SIGIR ’01, pp 19–25 Gong Y, Liu X (2001) Generic text summarization using relevance measure and latent semantic analysis. In: Proceedings of the 24th annual international ACM SIGIR conference on research and development in information retrieval. ACM, New York, NY, USA, SIGIR ’01, pp 19–25
Zurück zum Zitat Haboush A, Momani A, Al-Zoubi M, Tarazi M (2012) Arabic text summerization model using clustering techniques. World Comput Sci Inf Technol J 2(2):62–67 Haboush A, Momani A, Al-Zoubi M, Tarazi M (2012) Arabic text summerization model using clustering techniques. World Comput Sci Inf Technol J 2(2):62–67
Zurück zum Zitat Ibrahim A, Elghazaly T (2013a) Improve the automatic summarization of arabic text depending on rhetorical structure theory. In: 2013 12th Mexican international conference on artificial intelligence (MICAI), pp 223–227 Ibrahim A, Elghazaly T (2013a) Improve the automatic summarization of arabic text depending on rhetorical structure theory. In: 2013 12th Mexican international conference on artificial intelligence (MICAI), pp 223–227
Zurück zum Zitat Ibrahim A, Elghazaly T (2013b) Rhetorical representation and vector representation in summarizing arabic text. Natural language processing and information systems, lecture notes in computer science, vol 7934. Springer, Berlin, pp 421–424 Ibrahim A, Elghazaly T (2013b) Rhetorical representation and vector representation in summarizing arabic text. Natural language processing and information systems, lecture notes in computer science, vol 7934. Springer, Berlin, pp 421–424
Zurück zum Zitat Imam I, Hamouda A, Khalek HAA (2013) An ontology-based summarization system for arabic documents (ossad). Int J Comput Appl 74(17):38–43 Imam I, Hamouda A, Khalek HAA (2013) An ontology-based summarization system for arabic documents (ossad). Int J Comput Appl 74(17):38–43
Zurück zum Zitat Inouye D, Kalita J (2011) Comparing twitter summarization algorithms for multiple post summaries. In: Privacy, security, risk and trust (passat), 2011 IEEE third international conference on and 2011 IEEE 3rd international conference on social computing (socialcom), pp 298–306 Inouye D, Kalita J (2011) Comparing twitter summarization algorithms for multiple post summaries. In: Privacy, security, risk and trust (passat), 2011 IEEE third international conference on and 2011 IEEE 3rd international conference on social computing (socialcom), pp 298–306
Zurück zum Zitat Ismail S, Moawd I, Aref M (2013) Arabic text representation using rich semantic graph: A case study. In: Proceedings of the 4th European conference of computer science (ECCS ’13), pp 148–153 Ismail S, Moawd I, Aref M (2013) Arabic text representation using rich semantic graph: A case study. In: Proceedings of the 4th European conference of computer science (ECCS ’13), pp 148–153
Zurück zum Zitat Järvelin K, Kekäläinen J (2002) Cumulated gain-based evaluation of ir techniques. ACM Trans Inf Syst 20(4):422–446CrossRef Järvelin K, Kekäläinen J (2002) Cumulated gain-based evaluation of ir techniques. ACM Trans Inf Syst 20(4):422–446CrossRef
Zurück zum Zitat Jones KS, Galliers JR (1996) Evaluating natural language processing systems: an analysis and review. Springer, Secaucus, NJ, USA Jones KS, Galliers JR (1996) Evaluating natural language processing systems: an analysis and review. Springer, Secaucus, NJ, USA
Zurück zum Zitat Keskes I, Boudabous MM, Maaloul MH, Hadrich Belguith L (2012) Étude comparative entre trois approches de résumé automatique de documents arabes (comparative study of three approaches to automatic summarization of arabic documents) [in french]. Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, vol 2: TALN. ATALA/AFCP, Grenoble, France, pp 225–238 Keskes I, Boudabous MM, Maaloul MH, Hadrich Belguith L (2012) Étude comparative entre trois approches de résumé automatique de documents arabes (comparative study of three approaches to automatic summarization of arabic documents) [in french]. Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, vol 2: TALN. ATALA/AFCP, Grenoble, France, pp 225–238
Zurück zum Zitat Li L, Forascu C, El-Haj M, Giannakopoulos G (2013) Multi-document multilingual summarization corpus preparation, part 1: Arabic, english, greek, chinese, romanian. Proceedings of the MultiLing 2013 workshop on multilingual multi-document summarization. Association for Computational Linguistics, Sofia, Bulgaria, pp 1–12 Li L, Forascu C, El-Haj M, Giannakopoulos G (2013) Multi-document multilingual summarization corpus preparation, part 1: Arabic, english, greek, chinese, romanian. Proceedings of the MultiLing 2013 workshop on multilingual multi-document summarization. Association for Computational Linguistics, Sofia, Bulgaria, pp 1–12
Zurück zum Zitat Lin CY (2004) Rouge: A package for automatic evaluation of summaries. In: Marie-Francine Moens SS (ed) Text summarization branches out: proceedings of the ACL-04 workshop. Association for Computational Linguistics, Barcelona, Spain, pp 74–81 Lin CY (2004) Rouge: A package for automatic evaluation of summaries. In: Marie-Francine Moens SS (ed) Text summarization branches out: proceedings of the ACL-04 workshop. Association for Computational Linguistics, Barcelona, Spain, pp 74–81
Zurück zum Zitat Lin CY, Hovy E (2002) Manual and automatic evaluation of summaries. In: Proceedings of the ACL-02 workshop on automatic summarization, vol 4. Association for Computational Linguistics, Stroudsburg, PA, USA, AS ’02, pp 45–51 Lin CY, Hovy E (2002) Manual and automatic evaluation of summaries. In: Proceedings of the ACL-02 workshop on automatic summarization, vol 4. Association for Computational Linguistics, Stroudsburg, PA, USA, AS ’02, pp 45–51
Zurück zum Zitat Lloret E, Palomar M (2012) Text summarisation in progress: a literature review. Artif Intell Rev 37(1):1–41CrossRef Lloret E, Palomar M (2012) Text summarisation in progress: a literature review. Artif Intell Rev 37(1):1–41CrossRef
Zurück zum Zitat Lloret E, Palomar M (2013) Tackling redundancy in text summarization through different levels of language analysis. Comput Stand Interfaces 35(5):507–518CrossRef Lloret E, Palomar M (2013) Tackling redundancy in text summarization through different levels of language analysis. Comput Stand Interfaces 35(5):507–518CrossRef
Zurück zum Zitat McKeown K, Radev DR (1995) Generating summaries of multiple news articles. In: Proceedings of the 18th annual international ACM SIGIR conference on research and development in information retrieval. ACM, New York, NY, USA, SIGIR ’95, pp 74–82 McKeown K, Radev DR (1995) Generating summaries of multiple news articles. In: Proceedings of the 18th annual international ACM SIGIR conference on research and development in information retrieval. ACM, New York, NY, USA, SIGIR ’95, pp 74–82
Zurück zum Zitat Mehdad Y, Magnini B (2009) Optimizing textual entailment recognition using particle swarm optimization. In: Proceedings of the 2009 workshop on applied textual inference. Association for Computational Linguistics, pp 36–43 Mehdad Y, Magnini B (2009) Optimizing textual entailment recognition using particle swarm optimization. In: Proceedings of the 2009 workshop on applied textual inference. Association for Computational Linguistics, pp 36–43
Zurück zum Zitat Mima H, Ananiadou S (2000) An application and e aluation of the c/nc-value approach for the automatic term recognition of multi-word units in japanese. Int J Terminol 6(2):175–194CrossRef Mima H, Ananiadou S (2000) An application and e aluation of the c/nc-value approach for the automatic term recognition of multi-word units in japanese. Int J Terminol 6(2):175–194CrossRef
Zurück zum Zitat Nenkova A (2006) Summarization evaluation for text and speech: issues and approaches. In: INTERSPEECH Nenkova A (2006) Summarization evaluation for text and speech: issues and approaches. In: INTERSPEECH
Zurück zum Zitat Nenkova A, Passonneau R (2004) Evaluating content selection in summarization: The pyramid method. In: Susan Dumais DM, Roukos S (eds) HLT-NAACL 2004: main proceedings. Association for Computational Linguistics, Boston, Massachusetts, USA, pp 145–152 Nenkova A, Passonneau R (2004) Evaluating content selection in summarization: The pyramid method. In: Susan Dumais DM, Roukos S (eds) HLT-NAACL 2004: main proceedings. Association for Computational Linguistics, Boston, Massachusetts, USA, pp 145–152
Zurück zum Zitat Nguyen KH, Ock CY (2013) Word sense disambiguation as a traveling salesman problem. Artif Intell Rev 40(4):405–427CrossRef Nguyen KH, Ock CY (2013) Word sense disambiguation as a traveling salesman problem. Artif Intell Rev 40(4):405–427CrossRef
Zurück zum Zitat Oufaida H, Nouali O, Blache P (2014) Minimum redundancy and maximum relevance for single and multi-document arabic text summarization. J King Saud Univ Comput Inf Sci 26(4):450–461 special Issue on Arabic NLP Oufaida H, Nouali O, Blache P (2014) Minimum redundancy and maximum relevance for single and multi-document arabic text summarization. J King Saud Univ Comput Inf Sci 26(4):450–461 special Issue on Arabic NLP
Zurück zum Zitat Over P, Dang H, Harman D (2007) Duc in context. Inf Process Manag 43(6):1506–1520CrossRef Over P, Dang H, Harman D (2007) Duc in context. Inf Process Manag 43(6):1506–1520CrossRef
Zurück zum Zitat Peng H, Long F, Ding C (2005) Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell 27(8):1226–1238CrossRef Peng H, Long F, Ding C (2005) Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell 27(8):1226–1238CrossRef
Zurück zum Zitat Radev D, Allison T, Blair-Goldensohn S, Blitzer J, Çelebi A, Dimitrov S, Drabek E, Hakim A, Lam W, Liu D, Otterbacher J, Qi H, Saggion H, Teufel S, Topper M, Winkel A, Zhang Z (2004) MEAD—a platform for multidocument multilingual text summarization. In: Conference on language resources and evaluation (LREC), Lisbon, Portugal Radev D, Allison T, Blair-Goldensohn S, Blitzer J, Çelebi A, Dimitrov S, Drabek E, Hakim A, Lam W, Liu D, Otterbacher J, Qi H, Saggion H, Teufel S, Topper M, Winkel A, Zhang Z (2004) MEAD—a platform for multidocument multilingual text summarization. In: Conference on language resources and evaluation (LREC), Lisbon, Portugal
Zurück zum Zitat Radev DR, Hovy E, McKeown K (2002) Introduction to the special issue on summarization. Comput Linguist 28(4):399–408CrossRef Radev DR, Hovy E, McKeown K (2002) Introduction to the special issue on summarization. Comput Linguist 28(4):399–408CrossRef
Zurück zum Zitat Rayson P, Garside R (2000) Comparing corpora using frequency profiling. In: Proceedings of the workshop on comparing corpora, vol 9. Association for Computational Linguistics, Stroudsburg, PA, USA, WCC ’00, pp 1–6 Rayson P, Garside R (2000) Comparing corpora using frequency profiling. In: Proceedings of the workshop on comparing corpora, vol 9. Association for Computational Linguistics, Stroudsburg, PA, USA, WCC ’00, pp 1–6
Zurück zum Zitat Ryding K (2005) A reference grammar of modern standard Arabic. Cambridge University Press, CambridgeCrossRef Ryding K (2005) A reference grammar of modern standard Arabic. Cambridge University Press, CambridgeCrossRef
Zurück zum Zitat Saggion H, Poibeau T (2013) Automatic text summarization: Past, present and future. In: Poibeau T, Saggion H, Piskorski J, Yangarber R (eds) Multi-source, multilingual information extraction and summarization, theory and applications of natural language processing. Springer, Berlin, pp 3–21CrossRef Saggion H, Poibeau T (2013) Automatic text summarization: Past, present and future. In: Poibeau T, Saggion H, Piskorski J, Yangarber R (eds) Multi-source, multilingual information extraction and summarization, theory and applications of natural language processing. Springer, Berlin, pp 3–21CrossRef
Zurück zum Zitat Schlesinger J, OLeary D, Conroy J (2008) Arabic /english multi-document summarization with classy-the past and the future. In: Gelbukh A (ed) Computational linguistics and intelligent text processing, lecture notes in computer science, vol 4919. Springer, Berlin, pp 568–581 Schlesinger J, OLeary D, Conroy J (2008) Arabic /english multi-document summarization with classy-the past and the future. In: Gelbukh A (ed) Computational linguistics and intelligent text processing, lecture notes in computer science, vol 4919. Springer, Berlin, pp 568–581
Zurück zum Zitat Shaheen M, Ezzeldin A (2014) Arabic question answering: systems, resources, tools, and future trends. Arabian J Sci Eng 39(6):4541–4564CrossRef Shaheen M, Ezzeldin A (2014) Arabic question answering: systems, resources, tools, and future trends. Arabian J Sci Eng 39(6):4541–4564CrossRef
Zurück zum Zitat Sobh I, Darwish N, Fayek M (2007) An optimized dual classification system for arabic extractive generic text summarization. In: Proceeding of the 7th conference on language engineering Sobh I, Darwish N, Fayek M (2007) An optimized dual classification system for arabic extractive generic text summarization. In: Proceeding of the 7th conference on language engineering
Zurück zum Zitat Vanderwende L, Suzuki H, Brockett C, Nenkova A (2007) Beyond sumbasic: task-focused summarization with sentence simplification and lexical expansion. Inf Process Manage 43(6):1606–1618CrossRef Vanderwende L, Suzuki H, Brockett C, Nenkova A (2007) Beyond sumbasic: task-focused summarization with sentence simplification and lexical expansion. Inf Process Manage 43(6):1606–1618CrossRef
Zurück zum Zitat Wu JW, Tseng J, Tsai WN (2010) A discrete particle swarm optimization algorithm for domain independent linear text segmentation. In: IEEE international conference on Granular computing (GrC), 2010, pp 519–524 Wu JW, Tseng J, Tsai WN (2010) A discrete particle swarm optimization algorithm for domain independent linear text segmentation. In: IEEE international conference on Granular computing (GrC), 2010, pp 519–524
Zurück zum Zitat Yih Wt, Goodman J, Vanderwende L, Suzuki H (2007) Multi-document summarization by maximizing informative content-words. In: Proceedings of the 20th international joint conference on artifical intelligence. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, IJCAI’07, pp 1776–1782 Yih Wt, Goodman J, Vanderwende L, Suzuki H (2007) Multi-document summarization by maximizing informative content-words. In: Proceedings of the 20th international joint conference on artifical intelligence. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, IJCAI’07, pp 1776–1782
Metadaten
Titel
Automatic Arabic text summarization: a survey
verfasst von
Asma Bader Al-Saleh
Mohamed El Bachir Menai
Publikationsdatum
01.02.2016
Verlag
Springer Netherlands
Erschienen in
Artificial Intelligence Review / Ausgabe 2/2016
Print ISSN: 0269-2821
Elektronische ISSN: 1573-7462
DOI
https://doi.org/10.1007/s10462-015-9442-x