Skip to main content
Erschienen in: International Journal of Speech Technology 2/2017

02.05.2017

Semantic role labeling for Arabic language using case-based reasoning approach

verfasst von: Hamza Meguehout, Tahar Bouhadada, Mohamed Tayeb Laskri

Erschienen in: International Journal of Speech Technology | Ausgabe 2/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Many natural language processing areas use semantic roles in order to improve the applications of the extracted information, the question answering and the machine translation, etc. In Arabic, the work of constructing the semantic role labeling system or the annotated corpus is extremely limited compared to their speaker’s number and to English language as well. In this paper, we present a supervised method for the semantic role labeling of Arabic sentences. Hence, we use the feedback capacity of the case-based reasoning to annotate new sentences from already annotated ones besides the use of the Arabic PropBank as a reference to the semantic labels. We test our method under a wide range corpus that contains 2332 attributes and 5291 arguments. Accordingly, an Arabic semantic role labeling system is tested, for the first time, in that corpus. As a result, our method shows the ability to annotate new sentences from the labeled sentences or the construction of the annotated corpus.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Aamodt, A., & Plaza, E. (1994). Case-based reasoning: Foundational issues, methodological variations, and system approaches. AI Commun, 7, 39–59. Aamodt, A., & Plaza, E. (1994). Case-based reasoning: Foundational issues, methodological variations, and system approaches. AI Commun, 7, 39–59.
Zurück zum Zitat Armaghan, N. (2009) Contribution à un système de retour d’expérience basé sur le raisonnement à partir de cas conversationnel: Application à la gestion des pannes de machines industrielles. Thèse de doctorat Génie des systèmes industriels, Vandoeuvre-les-Nancy, France. Armaghan, N. (2009) Contribution à un système de retour d’expérience basé sur le raisonnement à partir de cas conversationnel: Application à la gestion des pannes de machines industrielles. Thèse de doctorat Génie des systèmes industriels, Vandoeuvre-les-Nancy, France.
Zurück zum Zitat Baker, C. F., Fillmore, C. J., & Lowe, J. B. (1998). The Berkeley FrameNet Project. Paper presented at the Proceedings of the 17th international conference on Computational linguistics—Vol. 1, Montreal, Quebec, Canada. doi:10.3115/980451.980860. Baker, C. F., Fillmore, C. J., & Lowe, J. B. (1998). The Berkeley FrameNet Project. Paper presented at the Proceedings of the 17th international conference on Computational linguistics—Vol. 1, Montreal, Quebec, Canada. doi:10.​3115/​980451.​980860.
Zurück zum Zitat Begum, S., Ahmed, M. U., Funk, P., Ning, X., & Folke, M. (2011). Case-based reasoning systems in the health sciences: A survey of recent trends and developments. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 41, 421–434. doi:10.1109/TSMCC.2010.2071862.CrossRef Begum, S., Ahmed, M. U., Funk, P., Ning, X., & Folke, M. (2011). Case-based reasoning systems in the health sciences: A survey of recent trends and developments. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 41, 421–434. doi:10.​1109/​TSMCC.​2010.​2071862.CrossRef
Zurück zum Zitat Bonial, C., Hwang, J., Bonn, J., Conger, K., Babko-malaya, O., & Palmer, M. (2012). English PropBank Annotation Guidelines. Bonial, C., Hwang, J., Bonn, J., Conger, K., Babko-malaya, O., & Palmer, M. (2012). English PropBank Annotation Guidelines.
Zurück zum Zitat Campillo-Gimenez, B., Jouini, W., Bayat, S., & Cuggia, M. (2013). Improving case-based reasoning systems by combining K-nearest neighbour algorithm with logistic regression in the prediction of patients’ registration on the renal transplant waiting list. PLoS ONE, 8, e71991. doi:10.1371/journal.pone.0071991.CrossRef Campillo-Gimenez, B., Jouini, W., Bayat, S., & Cuggia, M. (2013). Improving case-based reasoning systems by combining K-nearest neighbour algorithm with logistic regression in the prediction of patients’ registration on the renal transplant waiting list. PLoS ONE, 8, e71991. doi:10.​1371/​journal.​pone.​0071991.CrossRef
Zurück zum Zitat Collins, B. (1998). Example Based Machine Translation: Adaptation Guided Retrieval Approach. PhD Thesis, University of Dublin, Trinity College. Collins, B. (1998). Example Based Machine Translation: Adaptation Guided Retrieval Approach. PhD Thesis, University of Dublin, Trinity College.
Zurück zum Zitat Cordierand, A., & Fuchs, B. (2005). Un assistant pour la conception et le développement des systèmes de RàPC. In S. Després (Ed.), Atelier Raisonnement à Partir de Cas (pp. 5–14). Nice: Plate-Forme AFIA 2005 - Atelier Raisonnement à Partir de Cas. Cordierand, A., & Fuchs, B. (2005). Un assistant pour la conception et le développement des systèmes de RàPC. In S. Després (Ed.), Atelier Raisonnement à Partir de Cas (pp. 5–14). Nice: Plate-Forme AFIA 2005 - Atelier Raisonnement à Partir de Cas.
Zurück zum Zitat Diab, M., Alkhalifa, M., ElKateb, S., Fellbaum, C., Mansouri, A., & Palmer, M. (2007a). SemEval-2007 Task 18: Arabic Semantic Labeling. In, Prague, 2007a. Workshop on Semantic Evaluations (SemEval). Diab, M., Alkhalifa, M., ElKateb, S., Fellbaum, C., Mansouri, A., & Palmer, M. (2007a). SemEval-2007 Task 18: Arabic Semantic Labeling. In, Prague, 2007a. Workshop on Semantic Evaluations (SemEval).
Zurück zum Zitat Diab, M., & Marton, Y. (2014). Semantic processing of semitic languages. In I. Zitouni (Ed.), Natural language processing of semitic languages (pp. 129–159). Berlin: Springer Berlin Heidelberg. doi:10.1007/978-3-642-45358-8_4.CrossRef Diab, M., & Marton, Y. (2014). Semantic processing of semitic languages. In I. Zitouni (Ed.), Natural language processing of semitic languages (pp. 129–159). Berlin: Springer Berlin Heidelberg. doi:10.​1007/​978-3-642-45358-8_​4.CrossRef
Zurück zum Zitat Diab, M., & Moschitti, A. (2007). Semantic parsing of modern standard Arabic. In: Recent advances in natural language processing (RANLP), Vol. 2007, Borovets: Association for Computational Linguistics (ACL). Diab, M., & Moschitti, A. (2007). Semantic parsing of modern standard Arabic. In: Recent advances in natural language processing (RANLP), Vol. 2007, Borovets: Association for Computational Linguistics (ACL).
Zurück zum Zitat Diab, M., Moschitti, A., & Pighin, D. (2007b). CUNIT: A semantic role labeling system for modern standard Arabic. Paper presented at the SemEval-2007 Workshop, co-located with ACL 2007. Diab, M., Moschitti, A., & Pighin, D. (2007b). CUNIT: A semantic role labeling system for modern standard Arabic. Paper presented at the SemEval-2007 Workshop, co-located with ACL 2007.
Zurück zum Zitat Diab, M., Moschitti, A., & Pighin, D. (2008). Semantic role labeling systems for Arabic language using kernel methods. In: 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL 2008: HLT), Columbus, Ohio, USA, 2008. Diab, M., Moschitti, A., & Pighin, D. (2008). Semantic role labeling systems for Arabic language using kernel methods. In: 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL 2008: HLT), Columbus, Ohio, USA, 2008.
Zurück zum Zitat Dufour-Lussier, V., Lieber, J., Nauer, E., & Toussaint, Y. (2010). Text adaptation using formal concept analysis. In I. Bichindaritz & S. Montani (Eds.), Case-based reasoning. research and development: 18th International Conference on Case-Based Reasoning, ICCBR 2010, Alessandria, Italy, July 19–22, 2010. Proceedings (pp. 96–110). Berlin: Springer Berlin Heidelberg. doi:10.1007/978-3-642-14274-1_9.CrossRef Dufour-Lussier, V., Lieber, J., Nauer, E., & Toussaint, Y. (2010). Text adaptation using formal concept analysis. In I. Bichindaritz & S. Montani (Eds.), Case-based reasoning. research and development: 18th International Conference on Case-Based Reasoning, ICCBR 2010, Alessandria, Italy, July 19–22, 2010. Proceedings (pp. 96–110). Berlin: Springer Berlin Heidelberg. doi:10.​1007/​978-3-642-14274-1_​9.CrossRef
Zurück zum Zitat Hechenbichler, K., & Schliep, K. (2004). Weighted k-Nearest-Neighbor Techniques and Ordinal Classification Vol. 399. Hechenbichler, K., & Schliep, K. (2004). Weighted k-Nearest-Neighbor Techniques and Ordinal Classification Vol. 399.
Zurück zum Zitat JianDe, W., ZhaoXiong, C., & HeYan, H. (2001). Intelligent case based machine translation system. In A. Gelbukh (Ed.), Computational linguistics and intelligent text processing: Second International Conference, CICLing 2001 Mexico City, Mexico, February 18–24, 2001 Proceedings (pp. 197–205). Berlin: Springer Berlin Heidelberg. doi:10.1007/3-540-44686-9_21.CrossRef JianDe, W., ZhaoXiong, C., & HeYan, H. (2001). Intelligent case based machine translation system. In A. Gelbukh (Ed.), Computational linguistics and intelligent text processing: Second International Conference, CICLing 2001 Mexico City, Mexico, February 18–24, 2001 Proceedings (pp. 197–205). Berlin: Springer Berlin Heidelberg. doi:10.​1007/​3-540-44686-9_​21.CrossRef
Zurück zum Zitat Lamontagne, L., & Lapalme, G. (2004). Textual reuse for email response. In P. Funk & P. A. González Calero (Eds.), Advances in case-based reasoning: 7th European Conference, ECCBR 2004, Madrid, Spain, August 30—September 2, 2004. Proceedings (pp. 242–256). Berlin: Springer Berlin Heidelberg. doi:10.1007/978-3-540-28631-8_19.CrossRef Lamontagne, L., & Lapalme, G. (2004). Textual reuse for email response. In P. Funk & P. A. González Calero (Eds.), Advances in case-based reasoning: 7th European Conference, ECCBR 2004, Madrid, Spain, August 30—September 2, 2004. Proceedings (pp. 242–256). Berlin: Springer Berlin Heidelberg. doi:10.​1007/​978-3-540-28631-8_​19.CrossRef
Zurück zum Zitat Maamouri, M., Bies, A., Buckwalter, T., Mekki, W. (2004). The Penn Arabic Treebank: Building a Large-Scale Annotated Arabic Corpus. In: NEMLAR Conference on Arabic Language Resources and Tools, 2004. Maamouri, M., Bies, A., Buckwalter, T., Mekki, W. (2004). The Penn Arabic Treebank: Building a Large-Scale Annotated Arabic Corpus. In: NEMLAR Conference on Arabic Language Resources and Tools, 2004.
Zurück zum Zitat Marcus, M. P., Marcinkiewicz, M. A., & Santorini, B. (1993). Building a large annotated corpus of English: The penn treebank. Computational Linguistics, 19, 313–330. Marcus, M. P., Marcinkiewicz, M. A., & Santorini, B. (1993). Building a large annotated corpus of English: The penn treebank. Computational Linguistics, 19, 313–330.
Zurück zum Zitat Mathieu-Dupas, E. (2010). Algorithme des k plus proches voisins pondérés et application en diagnostic. In: 42èmes Journées de Statistique, Marseille, France, France, 2010. Mathieu-Dupas, E. (2010). Algorithme des k plus proches voisins pondérés et application en diagnostic. In: 42èmes Journées de Statistique, Marseille, France, France, 2010.
Zurück zum Zitat Meguehout, H., Bouhadada, T., & Laskri, T. (2013). Un raisonnement à partir de cas pour la traduction automatique arabe-français basée sur la sémantique. Paper presented at the CEC-TAL’2013, Montréal, Canada. Meguehout, H., Bouhadada, T., & Laskri, T. (2013). Un raisonnement à partir de cas pour la traduction automatique arabe-français basée sur la sémantique. Paper presented at the CEC-TAL’2013, Montréal, Canada.
Zurück zum Zitat Mishra, V., & Mishra, R. B. (2010). Approach of English to Sanskrit machine translation based on case based reasoning, artificial neural networks and translation rules. International Journal of Knowledge Engineering and Soft Data Paradigms, 2, 328–348. doi:10.1504/ijkesdp.2010.037494.CrossRef Mishra, V., & Mishra, R. B. (2010). Approach of English to Sanskrit machine translation based on case based reasoning, artificial neural networks and translation rules. International Journal of Knowledge Engineering and Soft Data Paradigms, 2, 328–348. doi:10.​1504/​ijkesdp.​2010.​037494.CrossRef
Zurück zum Zitat Mohamed, M., Bies, A., Kulick, S., Krouna, S., Gaddeche, F., & Zaghouani, W. (2010). Arabic Treebank: Part 3 v 3.2 LDC2010T08. Philadelphia: Linguistic Data Consortium. Mohamed, M., Bies, A., Kulick, S., Krouna, S., Gaddeche, F., & Zaghouani, W. (2010). Arabic Treebank: Part 3 v 3.2 LDC2010T08. Philadelphia: Linguistic Data Consortium.
Zurück zum Zitat Morante, R., Daelemans, W., & Asch, V. V. (2008). A combined memory-based semantic role labeler of English. Paper presented at the Proceedings of the Twelfth Conference on Computational Natural Language Learning, Manchester, United Kingdom. Morante, R., Daelemans, W., & Asch, V. V. (2008). A combined memory-based semantic role labeler of English. Paper presented at the Proceedings of the Twelfth Conference on Computational Natural Language Learning, Manchester, United Kingdom.
Zurück zum Zitat Mousser, J. (2010). A Large Coverage Verb Taxonomy for Arabic. Paper presented at the LREC. Mousser, J. (2010). A Large Coverage Verb Taxonomy for Arabic. Paper presented at the LREC.
Zurück zum Zitat Palmer, M., Babko-Malaya, O., Bies, A., Diab, M., Maamouri, M., Mansouri, A., & Zaghouani, W. (2008). A Pilot Arabic Propbank. Marrakech: European Language Resources Association (ELRA). Palmer, M., Babko-Malaya, O., Bies, A., Diab, M., Maamouri, M., Mansouri, A., & Zaghouani, W. (2008). A Pilot Arabic Propbank. Marrakech: European Language Resources Association (ELRA).
Zurück zum Zitat Pradhan, S., Moschitti, A., Xue, N., Uryupina, O., & Zhan, Y. (2012). CoNLL-2012 Shared Task: Modeling Multilingual Unrestricted Coreference in OntoNotes. Jju Island. Pradhan, S., Moschitti, A., Xue, N., Uryupina, O., & Zhan, Y. (2012). CoNLL-2012 Shared Task: Modeling Multilingual Unrestricted Coreference in OntoNotes. Jju Island.
Zurück zum Zitat Ralph, W. et al. (2012). OntoNotes Release 5.0 with OntoNotes DB Tool v0.999 beta. Ralph, W. et al. (2012). OntoNotes Release 5.0 with OntoNotes DB Tool v0.999 beta.
Zurück zum Zitat Riesbeck, C. K., & Schank, R. C. (1989). Inside case-based reasoning. New Jersey: L. Erlbaum Associates Inc. Riesbeck, C. K., & Schank, R. C. (1989). Inside case-based reasoning. New Jersey: L. Erlbaum Associates Inc.
Zurück zum Zitat Schank, R. C. (1982). Dynamic memory: A theory of reminding and learning in computers and people. cambridge: Cambridge University Press. Schank, R. C. (1982). Dynamic memory: A theory of reminding and learning in computers and people. cambridge: Cambridge University Press.
Zurück zum Zitat Schuler, K. K. (2005). Verbnet: A broad-coverage, comprehensive verb lexicon. Philadelphia: University of Pennsylvania. Schuler, K. K. (2005). Verbnet: A broad-coverage, comprehensive verb lexicon. Philadelphia: University of Pennsylvania.
Zurück zum Zitat Schuler, K. K. (2006). VerbNet: A broad-coverage, comprehensive verb Lexicon. Philadelphia: University of Pennsylvania. Schuler, K. K. (2006). VerbNet: A broad-coverage, comprehensive verb Lexicon. Philadelphia: University of Pennsylvania.
Zurück zum Zitat Surdeanu, M., Morante, R., & Màrquez, L. (2008). Analysis of joint inference strategies for the semantic role labeling of spanish and catalan. In A. Gelbukh (Ed.), Computational Linguistics and Intelligent Text Processing: 9th International Conference, CICLing 2008, Haifa, Israel, February 17–23, 2008. Proceedings (pp. 206–218). Berlin: Springer Berlin Heidelberg. doi:10.1007/978-3-540-78135-6_18.CrossRef Surdeanu, M., Morante, R., & Màrquez, L. (2008). Analysis of joint inference strategies for the semantic role labeling of spanish and catalan. In A. Gelbukh (Ed.), Computational Linguistics and Intelligent Text Processing: 9th International Conference, CICLing 2008, Haifa, Israel, February 17–23, 2008. Proceedings (pp. 206–218). Berlin: Springer Berlin Heidelberg. doi:10.​1007/​978-3-540-78135-6_​18.CrossRef
Zurück zum Zitat Zaghouani, W. (2015). Le développement de corpus annotés pour la langue arabe. PhD Thesis, University Paris Ouest Nanterre la Défense. Zaghouani, W. (2015). Le développement de corpus annotés pour la langue arabe. PhD Thesis, University Paris Ouest Nanterre la Défense.
Zurück zum Zitat Zaghouani, W., Diab, M., Mansouri, A., Pradhan, S., & Palmer, M. (2010). The revised Arabic PropBank. Paper presented at the Proceedings of the Fourth Linguistic Annotation Workshop, Uppsala, Sweden. Zaghouani, W., Diab, M., Mansouri, A., Pradhan, S., & Palmer, M. (2010). The revised Arabic PropBank. Paper presented at the Proceedings of the Fourth Linguistic Annotation Workshop, Uppsala, Sweden.
Zurück zum Zitat Zaghouani, W., Hawwari, A., & Diab, M. A. (2012). Pilot PropBank Annotation for Quranic Arabic. In the NAACL-HLT 2012 Workshop on Computational Linguistics for Literature (pp. 78–83). Canada: Association for Computational Linguistics. Zaghouani, W., Hawwari, A., & Diab, M. A. (2012). Pilot PropBank Annotation for Quranic Arabic. In the NAACL-HLT 2012 Workshop on Computational Linguistics for Literature (pp. 78–83). Canada: Association for Computational Linguistics.
Zurück zum Zitat Zwarts, S., Nijholt, A., Akker DiHJAod, & Poel, M. (2004). CBR in Dependency-based Machine Translation. Paper presented at the Proceedings of Konvens 2004, Vienna, Austria. Zwarts, S., Nijholt, A., Akker DiHJAod, & Poel, M. (2004). CBR in Dependency-based Machine Translation. Paper presented at the Proceedings of Konvens 2004, Vienna, Austria.
Metadaten
Titel
Semantic role labeling for Arabic language using case-based reasoning approach
verfasst von
Hamza Meguehout
Tahar Bouhadada
Mohamed Tayeb Laskri
Publikationsdatum
02.05.2017
Verlag
Springer US
Erschienen in
International Journal of Speech Technology / Ausgabe 2/2017
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-017-9412-6

Weitere Artikel der Ausgabe 2/2017

International Journal of Speech Technology 2/2017 Zur Ausgabe

Neuer Inhalt