Skip to main content
Erschienen in: Arabian Journal for Science and Engineering 11/2019

06.08.2019 | Research Article - Computer Engineering and Computer Science

Bidirectional Encoder–Decoder Model for Arabic Named Entity Recognition

verfasst von: Mohammed N. A. Ali, Guanzheng Tan

Erschienen in: Arabian Journal for Science and Engineering | Ausgabe 11/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Sequence labeling models with recurrent neural network variants, such as long short-term memory (LSTM) and gated recurrent units, show promising performance in several natural language processing tasks, such as named entity recognition (NER). Most current models use unidirectional decoders, which reason only about the past and remain limited to retaining future contexts while generating predictions. Therefore, these models suffer from their generation of unbalanced outputs. Moreover, most existing NER models utilize word embeddings for capturing similarities between words but sustain when handling previously unobserved or infrequently used words. We propose a bidirectional encoder–decoder model for addressing the problem of Arabic NER on the basis of recent work in deep learning, in which the encoder and decoder are bidirectional LSTMs. In addition to word-level embeddings, character-level embeddings are adopted, and they are combined via an embedding-level attention mechanism. Our model can dynamically determine the information that must be utilized from a word- or character-level component through this attention mechanism. Experimental results on the ANERCorp and AQMAR datasets show that the model with a bi-encoder–decoder network and embedding attention layer achieves a high F-score measure of approximately 92%.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Shaalan, K.: A survey of Arabic named entity recognition and classification. Comput. Linguist. 40(2), 469–510 (2014)CrossRef Shaalan, K.: A survey of Arabic named entity recognition and classification. Comput. Linguist. 40(2), 469–510 (2014)CrossRef
2.
Zurück zum Zitat Nydell, M.K.: Understanding Arabs: A Guide for Modern Times. Hodder & Stoughton, London (2018) Nydell, M.K.: Understanding Arabs: A Guide for Modern Times. Hodder & Stoughton, London (2018)
3.
Zurück zum Zitat Oudah, M.; Shaalan, K.: NERA 2.0: improving coverage and performance of rule-based named entity recognition for Arabic. Nat. Lang. Eng. 23(3), 441–472 (2017)CrossRef Oudah, M.; Shaalan, K.: NERA 2.0: improving coverage and performance of rule-based named entity recognition for Arabic. Nat. Lang. Eng. 23(3), 441–472 (2017)CrossRef
4.
Zurück zum Zitat Zirikly, A.; Diab, M.: Named entity recognition for Arabic social media. In: Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing, pp. 176–185 (2015) Zirikly, A.; Diab, M.: Named entity recognition for Arabic social media. In: Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing, pp. 176–185 (2015)
5.
Zurück zum Zitat Zaghouani, W.: RENAR: a rule-based Arabic named entity recognition system. ACM Trans. Asian Lang. Inf. Process. 11(1), 2 (2012)MathSciNetCrossRef Zaghouani, W.: RENAR: a rule-based Arabic named entity recognition system. ACM Trans. Asian Lang. Inf. Process. 11(1), 2 (2012)MathSciNetCrossRef
6.
Zurück zum Zitat Kim, Y.; Jernite, Y.; Sontag, D.; Rush, A.M.: Character-aware neural language models. In: AAAI, pp. 2741–2749 (2016) Kim, Y.; Jernite, Y.; Sontag, D.; Rush, A.M.: Character-aware neural language models. In: AAAI, pp. 2741–2749 (2016)
7.
Zurück zum Zitat Dahou, A.; Xiong, S.; Zhou, J.; Haddoud, M.H.; Duan, P.: Word embeddings and convolutional neural network for Arabic sentiment classification. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 2418–2427 (2016) Dahou, A.; Xiong, S.; Zhou, J.; Haddoud, M.H.; Duan, P.: Word embeddings and convolutional neural network for Arabic sentiment classification. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 2418–2427 (2016)
8.
Zurück zum Zitat Ali, M.; Tan, G.; Hussain, A.: Bidirectional recurrent neural network approach for Arabic named entity recognition. Future Internet 10(12), 123 (2018)CrossRef Ali, M.; Tan, G.; Hussain, A.: Bidirectional recurrent neural network approach for Arabic named entity recognition. Future Internet 10(12), 123 (2018)CrossRef
9.
Zurück zum Zitat Sun, Q.; Lee, S.; Batra, D.: Bidirectional beam search: forward–backward inference in neural sequence models for fill-in-the-blank image captioning. arXiv preprint, arXiv:1705.08759 (2017). Sun, Q.; Lee, S.; Batra, D.: Bidirectional beam search: forward–backward inference in neural sequence models for fill-in-the-blank image captioning. arXiv preprint, arXiv:​1705.​08759 (2017).
10.
Zurück zum Zitat Doetsch, P.; Zeyer, A.; Ney, H.: Bidirectional decoder networks for attention-based end-to-end offline handwriting recognition. In 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 361–366 (2016) Doetsch, P.; Zeyer, A.; Ney, H.: Bidirectional decoder networks for attention-based end-to-end offline handwriting recognition. In 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 361–366 (2016)
11.
Zurück zum Zitat Goyal, A.; Gupta, V.; Kumar, M.: Recent named entity recognition and classification techniques: a systematic review. Comput. Sci. Rev. 29, 21–43 (2018)CrossRef Goyal, A.; Gupta, V.; Kumar, M.: Recent named entity recognition and classification techniques: a systematic review. Comput. Sci. Rev. 29, 21–43 (2018)CrossRef
12.
Zurück zum Zitat Etaiwi, W.; Awajan, A.; Suleiman, D.: Statistical Arabic name entity recognition approaches: a survey. Procedia Comput. Sci. 113, 57–64 (2017)CrossRef Etaiwi, W.; Awajan, A.; Suleiman, D.: Statistical Arabic name entity recognition approaches: a survey. Procedia Comput. Sci. 113, 57–64 (2017)CrossRef
13.
Zurück zum Zitat Al-Ayyoub, M.; Nuseir, A.; Alsmearat, K.; Jararweh, Y.; Gupta, B.: Deep learning for Arabic NLP: a survey. J. Comput. Sci. 26, 522–531 (2018)CrossRef Al-Ayyoub, M.; Nuseir, A.; Alsmearat, K.; Jararweh, Y.; Gupta, B.: Deep learning for Arabic NLP: a survey. J. Comput. Sci. 26, 522–531 (2018)CrossRef
14.
Zurück zum Zitat Shaalan, K.: Rule-based approach in Arabic natural language processing. Int. J. Inf. Commun. Technol. 3(3), 11–19 (2010) Shaalan, K.: Rule-based approach in Arabic natural language processing. Int. J. Inf. Commun. Technol. 3(3), 11–19 (2010)
15.
Zurück zum Zitat Benajiba, Y.; Rosso, P.: Arabic named entity recognition using conditional random fields. In: Proceedings of Workshop HLT NLP Within Arabic World, LREC, pp. 143–153 (2008) Benajiba, Y.; Rosso, P.: Arabic named entity recognition using conditional random fields. In: Proceedings of Workshop HLT NLP Within Arabic World, LREC, pp. 143–153 (2008)
16.
Zurück zum Zitat Meselhi, M.A.; Bakr, H.M.A.; Ziedan, I.; Shaalan, K.: Hybrid named entity recognition-application to Arabic language. In: 2014 9th International Conference on Computer Engineering & Systems (ICCES), pp. 80–85 (2014) Meselhi, M.A.; Bakr, H.M.A.; Ziedan, I.; Shaalan, K.: Hybrid named entity recognition-application to Arabic language. In: 2014 9th International Conference on Computer Engineering & Systems (ICCES), pp. 80–85 (2014)
17.
Zurück zum Zitat Ali, M.N.A.; Tan, G.; Hussain, A.: Boosting Arabic named-entity recognition with multi-attention layer. IEEE Access 7, 46575–46582 (2019)CrossRef Ali, M.N.A.; Tan, G.; Hussain, A.: Boosting Arabic named-entity recognition with multi-attention layer. IEEE Access 7, 46575–46582 (2019)CrossRef
18.
Zurück zum Zitat Mohammed, N.F.; Omar, N.: Arabic named entity recognition using artificial neural network. J. Comput. Sci. 8(8), 1285–1293 (2012)CrossRef Mohammed, N.F.; Omar, N.: Arabic named entity recognition using artificial neural network. J. Comput. Sci. 8(8), 1285–1293 (2012)CrossRef
19.
Zurück zum Zitat Gridach, M.; Haddad, H.: Arabic named entity recognition: a bidirectional GRU-CRF approach. In: International Conference on Intelligent Text Processing and Computational Linguistics, pp. 264–275 (2017) Gridach, M.; Haddad, H.: Arabic named entity recognition: a bidirectional GRU-CRF approach. In: International Conference on Intelligent Text Processing and Computational Linguistics, pp. 264–275 (2017)
20.
Zurück zum Zitat Awad, D.; Sabty, C.; Elmahdy, M.; Abdennadher, S.: Arabic name entity recognition using deep learning. In: International Conference on Statistical Language and Speech Processing, pp. 105–116 (2018) Awad, D.; Sabty, C.; Elmahdy, M.; Abdennadher, S.: Arabic name entity recognition using deep learning. In: International Conference on Statistical Language and Speech Processing, pp. 105–116 (2018)
21.
Zurück zum Zitat Cho, K., et al.: Learning phrase representations using RNN encoder–decoder for statistical machine translation. arXiv preprint, arXiv:1406.1078 (2014) Cho, K., et al.: Learning phrase representations using RNN encoder–decoder for statistical machine translation. arXiv preprint, arXiv:​1406.​1078 (2014)
22.
Zurück zum Zitat Xu, K., et al.: Show, attend and tell: neural image caption generation with visual attention. In: International Conference on Machine Learning, pp. 2048–2057 (2015) Xu, K., et al.: Show, attend and tell: neural image caption generation with visual attention. In: International Conference on Machine Learning, pp. 2048–2057 (2015)
23.
Zurück zum Zitat Mikolov, T.; Chen, K.; Corrado, G.; Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint, arXiv:1301.3781 (2013) Mikolov, T.; Chen, K.; Corrado, G.; Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint, arXiv:​1301.​3781 (2013)
24.
Zurück zum Zitat Li, F.; Zhang, M.; Fu, G.; Ji, D.: A neural joint model for entity and relation extraction from biomedical text. BMC Bioinformatics 18(1), 198 (2017)CrossRef Li, F.; Zhang, M.; Fu, G.; Ji, D.: A neural joint model for entity and relation extraction from biomedical text. BMC Bioinformatics 18(1), 198 (2017)CrossRef
25.
Zurück zum Zitat Ling, W., et al.: Finding function in form: Compositional character models for open vocabulary word representation. arXiv preprint, arXiv:1508.02096 (2015) Ling, W., et al.: Finding function in form: Compositional character models for open vocabulary word representation. arXiv preprint, arXiv:​1508.​02096 (2015)
26.
Zurück zum Zitat Luong, M.-T.; Manning, C.D.: Achieving open vocabulary neural machine translation with hybrid word-character models. arXiv preprint, arXiv:1604.00788 (2016) Luong, M.-T.; Manning, C.D.: Achieving open vocabulary neural machine translation with hybrid word-character models. arXiv preprint, arXiv:​1604.​00788 (2016)
27.
Zurück zum Zitat Cocos, A.; Fiks, A.G.; Masino, A.J.: Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts. J. Am. Med. Inform. Assoc. 24(4), 813–821 (2017)CrossRef Cocos, A.; Fiks, A.G.; Masino, A.J.: Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts. J. Am. Med. Inform. Assoc. 24(4), 813–821 (2017)CrossRef
28.
Zurück zum Zitat Soliman, A.B.; Eissa, K.; El-Beltagy, S.R.: AraVec: a set of arabic word embedding models for use in Arabic NLP. Procedia Comput. Sci. 117, 256–265 (2017)CrossRef Soliman, A.B.; Eissa, K.; El-Beltagy, S.R.: AraVec: a set of arabic word embedding models for use in Arabic NLP. Procedia Comput. Sci. 117, 256–265 (2017)CrossRef
29.
Zurück zum Zitat Rei, M.; Crichton, G.K.O.; Pyysalo, S.: Attending to characters in neural sequence labeling models. arXiv preprint, arXiv:1611.04361 (2016) Rei, M.; Crichton, G.K.O.; Pyysalo, S.: Attending to characters in neural sequence labeling models. arXiv preprint, arXiv:​1611.​04361 (2016)
30.
Zurück zum Zitat Benajiba, Y.; Rosso, P.; Miguel, J.; Ruiz, B.: ANERsys: an Arabic named entity recognition system based on maximum entropy. In: International Conference on Intelligent Text Processing and Computational Linguistics, pp. 143–153 (2007) Benajiba, Y.; Rosso, P.; Miguel, J.; Ruiz, B.: ANERsys: an Arabic named entity recognition system based on maximum entropy. In: International Conference on Intelligent Text Processing and Computational Linguistics, pp. 143–153 (2007)
31.
Zurück zum Zitat Mohit, B.; Schneider, N.; Bhowmick, R.; Oflazer, K.; Smith, N.A.: Recall-oriented learning of named entities in Arabic Wikipedia. In: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, pp. 162–173 (2012) Mohit, B.; Schneider, N.; Bhowmick, R.; Oflazer, K.; Smith, N.A.: Recall-oriented learning of named entities in Arabic Wikipedia. In: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, pp. 162–173 (2012)
32.
Zurück zum Zitat Duchi, J.; Hazan, E.; Singer, Y.: Adaptive subgradient methods for online learning and stochastic optimization. J. Mach. Learn. Res. 12, 2121–2159 (2011)MathSciNetMATH Duchi, J.; Hazan, E.; Singer, Y.: Adaptive subgradient methods for online learning and stochastic optimization. J. Mach. Learn. Res. 12, 2121–2159 (2011)MathSciNetMATH
33.
Zurück zum Zitat Azroumahli, C.; El Younoussi, Y.; Achbal, F.: An overview of a distributional word representation for an Arabic named entity recognition system. In: International Conference on Soft Computing and Pattern Recognition, pp. 130–140 (2017) Azroumahli, C.; El Younoussi, Y.; Achbal, F.: An overview of a distributional word representation for an Arabic named entity recognition system. In: International Conference on Soft Computing and Pattern Recognition, pp. 130–140 (2017)
34.
Zurück zum Zitat Devarakonda, A.; Naumov, M.; Garland, M.: AdaBatch: adaptive batch sizes for training deep neural networks. arXiv preprint arXiv:1712.02029 (2017) Devarakonda, A.; Naumov, M.; Garland, M.: AdaBatch: adaptive batch sizes for training deep neural networks. arXiv preprint arXiv:​1712.​02029 (2017)
Metadaten
Titel
Bidirectional Encoder–Decoder Model for Arabic Named Entity Recognition
verfasst von
Mohammed N. A. Ali
Guanzheng Tan
Publikationsdatum
06.08.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Arabian Journal for Science and Engineering / Ausgabe 11/2019
Print ISSN: 2193-567X
Elektronische ISSN: 2191-4281
DOI
https://doi.org/10.1007/s13369-019-04068-2

Weitere Artikel der Ausgabe 11/2019

Arabian Journal for Science and Engineering 11/2019 Zur Ausgabe

Research Article - Computer Engineering and Computer Science

Storage Node Allocation Methods for Erasure Code-based Cloud Storage Systems

Research Article - Computer Engineering and Computer Science

Bayesian Versus Convolutional Networks for Arabic Handwriting Recognition

Research Article - Computer Engineering and Computer Science

A New Mobility Control Approach for Improved Route Availability in Mobile Ad Hoc Networks

Research Article - Computer Engineering and Computer Science

Priority-Based and Optimized Data Center Selection in Cloud Computing

    Marktübersichten

    Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.