Skip to main content
Top
Published in: Arabian Journal for Science and Engineering 8/2022

08-02-2022 | Research Article-Computer Engineering and Computer Science

Improving Neural Machine Translation for Low Resource Algerian Dialect by Transductive Transfer Learning Strategy

Authors: Amel Slim, Ahlem Melouah, Usef Faghihi, Khouloud Sahib

Published in: Arabian Journal for Science and Engineering | Issue 8/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This study is the first work on a transductive transfer learning approach for low-resource neural machine translation applied to the Algerian Arabic dialect. The transductive approach is based on a fine-tuning transfer learning strategy that transfers knowledge from the parent model to the child model. This strategy helps to solve the learning problem using limited parallel corpora. We tested the approach on a sequence-to-sequence model with and without the Attention mechanism. We first trained the models on a parallel multi-dialects Arabic corpus and then switch them to a low-resource of the Algerian dialect. Transductive transfer learning raises the BLEU score for the Seq2Seq model from 0.3 to more than 34, and for the Attentional-Seq2Seq model from less than 17 to more than 35. The obtained results prove the validity of this approach.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Wu, L.; Tian, F.; Qin, T.; Lai, J.; Liu, T.Y.: A study of reinforcement learning for neural machine translation. In: Proceeding of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018, pp. 3612–3621 (2018). https://doi.org/10.18653/v1/d18-1397 Wu, L.; Tian, F.; Qin, T.; Lai, J.; Liu, T.Y.: A study of reinforcement learning for neural machine translation. In: Proceeding of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018, pp. 3612–3621 (2018). https://​doi.​org/​10.​18653/​v1/​d18-1397
3.
go back to reference Meftouh, K.; Harrat, S.; Jamoussi, S.; Abbas, M.; Smaili, K.: Machine translation experiments on PADIC: a parallel Arabic DIalect corpus. In: Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, pp. 26–34 (2015) Meftouh, K.; Harrat, S.; Jamoussi, S.; Abbas, M.; Smaili, K.: Machine translation experiments on PADIC: a parallel Arabic DIalect corpus. In: Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, pp. 26–34 (2015)
4.
go back to reference Sutskever, I.; Vinyals, O.; Le, Q.V.: Sequence to sequence learning with neural networks. Adv. Neural. Inf. Process. Syst. 4(January), 3104–3112 (2014) Sutskever, I.; Vinyals, O.; Le, Q.V.: Sequence to sequence learning with neural networks. Adv. Neural. Inf. Process. Syst. 4(January), 3104–3112 (2014)
5.
go back to reference Malki, Z.; Atlam, E.; Dagnew, G.; Alzighaibi, A.R.; Ghada, E.; Gad, I.: Bidirectional residual LSTM-based human activity recognition. Comput. Inf. Sci. 13(3), 1–40 (2020) Malki, Z.; Atlam, E.; Dagnew, G.; Alzighaibi, A.R.; Ghada, E.; Gad, I.: Bidirectional residual LSTM-based human activity recognition. Comput. Inf. Sci. 13(3), 1–40 (2020)
6.
go back to reference Xu, H.; Liu, Q.; van Genabith, J.; Xiong, D.; Zhang, M.: Multi-head highly parallelized LSTM decoder for neural machine translation. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) (pp. 273–282) (2021) Xu, H.; Liu, Q.; van Genabith, J.; Xiong, D.; Zhang, M.: Multi-head highly parallelized LSTM decoder for neural machine translation. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) (pp. 273–282) (2021)
7.
go back to reference Nguyen, L.H.; Pham, V.H.; Dinh, D.: Improving neural machine translation with AMR semantic graphs. Math. Probl. Eng. (2021) Nguyen, L.H.; Pham, V.H.; Dinh, D.: Improving neural machine translation with AMR semantic graphs. Math. Probl. Eng. (2021)
8.
go back to reference Shi, X.; Huang, H.; Jian, P.; Tang, Y.K.: Improving neural machine translation with sentence alignment learning. Neurocomputing 420, 15–26 (2021)CrossRef Shi, X.; Huang, H.; Jian, P.; Tang, Y.K.: Improving neural machine translation with sentence alignment learning. Neurocomputing 420, 15–26 (2021)CrossRef
10.
go back to reference Luong, M.-T.; Pham, H.; Manning, C.D.: Effective approaches to attention-based neural machine translation. CoRR, vol. abs/1508.0 (2015) Luong, M.-T.; Pham, H.; Manning, C.D.: Effective approaches to attention-based neural machine translation. CoRR, vol. abs/1508.0 (2015)
11.
go back to reference Benmamoun, E.: The Feature Structure of Functional Categories: A Comparative Study of Arabic Dialects. Oxford University Press, Oxford (2000) Benmamoun, E.: The Feature Structure of Functional Categories: A Comparative Study of Arabic Dialects. Oxford University Press, Oxford (2000)
12.
go back to reference Hughes, A.; Trudgill, P.; Watt, D.: English Accents and Dialects: An Introduction to Social and Regional Varieties of English in the British Isles. Routledge, London (2013)CrossRef Hughes, A.; Trudgill, P.; Watt, D.: English Accents and Dialects: An Introduction to Social and Regional Varieties of English in the British Isles. Routledge, London (2013)CrossRef
13.
go back to reference Wolfram, W.; Schilling, N.: American English: Dialects and Variation. Wiley, New York (2015) Wolfram, W.; Schilling, N.: American English: Dialects and Variation. Wiley, New York (2015)
14.
go back to reference Al-Gaphari, G.H.; Al-Yadoumi, M.: A method to convert Sana’ani accent to Modern Standard Arabic. Int. J. Inf. Sci. Manag. 8(1), 39–49 (2012) Al-Gaphari, G.H.; Al-Yadoumi, M.: A method to convert Sana’ani accent to Modern Standard Arabic. Int. J. Inf. Sci. Manag. 8(1), 39–49 (2012)
15.
go back to reference Hamdi, A.; Boujelbane, R.; Habash, N.; Nasr, A.: The Effects of Factorizing Root and Pattern Mapping in Bidirectional Tunisian - Standard Arabic Machine Translation. In: MT Summit 2013, hal-00908761 (2013). Hamdi, A.; Boujelbane, R.; Habash, N.; Nasr, A.: The Effects of Factorizing Root and Pattern Mapping in Bidirectional Tunisian - Standard Arabic Machine Translation. In: MT Summit 2013, hal-00908761 (2013).
16.
go back to reference Mohamed, E.; Mohit, B.; Oflazer, K.: Transforming Standard Arabic to Colloquial Arabic. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 176–180 (2012) Mohamed, E.; Mohit, B.; Oflazer, K.: Transforming Standard Arabic to Colloquial Arabic. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 176–180 (2012)
17.
go back to reference Salloum, W.; Habash, N.: Elissa: a dialectal to standard arabic machine translation system. In: Proceedings of COLING 2012: Demonstration Papers, pp. 385–392 (2012) Salloum, W.; Habash, N.: Elissa: a dialectal to standard arabic machine translation system. In: Proceedings of COLING 2012: Demonstration Papers, pp. 385–392 (2012)
19.
go back to reference Hamada, S.; Marzouk, R.M.: Developing a transfer-based system for Arabic Dialects translation. In: Shaalan, K., Hassanien, A.E., Tolba, F. (eds.) Intelligent Natural Language Processing: Trends and Applications, pp. 121–138. Springer, Cham (2018)CrossRef Hamada, S.; Marzouk, R.M.: Developing a transfer-based system for Arabic Dialects translation. In: Shaalan, K., Hassanien, A.E., Tolba, F. (eds.) Intelligent Natural Language Processing: Trends and Applications, pp. 121–138. Springer, Cham (2018)CrossRef
20.
go back to reference Jeblee, S.; Feely, W.; Bouamor, H.; Lavie, A.; Habash, N.; Oflazer, K.: Domain and Dialect adaptation for machine translation into Egyptian Arabic. In: Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing (ANLP), pp. 196–206 (2014). https://doi.org/10.3115/v1/W14-3627 Jeblee, S.; Feely, W.; Bouamor, H.; Lavie, A.; Habash, N.; Oflazer, K.: Domain and Dialect adaptation for machine translation into Egyptian Arabic. In: Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing (ANLP), pp. 196–206 (2014). https://​doi.​org/​10.​3115/​v1/​W14-3627
21.
go back to reference Sajjad, H.; Darwish, K.; Belinkov, Y.: Translating Dialectal Arabic to English. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 1–6 (2013) Sajjad, H.; Darwish, K.; Belinkov, Y.: Translating Dialectal Arabic to English. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp. 1–6 (2013)
22.
go back to reference Zbib, R., et al.: Machine translation of Arabic dialects. In: Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 49–59 (2012) Zbib, R., et al.: Machine translation of Arabic dialects. In: Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 49–59 (2012)
23.
go back to reference Bakr, H.A.; Shaalan, K.; Ziedan, I.: A hybrid approach for converting written Egyptian colloquial dialect into diacritized Arabic. In: The 6th International Conference on Informatics and Systems, Infos2008. Cairo University (2008) Bakr, H.A.; Shaalan, K.; Ziedan, I.: A hybrid approach for converting written Egyptian colloquial dialect into diacritized Arabic. In: The 6th International Conference on Informatics and Systems, Infos2008. Cairo University (2008)
24.
go back to reference Sawaf, H.: Arabic dialect handling in hybrid machine translation. In: Proceedings of the Conference of the Association for Machine Translation in the Americas (AMTA), Denver, Colorado (2010) Sawaf, H.: Arabic dialect handling in hybrid machine translation. In: Proceedings of the Conference of the Association for Machine Translation in the Americas (AMTA), Denver, Colorado (2010)
25.
go back to reference Guellil, I.; Azouaou, F.; Abbas, M.: Neural Vs statistical translation of Algerian Arabic Dialect written with Arabizi and Arabic letter Neural Vs statistical translation of Algerian Arabic Dialect written with Arabizi and Arabic letter. In: 31st Pacific Asia Conference on Language, Information and Computer (PACLIC) (2017) Guellil, I.; Azouaou, F.; Abbas, M.: Neural Vs statistical translation of Algerian Arabic Dialect written with Arabizi and Arabic letter Neural Vs statistical translation of Algerian Arabic Dialect written with Arabizi and Arabic letter. In: 31st Pacific Asia Conference on Language, Information and Computer (PACLIC) (2017)
26.
go back to reference Slim, A.; Melouah, A.; Faghihi, Y., et al.: Algerian Dialect translation applied on COVID-19 social media comments. In: International Conference in Artificial Intelligence in Renewable Energetic Systems, pp. 716–726. Springer, Cham (2020) Slim, A.; Melouah, A.; Faghihi, Y., et al.: Algerian Dialect translation applied on COVID-19 social media comments. In: International Conference in Artificial Intelligence in Renewable Energetic Systems, pp. 716–726. Springer, Cham (2020)
29.
go back to reference Bouamor, H., et al.: The Madar Arabic dialect corpus and lexicon. In: Lr. 2018—International Conference on Language Resources and Evaluation, pp. 3387–3396 (2019) Bouamor, H., et al.: The Madar Arabic dialect corpus and lexicon. In: Lr. 2018—International Conference on Language Resources and Evaluation, pp. 3387–3396 (2019)
30.
go back to reference Pan, S.J.; Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2009)CrossRef Pan, S.J.; Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2009)CrossRef
31.
go back to reference Sugiyama, M.; Nakajima, S.; Kashima, H.; Von Buenau, P.; Kawanabe, M.: Direct importance estimation with model selection and its application to covariate shift adaptation. In: NIPS, vol. 7, pp. 1433–1440 (2007) Sugiyama, M.; Nakajima, S.; Kashima, H.; Von Buenau, P.; Kawanabe, M.: Direct importance estimation with model selection and its application to covariate shift adaptation. In: NIPS, vol. 7, pp. 1433–1440 (2007)
32.
go back to reference Papineni, K.; Roukos, S.; Ward, T.; Zhu, W.-J.: BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 311–318(2002) Papineni, K.; Roukos, S.; Ward, T.; Zhu, W.-J.: BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 311–318(2002)
Metadata
Title
Improving Neural Machine Translation for Low Resource Algerian Dialect by Transductive Transfer Learning Strategy
Authors
Amel Slim
Ahlem Melouah
Usef Faghihi
Khouloud Sahib
Publication date
08-02-2022
Publisher
Springer Berlin Heidelberg
Published in
Arabian Journal for Science and Engineering / Issue 8/2022
Print ISSN: 2193-567X
Electronic ISSN: 2191-4281
DOI
https://doi.org/10.1007/s13369-022-06588-w

Other articles of this Issue 8/2022

Arabian Journal for Science and Engineering 8/2022 Go to the issue

Research Article-Computer Engineering and Computer Science

A Heuristic Local-sensitive Program-Wide Diffing Method for IoT Binary Files

Research Article-Computer Engineering and Computer Science

Probability Quantization Model for Sample-to-Sample Stochastic Sampling

Research Article-Computer Engineering and Computer Science

A Distributed Data Storage Strategy Based on LOPs

Premium Partners