Skip to main content

2017 | OriginalPaper | Buchkapitel

Statistical Machine Translation Context Modelling with Recurrent Neural Network and LDA

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Machine Translation of text is a fundamental problem in machine learning that resists solutions that do not take into account the dependencies between words and sentences. Recurrent Neural Networks have recently delivered outstanding results in learning about sequential dependencies in many languages. Arabic language as a target language has not received enough attention in the recent language model experiments due to its, structural and semantic difficulties. In this paper, we present a Statistical Machine Translation (SMT) Context Modelling using Recurrent Neural Networks (RNNs) and Latent Dirichlet Allocation (LDA). This research is based on the state-of-the-art RNN language model by Mikolov. Our preliminary contribution is in integrating and presenting a new hybridization to utilize Recurrent Neural Network sequential word learning dependencies as well as Latent Dirichlet Allocation context and topic classification ability to produce the most accurate language scoring.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Callison-Burch, C., Talbot, D., Osborne, M.: Statistical machine translation with word-and sentence-aligned parallel corpora. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, p. 175 (2004) Callison-Burch, C., Talbot, D., Osborne, M.: Statistical machine translation with word-and sentence-aligned parallel corpora. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, p. 175 (2004)
2.
Zurück zum Zitat Durrani, N., Fraser, A., Schmid, H.: Model with minimal translation units, but decode with phrases. In: Proceedings of NAACL-HLT, 9–14 June 2013, Atlanta, Georgia (2013) Durrani, N., Fraser, A., Schmid, H.: Model with minimal translation units, but decode with phrases. In: Proceedings of NAACL-HLT, 9–14 June 2013, Atlanta, Georgia (2013)
3.
Zurück zum Zitat Brown, P., de Souza, P., Mercer, R., Pietra, V., Lai, J.: Class-based n-gram models of natural language. Computational Linguistics. Comput. Linguist. 18(4), 467–479 (1992) Brown, P., de Souza, P., Mercer, R., Pietra, V., Lai, J.: Class-based n-gram models of natural language. Computational Linguistics. Comput. Linguist. 18(4), 467–479 (1992)
4.
Zurück zum Zitat Lipton, Z., Berkowitz, J., Elkan, C.: A Critical review of recurrent neural networks for sequence learning’, arXiv preprint arXiv:1506.00019 (2015) Lipton, Z., Berkowitz, J., Elkan, C.: A Critical review of recurrent neural networks for sequence learning’, arXiv preprint arXiv:​1506.​00019 (2015)
5.
Zurück zum Zitat Farabet, C., Couprie, C., Najman, L., LeCun, Y.: Learning hierarchical features for scene labeling. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1915–1929 (2013)CrossRef Farabet, C., Couprie, C., Najman, L., LeCun, Y.: Learning hierarchical features for scene labeling. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1915–1929 (2013)CrossRef
6.
Zurück zum Zitat Zhao, B., Tam, Y.: Bilingual recurrent neural networks for improved statistical machine translation. In: Spoken Language Technology Workshop (SLT), 7–10 December 2014, South Lake Tahoe, NV. IEEE (2014) Zhao, B., Tam, Y.: Bilingual recurrent neural networks for improved statistical machine translation. In: Spoken Language Technology Workshop (SLT), 7–10 December 2014, South Lake Tahoe, NV. IEEE (2014)
7.
Zurück zum Zitat Sundermeyer, M., Alkhouli, T., Wuebker, J., Ney, H.: Translation modeling with bidirectional recurrent neural networks. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 14–25 October 2014, Doha, Qatar (2014) Sundermeyer, M., Alkhouli, T., Wuebker, J., Ney, H.: Translation modeling with bidirectional recurrent neural networks. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 14–25 October 2014, Doha, Qatar (2014)
8.
Zurück zum Zitat Hopfield, J.J.: Neural networks and physical systems with emergent collective computational abilities. Proc. Nat. Acad. Sci. 79(8), 2554–2558 (1982)MathSciNetCrossRef Hopfield, J.J.: Neural networks and physical systems with emergent collective computational abilities. Proc. Nat. Acad. Sci. 79(8), 2554–2558 (1982)MathSciNetCrossRef
9.
Zurück zum Zitat Jordan, M.: Serial order: a parallel distributed processing approach. Technical report 8604, Institute for Cognitive Science, University of California, San Diego (1986) Jordan, M.: Serial order: a parallel distributed processing approach. Technical report 8604, Institute for Cognitive Science, University of California, San Diego (1986)
10.
Zurück zum Zitat Elman, J.: Finding structure in time. Cogn. Sci. 14, 179–211 (1990)CrossRef Elman, J.: Finding structure in time. Cogn. Sci. 14, 179–211 (1990)CrossRef
11.
Zurück zum Zitat Mikolov, T., Zweig, G.: Context dependent recurrent neural network language model. In: 2012 workshop on Spoken Language Technology, pp. 234–239 (2012) Mikolov, T., Zweig, G.: Context dependent recurrent neural network language model. In: 2012 workshop on Spoken Language Technology, pp. 234–239 (2012)
12.
Zurück zum Zitat Kombrink, S., Mikolov, T., Karafiat, M., Burget, L.: Recurrent neural network based language model. In: INTERSPEECH 2010, 11th Annual Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, 26–30 September 2010, pp. 1045–1048 (2010) Kombrink, S., Mikolov, T., Karafiat, M., Burget, L.: Recurrent neural network based language model. In: INTERSPEECH 2010, 11th Annual Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, 26–30 September 2010, pp. 1045–1048 (2010)
13.
Zurück zum Zitat Kalchbrenner, N., Blunsom, P.: Recurrent continuous translation models. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, October 2013, Seattle, Washington, USA, pp. 1700–1709 (2013) Kalchbrenner, N., Blunsom, P.: Recurrent continuous translation models. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, October 2013, Seattle, Washington, USA, pp. 1700–1709 (2013)
14.
Zurück zum Zitat Liu, S., Yang, N., Li, Zhou, M.: A recursive recurrent neural network for statistical machine translation. In: Proceedings of ACL, pp. 1491–1550 (2014) Liu, S., Yang, N., Li, Zhou, M.: A recursive recurrent neural network for statistical machine translation. In: Proceedings of ACL, pp. 1491–1550 (2014)
15.
Zurück zum Zitat Hu, Y., Auli, M., Gao, Q., Gao, J.: Minimum translation modeling with recurrent neural networks. In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, April 2014 Hu, Y., Auli, M., Gao, Q., Gao, J.: Minimum translation modeling with recurrent neural networks. In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, April 2014
16.
Zurück zum Zitat Schuster, M., Paliwal, K.: Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 45(11), 2673–2681 (1997)CrossRef Schuster, M., Paliwal, K.: Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 45(11), 2673–2681 (1997)CrossRef
17.
Zurück zum Zitat Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef
18.
Zurück zum Zitat Schwenk, H.: Continuous space translation models for phrase-based statistical machine translation. In: 25th International Conference on Computational Linguistics (COLING), December, Mumbai, India, pp. 1071–1080 (2012) Schwenk, H.: Continuous space translation models for phrase-based statistical machine translation. In: 25th International Conference on Computational Linguistics (COLING), December, Mumbai, India, pp. 1071–1080 (2012)
19.
Zurück zum Zitat Mikolov, T.: Statistical Language Models Based on Neural Networks. Ph.D., Brno University of Technology (2012) Mikolov, T.: Statistical Language Models Based on Neural Networks. Ph.D., Brno University of Technology (2012)
22.
Zurück zum Zitat Guessabi, F.: The cultural problems in translating a novel from arabic to english language. AWEJ Special Issue on Translation (2), 224–232 (2013) Guessabi, F.: The cultural problems in translating a novel from arabic to english language. AWEJ Special Issue on Translation (2), 224–232 (2013)
24.
Zurück zum Zitat Ponweiser, M.: Latent dirichlet allocation in R. Diploma Thesis, Institute for Statistics and Mathematics 2 May 2012 Ponweiser, M.: Latent dirichlet allocation in R. Diploma Thesis, Institute for Statistics and Mathematics 2 May 2012
25.
Zurück zum Zitat Zhengxian, G., Guodong, Z.: Employing topic modeling for statistical machine translation. In: 2011 IEEE International Conference on Computer Science and Automation Engineering, CSAE (2011) Zhengxian, G., Guodong, Z.: Employing topic modeling for statistical machine translation. In: 2011 IEEE International Conference on Computer Science and Automation Engineering, CSAE (2011)
Metadaten
Titel
Statistical Machine Translation Context Modelling with Recurrent Neural Network and LDA
verfasst von
Shrooq Alsenan
Mourad Ykhlef
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-48308-5_8