Skip to main content

2018 | OriginalPaper | Buchkapitel

Combining Phrase and Neural-Based Machine Translation: What Worked and Did Not

verfasst von : Marta R. Costa-jussà, José A. R. Fonollosa

Erschienen in: Computational Linguistics and Intelligent Text Processing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Phrase-based machine translation assumes that all words are at the same distance and translates them using feature functions that approximate the probability at different levels. On the other hand, neural machine translation infers a word embedding and translates these word vectors using a neural model. At the moment, both approaches co-exist and are being intensively investigated.
This paper to the best of our knowledge is the first work that both compares and combines these two systems by: using the phrase-based output to solve unknown words in the neural machine translation output; using the neural alignment in the phrase-based system; comparing how the popular strategy of pre-reordering affects both systems; and combining both translation outputs. Improvements are achieved in Catalan-to-Spanish and German-to-English.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Sundermeyer, M., Alkhouli, T., Wuebker, J., Ney, H.: Translation modeling with bidirectional recurrent neural networks. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Doha (2014) Sundermeyer, M., Alkhouli, T., Wuebker, J., Ney, H.: Translation modeling with bidirectional recurrent neural networks. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Doha (2014)
2.
Zurück zum Zitat Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. CoRR abs/1409.0473 (2015) Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. CoRR abs/1409.0473 (2015)
3.
Zurück zum Zitat Bisazza, A., Federico, M.: A survey of word reordering in statistical machine translation: computational models and language phenomena. arXiv preprint arXiv:1502.04938 (2015) Bisazza, A., Federico, M.: A survey of word reordering in statistical machine translation: computational models and language phenomena. arXiv preprint arXiv:​1502.​04938 (2015)
4.
Zurück zum Zitat Castaño, M.A., Casacuberta, F.: A connectionist approach to MT. In: Proceedings of the EUROSPEECH Conference (1997) Castaño, M.A., Casacuberta, F.: A connectionist approach to MT. In: Proceedings of the EUROSPEECH Conference (1997)
5.
Zurück zum Zitat Chiang, D.: Hierarchical phrase-based translation. Comput. Linguist. 33(2), 201–228 (2007)CrossRefMATH Chiang, D.: Hierarchical phrase-based translation. Comput. Linguist. 33(2), 201–228 (2007)CrossRefMATH
6.
Zurück zum Zitat Cho, K., van Merrienboer, B., Bahdanau, D., Bengio, Y.: On the properties of neural machine translation: encoder-decoder approaches. In: Proceedings of the 8th Workshop on Syntax, Semantics and Structure in Statistical Translation, Doha (2014) Cho, K., van Merrienboer, B., Bahdanau, D., Bengio, Y.: On the properties of neural machine translation: encoder-decoder approaches. In: Proceedings of the 8th Workshop on Syntax, Semantics and Structure in Statistical Translation, Doha (2014)
7.
Zurück zum Zitat Collins, M., Koehn, P., Kucerova, I.: Clause restructuring for statistical machine translation. In: Annual Conference of the Association for Computational Lingusitics (ACL 2005), Michigan (2005) Collins, M., Koehn, P., Kucerova, I.: Clause restructuring for statistical machine translation. In: Annual Conference of the Association for Computational Lingusitics (ACL 2005), Michigan (2005)
8.
Zurück zum Zitat Costa-Jussà, M.R., Fonollosa, J.A.R.: Statistical machine reordering. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, pp. 70–76 (2006) Costa-Jussà, M.R., Fonollosa, J.A.R.: Statistical machine reordering. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, pp. 70–76 (2006)
9.
Zurück zum Zitat Costa-Jussà, M., Poch, M., Fonollosa, J., Farrús, M., Marinno, J.: A large Spanish-Catalan parallel corpus release for machine translation. Comput. Inf. 33(4), 907–920 (2014) Costa-Jussà, M., Poch, M., Fonollosa, J., Farrús, M., Marinno, J.: A large Spanish-Catalan parallel corpus release for machine translation. Comput. Inf. 33(4), 907–920 (2014)
11.
Zurück zum Zitat Jean, S., Firat, O., Cho, K., Memisevic, R., Bengio, Y.: Montreal neural machine translation systems for WMT15. In: Proceedings of the 10th Workshop on Statistical Machine Translation, Lisbon (2015) Jean, S., Firat, O., Cho, K., Memisevic, R., Bengio, Y.: Montreal neural machine translation systems for WMT15. In: Proceedings of the 10th Workshop on Statistical Machine Translation, Lisbon (2015)
12.
Zurück zum Zitat Kalchbrenner, N., Blunsom, P.: Recurrent continuous translation models. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Seattle (2013) Kalchbrenner, N., Blunsom, P.: Recurrent continuous translation models. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Seattle (2013)
13.
Zurück zum Zitat Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C., Bojar, O., Constantin, A., Herbst, E.: Moses: open source toolkit for statistical machine translation. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, pp. 177–180 (2007) Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C., Bojar, O., Constantin, A., Herbst, E.: Moses: open source toolkit for statistical machine translation. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, pp. 177–180 (2007)
14.
Zurück zum Zitat Koehn, P., Och, F., Marcu, D.: Statistical phrase-based translation. In: Proceedings of the 41th Annual Meeting of the Association for Computational Linguistics (2003) Koehn, P., Och, F., Marcu, D.: Statistical phrase-based translation. In: Proceedings of the 41th Annual Meeting of the Association for Computational Linguistics (2003)
15.
Zurück zum Zitat Kumar, S., Byrne, W.J.: Minimum Bayes-risk decoding for statistical machine translation. In: HLT-NAACL, pp. 169–176 (2004) Kumar, S., Byrne, W.J.: Minimum Bayes-risk decoding for statistical machine translation. In: HLT-NAACL, pp. 169–176 (2004)
16.
Zurück zum Zitat Luong, T., Sutskever, I., Le, Q.V., Vinyals, O., Zaremba, W.: Addressing the rare word problem in neural machine translation. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Beijing, pp. 11–19 (2015) Luong, T., Sutskever, I., Le, Q.V., Vinyals, O., Zaremba, W.: Addressing the rare word problem in neural machine translation. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Beijing, pp. 11–19 (2015)
17.
Zurück zum Zitat Och, F.: Minimum error rate training in statistical machine translation. In: Proceedings of the 41th Annual Meeting of the Association for Computational Linguistics, pp. 160–167 (2003) Och, F.: Minimum error rate training in statistical machine translation. In: Proceedings of the 41th Annual Meeting of the Association for Computational Linguistics, pp. 160–167 (2003)
18.
Zurück zum Zitat Quirk, C., Menezes, A., Cherry, C.: Dependency treelet translation: syntactically informed phrasal SMT. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 271–279 (2005) Quirk, C., Menezes, A., Cherry, C.: Dependency treelet translation: syntactically informed phrasal SMT. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 271–279 (2005)
19.
Zurück zum Zitat Schwenk, H., Costa-Jussà, M.R., Fonollosa, J.A.R.: Smooth bilingual n-gram translation. In: Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Prague, pp. 430–438 (2007) Schwenk, H., Costa-Jussà, M.R., Fonollosa, J.A.R.: Smooth bilingual n-gram translation. In: Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Prague, pp. 430–438 (2007)
20.
Zurück zum Zitat Stolcke, A.: SRILM - an extensible language modeling toolkit. In: 7th International Conference on Spoken Language Processing, Denver (2002) Stolcke, A.: SRILM - an extensible language modeling toolkit. In: 7th International Conference on Spoken Language Processing, Denver (2002)
21.
Zurück zum Zitat Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 27, pp. 3104–3112. Curran Associates, Inc. (2014) Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 27, pp. 3104–3112. Curran Associates, Inc. (2014)
22.
Zurück zum Zitat Toutanova, K., Suzuki, H., Ruopp, A.: Applying morphology generation models to machine translation. In: Proceedings of the Joint Conference of the Association for Computational Linguistics and Human Language Technology, Columbus, pp. 514–522 (2008) Toutanova, K., Suzuki, H., Ruopp, A.: Applying morphology generation models to machine translation. In: Proceedings of the Joint Conference of the Association for Computational Linguistics and Human Language Technology, Columbus, pp. 514–522 (2008)
Metadaten
Titel
Combining Phrase and Neural-Based Machine Translation: What Worked and Did Not
verfasst von
Marta R. Costa-jussà
José A. R. Fonollosa
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-75487-1_2

Premium Partner