Skip to main content
Top

2020 | OriginalPaper | Chapter

Neural Machine Translation with Soft Reordering Knowledge

Authors : Leiying Zhou, Jie Zhou, Wenjie Lu, Kui Meng, Gongshen Liu

Published in: Neural Information Processing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The Transformer architecture has been widely used in sequence to sequence tasks since it was proposed. However, it only adds the representations of absolute positions to its inputs to make use of the order information of the sequence. It lacks explicit structures to exploit the reordering knowledge of words. In this paper, we propose a simple but effective method to incorporate the reordering knowledge into the Transformer translation system. The reordering knowledge of each word is obtained by an additional reordering-aware attention sublayer based on its semantic and contextual information. The proposed approach can be easily integrated into the existing framework of the Transformer. Experimental results on two public translation tasks demonstrate that our proposed method can achieve significant translation improvements over the basic Transformer model and also outperforms the existing competitive systems.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: ICLR (2015) Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: ICLR (2015)
2.
go back to reference Brown, P.F., Pietra, S.D., Pietra, V.J.D., Mercer, R.L.: The mathematics of statistical machine translation: parameter estimation. Comput. Linguistics 19(2), 263–311 (1993) Brown, P.F., Pietra, S.D., Pietra, V.J.D., Mercer, R.L.: The mathematics of statistical machine translation: parameter estimation. Comput. Linguistics 19(2), 263–311 (1993)
3.
go back to reference Cettolo, M., Girardi, C., Federico, M.: Wit3: Web inventory of transcribed and translated talks. In: Conference of European Association for Machine Translation, pp. 261–268 (2012) Cettolo, M., Girardi, C., Federico, M.: Wit3: Web inventory of transcribed and translated talks. In: Conference of European Association for Machine Translation, pp. 261–268 (2012)
4.
go back to reference Chen, K., Wang, R., Utiyama, M., Sumita, E.: Neural machine translation with reordering embeddings. In: ACL (1), pp. 1787–1799. Association for Computational Linguistics (2019) Chen, K., Wang, R., Utiyama, M., Sumita, E.: Neural machine translation with reordering embeddings. In: ACL (1), pp. 1787–1799. Association for Computational Linguistics (2019)
5.
go back to reference Chiang, D.: A hierarchical phrase-based model for statistical machine translation. In: ACL, pp. 263–270. The Association for Computer Linguistics (2005) Chiang, D.: A hierarchical phrase-based model for statistical machine translation. In: ACL, pp. 263–270. The Association for Computer Linguistics (2005)
6.
go back to reference Gehring, J., Auli, M., Grangier, D., Dauphin, Y.N.: A convolutional encoder model for neural machine translation. In: ACL (1), pp. 123–135. Association for Computational Linguistics (2017) Gehring, J., Auli, M., Grangier, D., Dauphin, Y.N.: A convolutional encoder model for neural machine translation. In: ACL (1), pp. 123–135. Association for Computational Linguistics (2017)
7.
go back to reference Gehring, J., Auli, M., Grangier, D., Yarats, D., Dauphin, Y.N.: Convolutional sequence to sequence learning. In: ICML. Proceedings of Machine Learning Research, vol. 70, pp. 1243–1252. PMLR (2017) Gehring, J., Auli, M., Grangier, D., Yarats, D., Dauphin, Y.N.: Convolutional sequence to sequence learning. In: ICML. Proceedings of Machine Learning Research, vol. 70, pp. 1243–1252. PMLR (2017)
8.
go back to reference Kawara, Y., Chu, C., Arase, Y.: Recursive neural network based preordering for English-to-Japanese machine translation. In: ACL (3), pp. 21–27. Association for Computational Linguistics (2018) Kawara, Y., Chu, C., Arase, Y.: Recursive neural network based preordering for English-to-Japanese machine translation. In: ACL (3), pp. 21–27. Association for Computational Linguistics (2018)
9.
go back to reference Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: ICLR (Poster) (2015) Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: ICLR (Poster) (2015)
10.
go back to reference Meng, F., Zhang, J.: DTMT: a novel deep transition architecture for neural machine translation. In: AAAI, pp. 224–231. AAAI Press (2019) Meng, F., Zhang, J.: DTMT: a novel deep transition architecture for neural machine translation. In: AAAI, pp. 224–231. AAAI Press (2019)
11.
go back to reference Nakagawa, T.: Efficient top-down BTG parsing for machine translation preordering. In: ACL (1), pp. 208–218. The Association for Computer Linguistics (2015) Nakagawa, T.: Efficient top-down BTG parsing for machine translation preordering. In: ACL (1), pp. 208–218. The Association for Computer Linguistics (2015)
12.
go back to reference Sennrich, R., Haddow, B., Birch, A.: Neural machine translation of rare words with subword units. In: ACL (1). The Association for Computer Linguistics (2016) Sennrich, R., Haddow, B., Birch, A.: Neural machine translation of rare words with subword units. In: ACL (1). The Association for Computer Linguistics (2016)
13.
go back to reference Shaw, P., Uszkoreit, J., Vaswani, A.: Self-attention with relative position representations. In: NAACL-HLT (2), pp. 464–468. Association for Computational Linguistics (2018) Shaw, P., Uszkoreit, J., Vaswani, A.: Self-attention with relative position representations. In: NAACL-HLT (2), pp. 464–468. Association for Computational Linguistics (2018)
14.
go back to reference So, D.R., Le, Q.V., Liang, C.: The evolved transformer. In: ICML. Proceedings of Machine Learning Research, vol. 97, pp. 5877–5886. PMLR (2019) So, D.R., Le, Q.V., Liang, C.: The evolved transformer. In: ICML. Proceedings of Machine Learning Research, vol. 97, pp. 5877–5886. PMLR (2019)
15.
go back to reference Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: NIPS, pp. 3104–3112 (2014) Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: NIPS, pp. 3104–3112 (2014)
16.
go back to reference Vaswani, A., et al.: Attention is all you need. In: NIPS, pp. 5998–6008 (2017) Vaswani, A., et al.: Attention is all you need. In: NIPS, pp. 5998–6008 (2017)
17.
go back to reference Zhang, J., Wang, M., Liu, Q., Zhou, J.: Incorporating word reordering knowledge into attention-based neural machine translation. In: ACL (1), pp. 1524–1534. Association for Computational Linguistics (2017) Zhang, J., Wang, M., Liu, Q., Zhou, J.: Incorporating word reordering knowledge into attention-based neural machine translation. In: ACL (1), pp. 1524–1534. Association for Computational Linguistics (2017)
18.
go back to reference Zhu, Z.: Evaluating neural machine translation in English-Japanese task. In: WAT, pp. 61–68. Workshop on Asian Translation (2015) Zhu, Z.: Evaluating neural machine translation in English-Japanese task. In: WAT, pp. 61–68. Workshop on Asian Translation (2015)
Metadata
Title
Neural Machine Translation with Soft Reordering Knowledge
Authors
Leiying Zhou
Jie Zhou
Wenjie Lu
Kui Meng
Gongshen Liu
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-63820-7_79

Premium Partner