Skip to main content
Top

2018 | OriginalPaper | Chapter

Improving Chinese-English Neural Machine Translation with Detected Usages of Function Words

Authors : Kunli Zhang, Hongfei Xu, Deyi Xiong, Qiuhui Liu, Hongying Zan

Published in: Natural Language Processing and Chinese Computing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

One of difficulties in Chinese-English machine translation is that the grammatical meaning expressed by morphology or syntax in target translations is usually determined by Chinese function words or word order. In order to address this issue, we develop classifiers to automatically detect usages of common Chinese function words based on Chinese Function usage Knowledge Base (CFKB) and initially propose a function word usage embedding model to incorporate detection results into neural machine translation (NMT). Experiments on the NIST Chinese-English translation task demonstrate that the proposed method can obtain significant improvements on the quality of both translation and word alignment over the NMT baseline.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Extracted from LDC2002E18, LDC2003E07, LDC2003E14, Hansards portion of LDC2004T07, LDC2004T08 and LDC2005T06.
 
2
We observe that the “concat” and “usage” method perform better than the “part” method in terms of both translation and alignment quality in “DE”’s experiment. Therefore we don’t have the experiments of “part” method with the other function words.
 
3
After the Chinese sentence is its reference translation, “BASE” means baseline model, “FNMT” means model with function word usages, text after “/” is the usage of Chinese function words.
 
Literature
go back to reference Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. Comput. Sci. (2014) Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. Comput. Sci. (2014)
go back to reference Chang, P.C., Jurafsky, D., Manning, C.D.: Disambiguating “DE" for Chinese-English machine translation. In: The Workshop on Statistical Machine Translation, pp. 215–223 (2009) Chang, P.C., Jurafsky, D., Manning, C.D.: Disambiguating “DE" for Chinese-English machine translation. In: The Workshop on Statistical Machine Translation, pp. 215–223 (2009)
go back to reference Chen, X., Qiu, X., Zhu, C., Liu, P., Huang, X.: Long short-term memory neural networks for Chinese word segmentation. In: Conference on Empirical Methods in Natural Language Processing, pp. 1197–1206 (2015) Chen, X., Qiu, X., Zhu, C., Liu, P., Huang, X.: Long short-term memory neural networks for Chinese word segmentation. In: Conference on Empirical Methods in Natural Language Processing, pp. 1197–1206 (2015)
go back to reference Cho, K., Van Merrienboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., Bengio, Y.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. Comput. Sci. (2014) Cho, K., Van Merrienboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., Bengio, Y.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. Comput. Sci. (2014)
go back to reference Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRef
go back to reference Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. Comput. Sci. (2014) Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. Comput. Sci. (2014)
go back to reference Klein, G., Kim, Y., Deng, Y., Senellart, J., Rush, A.M.: OpenNMT: open-source toolkit for neural machine translation (2017) Klein, G., Kim, Y., Deng, Y., Senellart, J., Rush, A.M.: OpenNMT: open-source toolkit for neural machine translation (2017)
go back to reference Liu, Q., Zhang, K., Xu, H., Zan, H., Yu, S.: Research on automatic recognition of auxiliary de. In: Proceedings of CLSW2017 (2017). (in Chinese) Liu, Q., Zhang, K., Xu, H., Zan, H., Yu, S.: Research on automatic recognition of auxiliary de. In: Proceedings of CLSW2017 (2017). (in Chinese)
go back to reference Liu, Y., Sun, M.,: Contrastive unsupervised word alignment with non-local features. In: Twenty-Ninth AAAI Conference on Artificial Intelligence, pp. 2295–2301 (2015) Liu, Y., Sun, M.,: Contrastive unsupervised word alignment with non-local features. In: Twenty-Ninth AAAI Conference on Artificial Intelligence, pp. 2295–2301 (2015)
go back to reference Lu, J.M., Ma, Z.: Scattered Essays of Modern Chinese Function Words. Language and Culture Press, Beijing (1999). (in Chinese) Lu, J.M., Ma, Z.: Scattered Essays of Modern Chinese Function Words. Language and Culture Press, Beijing (1999). (in Chinese)
go back to reference Luong, M.T., Pham, H., Manning, C.D.: Effective approaches to attention-based neural machine translation. Comput. Sci. (2015) Luong, M.T., Pham, H., Manning, C.D.: Effective approaches to attention-based neural machine translation. Comput. Sci. (2015)
go back to reference Lv, S.X., Zhu, D.X.: Grammatical Rhetoric. Liaoning Education Press, Beijing (2002). (in Chinese) Lv, S.X., Zhu, D.X.: Grammatical Rhetoric. Liaoning Education Press, Beijing (2002). (in Chinese)
go back to reference Sennrich R., Haddow, B.: Linguistic input features improve neural machine translation, pp. 83–91 (2016) Sennrich R., Haddow, B.: Linguistic input features improve neural machine translation, pp. 83–91 (2016)
go back to reference Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Advances in Neural Information Processing Systems, vol. 4, pp. 3104–3112 (2014) Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Advances in Neural Information Processing Systems, vol. 4, pp. 3104–3112 (2014)
go back to reference Tu, Z., Lu, Z., Liu, Y., Liu, X., Li, H.: Modeling coverage for neural machine translation. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 76–85 (2016) Tu, Z., Lu, Z., Liu, Y., Liu, X., Li, H.: Modeling coverage for neural machine translation. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 76–85 (2016)
go back to reference Shiwen, Y., Zhu, X., Liu, Y.: Knowledge-base of generalized functional words of contemporary chinese. J. Chin. Lang. Comput. 13, 89–98 (2003) Shiwen, Y., Zhu, X., Liu, Y.: Knowledge-base of generalized functional words of contemporary chinese. J. Chin. Lang. Comput. 13, 89–98 (2003)
go back to reference Zan, H., Zhang, J.: Studies on automatic recognition of chinese adverb CAI’s usages based on statistics. In: 2009 International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE, pp. 1–5 (2009). (in Chinese) Zan, H., Zhang, J.: Studies on automatic recognition of chinese adverb CAI’s usages based on statistics. In: 2009 International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE, pp. 1–5 (2009). (in Chinese)
go back to reference Zan, H., Zhu, X.: Research on the chinese function word usage knowledge base. Int. J. Asian Lang. Process. 21(4), 185–198 (2011). (in Chinese) Zan, H., Zhu, X.: Research on the chinese function word usage knowledge base. Int. J. Asian Lang. Process. 21(4), 185–198 (2011). (in Chinese)
go back to reference Zhang, J., Zan, H.: Automatic recognition research on chinese adverb DOU’s usages. Acta Scientiarum Naturalium Universitatis Pekinensis 49(1), 165–169 (2013). (in Chinese)MathSciNet Zhang, J., Zan, H.: Automatic recognition research on chinese adverb DOU’s usages. Acta Scientiarum Naturalium Universitatis Pekinensis 49(1), 165–169 (2013). (in Chinese)MathSciNet
go back to reference Zhang, K., Zan, H., Chai, Y., Han, Y., Zhao, D.: Survey of the Chinese function word usage knowledge base. J. Chin. Inf. Process. 29(3), 1–8 (2015). (in Chinese) Zhang, K., Zan, H., Chai, Y., Han, Y., Zhao, D.: Survey of the Chinese function word usage knowledge base. J. Chin. Inf. Process. 29(3), 1–8 (2015). (in Chinese)
Metadata
Title
Improving Chinese-English Neural Machine Translation with Detected Usages of Function Words
Authors
Kunli Zhang
Hongfei Xu
Deyi Xiong
Qiuhui Liu
Hongying Zan
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-73618-1_64

Premium Partner