Skip to main content
Erschienen in: Neural Computing and Applications 9/2021

27.11.2020 | S.I.: SPIoT 2020

Named entity translation method based on machine translation lexicon

verfasst von: Panpan Li, Mengxiang Wang, Jian Wang

Erschienen in: Neural Computing and Applications | Ausgabe 9/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In the context of the rapid development of computer technology, communication between various languages has become increasingly important. Among the research methods of named entities, the research on named entity translation methods based on machine translation is one of the research hotspots. Named entity translation is to realize the switching between entities in two languages, which can be used by browsers, translators, etc., and can greatly reduce the cost of communication between people from all over the world. Due to the immaturity of the existing translation model technology and the lack of integration, the translation of bilingual named entities with unique composition is very challenging. Based on this, this paper proposes a fusion method of bilingual entity class named entity translation based on chunk symmetry strategy and English–Chinese transliteration model based on machine learning strategy. According to the bilingual corpus, a more standard candidate entity translation pair is generated through bilingual named entity alignment. The transliteration model is used to reorder and correct the candidate translation results, so as to achieve the correct selection of translation pairs. Experiments show that the model based on the translation of named entity translations based on chunks and transliteration models based on machine learning strategies not only effectively solves the problem of difficulty in word ordering and selection in bilingual translation, but also makes extraction to a certain extent. The accuracy of the translation results has been improved.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Choi H, Cho K, Bengio Y (2017) Context-dependent word representation for neural machine translation. Comput Speech Lang 45(9):149–160CrossRef Choi H, Cho K, Bengio Y (2017) Context-dependent word representation for neural machine translation. Comput Speech Lang 45(9):149–160CrossRef
2.
Zurück zum Zitat Gao S, Yang X, Yu Z, Pan X, Guo J (2017) Chinese-naxi machine translation method based on naxi dependency language model. Int J Mach Learn Cybern 8(1):333–342CrossRef Gao S, Yang X, Yu Z, Pan X, Guo J (2017) Chinese-naxi machine translation method based on naxi dependency language model. Int J Mach Learn Cybern 8(1):333–342CrossRef
3.
Zurück zum Zitat Chu C, Nakazawa T, Kurohashi S (2015) Parallel sentence extraction based on unsupervised bilingual lexicon extraction from comparable corpora. J Nat Lang Process 22(3):139–170CrossRef Chu C, Nakazawa T, Kurohashi S (2015) Parallel sentence extraction based on unsupervised bilingual lexicon extraction from comparable corpora. J Nat Lang Process 22(3):139–170CrossRef
4.
Zurück zum Zitat Wangzhijuan, Lifuxian (2017) Survey on cross-language named entity translation pairs extraction. Comput Sci 044(0z1):14–28 Wangzhijuan, Lifuxian (2017) Survey on cross-language named entity translation pairs extraction. Comput Sci 044(0z1):14–28
5.
Zurück zum Zitat Elayeb B, Bounhas I (2016) Arabic cross-language information retrieval: a review. ACM Trans Asian Low Resour Lang Inf Process 15(3):1–44CrossRef Elayeb B, Bounhas I (2016) Arabic cross-language information retrieval: a review. ACM Trans Asian Low Resour Lang Inf Process 15(3):1–44CrossRef
6.
Zurück zum Zitat Rahimi R, Shakery A, King I (2016) Extracting translations from comparable corpora for cross-language information retrieval using the language modeling framework. Inf Process Manag 52(2):299–318CrossRef Rahimi R, Shakery A, King I (2016) Extracting translations from comparable corpora for cross-language information retrieval using the language modeling framework. Inf Process Manag 52(2):299–318CrossRef
7.
Zurück zum Zitat Chen Y, Yang M, Li S (2018) Machine translation. Int J Comput Process Orient Lang 20(2–3):101–113 Chen Y, Yang M, Li S (2018) Machine translation. Int J Comput Process Orient Lang 20(2–3):101–113
8.
Zurück zum Zitat Haque R, Hasanuzzaman M, Way A (2019) Ruslan Mitkov, Johanna Monti, Gloria Corpas Pastor, and Violeta Seretan (eds): multiword units in machine translation and translation technology. Mach Transl 33(4):349–354CrossRef Haque R, Hasanuzzaman M, Way A (2019) Ruslan Mitkov, Johanna Monti, Gloria Corpas Pastor, and Violeta Seretan (eds): multiword units in machine translation and translation technology. Mach Transl 33(4):349–354CrossRef
9.
Zurück zum Zitat Ameur H, Seghir M, Meziane F, Guessoum A (2017) Arabic machine transliteration using an attention-based encoder–decoder model. Proc Comput 117:287–297CrossRef Ameur H, Seghir M, Meziane F, Guessoum A (2017) Arabic machine transliteration using an attention-based encoder–decoder model. Proc Comput 117:287–297CrossRef
10.
Zurück zum Zitat Kaur K, Singh P (2015) Hindi to punjabi transliteration using phonetic and orthographic rules. Int J Comput Appl 114(8):43–48 Kaur K, Singh P (2015) Hindi to punjabi transliteration using phonetic and orthographic rules. Int J Comput Appl 114(8):43–48
11.
Zurück zum Zitat Lula S, Petersel D, Chee-Keng N et al (2016) Monoclonal antibody and fusion protein biosimilars across therapeutic areas: a systematic review of published evidence. Biodrugs Clinical Immunotherapeutics Biopharmaceuticals and Gene Therapy. In: Proceedings of ACL-2016, 2016(30):489–523 Lula S, Petersel D, Chee-Keng N et al (2016) Monoclonal antibody and fusion protein biosimilars across therapeutic areas: a systematic review of published evidence. Biodrugs Clinical Immunotherapeutics Biopharmaceuticals and Gene Therapy. In: Proceedings of ACL-2016, 2016(30):489–523
12.
Zurück zum Zitat Wessels HR, Gibson HW (2016) Multi-gram syntheses of four crown ethers using k+ as templating agent. Tetrahedron 2016:396–399CrossRef Wessels HR, Gibson HW (2016) Multi-gram syntheses of four crown ethers using k+ as templating agent. Tetrahedron 2016:396–399CrossRef
13.
Zurück zum Zitat Wen X (2017) Hierarchical phrase machine translation decoding method based on tree-to-string model enhancement. Acta Tech CSAV (Ceskoslovensk Akademie Ved) 62(1):531–539 Wen X (2017) Hierarchical phrase machine translation decoding method based on tree-to-string model enhancement. Acta Tech CSAV (Ceskoslovensk Akademie Ved) 62(1):531–539
14.
Zurück zum Zitat Chen L, Mu Z, Nan B (2015) Semantic image segmentation based on hierarchical conditional random field mode. J Comput Inf Syst 11(2):527–534 Chen L, Mu Z, Nan B (2015) Semantic image segmentation based on hierarchical conditional random field mode. J Comput Inf Syst 11(2):527–534
15.
Zurück zum Zitat Orlando J, Prokofyeva E, Blaschko M (2016) A discriminatively trained fully connected conditional random field model for blood vessel segmentation in fundus images. IEEE Trans Biomed Eng 2016:1–1 Orlando J, Prokofyeva E, Blaschko M (2016) A discriminatively trained fully connected conditional random field model for blood vessel segmentation in fundus images. IEEE Trans Biomed Eng 2016:1–1
16.
Zurück zum Zitat Tang A, Jackson D, Hobbs J, Chen W, Smith JL, Patel H et al (2018) A maximum entropy model applied to spatial and temporal correlations from cortical networks in vitro. J Neuroence Off J Soc Neuroence 28(2):505–518CrossRef Tang A, Jackson D, Hobbs J, Chen W, Smith JL, Patel H et al (2018) A maximum entropy model applied to spatial and temporal correlations from cortical networks in vitro. J Neuroence Off J Soc Neuroence 28(2):505–518CrossRef
17.
Zurück zum Zitat Zhang C, Xu W, Ma Z, Gao S, Li Q, Guo J (2015) Construction of semantic bootstrapping models for relation extraction. Knowl Based Syst 83(July):128–137CrossRef Zhang C, Xu W, Ma Z, Gao S, Li Q, Guo J (2015) Construction of semantic bootstrapping models for relation extraction. Knowl Based Syst 83(July):128–137CrossRef
18.
Zurück zum Zitat Jung SW (2015) A method to improve cutset probability calculation in probabilistic safety assessment of nuclear power plants. Reliab Eng Syst Saf 134:134–142 Jung SW (2015) A method to improve cutset probability calculation in probabilistic safety assessment of nuclear power plants. Reliab Eng Syst Saf 134:134–142
19.
Zurück zum Zitat Icen D, Ersel D (2019) A new approach for probability calculation of fuzzy events in bayesian networks. Int J Approx Reason 108(MAY):76–88MathSciNetCrossRef Icen D, Ersel D (2019) A new approach for probability calculation of fuzzy events in bayesian networks. Int J Approx Reason 108(MAY):76–88MathSciNetCrossRef
20.
Zurück zum Zitat Tran P, Dinh D, Tan LE, Nguyen LHB (2018) Linguistic-relationships-based approach for improving word alignment. ACM Trans Asian Lang Inf Process 17(1):5.1-5.16 Tran P, Dinh D, Tan LE, Nguyen LHB (2018) Linguistic-relationships-based approach for improving word alignment. ACM Trans Asian Lang Inf Process 17(1):5.1-5.16
21.
Zurück zum Zitat Hkiri E, Mallat S, Zrigui M, Mars M (2017) Constructing a lexicon of Arabic–English named entity using SMT and semantic linked data. Int Arab J Inf Technol 14(6):820–825 Hkiri E, Mallat S, Zrigui M, Mars M (2017) Constructing a lexicon of Arabic–English named entity using SMT and semantic linked data. Int Arab J Inf Technol 14(6):820–825
22.
Zurück zum Zitat Dandapat S, Way A (2016) Improved named entity recognition using machine translation-based cross-lingual information. Comput Y Sistemas 20(3):495–504CrossRef Dandapat S, Way A (2016) Improved named entity recognition using machine translation-based cross-lingual information. Comput Y Sistemas 20(3):495–504CrossRef
23.
Zurück zum Zitat Podicheti R, Mockaitis K (2015) Featnotator: a tool for integrated annotation of sequence features and variation, facilitating interpretation in genomics experiments. Methods 79–80:11–17CrossRef Podicheti R, Mockaitis K (2015) Featnotator: a tool for integrated annotation of sequence features and variation, facilitating interpretation in genomics experiments. Methods 79–80:11–17CrossRef
24.
Zurück zum Zitat Jamie A, Shanahan HP (2015) Investigation into the annotation of protocol sequencing steps in the sequence read archive. GigaScience 4(1):1–11 Jamie A, Shanahan HP (2015) Investigation into the annotation of protocol sequencing steps in the sequence read archive. GigaScience 4(1):1–11
25.
Zurück zum Zitat Leopold H, Mendling J, Günther O (2016) Learning from quality issues of bpmn models from industry. IEEE Softw 33(4):26–33CrossRef Leopold H, Mendling J, Günther O (2016) Learning from quality issues of bpmn models from industry. IEEE Softw 33(4):26–33CrossRef
26.
Zurück zum Zitat Qing-Dao-Er-Ji R, Su Y, Wu N (2020) Research on Mongolian–Chinese machine translation based on the end-to-end neural network. Int J Wavelets Multiresolut Inf Process 18(01):46–59MathSciNetCrossRef Qing-Dao-Er-Ji R, Su Y, Wu N (2020) Research on Mongolian–Chinese machine translation based on the end-to-end neural network. Int J Wavelets Multiresolut Inf Process 18(01):46–59MathSciNetCrossRef
Metadaten
Titel
Named entity translation method based on machine translation lexicon
verfasst von
Panpan Li
Mengxiang Wang
Jian Wang
Publikationsdatum
27.11.2020
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 9/2021
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-020-05509-y

Weitere Artikel der Ausgabe 9/2021

Neural Computing and Applications 9/2021 Zur Ausgabe

Premium Partner