Skip to main content
Erschienen in: Artificial Intelligence Review 2/2016

01.08.2016

Learning sign language machine translation based on elastic net regularization and latent semantic analysis

verfasst von: Mehrez Boulares, Mohamed Jemni

Erschienen in: Artificial Intelligence Review | Ausgabe 2/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we present a new sign language machine translation approach based on regression method. The aim of this work is to improve the translation quality and accuracy of existing regularized regression methods. Our approach represents a methodological foundation for small-scale corpus domains such as the Sign Language Machine Translation field. Our method is based on the Elastic net regularization using linear combination of the L1 and L2 penalties of the lasso and ridge methods. We show that using both the de-bruijn graph with the Latent Semantic Analysis technique in the decoding process improves the translation results. The system is experimented on American Sign Language parallel corpora containing 300 sentences and assessed by BLEU, METEOR, NIST and F1-MESURE machine translation evaluation metrics. We obtained good experimental results compared to classical phrase based approach i.e MOSES framework. Also our approach improved the translation results compared to LASSO and RIDGE regression approaches.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Banerjee S, Lavie A (2005) METEOR: an automatic metric for MT evaluation with improved correlation with human judgments. In: Proceedings of Workshop on Intrinsic and Extrinsic Evaluation Measures for MT and/or Summarization at the 43rd Annual Meeting of the Association of Computational Linguistics (ACL-2005), Ann Arbor, June 2005 Banerjee S, Lavie A (2005) METEOR: an automatic metric for MT evaluation with improved correlation with human judgments. In: Proceedings of Workshop on Intrinsic and Extrinsic Evaluation Measures for MT and/or Summarization at the 43rd Annual Meeting of the Association of Computational Linguistics (ACL-2005), Ann Arbor, June 2005
Zurück zum Zitat Battison R (1978) Lexical borrowing in American sign language: phonological and morphological restructuring. Linstok Press, Silver Spring Battison R (1978) Lexical borrowing in American sign language: phonological and morphological restructuring. Linstok Press, Silver Spring
Zurück zum Zitat Biçici E, Yuret D (2010) L1 regularization for learning word alignments in sparse feature matrices. In: Proceedings of the Computer Science Student Workshop, 2010 Biçici E, Yuret D (2010) L1 regularization for learning word alignments in sparse feature matrices. In: Proceedings of the Computer Science Student Workshop, 2010
Zurück zum Zitat Bishop CM (2006). Pattern recognition and machine learning. Springer ISBN 978-0-387-31073-2, 2006 Bishop CM (2006). Pattern recognition and machine learning. Springer ISBN 978-0-387-31073-2, 2006
Zurück zum Zitat Boulares M, Jemni M (2012) Mobile sign language translation system for deaf community. In: Proceedings of the International Cross-Disciplinary Conference on Web Accessibility. ACM, ISBN: 978-1-4503-1019-2 Boulares M, Jemni M (2012) Mobile sign language translation system for deaf community. In: Proceedings of the International Cross-Disciplinary Conference on Web Accessibility. ACM, ISBN: 978-1-4503-1019-2
Zurück zum Zitat Boulares M, Jemni M (2014) Combined methodology based on kernel regression and kernel density estimation for sign language machine translation. Advances in neural networks. ISNN’14, Springer pp 374–384 Boulares M, Jemni M (2014) Combined methodology based on kernel regression and kernel density estimation for sign language machine translation. Advances in neural networks. ISNN’14, Springer pp 374–384
Zurück zum Zitat Charles A, Rebecca S (2000) Reading optimally builds on spoken language implication for deaf readers. Learning research and development center University of Pittsburgh, Pittsburgh Charles A, Rebecca S (2000) Reading optimally builds on spoken language implication for deaf readers. Learning research and development center University of Pittsburgh, Pittsburgh
Zurück zum Zitat Cortes C, Mehryar M, Jason W (2007) A general regression framework for learning string-to-string mappings. In: Predicting structured data. The MIT Press, pp 143–168, September 2007 Cortes C, Mehryar M, Jason W (2007) A general regression framework for learning string-to-string mappings. In: Predicting structured data. The MIT Press, pp 143–168, September 2007
Zurück zum Zitat Coughlin D (2003) Correlating automated and human assessments of machine translation quality. MT Summit IX, New Orleans Coughlin D (2003) Correlating automated and human assessments of machine translation quality. MT Summit IX, New Orleans
Zurück zum Zitat Deerwester SC et al (1990) Indexing by latent semantic analysis. J Am Soc Inf Sci (1986–1998); Sep 1990; 41, 6; ABI/INFORM Global pg. 391 Deerwester SC et al (1990) Indexing by latent semantic analysis. J Am Soc Inf Sci (1986–1998); Sep 1990; 41, 6; ABI/INFORM Global pg. 391
Zurück zum Zitat Denoual E, Lepage Y (2005) BLEU in characters: towards automatic MT evaluation in languages without word delimiters. In: Companion Volume to the Proceedings of the Second International Joint Conference on Natural Language Processing, pp 81–86 Denoual E, Lepage Y (2005) BLEU in characters: towards automatic MT evaluation in languages without word delimiters. In: Companion Volume to the Proceedings of the Second International Joint Conference on Natural Language Processing, pp 81–86
Zurück zum Zitat Doddington G (2002) Automatic evaluation of machine translation quality using n-gram cooccurrence statistics. In: Proceedings of the Human Language Technology Conference (HLT), San Diego, pp 128–132 Doddington G (2002) Automatic evaluation of machine translation quality using n-gram cooccurrence statistics. In: Proceedings of the Human Language Technology Conference (HLT), San Diego, pp 128–132
Zurück zum Zitat Doddington G (2002) The NIST automated measure and its relation to IBMs BLEU. In: Proceedings of LREC-2002 Workshopon Machine Translation Evaluation: Human Evaluators Meet Automated Metrics Doddington G (2002) The NIST automated measure and its relation to IBMs BLEU. In: Proceedings of LREC-2002 Workshopon Machine Translation Evaluation: Human Evaluators Meet Automated Metrics
Zurück zum Zitat Efthimiou E et al (2009) Sign language recognition, generation, and modelling: a research effort with applications in deaf communication. In: Universal access in human–computer interaction addressing diversity. Springer, pp 21–30 Efthimiou E et al (2009) Sign language recognition, generation, and modelling: a research effort with applications in deaf communication. In: Universal access in human–computer interaction addressing diversity. Springer, pp 21–30
Zurück zum Zitat Efthimiou E et al (2007) GSLC: creation and annotation of a greek sign language corpus for HCI. In: Stephanidis C (ed) HCI 2007, LNCS, vol 4554. Springer, Heidelberg, pp 657–666 Efthimiou E et al (2007) GSLC: creation and annotation of a greek sign language corpus for HCI. In: Stephanidis C (ed) HCI 2007, LNCS, vol 4554. Springer, Heidelberg, pp 657–666
Zurück zum Zitat Emmorey K (2005) The confluence of space and language in signed languages. Linguist Am Sign Lang Introd 3:318–346 Emmorey K (2005) The confluence of space and language in signed languages. Linguist Am Sign Lang Introd 3:318–346
Zurück zum Zitat Finch A, Hwang Y-S, Sumita E (2005) Using machine translation evaluation techniques to determine sentence-level semantic equivalence. IWP2005 2005 Finch A, Hwang Y-S, Sumita E (2005) Using machine translation evaluation techniques to determine sentence-level semantic equivalence. IWP2005 2005
Zurück zum Zitat Huenerfauth M, Lu P (2011) Effect of spatial reference and verb inflection on usability of sign language animations. Springer-Verlag Univ Access Inf Soc. doi:10.1007/s10209-011-0247-7 Huenerfauth M, Lu P (2011) Effect of spatial reference and verb inflection on usability of sign language animations. Springer-Verlag Univ Access Inf Soc. doi:10.​1007/​s10209-011-0247-7
Zurück zum Zitat Hung-Yu S, Chung-Hsien W (2009) Improving structural statistical machine translation for sign language with small corpus using thematic role templates as translation memory. IEEE Trans Audio Speech Lang Process 17(7):1305–1315 September 2009CrossRef Hung-Yu S, Chung-Hsien W (2009) Improving structural statistical machine translation for sign language with small corpus using thematic role templates as translation memory. IEEE Trans Audio Speech Lang Process 17(7):1305–1315 September 2009CrossRef
Zurück zum Zitat Koehn P et al (2007) Moses: open source toolkit for statistical machine translation. In: Proceedings of the 45th annual meeting of the ACL on interactive poster and demonstration sessions, pp 177–180 Koehn P et al (2007) Moses: open source toolkit for statistical machine translation. In: Proceedings of the 45th annual meeting of the ACL on interactive poster and demonstration sessions, pp 177–180
Zurück zum Zitat Koehn P, Hoang H (2007) Factored translation models. In: Proceedings of EMNLP-CoNLL’07 Koehn P, Hoang H (2007) Factored translation models. In: Proceedings of EMNLP-CoNLL’07
Zurück zum Zitat Koehn P, Och FJ, Marcu D (2003) Statistical phrase-based translation. In: Proceedings of HAACL-HLT’03, pp 48–54 Koehn P, Och FJ, Marcu D (2003) Statistical phrase-based translation. In: Proceedings of HAACL-HLT’03, pp 48–54
Zurück zum Zitat Lavie A, Sagae K, Jayaraman S (2004) The Significance of Recall in Automatic Metrics for MT Evaluation. In: Proceedings of AMTA 2004, Washington DC. September 2004 Lavie A, Sagae K, Jayaraman S (2004) The Significance of Recall in Automatic Metrics for MT Evaluation. In: Proceedings of AMTA 2004, Washington DC. September 2004
Zurück zum Zitat Leslie C, Eskin E, Stafford W (2002) The spectrum kernel: a string kernel forsvm protein classifi-cation. Pacific symposium on Biocomputing. pp 566–575. 2002 Leslie C, Eskin E, Stafford W (2002) The spectrum kernel: a string kernel forsvm protein classifi-cation. Pacific symposium on Biocomputing. pp 566–575. 2002
Zurück zum Zitat Lodhi H, Saunders C, Shawe-Taylor J, Nello C, Watkins C (2002) Text classification using string kernels. J Mach Learn Res 2:419–444MATH Lodhi H, Saunders C, Shawe-Taylor J, Nello C, Watkins C (2002) Text classification using string kernels. J Mach Learn Res 2:419–444MATH
Zurück zum Zitat Neidle C et al (2000) The syntax of American sign language: functional categories and hierarchical structure. MIT Press, Cambridge Neidle C et al (2000) The syntax of American sign language: functional categories and hierarchical structure. MIT Press, Cambridge
Zurück zum Zitat Neidle C, Sclaroff S (2002) Data collected at the national center for sign language and gesture resources. Boston University, Boston Neidle C, Sclaroff S (2002) Data collected at the national center for sign language and gesture resources. Boston University, Boston
Zurück zum Zitat Papineni K et al (2002) BLEU: a method for automatic evaluation of machine translation. ACL-2002: 40th Annual meeting of the Association for Computational Linguistics. pp 311–318. CiteSeerX: 10.1.1.19.9416 Papineni K et al (2002) BLEU: a method for automatic evaluation of machine translation. ACL-2002: 40th Annual meeting of the Association for Computational Linguistics. pp 311–318. CiteSeerX: 10.1.1.19.9416
Zurück zum Zitat Powers DMW (2011) Evaluation: from precision, recall and F-measure to ROC, informedness, markedness & correlation. J Mach Learn Technol 2(1):37–63MathSciNet Powers DMW (2011) Evaluation: from precision, recall and F-measure to ROC, informedness, markedness & correlation. J Mach Learn Technol 2(1):37–63MathSciNet
Zurück zum Zitat Przybocki M (2004) NIST machine translation 2004 evaluation summary of results. In: Machine Translation Evaluation Workshop Przybocki M (2004) NIST machine translation 2004 evaluation summary of results. In: Machine Translation Evaluation Workshop
Zurück zum Zitat Sandler W (1989) Phonological representation of the sign: linearity and nonlinearity in American sign language. Foris, DordrechtCrossRef Sandler W (1989) Phonological representation of the sign: linearity and nonlinearity in American sign language. Foris, DordrechtCrossRef
Zurück zum Zitat Schmidt C, Koller O, Ney H, Hoyoux T, Piater J (2013) Using viseme recognition to improve a sign language translation system. International Workshop on Spoken Language Translation, pp 197–203 Schmidt C, Koller O, Ney H, Hoyoux T, Piater J (2013) Using viseme recognition to improve a sign language translation system. International Workshop on Spoken Language Translation, pp 197–203
Zurück zum Zitat Serrano N, Andres-Ferrer J, Casacuberta F (2009) On a kernel regression approach to machine translation. In: Iberian Conference on Pattern Recognition and Image Analysis, pp 394–401, 2009 Serrano N, Andres-Ferrer J, Casacuberta F (2009) On a kernel regression approach to machine translation. In: Iberian Conference on Pattern Recognition and Image Analysis, pp 394–401, 2009
Zurück zum Zitat Shawe-Taylor J, Cristianini N (2004) Kernel methods for pattern analysis. Cambridge university press, CambridgeCrossRefMATH Shawe-Taylor J, Cristianini N (2004) Kernel methods for pattern analysis. Cambridge university press, CambridgeCrossRefMATH
Zurück zum Zitat Stein D (2012) Analysis, preparation, and optimization of statistical sign language machine translation. Mach Trans 26(4):325–357CrossRef Stein D (2012) Analysis, preparation, and optimization of statistical sign language machine translation. Mach Trans 26(4):325–357CrossRef
Zurück zum Zitat Stokoe WC (1978) Sign language structure. 1978. ERIC Stokoe WC (1978) Sign language structure. 1978. ERIC
Zurück zum Zitat Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc. Series B (Methodol). 267–288 Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc. Series B (Methodol). 267–288
Zurück zum Zitat Trevor H, Jonathan T, Robert T, Guenther W (2006) Forward stagewise regression and the monotone lasso. Electron J Stat 1:1–29MathSciNetMATH Trevor H, Jonathan T, Robert T, Guenther W (2006) Forward stagewise regression and the monotone lasso. Electron J Stat 1:1–29MathSciNetMATH
Zurück zum Zitat Trevor H, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference and prediction, 2nd edn. Springer, New YorkMATH Trevor H, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference and prediction, 2nd edn. Springer, New YorkMATH
Zurück zum Zitat Valli C, Lucas C (2000) Linguistics of American sign language: an introduction. Gallaudet University Press, Washington, DC Valli C, Lucas C (2000) Linguistics of American sign language: an introduction. Gallaudet University Press, Washington, DC
Zurück zum Zitat Watkins C (2000) Dynamic alignment kernels. Adv Large Margin Classif, 39–50 Watkins C (2000) Dynamic alignment kernels. Adv Large Margin Classif, 39–50
Zurück zum Zitat Zhuoran W, Shawe-Taylor J (2008) Kernel regression framework for machine translation: UCL system description for WMT 2008 shared translation task. In: Proceedings of the Third Workshop on Statistical Machine Translation, pp 155–158, 2008 Zhuoran W, Shawe-Taylor J (2008) Kernel regression framework for machine translation: UCL system description for WMT 2008 shared translation task. In: Proceedings of the Third Workshop on Statistical Machine Translation, pp 155–158, 2008
Zurück zum Zitat Zhuoran W, Shawe-Taylor J, Sandor S (2007) Kernel regression based machine translation. In: Human Language Technologies. The Conference of the North American Chapter of the Association for Computational Linguistics; pp 185–188, 2007 Zhuoran W, Shawe-Taylor J, Sandor S (2007) Kernel regression based machine translation. In: Human Language Technologies. The Conference of the North American Chapter of the Association for Computational Linguistics; pp 185–188, 2007
Zurück zum Zitat Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J R Stat Soc: Series B (Stat Methodol) 67(2):301–320 April 2005MathSciNetCrossRefMATH Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J R Stat Soc: Series B (Stat Methodol) 67(2):301–320 April 2005MathSciNetCrossRefMATH
Metadaten
Titel
Learning sign language machine translation based on elastic net regularization and latent semantic analysis
verfasst von
Mehrez Boulares
Mohamed Jemni
Publikationsdatum
01.08.2016
Verlag
Springer Netherlands
Erschienen in
Artificial Intelligence Review / Ausgabe 2/2016
Print ISSN: 0269-2821
Elektronische ISSN: 1573-7462
DOI
https://doi.org/10.1007/s10462-016-9460-3