nach oben

Erschienen in:

2018 | OriginalPaper | Buchkapitel

5. Formant-Based Lip Motion Generation and Evaluation in Humanoid Robots

verfasst von : Carlos T. Ishi, Chaoran Liu, Hiroshi Ishiguro, Norihiro Hagita

Erschienen in: Geminoid Studies

Verlag: Springer Singapore

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Generating natural motion in robots is important for improving human–robot interaction. We have developed a teleoperation system in which the lip motion of a remote humanoid robot is automatically controlled by the operator’s voice. In the present work, we introduce an improved version of our proposed speech-driven lip motion generation method, where lip height and width degrees are estimated based on vowel formant information. The method requires the calibration of only one parameter for speaker normalization. Lip height control is evaluated in two types of humanoid robots (Telenoid-R2 and Geminoid-F). Subjective evaluations indicate that the proposed audio-based method can generate lip motion with superior naturalness to vision-based and motion capture-based approaches. Partial lip width control is shown to improve lip motion naturalness in Geminoid-F, which also has an actuator for stretching the lip corners. Issues regarding online real-time processing are also discussed.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Generating Natural Motion in an Android by Mapping Human Motion

Nächstes Kapitel Analysis of Head Motions and Speech, and Head Motion Control in an Android Robot

Ishi, C., C. Liu, H. Ishiguro, and Hagita, N. 2012. Evaluation of formant-based lip motion generation in tele-operated humanoid robots. In 2012 IEEE/RSJ international conference on intelligent robots and systems (IROS 2012), 2377–2382.

Cohen, M., and D. Massaro. 1993. Modeling coarticulation in synthetic visual speech. In Models and techniques in computer animation.CrossRef

Tamura, M., S. Kondo, T. Masuko, and T. Kobayashi. 1998. Text-to-visual speech synthesis based on parameter generation from HMM. In Proceedings of ICASSP98, 3745–3748.

Hong, P., Z. Wen, and T. Huang. 2002. Real-time speech-driven face animation with expressions using neural networks. IEEE Transactions on Neural Networks 13 (4): 916–927.CrossRef

Beskow, J., and M. Nordenberg, 2005. Data-driven synthesis of expressive visual speech using an MPEG-4 talking head. In Proceedings of interspeech 2005, 793–796.

Hofer, G., J. Yamagishi, and H. Shimodaira. 2008. Speech-driven lip motion generation with a trajectory HMM. In Proceedings of the interspeech 2008, 2314–2317.

Salvi, G. 2006. Dynamic behaviour of connectionist speech recognition with strong latency constraints. Speech Communication 48 (7): 802–818.CrossRef

Takacs, G. 2009. Direct, modular and hybrid audio to visual speech conversion methods—a comparative study. In Proceedings of the Interspeech09, 2267–2270.

Hofer, G., and K. Richmond. 2010. Comparison of HMM and TMDN methods for lip synchronization. In Proceedings of the Interspeech 2010, 454–457.

10.

Zhuang, X., et al. 2010. A minimum converted trajectory error (MCTE) approach to high quality speech-to-lips conversion. In Proceedings of interspeech 2010, 1726–1739.

11.

Wu, J., et al. 2008. Statistical correlation analysis between lip contour parameters and formant parameters for Mandarin monophthongs. In Proceedings of the AVSP2008, 121–126.

12.

Ishi, C., C. Liu, H. Ishiguro, and N. Hagita. 2011. Speech-driven lip motion generation for tele-operated humanoid robots. In Proceedings of the Auditory-Visual Speech Processing, 2011 (AVSP2011), 131–135.

13.

Markel, J.D., and A.H. Gray. 1976. Linear prediction of speech. Berlin, Heidelberg, New York: Springer.CrossRef

14.

Titze, I.R. 1994. Principles of voice production, 136–168. NJ: Prentice Hall.

Titel: Formant-Based Lip Motion Generation and Evaluation in Humanoid Robots
verfasst von: Carlos T. Ishi
Chaoran Liu
Hiroshi Ishiguro
Norihiro Hagita
Verlag: Springer Singapore
Buch: Geminoid Studies
Print ISBN: 978-981-10-8701-1

Electronic ISBN: 978-981-10-8702-8

Copyright-Jahr: 2018
DOI: https://doi.org/10.1007/978-981-10-8702-8_5

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Nachhaltigkeitsaward Key Visual/© Cometis AG/Global ESG Monitor | Daniel Rupp | Generiert mit KI, Search Icon, Banner Hanser, Kryptowährungen/© gopixa / Getty Images / iStock, MG4 aus China auf dem Prüfstand im ADAC-Technik-Zentrum in Landsberg am Lech/© ADAC e.V., Chassis eines Elektrofahrzeugs/© chesky / stock.adobe.com, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Sustainibility Finance/© Robert Kneschke / stock.adobe.com / Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.