Your browser does not support JavaScript!
http://iet.metastore.ingenta.com
1887

Voice conversion based on state-space model for modelling spectral trajectory

Voice conversion based on state-space model for modelling spectral trajectory

For access to this article, please select a purchase option:

Buy article PDF
£12.50
(plus tax if applicable)
Buy Knowledge Pack
10 articles for £75.00
(plus taxes if applicable)

IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.

Learn more about IET membership 

Recommend Title Publication to library

You must fill out fields marked with: *

Librarian details
Name:*
Email:*
Your details
Name:*
Email:*
Department:*
Why are you recommending this title?
Select reason:
 
 
 
 
 
Electronics Letters — Recommend this title to your library

Thank you

Your recommendation has been sent to your librarian.

A novel voice conversion (VC) method using a state-space model (SSM) is presented. The SSM, which has never been shown before in the context of VC, has the advantage of explicitly modelling spectral parameter trajectory. Thus, it will be superior to the conventional Gaussian mixture model (GMM)-based method, where the conversion algorithm is performed on a frame-by-frame procedure, ignoring the correlation between adjacent frames. Experiments using both objective and subjective measurements show that the proposed SSM-based method significantly outperforms the traditional GMM-based technique in the view of both speech quality and conversion accuracy for speaker individuality.

References

    1. 1)
      • M. Athanasios , V.S. Jan , M. Paul . Nonparallel training for voice conversion based on a parameter adaptation approach. IEEE Trans. Audio, Speech Lang. Process. , 3 , 952 - 963
    2. 2)
      • T. Toda , A.W. Black , K. Tokuda . Voice conversion based on maximum likelihood estimation of spectral parameter trajectory. IEEE Trans. Audio, Speech Lang. Process. , 8 , 2222 - 2235
    3. 3)
      • Y. Stylianou , O. Cappe , E. Moulines . Continuous probabilistic transform for voice conversion. IEEE Trans. Speech Audio Process. , 2 , 131 - 142
    4. 4)
      • Abe, M., Nakamura, S., Shikano, K., Kuwabara, H.: `Voice conversion through vector quantization', IEEE ICASSP, April 1988, New York, NY, USA, 1, p. 655–658.
    5. 5)
      • N. Iwahashi , Y. Sagisaka . Speech spectrum conversion based on speaker interpolation and multi-functional representation with weighting by radial basis function networks. Speech Commun. , 2 , 139 - 151
    6. 6)
      • H. Tanizaki . (1996) Nonlinear Filters.
    7. 7)
      • H. Kawahara , I. Masuda-Katsuse , A. de Cheveigné . Restructuring speech representations using a pitch adaptive time-frequency-based F0 extraction: Possible role of a repetitive structure in sounds. Speech Commun. , 3 , 187 - 207
http://iet.metastore.ingenta.com/content/journals/10.1049/el.2009.0904
Loading

Related content

content/journals/10.1049/el.2009.0904
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading
This is a required field
Please enter a valid email address