Skip to main content
Erschienen in: Neural Computing and Applications 3/2010

01.04.2010 | Original Article

Unaligned training for voice conversion based on a local nonlinear principal component analysis approach

verfasst von: Behrooz Makki, Mona Noori Hosseini, Seyyed Ali Seyyedsalehi, Nasser Sadati

Erschienen in: Neural Computing and Applications | Ausgabe 3/2010

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

During the past years, various principal component analysis algorithms have been developed. In this paper, a new approach for local nonlinear principal component analysis is proposed which is applied to capture voice conversion (VC). A new structure of autoassociative neural network is designed which not only performs data partitioning but also extracts nonlinear principal components of the clusters. Performance of the proposed method is evaluated by means of two experiments that illustrate its efficiency; at first, performance of the network is described by means of an artificial dataset and then, the developed method is applied to perform VC.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
2.
Zurück zum Zitat Cheng-Yuan L, Jang J-SR (2003) New refinement schemes for voice conversion. In: Proceeding of the ICME 2003, vol 2. IEEE Computer Society, Washington, DC, pp 725–728 Cheng-Yuan L, Jang J-SR (2003) New refinement schemes for voice conversion. In: Proceeding of the ICME 2003, vol 2. IEEE Computer Society, Washington, DC, pp 725–728
6.
Zurück zum Zitat Hui Y, Young S (2006) Quality-enhanced voice morphing using maximum likelihood transformations. IEEE Trans Audio Speech Lang Processing 14(4):1301–1312CrossRef Hui Y, Young S (2006) Quality-enhanced voice morphing using maximum likelihood transformations. IEEE Trans Audio Speech Lang Processing 14(4):1301–1312CrossRef
10.
Zurück zum Zitat Latorre J, Iwano K, Furui S (2005) Polyglot synthesis using a mixture of monolingual corpora. In: Proceeding of IEEE international conference on acoustics, speech, and signal processing (ICASSP 2005). IEEE, Philadelphia, pp 1–4 Latorre J, Iwano K, Furui S (2005) Polyglot synthesis using a mixture of monolingual corpora. In: Proceeding of IEEE international conference on acoustics, speech, and signal processing (ICASSP 2005). IEEE, Philadelphia, pp 1–4
11.
Zurück zum Zitat Ma J, Liu W (2005) Voice conversion based on joint pitch and spectral transformation with component group GMM. In: Proceeding of IEEE conference on natural language processing and knowledge engineering, 2005. IEEE, Beijing, pp 199–203 Ma J, Liu W (2005) Voice conversion based on joint pitch and spectral transformation with component group GMM. In: Proceeding of IEEE conference on natural language processing and knowledge engineering, 2005. IEEE, Beijing, pp 199–203
12.
Zurück zum Zitat Makki B, Noori Hosseini M, Seyyedsalehi SA (2008) Unsupervised extraction of meaningful nonlinear principal components applied for voice conversion. In: IEEE joint conference on Neural Networks, 2008, Hong Kong, pp 1370–1373 Makki B, Noori Hosseini M, Seyyedsalehi SA (2008) Unsupervised extraction of meaningful nonlinear principal components applied for voice conversion. In: IEEE joint conference on Neural Networks, 2008, Hong Kong, pp 1370–1373
13.
Zurück zum Zitat Makki B, Seyedsalehi SA, Noori Hosseini M, Sadati N (2007) Principal component analysis using constructive neural networks. In: IEEE international joint conference on neural networks, Orlando, 2007, pp 558–562 Makki B, Seyedsalehi SA, Noori Hosseini M, Sadati N (2007) Principal component analysis using constructive neural networks. In: IEEE international joint conference on neural networks, Orlando, 2007, pp 558–562
14.
Zurück zum Zitat Makki B, Salehi SA, Sadati N, Noori Hosseini M (2007) Voice conversion using nonlinear principal component analysis. In: IEEE symposium on computational intelligence in image and signal processing, 2007, IEEE, Hawaii, pp 336–339 Makki B, Salehi SA, Sadati N, Noori Hosseini M (2007) Voice conversion using nonlinear principal component analysis. In: IEEE symposium on computational intelligence in image and signal processing, 2007, IEEE, Hawaii, pp 336–339
17.
21.
Zurück zum Zitat Zuo G, Chen Y, Ruan XG, Liu WJ (2005) Learning Mandarin tone mapping codebook for voice conversion. In: Proceeding of international conference on machine learning and cybernetics, vol 8. IEEE, Guangzhou, pp 4824–4828 Zuo G, Chen Y, Ruan XG, Liu WJ (2005) Learning Mandarin tone mapping codebook for voice conversion. In: Proceeding of international conference on machine learning and cybernetics, vol 8. IEEE, Guangzhou, pp 4824–4828
22.
Zurück zum Zitat Zuo G, Liu W (2004) Genetic algorithm based RBF neural network for voice conversion. In: Fifth world congress on intelligent control and automation, vol 5. IEEE, Beijing, pp 4215–4218 Zuo G, Liu W (2004) Genetic algorithm based RBF neural network for voice conversion. In: Fifth world congress on intelligent control and automation, vol 5. IEEE, Beijing, pp 4215–4218
Metadaten
Titel
Unaligned training for voice conversion based on a local nonlinear principal component analysis approach
verfasst von
Behrooz Makki
Mona Noori Hosseini
Seyyed Ali Seyyedsalehi
Nasser Sadati
Publikationsdatum
01.04.2010
Verlag
Springer-Verlag
Erschienen in
Neural Computing and Applications / Ausgabe 3/2010
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-009-0275-x

Weitere Artikel der Ausgabe 3/2010

Neural Computing and Applications 3/2010 Zur Ausgabe

Premium Partner