Skip to main content
Erschienen in: Neural Computing and Applications 3-4/2006

01.06.2006 | Original Article

The construction of wavelet network for speech signal processing

verfasst von: D. Shi, F. Chen, G. S. Ng, J. Gao

Erschienen in: Neural Computing and Applications | Ausgabe 3-4/2006

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Wavelet decomposition reconstructs a signal by a series of scaled and translated wavelets. Incorporating discrete wavelet decomposition theory with neural network techniques, wavelet networks have recently emerged as a powerful tool for many applications in the field of signal processing, such as data compression and function approximation. In this paper, four contributions are claimed: (1) From the point of view of machine learning, we analyse and construct wavelet network to achieve the compact representation of a signal. (2) A new algorithm of constructing wavelet network is proposed. The orthogonal least square (OLS) is employed to prune the wavelet network. (3) Our experiments on speech signal processing results show that the wavelet network pruned by OLS achieves the best approximation and prediction capabilities among the representative speech processing techniques. (4) Our proposed methodology has been successfully applied to speech synthesis for a talking head to read web texts.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Gao JB, Harris CJ, Gunn SR (2001) On a class of support vector kernels based on frames in function hilber spaces. Neural Comput 13:1975–1994MATHCrossRef Gao JB, Harris CJ, Gunn SR (2001) On a class of support vector kernels based on frames in function hilber spaces. Neural Comput 13:1975–1994MATHCrossRef
2.
Zurück zum Zitat Gorriz JM, Puntonet CG, Salmeron M, de la Rosa JJG (2004) A new model for time-series forecasting using radial basis functions and exogenous data. Neural Comput Appl 13:101–111 Gorriz JM, Puntonet CG, Salmeron M, de la Rosa JJG (2004) A new model for time-series forecasting using radial basis functions and exogenous data. Neural Comput Appl 13:101–111
3.
4.
Zurück zum Zitat Salmeron M, Ortega J, Puntonet CG, Prieto A (2001) Improved RAN sequential prediction using orthogonal techniques. Neurocomputing 41:153–172MATHCrossRef Salmeron M, Ortega J, Puntonet CG, Prieto A (2001) Improved RAN sequential prediction using orthogonal techniques. Neurocomputing 41:153–172MATHCrossRef
5.
Zurück zum Zitat Moulines E, Charpentier F (1990) Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Commun 9:453–467CrossRef Moulines E, Charpentier F (1990) Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Commun 9:453–467CrossRef
6.
Zurück zum Zitat McAulay RJ, Quatieri TF (1986) Speech analysis/synthesis based on a sinusoidal representation. IEEE Trans Acoust Speech Signal Process 34:744–754CrossRef McAulay RJ, Quatieri TF (1986) Speech analysis/synthesis based on a sinusoidal representation. IEEE Trans Acoust Speech Signal Process 34:744–754CrossRef
7.
Zurück zum Zitat Mallat SG (1989) A theory of multiresolution signal decomposition: the wavelet representation. IEEE Trans Pattern Anal Machine Intell 11:674–693MATHCrossRef Mallat SG (1989) A theory of multiresolution signal decomposition: the wavelet representation. IEEE Trans Pattern Anal Machine Intell 11:674–693MATHCrossRef
8.
Zurück zum Zitat Daubechies I (1990) The wavelet transform, time-frequency localization and signal analysis. IEEE Trans Inf Theory 36:961–1005MATHCrossRefMathSciNet Daubechies I (1990) The wavelet transform, time-frequency localization and signal analysis. IEEE Trans Inf Theory 36:961–1005MATHCrossRefMathSciNet
9.
Zurück zum Zitat Mallat SG, Zhong S (1992) Characterization of signals from multiscale edges. IEEE Trans Pattern Anal Machine Intell 14(7):710–732CrossRef Mallat SG, Zhong S (1992) Characterization of signals from multiscale edges. IEEE Trans Pattern Anal Machine Intell 14(7):710–732CrossRef
10.
Zurück zum Zitat Zhang Q, Benveniste A (1992) Wavelet network. IEEE Trans Neural Networks 3(6):889–898CrossRef Zhang Q, Benveniste A (1992) Wavelet network. IEEE Trans Neural Networks 3(6):889–898CrossRef
11.
Zurück zum Zitat Zhang Q (1997) Using wavelet network in nonparametric estimation. IEEE Trans Neural Networks 8(2):227–236CrossRef Zhang Q (1997) Using wavelet network in nonparametric estimation. IEEE Trans Neural Networks 8(2):227–236CrossRef
12.
Zurück zum Zitat Bishop CM (19991) Improving the generalization properties of radial basis function neural networks. Neural Comput 3(4):579–588CrossRefMathSciNet Bishop CM (19991) Improving the generalization properties of radial basis function neural networks. Neural Comput 3(4):579–588CrossRefMathSciNet
13.
Zurück zum Zitat Chen S, Cowan CF, Grant PM (1991) Orthogonal least squares learning algorithms for radial basis function networks. IEEE Trans Neural Networks 2(2):302–309CrossRef Chen S, Cowan CF, Grant PM (1991) Orthogonal least squares learning algorithms for radial basis function networks. IEEE Trans Neural Networks 2(2):302–309CrossRef
14.
Zurück zum Zitat Chen S, Chng ES, Alkadhimim K (1996) Regularized orthogonal least squares algorithm for constructing radial basis function networks. Int J Control 64(5):829–837MATHCrossRef Chen S, Chng ES, Alkadhimim K (1996) Regularized orthogonal least squares algorithm for constructing radial basis function networks. Int J Control 64(5):829–837MATHCrossRef
15.
Zurück zum Zitat Chen S, Wu Y, Luk BL (1999) Combined genetic algorithm optimisation and regularised orthogonal least squares learning for radial basis function networks. IEEE Trans Neural Networks 10(5):1239–1243CrossRef Chen S, Wu Y, Luk BL (1999) Combined genetic algorithm optimisation and regularised orthogonal least squares learning for radial basis function networks. IEEE Trans Neural Networks 10(5):1239–1243CrossRef
16.
Zurück zum Zitat Gomm JB, Yu DL (2000) Selecting radial basis function network centers with recursive orthogonal least squares training. IEEE Trans Neural Networks 11:306–314CrossRef Gomm JB, Yu DL (2000) Selecting radial basis function network centers with recursive orthogonal least squares training. IEEE Trans Neural Networks 11:306–314CrossRef
18.
Zurück zum Zitat Chen F, Spinko V, Shi D (2005) Real-time lip synchronization using wavelet network. In: Proceedings of International Conference on Cyberworlds, Singapore Chen F, Spinko V, Shi D (2005) Real-time lip synchronization using wavelet network. In: Proceedings of International Conference on Cyberworlds, Singapore
19.
Zurück zum Zitat Vapnik VN (1999) An overview of statistical learning theory. IEEE Trans Neural Networks 10:988–999CrossRef Vapnik VN (1999) An overview of statistical learning theory. IEEE Trans Neural Networks 10:988–999CrossRef
20.
Zurück zum Zitat Scholkopf B, Sung KK, Burges CJC, Girosi F, Niyogi P, Poggio T, Vapnik V (1997) Comparing support vector machines with gaussian kernels to radial basis function classifiers. IEEE Trans Signal Process 45(11):2758–2765CrossRef Scholkopf B, Sung KK, Burges CJC, Girosi F, Niyogi P, Poggio T, Vapnik V (1997) Comparing support vector machines with gaussian kernels to radial basis function classifiers. IEEE Trans Signal Process 45(11):2758–2765CrossRef
Metadaten
Titel
The construction of wavelet network for speech signal processing
verfasst von
D. Shi
F. Chen
G. S. Ng
J. Gao
Publikationsdatum
01.06.2006
Verlag
Springer-Verlag
Erschienen in
Neural Computing and Applications / Ausgabe 3-4/2006
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-005-0016-8

Weitere Artikel der Ausgabe 3-4/2006

Neural Computing and Applications 3-4/2006 Zur Ausgabe

Premium Partner