Skip to main content

2016 | OriginalPaper | Buchkapitel

Segment-Level Probabilistic Sequence Kernel Based Support Vector Machines for Classification of Varying Length Patterns of Speech

verfasst von : Shikha Gupta, Veena Thenkanidiyoor, Dileep Aroor Dinesh

Erschienen in: Neural Information Processing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this work we propose the segment-level probabilistic sequence kernel (SLPSK) as dynamic kernel to be used in support vector machine (SVM) for classification of varying length patterns of long duration speech represented as sets of feature vectors. SLPSK is built upon a set of Gaussian basis functions, where half of the basis functions contain class specific information while the other half implicates the common characteristics of all the speech utterances of all classes. The proposed kernel is computed between the pair of examples, by partitioning the speech signal into fixed number of segments and then matching the corresponding segments. We study the performance of the SVM-based classifiers using the proposed SLPSK using different pooling technique for speech emotion recognition and speaker identification and compare with that of the SVM-based classifiers using other kernels for varying length patterns.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Dileep, A.D., Chandra Sekhar, C.: GMM-based intermediate matching kernel for classification of varying length patterns of long duration speech using support vector machines. IEEE Trans. Neural Netw. Learn. Syst. 25(8), 1421–1432 (2014)CrossRef Dileep, A.D., Chandra Sekhar, C.: GMM-based intermediate matching kernel for classification of varying length patterns of long duration speech using support vector machines. IEEE Trans. Neural Netw. Learn. Syst. 25(8), 1421–1432 (2014)CrossRef
2.
Zurück zum Zitat Smith, N., Gales, M., Niranjan, M.: Data-dependent kernels in SVM classification of speech patterns. Technical report CUED/F-INFENG/TR.387, Cambridge University Engineering Department, Trumpington Street, Cambridge, CB2 1PZ, U.K., April 2001 Smith, N., Gales, M., Niranjan, M.: Data-dependent kernels in SVM classification of speech patterns. Technical report CUED/F-INFENG/TR.387, Cambridge University Engineering Department, Trumpington Street, Cambridge, CB2 1PZ, U.K., April 2001
3.
Zurück zum Zitat Lee, K-A., You, C.H., Li, H., Kinnunen, T.: A GMM-based probabilistic sequence kernel for speaker verification. In: Proceedings of INTERSPEECH, Antwerp, Belgium, pp. 294–297, August 2007 Lee, K-A., You, C.H., Li, H., Kinnunen, T.: A GMM-based probabilistic sequence kernel for speaker verification. In: Proceedings of INTERSPEECH, Antwerp, Belgium, pp. 294–297, August 2007
4.
Zurück zum Zitat Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machines using GMM supervectors for speaker verification. IEEE Signal Process. Lett. 13(5), 308–311 (2006)CrossRef Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machines using GMM supervectors for speaker verification. IEEE Signal Process. Lett. 13(5), 308–311 (2006)CrossRef
5.
Zurück zum Zitat You, C.H., Lee, K.A., Li, H.: An SVM kernel with GMM-supervector based on the Bhattacharyya distance for speaker recognition. IEEE Signal Process. Lett. 16(1), 49–52 (2009)CrossRef You, C.H., Lee, K.A., Li, H.: An SVM kernel with GMM-supervector based on the Bhattacharyya distance for speaker recognition. IEEE Signal Process. Lett. 16(1), 49–52 (2009)CrossRef
6.
Zurück zum Zitat Dileep, A.D., Chandra Sekhar, C.: Speaker recognition using pyramid match kernel based support vector machines. Int. J. Speech Technol. 15(3), 365–379 (2012)CrossRef Dileep, A.D., Chandra Sekhar, C.: Speaker recognition using pyramid match kernel based support vector machines. Int. J. Speech Technol. 15(3), 365–379 (2012)CrossRef
7.
Zurück zum Zitat Sachdev, A., Dileep, A.D., Thenkanidiyoor, V.: Example-specific density based matching kernel for classificationof varying length patterns of speech using support vector machines. In: Proceedings of ICONIP, Istanbul, Turkey, pp.177–184, November 2015 Sachdev, A., Dileep, A.D., Thenkanidiyoor, V.: Example-specific density based matching kernel for classificationof varying length patterns of speech using support vector machines. In: Proceedings of ICONIP, Istanbul, Turkey, pp.177–184, November 2015
8.
Zurück zum Zitat Yu, K., Lv, F., Huang, T., Wang, J., Yang, J., Gong, Y.: Locality-constrained linear coding for image classification. In: Proceedings of CVPR 2010, pp. 3360–3367. IEEE (2010) Yu, K., Lv, F., Huang, T., Wang, J., Yang, J., Gong, Y.: Locality-constrained linear coding for image classification. In: Proceedings of CVPR 2010, pp. 3360–3367. IEEE (2010)
9.
Zurück zum Zitat Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: Proceedings of CVPR 2009, pp. 1794–1801. IEEE (2009) Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: Proceedings of CVPR 2009, pp. 1794–1801. IEEE (2009)
10.
Zurück zum Zitat Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 27 (2011) Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 27 (2011)
11.
Zurück zum Zitat Burkhardt, F., Paeschke, A., Rolfes, M., Weiss, W.S.B.: A database of German emotional speech. In: Proceedings of INTERSPEECH, Lisbon, Portugal, pp. 1517–1520, September 2005 Burkhardt, F., Paeschke, A., Rolfes, M., Weiss, W.S.B.: A database of German emotional speech. In: Proceedings of INTERSPEECH, Lisbon, Portugal, pp. 1517–1520, September 2005
12.
Zurück zum Zitat Steidl, S.: Automatic classification of emotion-related user states inspontaneous childern’s speech. Ph.D. thesis, Der Technischen Fakultät der Universität Erlangen-Nürnberg, Germany (2009) Steidl, S.: Automatic classification of emotion-related user states inspontaneous childern’s speech. Ph.D. thesis, Der Technischen Fakultät der Universität Erlangen-Nürnberg, Germany (2009)
Metadaten
Titel
Segment-Level Probabilistic Sequence Kernel Based Support Vector Machines for Classification of Varying Length Patterns of Speech
verfasst von
Shikha Gupta
Veena Thenkanidiyoor
Dileep Aroor Dinesh
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-46681-1_39