Skip to main content

2015 | OriginalPaper | Buchkapitel

Use of Different Features for Emotion Recognition Using MLP Network

verfasst von : H. K. Palo, Mihir Narayana Mohanty, Mahesh Chandra

Erschienen in: Computational Vision and Robotics

Verlag: Springer India

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Emotion recognition of human being is one of the major challenges in modern complicated world of political and criminal scenario. In this paper, an attempt is taken to recognise two classes of speech emotions as high arousal like angry and surprise and low arousal like sad and bore. Linear prediction coefficients (LPC), linear prediction cepstral coefficient (LPCC), Mel frequency cepstral coefficient (MFCC) and perceptual linear prediction (PLP) features are used for emotion recognition using multilayer perception (MLP).Various emotional speech features are extracted from audio channel using above-mentioned features to be used in training and testing. Two hundred utterances from ten subjects were collected based on four emotion categories. One hundred and seventy-five and twenty-five utterances have been considered for training and testing purpose.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Mohanty, M., Mishra, A., Routray A.: A non-rigid motion estimation algorithm for yawn detection in human drivers. Int. J. Comput. Vision Robot. 1(1), 89–109 (2009) Mohanty, M., Mishra, A., Routray A.: A non-rigid motion estimation algorithm for yawn detection in human drivers. Int. J. Comput. Vision Robot. 1(1), 89–109 (2009)
2.
Zurück zum Zitat Mohanty, M.N., Routray, A., Kabisatpathy, P.: Voice detection using statistical method. Int. J. Engg. Techsci. 2(1), 120–124 (2010) Mohanty, M.N., Routray, A., Kabisatpathy, P.: Voice detection using statistical method. Int. J. Engg. Techsci. 2(1), 120–124 (2010)
3.
Zurück zum Zitat Lee, C.M., Narayanan, S.S.: Toward detecting emotions in spoken dialogs. IEEE Trans. Speech Audio Process. 13(2), (2005) Lee, C.M., Narayanan, S.S.: Toward detecting emotions in spoken dialogs. IEEE Trans. Speech Audio Process. 13(2), (2005)
4.
Zurück zum Zitat Ververidis, D., Kotropoulos, C.: Emotional speech recognition: resources, features, and methods, speech communication. Elsevier 48, 1162–1181 (2006) Ververidis, D., Kotropoulos, C.: Emotional speech recognition: resources, features, and methods, speech communication. Elsevier 48, 1162–1181 (2006)
5.
Zurück zum Zitat Fragopanagos, N., Taylor, J.G.: Emotion recognition in human–computer interaction. Neural Networks, Elsevier 18, 389–405 (2005) Fragopanagos, N., Taylor, J.G.: Emotion recognition in human–computer interaction. Neural Networks, Elsevier 18, 389–405 (2005)
6.
Zurück zum Zitat Makhoul, J.: Linear prediction: a tutorial review. Proc. IEEE 63, 561–580 (1975)CrossRef Makhoul, J.: Linear prediction: a tutorial review. Proc. IEEE 63, 561–580 (1975)CrossRef
7.
Zurück zum Zitat Ram, R., Palo, H.K., Mohanty, M.N.: Emotion recognition with speech for call centres using LPC and spectral analysis. Int. J. Adv. Comput. Res. 3(3/11), 189–194 (2013) Ram, R., Palo, H.K., Mohanty, M.N.: Emotion recognition with speech for call centres using LPC and spectral analysis. Int. J. Adv. Comput. Res. 3(3/11), 189–194 (2013)
8.
Zurück zum Zitat Quatieri, T.F.: Discrete-Time Speech Signal Processing, 3rd edn. Prentice-Hall, New Jersey (1996) Quatieri, T.F.: Discrete-Time Speech Signal Processing, 3rd edn. Prentice-Hall, New Jersey (1996)
9.
Zurück zum Zitat Samal, A., Parida, D., Satpathy, M.R., Mohanty M.N.: On the use of MFCC feature vectors clustering for efficient text dependent speaker recognition. In: Proceedings of International Conference on Frontiers of Intelligent Computing: Theory and Application (FICTA)-2013, Advances in Intelligence System and Computing Series, vol. 247, pp. 305–312. Springer, Switzerland (2014) Samal, A., Parida, D., Satpathy, M.R., Mohanty M.N.: On the use of MFCC feature vectors clustering for efficient text dependent speaker recognition. In: Proceedings of International Conference on Frontiers of Intelligent Computing: Theory and Application (FICTA)-2013, Advances in Intelligence System and Computing Series, vol. 247, pp. 305–312. Springer, Switzerland (2014)
10.
Zurück zum Zitat Palo, H.K., Mohanty, M.N., Chandra M.: Design of neural network model for emotional speech recognition. In: International Conference on Artificial Intelligence and Evolutionary Algorithms in Engineering Systems, April 2014 Palo, H.K., Mohanty, M.N., Chandra M.: Design of neural network model for emotional speech recognition. In: International Conference on Artificial Intelligence and Evolutionary Algorithms in Engineering Systems, April 2014
11.
Zurück zum Zitat Hermansk, H.: Perceptual linear predictive (PLP) analysis of speech. J. Accoust. Soc. Am. 87(4), 1739–1752 (1990) Hermansk, H.: Perceptual linear predictive (PLP) analysis of speech. J. Accoust. Soc. Am. 87(4), 1739–1752 (1990)
12.
Zurück zum Zitat Farrell, K.R., Mammone, R.J., Assaleh, K.T.: Speaker networks recognition using neural and conventional classifiers. IEEE Trans. Acoust. Speech Signal Process. 2(1 part 11), (1994) Farrell, K.R., Mammone, R.J., Assaleh, K.T.: Speaker networks recognition using neural and conventional classifiers. IEEE Trans. Acoust. Speech Signal Process. 2(1 part 11), (1994)
13.
Zurück zum Zitat Javidi, M.M., Roshan, E.F.: Speech emotion recognition by using combinations of C5.0, neural network (NN), and support vector machines (SVM) classification methods. J. Math. Comput. Sci. 6, 191–200 (2013) Javidi, M.M., Roshan, E.F.: Speech emotion recognition by using combinations of C5.0, neural network (NN), and support vector machines (SVM) classification methods. J. Math. Comput. Sci. 6, 191–200 (2013)
Metadaten
Titel
Use of Different Features for Emotion Recognition Using MLP Network
verfasst von
H. K. Palo
Mihir Narayana Mohanty
Mahesh Chandra
Copyright-Jahr
2015
Verlag
Springer India
DOI
https://doi.org/10.1007/978-81-322-2196-8_2