Skip to main content
Erschienen in: International Journal of Speech Technology 4/2018

25.09.2018

Tamil and English speech database for heartbeat estimation

verfasst von: A. Milton, K. Anish Monsely

Erschienen in: International Journal of Speech Technology | Ausgabe 4/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The aim of this research work is to provide an open source database containing speech signals and the corresponding heartbeat rates, so as to further widen the area of research in speech signal processing, especially estimation of heartbeat rate from speech. Tamil and English Speech Database for Heartbeat Estimation consists of 10,040 speech recordings. The speech signals were recorded from 109 persons, 52 females and 57 males with an average age of 25 years and 6 months. The informed consented volunteers were asked to perform three tasks; like answering and reading in rest state; answering and reading after physical exercise and answering after watching video clips. 24-th and 72-nd order Mel-Frequency Cepstral Coefficients and 14-th and 52-nd order Auto Regressive Reflection Coefficients are extracted from the speech signal. Prediction of heartbeat is done by linear regression using support vector machine. The statistical significance of the heartbeat prediction results are improved by 10-fold speaker-independent cross validation scheme. Experimental results show a minimum average estimation error of ± 13.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Barros, A. K., & Ohnishi, N. (2001). Heart instantaneous frequency (HIF): An alternative approach to extract heart rate variability. IEEE Transactions on Biomedical Engineering, 48(8), 850–855.CrossRef Barros, A. K., & Ohnishi, N. (2001). Heart instantaneous frequency (HIF): An alternative approach to extract heart rate variability. IEEE Transactions on Biomedical Engineering, 48(8), 850–855.CrossRef
Zurück zum Zitat Bernardi, L., Wdowczyk-Szulc, J., Valenti, C., Castoldi, S., Passino, C., Spadacini, G., G., & Sleight, P. (2000). Effects of controlled breathing, mental activity and mental stress with or without verbalization on heart rate variability. Journal of the American College of Cardiology, 35(6), 1462–1469.CrossRef Bernardi, L., Wdowczyk-Szulc, J., Valenti, C., Castoldi, S., Passino, C., Spadacini, G., G., & Sleight, P. (2000). Effects of controlled breathing, mental activity and mental stress with or without verbalization on heart rate variability. Journal of the American College of Cardiology, 35(6), 1462–1469.CrossRef
Zurück zum Zitat Hayre, H. S., & Holland, J. C. (1980). Cross-correlation of voice and heart rate as stress measures. Applied Acoustics, 13(1), 57–62.CrossRef Hayre, H. S., & Holland, J. C. (1980). Cross-correlation of voice and heart rate as stress measures. Applied Acoustics, 13(1), 57–62.CrossRef
Zurück zum Zitat Johnson, H. J., & Campos, J. J. (1967). The effect of cognitive tasks and verbalization instructions on heart rate and skin conductance. Psychophysiology, 4(2), 143–150.CrossRef Johnson, H. J., & Campos, J. J. (1967). The effect of cognitive tasks and verbalization instructions on heart rate and skin conductance. Psychophysiology, 4(2), 143–150.CrossRef
Zurück zum Zitat Kathol, A., & Shriberg, E. (2015). The SRI biofrustration corpus: audio, video, and physiological signals for continuous user modelling. In Proceedings of AAAI Spring Symposium Series 2015, (pp. 96–99) Palo Alto, California. Kathol, A., & Shriberg, E. (2015). The SRI biofrustration corpus: audio, video, and physiological signals for continuous user modelling. In Proceedings of AAAI Spring Symposium Series 2015, (pp. 96–99) Palo Alto, California.
Zurück zum Zitat Makhoul, J. (1975). Linear prediction: a tutorial review. Proceedings of the IEEE, 63(4), 561–580.CrossRef Makhoul, J. (1975). Linear prediction: a tutorial review. Proceedings of the IEEE, 63(4), 561–580.CrossRef
Zurück zum Zitat Mesleh, A., Skopin, D., Baglikov, S., & Quteishat, A. (2012). Heart rate extraction from vowel speech signals. Journal of computer science and technology, 27(6), 1243–1251.CrossRef Mesleh, A., Skopin, D., Baglikov, S., & Quteishat, A. (2012). Heart rate extraction from vowel speech signals. Journal of computer science and technology, 27(6), 1243–1251.CrossRef
Zurück zum Zitat Milton, A. (2015). Automatic recognition of speech emotions using class-specific multiple classifier scheme. Ph.D. Thesis, Anna University, Chennai, India. Milton, A. (2015). Automatic recognition of speech emotions using class-specific multiple classifier scheme. Ph.D. Thesis, Anna University, Chennai, India.
Zurück zum Zitat Rabiner, L. R., & Schafer, R. W. (2004s). Digital processing of speech signals. Delhi: Pearson Education (Singapore) Pte.Ltd. Rabiner, L. R., & Schafer, R. W. (2004s). Digital processing of speech signals. Delhi: Pearson Education (Singapore) Pte.Ltd.
Zurück zum Zitat Ryskaliyev, A., Askaruly, S., & James, A. (2016). Speech signal analysis for the estimation of heart rates under different emotional states. In Proceedings of IEEE International Conference on Advances in Computing, Communications and Informatics, (pp. 1160–1165) Jaipur, India. Ryskaliyev, A., Askaruly, S., & James, A. (2016). Speech signal analysis for the estimation of heart rates under different emotional states. In Proceedings of IEEE International Conference on Advances in Computing, Communications and Informatics, (pp. 1160–1165) Jaipur, India.
Zurück zum Zitat Schnell, I., Potchter, O., Epstein, Y., Yaakov, Y., Hermesh, H., Brenner, S., & Tirosh, E. (2013). The effects of exposure to environmental factors on heart rate variability: An ecological perspective. Environmental Pollution, 183, 7–13.CrossRef Schnell, I., Potchter, O., Epstein, Y., Yaakov, Y., Hermesh, H., Brenner, S., & Tirosh, E. (2013). The effects of exposure to environmental factors on heart rate variability: An ecological perspective. Environmental Pollution, 183, 7–13.CrossRef
Zurück zum Zitat Schuller, B., Friedmann, F., & Eyben, F. (2013). Automatic recognition of physiological parameters in the human voice: heart rate and skin conductance. In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, (pp. 7219–7223) Vancouver, BC, Canada. Schuller, B., Friedmann, F., & Eyben, F. (2013). Automatic recognition of physiological parameters in the human voice: heart rate and skin conductance. In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, (pp. 7219–7223) Vancouver, BC, Canada.
Zurück zum Zitat Schuller, B., Friedmann, F., & Eyben, F. (2014). The Munich biovoice corpus: effects of physical exercising, heart rate and skin conductance on human speech production. In Proceedings of the Ninth International Conference on Language Resources and Evaluation, (pp. 1506–1510) Reykjavik, Iceland. Schuller, B., Friedmann, F., & Eyben, F. (2014). The Munich biovoice corpus: effects of physical exercising, heart rate and skin conductance on human speech production. In Proceedings of the Ninth International Conference on Language Resources and Evaluation, (pp. 1506–1510) Reykjavik, Iceland.
Zurück zum Zitat Seraganian, P., Szabob, A., & Brown, T. G. (1997). The effect of vocalization on the heart rate response to mental arithmetic. Physiology & Behavior, 62(2), 221–224.CrossRef Seraganian, P., Szabob, A., & Brown, T. G. (1997). The effect of vocalization on the heart rate response to mental arithmetic. Physiology & Behavior, 62(2), 221–224.CrossRef
Zurück zum Zitat Smith, J., Tsiartas, A., Shriberg, E., Kathol, A., Willoughby, A., & Zambotti, M. D. (2017). Analysis and prediction of heart rate using speech features from natural speech. In IEEE International Conference in Acoustics, Speech and Signal Processing, (pp. 989–993) New Orleans, LA, USA. Smith, J., Tsiartas, A., Shriberg, E., Kathol, A., Willoughby, A., & Zambotti, M. D. (2017). Analysis and prediction of heart rate using speech features from natural speech. In IEEE International Conference in Acoustics, Speech and Signal Processing, (pp. 989–993) New Orleans, LA, USA.
Zurück zum Zitat Tsiartas, A., Kathol, A., Shriberg, E., Zambotti, M. D., & Willoughby, A. (2015). Prediction of heart rate changes from speech features during interaction with a misbehaving dialog system. In Proceedings of Interspeech 2015, (pp. 3175–3179) Dresden, Germany. Tsiartas, A., Kathol, A., Shriberg, E., Zambotti, M. D., & Willoughby, A. (2015). Prediction of heart rate changes from speech features during interaction with a misbehaving dialog system. In Proceedings of Interspeech 2015, (pp. 3175–3179) Dresden, Germany.
Zurück zum Zitat Vapnik, V. N. (1998). Statistical learning theory. New York: Wiley.MATH Vapnik, V. N. (1998). Statistical learning theory. New York: Wiley.MATH
Metadaten
Titel
Tamil and English speech database for heartbeat estimation
verfasst von
A. Milton
K. Anish Monsely
Publikationsdatum
25.09.2018
Verlag
Springer US
Erschienen in
International Journal of Speech Technology / Ausgabe 4/2018
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-018-9557-y

Weitere Artikel der Ausgabe 4/2018

International Journal of Speech Technology 4/2018 Zur Ausgabe

Neuer Inhalt