Skip to main content
Top
Published in: Cluster Computing 2/2015

01-06-2015

Automatic speech recognition using interlaced derivative pattern for cloud based healthcare system

Author: Ghulam Muhammad

Published in: Cluster Computing | Issue 2/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Cloud computing brings several advantages such as flexibility, scalability, and ubiquity in terms of data acquisition, data storage, and data transmission. This can help remote healthcare among other applications in a great deal. This paper proposes a cloud based framework for speech enabling healthcare. In the proposed framework, the patients or any healthy person seeking for some medical assistance can send his/her request by speech commands. The commands are managed and processed in the cloud server. Any doctor with proper authentication can receive the request. By analyzing the request, the doctor can assist the patient or the person. This paper also proposes a new feature extraction technique, namely, interlaced derivative pattern (IDP), to automatic speech recognition (ASR) system to be deployed into the cloud server. The IDP exploits the relative Mel-filter bank coefficients along different neighborhood directions from the speech signal. Experimental results show that the proposed IDP-based ASR system performs reasonably well even when the speech is transmitted via smart phones.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Bercovitz, A., Moss, A., Sengupta, M., Park-Lee, E.Y., Jones, A., Harris-Kojetin, L.D., Squillace, M.R.: An overview of home health aides: United States, 2007. National Health Statistics Reports, U.S. Department of Health and Human Services, Centers for Disease Control and Prevention, National Center for Health Statistics, Number 34, May 19, (2011). http://www.cdc.gov/nchs/data/nhsr/nhsr034.pdf Bercovitz, A., Moss, A., Sengupta, M., Park-Lee, E.Y., Jones, A., Harris-Kojetin, L.D., Squillace, M.R.: An overview of home health aides: United States, 2007. National Health Statistics Reports, U.S. Department of Health and Human Services, Centers for Disease Control and Prevention, National Center for Health Statistics, Number 34, May 19, (2011). http://​www.​cdc.​gov/​nchs/​data/​nhsr/​nhsr034.​pdf
3.
go back to reference Kaur, P.D., Chan, I.: Cloud based intelligent system for delivering health care as a service. Comput. Methods Progr. Biomed. 113(1), 346–359 (2014)CrossRef Kaur, P.D., Chan, I.: Cloud based intelligent system for delivering health care as a service. Comput. Methods Progr. Biomed. 113(1), 346–359 (2014)CrossRef
4.
go back to reference Hossain, M.S., Muhammad, G.: Cloud-based collaborative media service framework for health-care. Int. J. Distrib. Sensor Netw. 2014 (2014). doi:10.1155/2014/858712 Hossain, M.S., Muhammad, G.: Cloud-based collaborative media service framework for health-care. Int. J. Distrib. Sensor Netw. 2014 (2014). doi:10.​1155/​2014/​858712
5.
go back to reference Muhammad, G., Masud, M., Alelaiwi, A., Rahman, M.A., Karime, A., Alamri, A., Hossain, M.S.: Spectro-temporal directional derivative based automatic speech recognition for a serious game scenario. Multimed. Tools Appl. (2014). doi:10.1007/s11042-014-1973-7 Muhammad, G., Masud, M., Alelaiwi, A., Rahman, M.A., Karime, A., Alamri, A., Hossain, M.S.: Spectro-temporal directional derivative based automatic speech recognition for a serious game scenario. Multimed. Tools Appl. (2014). doi:10.​1007/​s11042-014-1973-7
6.
go back to reference Glasberg, R., Hartmann, M., Draheim, M., Tamm, G., Hessel, F.: Risks and crises for healthcare providers: the impact of cloud computing. Sci. World J. 2014 (2014) Glasberg, R., Hartmann, M., Draheim, M., Tamm, G., Hessel, F.: Risks and crises for healthcare providers: the impact of cloud computing. Sci. World J. 2014 (2014)
7.
go back to reference Oh, S.Y., Chung, K.Y.: Target speech feature extraction using non-parametric correlation coefficient. Clust. Comput. 17(3), 839–899 (2014) Oh, S.Y., Chung, K.Y.: Target speech feature extraction using non-parametric correlation coefficient. Clust. Comput. 17(3), 839–899 (2014)
8.
go back to reference Lee, D., Lee, H., Park, D., Jeong, Y.-S.: Proxy based seamless connection management method in mobile cloud computing. Clust. Comput. 16(4), 733–744 (2013)CrossRef Lee, D., Lee, H., Park, D., Jeong, Y.-S.: Proxy based seamless connection management method in mobile cloud computing. Clust. Comput. 16(4), 733–744 (2013)CrossRef
9.
go back to reference Jung, E.-Y., Kim, J., Chung, K.-Y., Park, D.K.: Mobile healthcare application with EMR interoperability for diabetes patients. Clust. Comput. 17(3), 871–880 (2014)CrossRef Jung, E.-Y., Kim, J., Chung, K.-Y., Park, D.K.: Mobile healthcare application with EMR interoperability for diabetes patients. Clust. Comput. 17(3), 871–880 (2014)CrossRef
10.
go back to reference Durling, S., Lumsden, J.: Speech recognition use in healthcare applications. In: Proceedings of the 6th International Conference on Advances in Mobile Computing and Multimedia (MoMM ’08), NY, USA, pp. 473–478 (2008) Durling, S., Lumsden, J.: Speech recognition use in healthcare applications. In: Proceedings of the 6th International Conference on Advances in Mobile Computing and Multimedia (MoMM ’08), NY, USA, pp. 473–478 (2008)
11.
go back to reference Hamill, M., Young, V., Boger, J., Mihailidis, A.: Development of an automated speech recognition interface for personal emergency response systems. J. Neuro Eng. Rehabil. 6(26), (2009) Hamill, M., Young, V., Boger, J., Mihailidis, A.: Development of an automated speech recognition interface for personal emergency response systems. J. Neuro Eng. Rehabil. 6(26), (2009)
12.
go back to reference Fagan, M.J., Ell, S.R., Gilbert, J.M., Sarrazin, E., Chapman, P.M.: Development of a (silent) speech recognition system for patients following laryngectomy. Med. Eng. Phys. 30(4), 419–425 (2008)CrossRef Fagan, M.J., Ell, S.R., Gilbert, J.M., Sarrazin, E., Chapman, P.M.: Development of a (silent) speech recognition system for patients following laryngectomy. Med. Eng. Phys. 30(4), 419–425 (2008)CrossRef
13.
go back to reference Muhammad, G., Mesallam, T.A., Malki, K.H., Farahat, M., Alsulaiman, M., Bukhari, M.: Formant analysis in dysphonic patients and automatic Arabic digit speech recognition. BioMed. Eng. OnLine 10, 41 (2011)CrossRef Muhammad, G., Mesallam, T.A., Malki, K.H., Farahat, M., Alsulaiman, M., Bukhari, M.: Formant analysis in dysphonic patients and automatic Arabic digit speech recognition. BioMed. Eng. OnLine 10, 41 (2011)CrossRef
14.
go back to reference Muhammad, G., Melhem, M.: Pathological voice detection and binary classification using MPEG-7 audio features. Biomed. Signal Process. Controls 11, 1–9 (2014)CrossRef Muhammad, G., Melhem, M.: Pathological voice detection and binary classification using MPEG-7 audio features. Biomed. Signal Process. Controls 11, 1–9 (2014)CrossRef
15.
go back to reference Alamri, A., Hassan, M.M., Hossain, M.A., Al-Qurishi, M., Aldukhayyil, Y., Hossain, M.S.: Evaluating the impact of a cloud-based serious game on obese people. Comput. Hum. Behav. 30, 468–475 (2014) Alamri, A., Hassan, M.M., Hossain, M.A., Al-Qurishi, M., Aldukhayyil, Y., Hossain, M.S.: Evaluating the impact of a cloud-based serious game on obese people. Comput. Hum. Behav. 30, 468–475 (2014)
16.
go back to reference Hossain, M.S., Muhammad, G.: Cloud-assisted speech and face recognition framework for health monitoring. Mobile Networks and Applications (MONET) (2015) Hossain, M.S., Muhammad, G.: Cloud-assisted speech and face recognition framework for health monitoring. Mobile Networks and Applications (MONET) (2015)
17.
go back to reference Shobeirinejad, A., Gao, Y.: Gender classification using interlaced derivative patterns. In: Proceedings of the 20th International Conference on Pattern Recognition (ICPR), pp. 1509–1512 (2010) Shobeirinejad, A., Gao, Y.: Gender classification using interlaced derivative patterns. In: Proceedings of the 20th International Conference on Pattern Recognition (ICPR), pp. 1509–1512 (2010)
18.
go back to reference Ahonen, T., Hadid, A., Pietikainen, M.: Face description with local binary patterns: application to face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 28(12), 2037–2041 (2006) Ahonen, T., Hadid, A., Pietikainen, M.: Face description with local binary patterns: application to face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 28(12), 2037–2041 (2006)
19.
go back to reference Shan, S., Chen, J., He, C., Zhao, G., Pietikainen, M., Gao, W.: WLD: a robust local image descriptor. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1–3 (2010) Shan, S., Chen, J., He, C., Zhao, G., Pietikainen, M., Gao, W.: WLD: a robust local image descriptor. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1–3 (2010)
20.
go back to reference Rabiner, L.R., Schafer, R.W.: Introduction to digital speech processing. Found. Trends Signal Process. 1, 1–194 (2007)CrossRef Rabiner, L.R., Schafer, R.W.: Introduction to digital speech processing. Found. Trends Signal Process. 1, 1–194 (2007)CrossRef
21.
go back to reference Nakamura, S., Takeda, K., Yamamoto, K., Yamada, T., Kuroiwa, S., Kitaoka, N., Nishiura, T., Sasou, A., Mizumachi, M., Miyajima, C., Fujimoto, M., Endo, T.: AURORA-2J: an evaluation framework for Japanese noisy speech recognition. In: IEICE Transactions Information and Systems, E88-D(3), pp. 535–544 (2005) Nakamura, S., Takeda, K., Yamamoto, K., Yamada, T., Kuroiwa, S., Kitaoka, N., Nishiura, T., Sasou, A., Mizumachi, M., Miyajima, C., Fujimoto, M., Endo, T.: AURORA-2J: an evaluation framework for Japanese noisy speech recognition. In: IEICE Transactions Information and Systems, E88-D(3), pp. 535–544 (2005)
22.
go back to reference Muhammad, G., Mesallam, T.A., Almalki, K., Farahat, M., Mahmood, A., Alsulaiman, M.: Multi directional regression (MDR) based features for automatic voice disorder detection. J. Voice 26(6), 817.e19–817.e27 (2012)CrossRef Muhammad, G., Mesallam, T.A., Almalki, K., Farahat, M., Mahmood, A., Alsulaiman, M.: Multi directional regression (MDR) based features for automatic voice disorder detection. J. Voice 26(6), 817.e19–817.e27 (2012)CrossRef
23.
go back to reference Hossain, M.S., El Saddik, A.: A biologically-inspired multimedia content repurposing system in heterogeneous network environments. ACM/Springer Multimed. Syst. J. 14(3), 135–143 (2008)CrossRef Hossain, M.S., El Saddik, A.: A biologically-inspired multimedia content repurposing system in heterogeneous network environments. ACM/Springer Multimed. Syst. J. 14(3), 135–143 (2008)CrossRef
24.
go back to reference Hossain, M.S., El Saddik, A.: Scalability Measurement of a proxy based personalized multimedia repurposing system. In: Proceedings of IEEE Instrumentation and Measurement Technology Conference (IEEE-IMTC’06), Sorrento, Italia, (2006) Hossain, M.S., El Saddik, A.: Scalability Measurement of a proxy based personalized multimedia repurposing system. In: Proceedings of IEEE Instrumentation and Measurement Technology Conference (IEEE-IMTC’06), Sorrento, Italia, (2006)
25.
go back to reference Martinez, D., Lleida, E., Ortega, A., Miguel, A., Villalba, J.: Voice pathology detection on the Saarbrucken voice database with calibration and fusion of scores using MultiFocal Toolkit. In: Toledano, D.T. et al. (eds.) IberSpeech 2012, CCIS, vol. 328, pp. 99–109 (2012) Martinez, D., Lleida, E., Ortega, A., Miguel, A., Villalba, J.: Voice pathology detection on the Saarbrucken voice database with calibration and fusion of scores using MultiFocal Toolkit. In: Toledano, D.T. et al. (eds.) IberSpeech 2012, CCIS, vol. 328, pp. 99–109 (2012)
Metadata
Title
Automatic speech recognition using interlaced derivative pattern for cloud based healthcare system
Author
Ghulam Muhammad
Publication date
01-06-2015
Publisher
Springer US
Published in
Cluster Computing / Issue 2/2015
Print ISSN: 1386-7857
Electronic ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-015-0439-7

Other articles of this Issue 2/2015

Cluster Computing 2/2015 Go to the issue

Premium Partner