Skip to main content
Top
Published in: International Journal of Speech Technology 3/2016

25-03-2016

Performance of speaker identification using CSM and TM

Authors: R. Visalakshi, P. Dhanalakshmi

Published in: International Journal of Speech Technology | Issue 3/2016

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The main objective of this paper is to develop the system of speaker identification. Speaker identification is a technology that allows a computer to automatically identify the person who is speaking, based on the information received from speech signal. One of the most difficult problems in speaker recognition is dealing with noises. The performance of speaker recognition using close speaking microphone (CSM) is affected in background noises. To overcome this problem throat microphone (TM) which has a transducer held at the throat resulting in a clean signal and unaffected by background noises is used. Acoustic features namely linear prediction coefficients, linear prediction cepstral coefficients, Mel frequency cepstral coefficients and relative spectral transform-perceptual linear prediction are extracted. These features are classified using RBFNN and AANN and their performance is analyzed. A new method was proposed for identification of speakers in clean and noisy using combined CSM and TM. The identification performance of the combined system is increased than individual system due to complementary nature of CSM and TM.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Chauhan, T., Soni, H., & Zafar, S. (2013). A review of automatic speaker recognition system. International Journal of Soft Computing and Engineering, 3, 132–135. Chauhan, T., Soni, H., & Zafar, S. (2013). A review of automatic speaker recognition system. International Journal of Soft Computing and Engineering, 3, 132–135.
go back to reference Dhanalakshmi, P., Palanivel, S., & Ramalingam, V. (2011). Classification of audio signals using aann and gmm. Applied Soft Computing, 11(10), 716–723.CrossRef Dhanalakshmi, P., Palanivel, S., & Ramalingam, V. (2011). Classification of audio signals using aann and gmm. Applied Soft Computing, 11(10), 716–723.CrossRef
go back to reference Erzin, E. (2009). Improved throat microphone speech recognition by joint analysis of throat and acoustic microphone recordings. IEEE, 17(7), 1558–7916. Erzin, E. (2009). Improved throat microphone speech recognition by joint analysis of throat and acoustic microphone recordings. IEEE, 17(7), 1558–7916.
go back to reference Gbadamosi, L. (2013). Text independent biometric speaker recognition system. International Journal of Research in Computer Science, 3, 9–15.CrossRef Gbadamosi, L. (2013). Text independent biometric speaker recognition system. International Journal of Research in Computer Science, 3, 9–15.CrossRef
go back to reference Haykin, S. (2001). Neural networks: A comprehensive foundation. Singapore: Pearson Education.MATH Haykin, S. (2001). Neural networks: A comprehensive foundation. Singapore: Pearson Education.MATH
go back to reference Hermansky, H. (1990). Perceptual linear predictive (plp) analysis for speech. The Journal of the Acoustical Society of America, 87(4), 1738–1752.CrossRef Hermansky, H. (1990). Perceptual linear predictive (plp) analysis for speech. The Journal of the Acoustical Society of America, 87(4), 1738–1752.CrossRef
go back to reference Kinnunen, T., & Li, H. (2010). An overview of text-independent speaker recognition: From features to supervectors. Speech Communication, 52, 12–40.CrossRef Kinnunen, T., & Li, H. (2010). An overview of text-independent speaker recognition: From features to supervectors. Speech Communication, 52, 12–40.CrossRef
go back to reference Krishnamoorthy, P., Jayanna, H. S., & Prasanna, S. R. M. (2011). Speaker recognition under limited data condition by noise addition. Expert System with Applications, 38(10), 13487–13490.CrossRef Krishnamoorthy, P., Jayanna, H. S., & Prasanna, S. R. M. (2011). Speaker recognition under limited data condition by noise addition. Expert System with Applications, 38(10), 13487–13490.CrossRef
go back to reference Kumar, P., Jakhanwal, N., & Chandra, M. (2011). Text dependent speaker identification in noisy environment. In IEEE international conference on device and communication (pp. 1–4). Kumar, P., Jakhanwal, N., & Chandra, M. (2011). Text dependent speaker identification in noisy environment. In IEEE international conference on device and communication (pp. 1–4).
go back to reference Mubeen, N., Shahina, A., Nayeemulla Khan, A., & Vinoth, G. (2012). Combining spectral features of standard and throat microphones for speaker identification. In IEEE ICRTIT (pp. 119–122), Chennai, Tamil Nadu. Mubeen, N., Shahina, A., Nayeemulla Khan, A., & Vinoth, G. (2012). Combining spectral features of standard and throat microphones for speaker identification. In IEEE ICRTIT (pp. 119–122), Chennai, Tamil Nadu.
go back to reference Nath, D., & Kalita, S. K. (2015). Composite feature selection method based on spoken word and speaker recognition. International Journal of Computer Applications, 121(8), 18–23.CrossRef Nath, D., & Kalita, S. K. (2015). Composite feature selection method based on spoken word and speaker recognition. International Journal of Computer Applications, 121(8), 18–23.CrossRef
go back to reference Nigade, Anuradha S., & Chitode, J. S. (2012). Throat microphone signals for isolated word recognition using LPC. International Journal of Advanced Research in Computer Science and Software Engineering, 2(8), 401–407. Nigade, Anuradha S., & Chitode, J. S. (2012). Throat microphone signals for isolated word recognition using LPC. International Journal of Advanced Research in Computer Science and Software Engineering, 2(8), 401–407.
go back to reference Palanivel, S. (2004). Person authentication using speech, face and visual speech, Ph.D. Thesis, IIT, Madras. Palanivel, S. (2004). Person authentication using speech, face and visual speech, Ph.D. Thesis, IIT, Madras.
go back to reference Patel, J. K., & Nandurbarkar, A. (2015). Development and implementation of algorithm for speaker recognition for Gujarati language. International Research Journal of Engineering and Technology, 2(2), 444–448. Patel, J. K., & Nandurbarkar, A. (2015). Development and implementation of algorithm for speaker recognition for Gujarati language. International Research Journal of Engineering and Technology, 2(2), 444–448.
go back to reference Rabiner, L., & Schafer, R. W. (2005). Digital processing of speech signals. Upper Saddle River, NJ: Pearson Education. Rabiner, L., & Schafer, R. W. (2005). Digital processing of speech signals. Upper Saddle River, NJ: Pearson Education.
go back to reference Sadic, S., & Bilginer Gulmezoglu, M. (2011). Common vector approach and its combination with GMM for text independent speaker recognition. Expert System with Applications, 38(9), 11394–11400.CrossRef Sadic, S., & Bilginer Gulmezoglu, M. (2011). Common vector approach and its combination with GMM for text independent speaker recognition. Expert System with Applications, 38(9), 11394–11400.CrossRef
go back to reference Shahina, A., Yegnanarayanan, B., & Kesheorey, M. R. (2004). Throat microphone signal for speaker recognition. In Proceedings of the international conference on spoken language processing. Shahina, A., Yegnanarayanan, B., & Kesheorey, M. R. (2004). Throat microphone signal for speaker recognition. In Proceedings of the international conference on spoken language processing.
go back to reference Shaughnessy, D. O. (1986). Speaker recognition. In IEEE international conference on acoustics, speech, signal processing (Vol. 3, pp. 4–17). Shaughnessy, D. O. (1986). Speaker recognition. In IEEE international conference on acoustics, speech, signal processing (Vol. 3, pp. 4–17).
go back to reference Sumithra, M. G., Thanuskodi, K., & Archana, A. H. J. (2011). A new speaker recognition system with combined feature extraction techniques. Journal of Computer Scence, 3, 459–465.CrossRef Sumithra, M. G., Thanuskodi, K., & Archana, A. H. J. (2011). A new speaker recognition system with combined feature extraction techniques. Journal of Computer Scence, 3, 459–465.CrossRef
go back to reference Wali, S. S., Hatture, S. M., & Nandyal, S. (2015). MFCC based text-dependent speaker identification using BPNN. International Journal of Signal Processing Systems, 3(1), 30–34. Wali, S. S., Hatture, S. M., & Nandyal, S. (2015). MFCC based text-dependent speaker identification using BPNN. International Journal of Signal Processing Systems, 3(1), 30–34.
go back to reference Xu, C., Maddage, N. C., & Shao, X. (2005). Automatic music classification and summarization. IEEE Transactions on Speech and Audio Processing, 13, 441–450.CrossRef Xu, C., Maddage, N. C., & Shao, X. (2005). Automatic music classification and summarization. IEEE Transactions on Speech and Audio Processing, 13, 441–450.CrossRef
go back to reference Yujin, Y., Peihua, Z., & Qun, Z. (2010). Research of speaker recognition based on combination of LPCC and MFCC. In 2010 IEEE international conference on intelligent computing and intelligent systems (ICIS). Yujin, Y., Peihua, Z., & Qun, Z. (2010). Research of speaker recognition based on combination of LPCC and MFCC. In 2010 IEEE international conference on intelligent computing and intelligent systems (ICIS).
go back to reference Zhu, L., & Yang, Q. (2012). Speaker recognition system based on weighted feature parameter. In International conference on solid state devices and materials science (pp. 1515–1522), Macao. Zhu, L., & Yang, Q. (2012). Speaker recognition system based on weighted feature parameter. In International conference on solid state devices and materials science (pp. 1515–1522), Macao.
Metadata
Title
Performance of speaker identification using CSM and TM
Authors
R. Visalakshi
P. Dhanalakshmi
Publication date
25-03-2016
Publisher
Springer US
Published in
International Journal of Speech Technology / Issue 3/2016
Print ISSN: 1381-2416
Electronic ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-016-9339-3

Other articles of this Issue 3/2016

International Journal of Speech Technology 3/2016 Go to the issue