Top

International Journal of Speech Technology

Published in:

25-03-2016

Performance of speaker identification using CSM and TM

Authors: R. Visalakshi, P. Dhanalakshmi

Published in: International Journal of Speech Technology | Issue 3/2016

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

The main objective of this paper is to develop the system of speaker identification. Speaker identification is a technology that allows a computer to automatically identify the person who is speaking, based on the information received from speech signal. One of the most difficult problems in speaker recognition is dealing with noises. The performance of speaker recognition using close speaking microphone (CSM) is affected in background noises. To overcome this problem throat microphone (TM) which has a transducer held at the throat resulting in a clean signal and unaffected by background noises is used. Acoustic features namely linear prediction coefficients, linear prediction cepstral coefficients, Mel frequency cepstral coefficients and relative spectral transform-perceptual linear prediction are extracted. These features are classified using RBFNN and AANN and their performance is analyzed. A new method was proposed for identification of speakers in clean and noisy using combined CSM and TM. The identification performance of the combined system is increased than individual system due to complementary nature of CSM and TM.

previous article Analysis of multiple types of voice recordings in cepstral domain using MFCC for discriminating between patients with Parkinson’s disease and healthy people

next article Performance of speaker localization using microphone array

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Chauhan, T., Soni, H., & Zafar, S. (2013). A review of automatic speaker recognition system. International Journal of Soft Computing and Engineering, 3, 132–135.

Dhanalakshmi, P., Palanivel, S., & Ramalingam, V. (2011). Classification of audio signals using aann and gmm. Applied Soft Computing, 11(10), 716–723.CrossRef

Erzin, E. (2009). Improved throat microphone speech recognition by joint analysis of throat and acoustic microphone recordings. IEEE, 17(7), 1558–7916.

Gbadamosi, L. (2013). Text independent biometric speaker recognition system. International Journal of Research in Computer Science, 3, 9–15.CrossRef

Haykin, S. (2001). Neural networks: A comprehensive foundation. Singapore: Pearson Education.MATH

Hermansky, H. (1990). Perceptual linear predictive (plp) analysis for speech. The Journal of the Acoustical Society of America, 87(4), 1738–1752.CrossRef

Kinnunen, T., & Li, H. (2010). An overview of text-independent speaker recognition: From features to supervectors. Speech Communication, 52, 12–40.CrossRef

Krishnamoorthy, P., Jayanna, H. S., & Prasanna, S. R. M. (2011). Speaker recognition under limited data condition by noise addition. Expert System with Applications, 38(10), 13487–13490.CrossRef

Kumar, P., Jakhanwal, N., & Chandra, M. (2011). Text dependent speaker identification in noisy environment. In IEEE international conference on device and communication (pp. 1–4).

Mubeen, N., Shahina, A., Nayeemulla Khan, A., & Vinoth, G. (2012). Combining spectral features of standard and throat microphones for speaker identification. In IEEE ICRTIT (pp. 119–122), Chennai, Tamil Nadu.

Nath, D., & Kalita, S. K. (2015). Composite feature selection method based on spoken word and speaker recognition. International Journal of Computer Applications, 121(8), 18–23.CrossRef

Nigade, Anuradha S., & Chitode, J. S. (2012). Throat microphone signals for isolated word recognition using LPC. International Journal of Advanced Research in Computer Science and Software Engineering, 2(8), 401–407.

Palanivel, S. (2004). Person authentication using speech, face and visual speech, Ph.D. Thesis, IIT, Madras.

Patel, J. K., & Nandurbarkar, A. (2015). Development and implementation of algorithm for speaker recognition for Gujarati language. International Research Journal of Engineering and Technology, 2(2), 444–448.

Rabiner, L., & Schafer, R. W. (2005). Digital processing of speech signals. Upper Saddle River, NJ: Pearson Education.

Sadic, S., & Bilginer Gulmezoglu, M. (2011). Common vector approach and its combination with GMM for text independent speaker recognition. Expert System with Applications, 38(9), 11394–11400.CrossRef

Shahina, A., Yegnanarayanan, B., & Kesheorey, M. R. (2004). Throat microphone signal for speaker recognition. In Proceedings of the international conference on spoken language processing.

Shaughnessy, D. O. (1986). Speaker recognition. In IEEE international conference on acoustics, speech, signal processing (Vol. 3, pp. 4–17).

Sumithra, M. G., Thanuskodi, K., & Archana, A. H. J. (2011). A new speaker recognition system with combined feature extraction techniques. Journal of Computer Scence, 3, 459–465.CrossRef

Wali, S. S., Hatture, S. M., & Nandyal, S. (2015). MFCC based text-dependent speaker identification using BPNN. International Journal of Signal Processing Systems, 3(1), 30–34.

Xu, C., Maddage, N. C., & Shao, X. (2005). Automatic music classification and summarization. IEEE Transactions on Speech and Audio Processing, 13, 441–450.CrossRef

Yujin, Y., Peihua, Z., & Qun, Z. (2010). Research of speaker recognition based on combination of LPCC and MFCC. In 2010 IEEE international conference on intelligent computing and intelligent systems (ICIS).

Zhu, L., & Yang, Q. (2012). Speaker recognition system based on weighted feature parameter. In International conference on solid state devices and materials science (pp. 1515–1522), Macao.

Title: Performance of speaker identification using CSM and TM
Authors: R. Visalakshi
P. Dhanalakshmi
Publication date: 25-03-2016
Publisher: Springer US
Published in: International Journal of Speech Technology / Issue 3/2016
Print ISSN: 1381-2416
Electronic ISSN: 1572-8110
DOI: https://doi.org/10.1007/s10772-016-9339-3

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Other articles of this Issue 3/2016

Simultaneous speech coding and de-noising in a dictionary based quantized CS framework

Speech transmission with COFDM based on different discrete transforms

Arabic speech synthesis and diacritic recognition

Wavelet energy based voice activity detection and adaptive thresholding for efficient speech coding

Performance of speaker localization using microphone array

Automatic genre classification of Indian Tamil and western music using fractional MFCC