Skip to main content
Top

2015 | OriginalPaper | Chapter

Speaker Identification Using Semi-supervised Learning

Authors : Nikos Fazakis, Stamatis Karlos, Sotiris Kotsiantis, Kyriakos Sgarbas

Published in: Speech and Computer

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Semi-supervised classification methods use available unlabeled data, along with a small set of labeled examples, to increase the classification accuracy in comparison with training a supervised method using only the labeled data. In this work, a new semi-supervised method for speaker identification is presented. We present a comparison with other well-known semi-supervised and supervised classification methods on benchmark datasets and verify that the presented technique exhibits better accuracy in most cases.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Khaled, D.: Wavelet entropy and neural network for text-independent speaker identification. Engg. Appl. Artif. Intell. 24, 796–802 (2011)CrossRef Khaled, D.: Wavelet entropy and neural network for text-independent speaker identification. Engg. Appl. Artif. Intell. 24, 796–802 (2011)CrossRef
2.
go back to reference Grimaldi, M., Cummins, F.: Speaker identification using instantaneous frequencies. IEEE TASLP 16(6), 1097–1111 (2008) Grimaldi, M., Cummins, F.: Speaker identification using instantaneous frequencies. IEEE TASLP 16(6), 1097–1111 (2008)
3.
go back to reference Manikandan, J., Venkataramani, B.: Design of a real time automatic speech recognition system using modified one against all SVM classifier. Microproc. Microsyst. 35(6), 568–578 (2011)CrossRef Manikandan, J., Venkataramani, B.: Design of a real time automatic speech recognition system using modified one against all SVM classifier. Microproc. Microsyst. 35(6), 568–578 (2011)CrossRef
4.
go back to reference Dileep, A., Chandra, C.: Speaker recognition using pyramid match kernel based support vector machines. Int. J. Speech Technol. 15(3), 365–379 (2012)CrossRef Dileep, A., Chandra, C.: Speaker recognition using pyramid match kernel based support vector machines. Int. J. Speech Technol. 15(3), 365–379 (2012)CrossRef
5.
go back to reference Friedhelm, S., Edmondo, T.: Pattern classification and clustering: a review of partially supervised learning approaches. Pattern Recogn. Lett. 37, 4–14 (2014)CrossRef Friedhelm, S., Edmondo, T.: Pattern classification and clustering: a review of partially supervised learning approaches. Pattern Recogn. Lett. 37, 4–14 (2014)CrossRef
6.
go back to reference Lan, Y., Hu, Z., Soh, Y.C., Huang, G.-B.: An extreme learning machine approach for speaker recognition. Neural Comput. Appl. 22(3–4), 417–425 (2013)CrossRef Lan, Y., Hu, Z., Soh, Y.C., Huang, G.-B.: An extreme learning machine approach for speaker recognition. Neural Comput. Appl. 22(3–4), 417–425 (2013)CrossRef
7.
go back to reference Zhi-Hua, Z., Li, M.: Tri-training: exploiting unlabeled data using three classifiers. IEEE TKDE 17(11), 1529–1541 (2005) Zhi-Hua, Z., Li, M.: Tri-training: exploiting unlabeled data using three classifiers. IEEE TKDE 17(11), 1529–1541 (2005)
8.
go back to reference Chapelle, O., Schlkopf, B., Zien, A.: Semi-supervised learning. MIT Press, Cambridge (2006)CrossRef Chapelle, O., Schlkopf, B., Zien, A.: Semi-supervised learning. MIT Press, Cambridge (2006)CrossRef
9.
go back to reference Shuang, W., Linsheng, W., Licheng, J., Hongying, L.: Improve the performance of co-training by committee with refinement of class probability estimations. Neurocomputing 136, 30–40 (2014)CrossRef Shuang, W., Linsheng, W., Licheng, J., Hongying, L.: Improve the performance of co-training by committee with refinement of class probability estimations. Neurocomputing 136, 30–40 (2014)CrossRef
10.
go back to reference Xu, J., He, H., Man, H.: DCPE co-training for classification. Neurocomputing 86, 75–85 (2012)CrossRef Xu, J., He, H., Man, H.: DCPE co-training for classification. Neurocomputing 86, 75–85 (2012)CrossRef
11.
go back to reference Li, M., Zhou, Z.: Improve computer-aided diagnosis with machine learning techniques using undiagnosed samples. IEEE TSMC 37, 1088–1098 (2007) Li, M., Zhou, Z.: Improve computer-aided diagnosis with machine learning techniques using undiagnosed samples. IEEE TSMC 37, 1088–1098 (2007)
12.
go back to reference Hady, M., Schwenker, F.: Co-training by committee: a new semi-supervised learning framework, In: Proceedings of the IEEE International Conference on Data Mining Workshops, pp. 563–572 (2008) Hady, M., Schwenker, F.: Co-training by committee: a new semi-supervised learning framework, In: Proceedings of the IEEE International Conference on Data Mining Workshops, pp. 563–572 (2008)
13.
go back to reference Zhou, Y., Goldman, S.: Democratic co-learning. In: 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI’04), pp. 594–202 (2004) Zhou, Y., Goldman, S.: Democratic co-learning. In: 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI’04), pp. 594–202 (2004)
14.
go back to reference Hyon, S., Dang, J., Feng, H., Wang, H., Honda, K.: Detection of speaker individual information using a phoneme effect suppression method. Speech Commun. 57, 87–100 (2014)CrossRef Hyon, S., Dang, J., Feng, H., Wang, H., Honda, K.: Detection of speaker individual information using a phoneme effect suppression method. Speech Commun. 57, 87–100 (2014)CrossRef
15.
go back to reference Sun, S.: A survey of multi-view machine learning. Neural Comput. Appl. 23(7–8), 2031–2038 (2013)CrossRef Sun, S.: A survey of multi-view machine learning. Neural Comput. Appl. 23(7–8), 2031–2038 (2013)CrossRef
16.
go back to reference Deng, C., Guo, M.Z.: A new co-training-style random forest for computer aided diagnosis. J. Intell. Inf. Syst. 36, 253–281 (2011)CrossRef Deng, C., Guo, M.Z.: A new co-training-style random forest for computer aided diagnosis. J. Intell. Inf. Syst. 36, 253–281 (2011)CrossRef
17.
go back to reference Wang, J., Luo, S., Zeng, X.: A random subspace method for co-training. In: IEEE International Joint Conference on Computational Intelligence, pp. 195–200 (2008) Wang, J., Luo, S., Zeng, X.: A random subspace method for co-training. In: IEEE International Joint Conference on Computational Intelligence, pp. 195–200 (2008)
18.
go back to reference Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, Cambridge (2000)CrossRef Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, Cambridge (2000)CrossRef
19.
go back to reference Susan, S., Sharma, S.: A fuzzy nearest neighbor classifier for speaker identification. In: 4th International Conference on Computational Intelligence and Communication Networks, CICN 2012, pp. 842–845 (2012) Susan, S., Sharma, S.: A fuzzy nearest neighbor classifier for speaker identification. In: 4th International Conference on Computational Intelligence and Communication Networks, CICN 2012, pp. 842–845 (2012)
20.
go back to reference Jiang, Z., Zhang, S., Zeng, J.: A hybrid generative/discriminative method for semi-supervised classification. Knowl.-Based Syst. 37, 137–145 (2013)CrossRef Jiang, Z., Zhang, S., Zeng, J.: A hybrid generative/discriminative method for semi-supervised classification. Knowl.-Based Syst. 37, 137–145 (2013)CrossRef
21.
go back to reference Didaci, L., Fumera, G., Roli, F.: Analysis of co-training algorithm with very small training sets. In: Gimel’farb, G., Hancock, E., Imiya, A., Kuijper, A., Kudo, M., Omachi, S., Windeatt, T., Yamada, K. (eds.) SSPR &SPR 2012, vol. 7626, pp. 719–726. Springer, Berlin (2012)CrossRef Didaci, L., Fumera, G., Roli, F.: Analysis of co-training algorithm with very small training sets. In: Gimel’farb, G., Hancock, E., Imiya, A., Kuijper, A., Kudo, M., Omachi, S., Windeatt, T., Yamada, K. (eds.) SSPR &SPR 2012, vol. 7626, pp. 719–726. Springer, Berlin (2012)CrossRef
22.
go back to reference Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.: The WEKA data mining software: an update. SIGKDD Explor. 11(1), 10–18 (2009)CrossRef Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.: The WEKA data mining software: an update. SIGKDD Explor. 11(1), 10–18 (2009)CrossRef
23.
go back to reference Frank, E., Hall, M., Pfahringer, B.: Locally weighted naive Bayes. In: 19th Conference on Uncertainty in Artificial Intelligence. Mexico (2003) Frank, E., Hall, M., Pfahringer, B.: Locally weighted naive Bayes. In: 19th Conference on Uncertainty in Artificial Intelligence. Mexico (2003)
24.
go back to reference Domingos, P., Pazzani, M.: On the optimality of the simple Bayesian classifier under zero-one loss. Mach. Learn. 29, 103–130 (1997)CrossRefMATH Domingos, P., Pazzani, M.: On the optimality of the simple Bayesian classifier under zero-one loss. Mach. Learn. 29, 103–130 (1997)CrossRefMATH
25.
go back to reference Du, J., Ling, C.X., Zhou, Z.-H.: When does cotraining work in real data? IEEE TKDE 23(5), 788–799 (2011) Du, J., Ling, C.X., Zhou, Z.-H.: When does cotraining work in real data? IEEE TKDE 23(5), 788–799 (2011)
26.
go back to reference Goldberg, X.: Introduction to semi-supervised learning. In: Synthesis Lectures on Artificial Intelligence and Machine Learning. Morgan Claypool (2009) Goldberg, X.: Introduction to semi-supervised learning. In: Synthesis Lectures on Artificial Intelligence and Machine Learning. Morgan Claypool (2009)
27.
go back to reference Bourouba, H., Korba, C.A., Djemili, R.: Novel approach in speaker identification using SVM and GMM. Control Engg. Appl. Inf. 15(3), 87–95 (2013) Bourouba, H., Korba, C.A., Djemili, R.: Novel approach in speaker identification using SVM and GMM. Control Engg. Appl. Inf. 15(3), 87–95 (2013)
28.
go back to reference Pal, A., Bose, S., Basak, G.K., Mukhopadhyay, A.: Speaker identification by aggregating Gaussian mixture models (GMMs) based on uncorrelated MFCC-derived features. Int. J. Pattern Recogn. Artif. Intell. 28(4), 25 (2014) Pal, A., Bose, S., Basak, G.K., Mukhopadhyay, A.: Speaker identification by aggregating Gaussian mixture models (GMMs) based on uncorrelated MFCC-derived features. Int. J. Pattern Recogn. Artif. Intell. 28(4), 25 (2014)
29.
go back to reference Zhao, X., Wang, Y., Wang, D.: Robust speaker identification in noisy and reverberant conditions. IEEE TASLP 22(4), 836–845 (2014) Zhao, X., Wang, Y., Wang, D.: Robust speaker identification in noisy and reverberant conditions. IEEE TASLP 22(4), 836–845 (2014)
30.
go back to reference Alcal-Fdez, J., Fernandez, A., Luengo, J., Derrac, J., Garca, S., Snchez, L.: KEEL data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. J. Multi.-Valued Logic Soft Comput. 17(2–3), 255–287 (2011) Alcal-Fdez, J., Fernandez, A., Luengo, J., Derrac, J., Garca, S., Snchez, L.: KEEL data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. J. Multi.-Valued Logic Soft Comput. 17(2–3), 255–287 (2011)
31.
go back to reference Triguero, I., Garca, S., Herrera, F.: Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study. Knowl. Inf. Syst. 42(2), 245–284 (2015)CrossRef Triguero, I., Garca, S., Herrera, F.: Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study. Knowl. Inf. Syst. 42(2), 245–284 (2015)CrossRef
32.
go back to reference Namrata, D.: Feature extraction methods LPC, PLP and MFCC in speech recognition. Int. J. Adv. Res. Engg Technol. 1(6), 1–4 (2013) Namrata, D.: Feature extraction methods LPC, PLP and MFCC in speech recognition. Int. J. Adv. Res. Engg Technol. 1(6), 1–4 (2013)
Metadata
Title
Speaker Identification Using Semi-supervised Learning
Authors
Nikos Fazakis
Stamatis Karlos
Sotiris Kotsiantis
Kyriakos Sgarbas
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-23132-7_48

Premium Partner