Skip to main content
Top

2019 | OriginalPaper | Chapter

DANTE Speaker Recognition Module. An Efficient and Robust Automatic Speaker Searching Solution for Terrorism-Related Scenarios

Authors : Jesús Jorrín, Luis Buera

Published in: MultiMedia Modeling

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The vast amount of data crossing the net with terrorism-related content, including voice, is so immense that the use of powerful filtering/detection tools with great discriminative capacities becomes essential. Although the analysis of this content often ends with some manual inspection, a first filtering process becomes basic. In this direction, we propose a speaker clustering solution based on a speaker identification system. We show that both the speaker clustering and the speaker recognition solution can be used individually to efficiently solve searching tasks in several terrorism-related scenarios.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Atal, B.S.: Automatic recognition of speakers from their voices. Proc. IEEE 64, 460–475 (1976)CrossRef Atal, B.S.: Automatic recognition of speakers from their voices. Proc. IEEE 64, 460–475 (1976)CrossRef
3.
go back to reference Doddington, G.R.: Speaker recognition-identifying people by their voices. Proc. IEEE 73, 1651–1664 (1985)CrossRef Doddington, G.R.: Speaker recognition-identifying people by their voices. Proc. IEEE 73, 1651–1664 (1985)CrossRef
4.
5.
go back to reference Dehak, N., Kenny, P., Dehak, R., Dumouchel, P., Ouellet, P.: Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech Lang. Process. 19(4), 788–798 (2011)CrossRef Dehak, N., Kenny, P., Dehak, R., Dumouchel, P., Ouellet, P.: Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech Lang. Process. 19(4), 788–798 (2011)CrossRef
6.
go back to reference Castaldo, F., Colibro, D., Dalmasso, E., Laface, P., Vair, C.: Compensation of nuisance factors for speaker and language recognition. IEEE Trans. Audio Speech Lang. Process. 15(7), 1969–1978 (2007)CrossRef Castaldo, F., Colibro, D., Dalmasso, E., Laface, P., Vair, C.: Compensation of nuisance factors for speaker and language recognition. IEEE Trans. Audio Speech Lang. Process. 15(7), 1969–1978 (2007)CrossRef
7.
go back to reference Cumani, S., Laface, P.: Training pairwise support vector machines with large scale datasets. In: 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP 2014, Florence (Italy), pp. 1664–1668 (2014) Cumani, S., Laface, P.: Training pairwise support vector machines with large scale datasets. In: 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP 2014, Florence (Italy), pp. 1664–1668 (2014)
8.
go back to reference Cumani, S., Laface, P.: Large scale training of pairwise support vector machines for speaker recognition. IEEE/ACM Trans. Audio Speech Lang. Process. 22(11), 1590–1600 (2014)CrossRef Cumani, S., Laface, P.: Large scale training of pairwise support vector machines for speaker recognition. IEEE/ACM Trans. Audio Speech Lang. Process. 22(11), 1590–1600 (2014)CrossRef
9.
go back to reference Lei, Y., Scheffer, N., Ferrer, L., McLaren, M.: A novel scheme for speaker recognition using a phonetically-aware deep neural network. In: Proceedings of ICASSP 2014, pp. 1714–1718 (2014) Lei, Y., Scheffer, N., Ferrer, L., McLaren, M.: A novel scheme for speaker recognition using a phonetically-aware deep neural network. In: Proceedings of ICASSP 2014, pp. 1714–1718 (2014)
10.
go back to reference Cumani, S., Batzu, P.D., Colibro, D., Vair, C., Laface, P., Vasilakakis, V.: Comparison of speaker recognition approaches for real applications. In: Interspeech 2011, Florence, Italy, pp. 2365–2368 (2011) Cumani, S., Batzu, P.D., Colibro, D., Vair, C., Laface, P., Vasilakakis, V.: Comparison of speaker recognition approaches for real applications. In: Interspeech 2011, Florence, Italy, pp. 2365–2368 (2011)
11.
go back to reference Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 4th edn. Academic Press, Cambridge (2008)MATH Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 4th edn. Academic Press, Cambridge (2008)MATH
12.
go back to reference Leeuwen, D.A.V.: Speaker linking in large datasets. In: Odyssey 2010, the Speaker Language and Recognition Workshop, Brno, Czech Republic, pp. 202–208 (2010) Leeuwen, D.A.V.: Speaker linking in large datasets. In: Odyssey 2010, the Speaker Language and Recognition Workshop, Brno, Czech Republic, pp. 202–208 (2010)
13.
go back to reference Jorrín-Prieto, J.., Vaquero, C., García, P.: Analysis of the impact of the audio database characteristics in the accuracy of a speaker clustering system. In: Odyssey 2016, the Speaker Language and Recognition Workshop, Bilbao, Spain, pp. 393–399 (2016) Jorrín-Prieto, J.., Vaquero, C., García, P.: Analysis of the impact of the audio database characteristics in the accuracy of a speaker clustering system. In: Odyssey 2016, the Speaker Language and Recognition Workshop, Bilbao, Spain, pp. 393–399 (2016)
14.
go back to reference Bolle, R.M., Connell, J.H., Pankanti, S., Ratha, N.K., Senior, A.W.: The relation between the ROC curve and the CMC. In: Fourth IEEE Workshop on Automatic Identification Advanced Technologies (AutoID 2005), pp. 15–20 (2005) Bolle, R.M., Connell, J.H., Pankanti, S., Ratha, N.K., Senior, A.W.: The relation between the ROC curve and the CMC. In: Fourth IEEE Workshop on Automatic Identification Advanced Technologies (AutoID 2005), pp. 15–20 (2005)
15.
go back to reference Manning, C.D., Raghavan, P., Schütze, H., et al.: Introduction to Information Retrieval, vol. 1, 1st edn. Cambridge University Press, Cambridge (2008)CrossRef Manning, C.D., Raghavan, P., Schütze, H., et al.: Introduction to Information Retrieval, vol. 1, 1st edn. Cambridge University Press, Cambridge (2008)CrossRef
Metadata
Title
DANTE Speaker Recognition Module. An Efficient and Robust Automatic Speaker Searching Solution for Terrorism-Related Scenarios
Authors
Jesús Jorrín
Luis Buera
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-05710-7_58