Skip to main content
Log in

Forensic investigation for twin identification from speech: perceptual and gamma-tone features and models

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

To assist an investigation process, forensic experts compare and analyze audio recordings. Speech utterances are compared by humans and/or machines for use in court for investigation. Scientific research community insists for specific automatic or human-based approach to identify uniquxy2e audio features from identical twins group. Filters can be employed to enhance an audio recording for improving clarity. This may entail removal of unnecessary noise to enrich the intelligibility of speech. Forensic audio experts can examine a variety of characteristics of the audio recording to decide the possibility of alterations in the collected evidences. This includes confirming the integrity and authenticating that the content is what it purports to be. Thiswork named as FIST(Forensic Investigation for Twin Identification from Speech: Perceptual and Gamma-tone Features and Models) proposes an automated system to identify a twin from identical twin pairs by the use of gamma-tone features and perceptual features.The proposed features are excerpted from the set of training speeches and templates are created for each twin based on vector quantisation (VQ), Fuzzy C means clustering (FCM) and multivariate hidden Markov modelling (MHMM) techniques. For testing, features are extracted from the set of test utterances and worked out to the templates for classification. Based on the type of classifier used, classification of twin is carried out with minimum distance and maximum loglikelihood value. The proposed features are examinedfor sub-optimal and true success rates as key performance metrics to assess the system and also a comparative analysis is made across the proposed features. Among the inspected features, Gammatone energy features expose better performance in comparison to perceptual features by attaining the overall sub-optimal success rate and true success rate as97.8375% and 92.75% for Gammatone energy features with VQ based modelling technique. This work FIST has also been analysed by inducing disturbance in the form of speech interference from their own twin pairs and Gamma-tone energy feature with VQ based modelling technique performs better for twin identification. A high claim of 99.625% and 95.0625% accuracy has been achieved by employing decision level fusion classifier.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  1. Ariyaeeinia A, Morrison C, Malegaonkar A, Black S (2008) A test of the effectiveness of speaker verification for differentiating between identical twins. Science and Justice 48:182–186

    Article  Google Scholar 

  2. Bolt RH, Cooper FS, Green DM, Hamlet SL, McKnight JG, Pickett JM, Tosi O, Underwood BD, Hogan DL (1979) On the theory and practice of voice identification. Washington, D.C, National Research Council, National Academy of Sciences

    Google Scholar 

  3. Debruyne F, Decoster W, Van Gijsel A, Vercammen J (2002) Speaking fundamental frequency in monozygotic and dizygotic twins. Journal of Voice 4:466–471

    Article  Google Scholar 

  4. Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27(8):861–874

    Article  MathSciNet  Google Scholar 

  5. Feiser HS (2009) Acoustic similarities and differences in the voices of same-sex siblings. 18th Annual Conference of the International Association for Forensic Phonetics and Acoustics (IAFPA). UK, Cambridge

    Google Scholar 

  6. Künzel HJ (2010) Automatic speaker recognition of identical twins. International Journal of Speech, Language and the Law 17(2):251–277

    Google Scholar 

  7. Leng, L., & Teoh, ABJ, “Alignment-free row-co-occurrence cancelable palmprint fuzzy vault”, Pattern Recognition, 48(7), 2290-2303. 2015. Comment 2

  8. Leng L, Teoh ABJ (2015) Dual-key-binding cancelable palmprint cryptosystem for palmprint protection and information security. Journal of Network and Computer Applications 34(6):1979–1989 Comment 2

    Article  Google Scholar 

  9. Leng L, Zhang J, Khan MK, Chen X, Alghathbar K (2010) Dynamic weighted discrimination power analysis: a novel approach for face and palmprint recognition in DCT domain. International Journal of the Physical Sciences 5(17):2543–2554 Comment 1 and comment 3

    Google Scholar 

  10. Leng, Lu, et al. (2011) Two-directional two-dimensional random projection and its variations for face and palmprint recognition. International conference on computational science and its applications. Springer, Berlin, Heidelberg, page no -458-470. Comment 1

  11. Leng L, Teoh ABJ, Li M, Khan MK (2014) A remote cancelable palmprint authentication protocol based on multi-directional two-dimensional PalmPhasor-fusion. Security and Communication Networks 7(11):1860–1871 Comment 2

    Article  Google Scholar 

  12. Leng, L., Li, M., Kim, C., & Bi, X, “Dual-source discrimination power analysis for multi-instance contactless palmprint recognition”, Multimedia Tools and Applications, 76(1), 333-354. 2017. Comment 2

    Article  Google Scholar 

  13. Nolan F, Oh T (1996) Identical twins, different voices. International Journal of Speech Language and the Law 3(1):39–49

    Article  Google Scholar 

  14. Patil HA, Basu TK (2004) Teager energy melcepstrum for identification of twins in Marathi. Proceedings of the IEEE INDICON 2004. First India Annual Conference, 58-61

  15. Przybyla BD, Horii Y, Crawford MH (1992) Vocal fundamental frequency in a twin sample: Looking for a genetic effect. Journal of Voice 6(3):261–266

    Article  Google Scholar 

  16. Rabiner L, Juang BH (1993) Fundamentals of speech recognition, Prentice Hall

  17. Revathi A, Venkataramani Y (2008) Use of perceptual features in iterative clustering based twins identification system. 2008 International Conference on Computing, Communication and Networking, page no-1-6

  18. Revathi A, Chinnadurai R, Venkataramani Y (2007) Effectiveness of LP derived features and DCTC in twins identification- iterative speaker clustering approach. International Conference on Computational Intelligence and Multimedia Applications, page no-535-539

  19. Revathi A, Sasikaladevi N, Nagakrishnan R, Jeyalakshmi C (September 2018) Robust emotion recognition from speech: Gamma tone features and models. International Journal of speech technology 21(3):723–739

    Article  Google Scholar 

  20. Rose P (2002) Forensic speaker identification. Taylor & Francis, London

    Book  Google Scholar 

  21. Ryalls J, Shaw H, Simon M (2004) Voice onset time production in older and younger female monozygotic twins. Folia PhoniatLogopaed 56:165–169

    Article  Google Scholar 

  22. Sabatier SB, Trester MR, Dawson JM (2019) Measurement of the impact of identical twin voices on automatic speaker recognition. International journal on Measurements 134:385–389

    Google Scholar 

  23. San Segundo, E (2012) Glottal source parameters for forensic voice comparison: An approach to voice quality in twins' voices. 21st Annual Conference of the International Association for Forensic Phonetics and Acoustics (IAFPA)

  24. San Segundo E (2013) Guess who is laughing: a perceptual experiment on twin and non-twin siblings’ identification. 31st International Conference AESLA

  25. SanSegundo E, Gómez-Vilda P (2013) Voice biometrical match of twin and non-twin siblings. 8th international Workshow on models and analysis of vocal emissions for biomedical Applications (MAVEBA 2013), page no – 253-256

  26. San Segundo E (2013) A phonetic corpus of Spanish male twins and siblings: Corpus design and forensic application. Proceedia - social and Behavioral sciences 95:59–67

    Article  Google Scholar 

  27. Segundo ES, Kunzel H (2005) Automatic speaker recognition of spanish siblings: (monozygotic and dizygotic) twins and non-twin brothers. Proceedings of the IEEE INDICON 2004, First India Annual Conference 2:1–10

    Google Scholar 

  28. Sun Z, Paulino AA, Feng J, Chai Z, Tieniu, Anil K. Jain (2010) A study of multi biometric traits of identical twins. Proc. SPIE 7667, Biometric Technology for Human Identification VII

  29. Van Gysel WD, Vercammen J, Debruyne (2001) Voice similarity in identical twins. Acta Oto-Rhino-Laryngologica Belg 55:49–55

    Google Scholar 

  30. Van Lierde KM, Vinck B, De Ley S, Clement G, Van Cauwenberge P (2004) Genetics of vocal quality characteristics in monozygotic twins: a multiparameter approach. Journal of voice 19(4):511–518

    Article  Google Scholar 

  31. Zuo D, Mok PPK (2015) Formant dynamics of bilingual identical twins. Journal of Phonetics 52:1–12

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sasikaladevi N..

Ethics declarations

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Informed consent

This article does not contain any studies with human participants.

Conflict of interest

First author declares that she has no conflict of interest. Second and third authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

A., R., N., S. & K., G. Forensic investigation for twin identification from speech: perceptual and gamma-tone features and models. Multimed Tools Appl 80, 18301–18315 (2021). https://doi.org/10.1007/s11042-021-10639-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-021-10639-z

Keywords

Navigation