Skip to main content
Top

2018 | OriginalPaper | Chapter

Computer-Based Statistical Description of Phonetical Balance for Romanian Utterances

Authors : A. Cocioceanu, T. Ivănoaica, A. I. Nicolin, M. C. Raportaru

Published in: ICT Innovations 2016

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Motivated by the advent of security solutions which rely on voice biometrics, we revisit by means of extensive computer-based investigations the concept of phonetical balance for Romanian utterances. We show that the standard distribution of phonems offers only a partial description of the phonetics of the language and that more detailed statistical indicators are needed. To this end, we introduce a simple indicator that measures vowel-consonant (or consonant-vowel) sequences and analyze the distribution of consonant clusters for Romanian words. Our results show that the distribution of consonant clusters is scale-free-like (akin to the distribution of words and phrases in large texts) and that large clusters of vowels or consonants are infrequent. This, in turn, indicates that utterances consisting of words which are statistically unrepresentative with respect to the previous indicators are good candidates for benchmarking the efficency of voice biometrics solutions.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Bonneau, J., Herley, C., van Oorschot, P.C., Stajano, F.: Passwords and the evolution of imperfect authentication. Commun. ACM 58, 78–87 (2015)CrossRef Bonneau, J., Herley, C., van Oorschot, P.C., Stajano, F.: Passwords and the evolution of imperfect authentication. Commun. ACM 58, 78–87 (2015)CrossRef
2.
go back to reference See, for example, Apple’s Touch ID, the biometric fingerprint plus PIN code, the Google 2-Step Verification, and the user telephone number plus user password See, for example, Apple’s Touch ID, the biometric fingerprint plus PIN code, the Google 2-Step Verification, and the user telephone number plus user password
3.
go back to reference Lupton, D.: Digital Sociology. Routledge, Abingdon (2014) Lupton, D.: Digital Sociology. Routledge, Abingdon (2014)
4.
go back to reference Jain, A.K., Flynn, P., Ross, A.A. (eds.): Hanbook of Biometrics. Springer, New York (2008) Jain, A.K., Flynn, P., Ross, A.A. (eds.): Hanbook of Biometrics. Springer, New York (2008)
5.
go back to reference See, in particular, the SpeechXRays project (http://www.speechxrays.com) which “will develop and test in real-life environments a user recognition platform based on voice acoustics analysis and audio-visual identity verification” which uses “text-independent speaker identification (no pass phrase)” and “low sensitivity to surrounding noise” (as retrived June 2016). The testing will be done in Greek and Romanian (2016) See, in particular, the SpeechXRays project (http://​www.​speechxrays.​com) which “will develop and test in real-life environments a user recognition platform based on voice acoustics analysis and audio-visual identity verification” which uses “text-independent speaker identification (no pass phrase)” and “low sensitivity to surrounding noise” (as retrived June 2016). The testing will be done in Greek and Romanian (2016)
7.
go back to reference Stanescu, M., Cucu, H., Buzo, A., Burileanu, C.: ASR for low-resourced languages: building a phonetically balanced Romanian speech corpus. In: 20th European Signal Processing Conference (EUSIPCO 2012) (2012). ISSN 2076-1465 Stanescu, M., Cucu, H., Buzo, A., Burileanu, C.: ASR for low-resourced languages: building a phonetically balanced Romanian speech corpus. In: 20th European Signal Processing Conference (EUSIPCO 2012) (2012). ISSN 2076-1465
8.
go back to reference Stanescu, M., Buzo, A., Cucu, H., Burileanu, C.: Statistical phonetic analysis of the Romanian language for speech recognition and synthesis tasks. In: 54th International Symposium ELMAR (2012) Stanescu, M., Buzo, A., Cucu, H., Burileanu, C.: Statistical phonetic analysis of the Romanian language for speech recognition and synthesis tasks. In: 54th International Symposium ELMAR (2012)
9.
go back to reference The Dexonline dictionary is publicly available at https://dexonline.ro and consists of more than fifty distinct dictionaries (see the complete list at https://dexonline.ro/surse) which have a very broad coverage. The dictionaries go from general-use prescriptive dictionaries and thesauruses to topical and orthographic dictionaries The Dexonline dictionary is publicly available at https://​dexonline.​ro and consists of more than fifty distinct dictionaries (see the complete list at https://​dexonline.​ro/​surse) which have a very broad coverage. The dictionaries go from general-use prescriptive dictionaries and thesauruses to topical and orthographic dictionaries
10.
go back to reference Explanatory Dictionary of the Romanian Language (in Romanian), Romanian Academy, “Iorgu Iordan” Liguistic Institute, Editura Univers Enciclopedic, 2nd Edn. (1996) Explanatory Dictionary of the Romanian Language (in Romanian), Romanian Academy, “Iorgu Iordan” Liguistic Institute, Editura Univers Enciclopedic, 2nd Edn. (1996)
11.
go back to reference Zipf, G.K.: Human Behavior and the Principle of Least Effort. An Introduction to Human Ecology. Addison-Wesley Press, Boston (1949) Zipf, G.K.: Human Behavior and the Principle of Least Effort. An Introduction to Human Ecology. Addison-Wesley Press, Boston (1949)
12.
go back to reference Ha, L.Q., Sicilia-Garcia, E.I., Ming, J., Smith, F.J.: Extension of Zipf’s law to words and phrases. In: Proceedings of 19th International Conference on Computational Linguistics - COLING (2012) Ha, L.Q., Sicilia-Garcia, E.I., Ming, J., Smith, F.J.: Extension of Zipf’s law to words and phrases. In: Proceedings of 19th International Conference on Computational Linguistics - COLING (2012)
14.
go back to reference i Cancho, R.F., Sole, R.V.: Least effort and the origins of scaling in human language. Proc. Nat. Acad. Sci. 100, 1788–1791 (2003)MathSciNetCrossRefMATH i Cancho, R.F., Sole, R.V.: Least effort and the origins of scaling in human language. Proc. Nat. Acad. Sci. 100, 1788–1791 (2003)MathSciNetCrossRefMATH
15.
go back to reference Lin, R., Ma, Q.D.Y., Bian, C.: Scaling laws in human speech, decreasing emergence of new words and a generalized model (2015). arXiv:1412.4846v2 Lin, R., Ma, Q.D.Y., Bian, C.: Scaling laws in human speech, decreasing emergence of new words and a generalized model (2015). arXiv:​1412.​4846v2
16.
go back to reference Dinu, M.: Personalitatea Limbii Române, Cartea Românească (1996). (in Romanian) Dinu, M.: Personalitatea Limbii Române, Cartea Românească (1996). (in Romanian)
17.
go back to reference Juilland, A., Edwards, P., Juilland, I.: Frequency Dictionary of Romanian Words. Mouton & Comp., Clearwater (1965) Juilland, A., Edwards, P., Juilland, I.: Frequency Dictionary of Romanian Words. Mouton & Comp., Clearwater (1965)
Metadata
Title
Computer-Based Statistical Description of Phonetical Balance for Romanian Utterances
Authors
A. Cocioceanu
T. Ivănoaica
A. I. Nicolin
M. C. Raportaru
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-68855-8_6

Premium Partner