Skip to main content

2018 | OriginalPaper | Buchkapitel

Computer-Based Statistical Description of Phonetical Balance for Romanian Utterances

verfasst von : A. Cocioceanu, T. Ivănoaica, A. I. Nicolin, M. C. Raportaru

Erschienen in: ICT Innovations 2016

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Motivated by the advent of security solutions which rely on voice biometrics, we revisit by means of extensive computer-based investigations the concept of phonetical balance for Romanian utterances. We show that the standard distribution of phonems offers only a partial description of the phonetics of the language and that more detailed statistical indicators are needed. To this end, we introduce a simple indicator that measures vowel-consonant (or consonant-vowel) sequences and analyze the distribution of consonant clusters for Romanian words. Our results show that the distribution of consonant clusters is scale-free-like (akin to the distribution of words and phrases in large texts) and that large clusters of vowels or consonants are infrequent. This, in turn, indicates that utterances consisting of words which are statistically unrepresentative with respect to the previous indicators are good candidates for benchmarking the efficency of voice biometrics solutions.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bonneau, J., Herley, C., van Oorschot, P.C., Stajano, F.: Passwords and the evolution of imperfect authentication. Commun. ACM 58, 78–87 (2015)CrossRef Bonneau, J., Herley, C., van Oorschot, P.C., Stajano, F.: Passwords and the evolution of imperfect authentication. Commun. ACM 58, 78–87 (2015)CrossRef
2.
Zurück zum Zitat See, for example, Apple’s Touch ID, the biometric fingerprint plus PIN code, the Google 2-Step Verification, and the user telephone number plus user password See, for example, Apple’s Touch ID, the biometric fingerprint plus PIN code, the Google 2-Step Verification, and the user telephone number plus user password
3.
Zurück zum Zitat Lupton, D.: Digital Sociology. Routledge, Abingdon (2014) Lupton, D.: Digital Sociology. Routledge, Abingdon (2014)
4.
Zurück zum Zitat Jain, A.K., Flynn, P., Ross, A.A. (eds.): Hanbook of Biometrics. Springer, New York (2008) Jain, A.K., Flynn, P., Ross, A.A. (eds.): Hanbook of Biometrics. Springer, New York (2008)
5.
Zurück zum Zitat See, in particular, the SpeechXRays project (http://www.speechxrays.com) which “will develop and test in real-life environments a user recognition platform based on voice acoustics analysis and audio-visual identity verification” which uses “text-independent speaker identification (no pass phrase)” and “low sensitivity to surrounding noise” (as retrived June 2016). The testing will be done in Greek and Romanian (2016) See, in particular, the SpeechXRays project (http://​www.​speechxrays.​com) which “will develop and test in real-life environments a user recognition platform based on voice acoustics analysis and audio-visual identity verification” which uses “text-independent speaker identification (no pass phrase)” and “low sensitivity to surrounding noise” (as retrived June 2016). The testing will be done in Greek and Romanian (2016)
7.
Zurück zum Zitat Stanescu, M., Cucu, H., Buzo, A., Burileanu, C.: ASR for low-resourced languages: building a phonetically balanced Romanian speech corpus. In: 20th European Signal Processing Conference (EUSIPCO 2012) (2012). ISSN 2076-1465 Stanescu, M., Cucu, H., Buzo, A., Burileanu, C.: ASR for low-resourced languages: building a phonetically balanced Romanian speech corpus. In: 20th European Signal Processing Conference (EUSIPCO 2012) (2012). ISSN 2076-1465
8.
Zurück zum Zitat Stanescu, M., Buzo, A., Cucu, H., Burileanu, C.: Statistical phonetic analysis of the Romanian language for speech recognition and synthesis tasks. In: 54th International Symposium ELMAR (2012) Stanescu, M., Buzo, A., Cucu, H., Burileanu, C.: Statistical phonetic analysis of the Romanian language for speech recognition and synthesis tasks. In: 54th International Symposium ELMAR (2012)
9.
Zurück zum Zitat The Dexonline dictionary is publicly available at https://dexonline.ro and consists of more than fifty distinct dictionaries (see the complete list at https://dexonline.ro/surse) which have a very broad coverage. The dictionaries go from general-use prescriptive dictionaries and thesauruses to topical and orthographic dictionaries The Dexonline dictionary is publicly available at https://​dexonline.​ro and consists of more than fifty distinct dictionaries (see the complete list at https://​dexonline.​ro/​surse) which have a very broad coverage. The dictionaries go from general-use prescriptive dictionaries and thesauruses to topical and orthographic dictionaries
10.
Zurück zum Zitat Explanatory Dictionary of the Romanian Language (in Romanian), Romanian Academy, “Iorgu Iordan” Liguistic Institute, Editura Univers Enciclopedic, 2nd Edn. (1996) Explanatory Dictionary of the Romanian Language (in Romanian), Romanian Academy, “Iorgu Iordan” Liguistic Institute, Editura Univers Enciclopedic, 2nd Edn. (1996)
11.
Zurück zum Zitat Zipf, G.K.: Human Behavior and the Principle of Least Effort. An Introduction to Human Ecology. Addison-Wesley Press, Boston (1949) Zipf, G.K.: Human Behavior and the Principle of Least Effort. An Introduction to Human Ecology. Addison-Wesley Press, Boston (1949)
12.
Zurück zum Zitat Ha, L.Q., Sicilia-Garcia, E.I., Ming, J., Smith, F.J.: Extension of Zipf’s law to words and phrases. In: Proceedings of 19th International Conference on Computational Linguistics - COLING (2012) Ha, L.Q., Sicilia-Garcia, E.I., Ming, J., Smith, F.J.: Extension of Zipf’s law to words and phrases. In: Proceedings of 19th International Conference on Computational Linguistics - COLING (2012)
14.
Zurück zum Zitat i Cancho, R.F., Sole, R.V.: Least effort and the origins of scaling in human language. Proc. Nat. Acad. Sci. 100, 1788–1791 (2003)MathSciNetCrossRefMATH i Cancho, R.F., Sole, R.V.: Least effort and the origins of scaling in human language. Proc. Nat. Acad. Sci. 100, 1788–1791 (2003)MathSciNetCrossRefMATH
15.
Zurück zum Zitat Lin, R., Ma, Q.D.Y., Bian, C.: Scaling laws in human speech, decreasing emergence of new words and a generalized model (2015). arXiv:1412.4846v2 Lin, R., Ma, Q.D.Y., Bian, C.: Scaling laws in human speech, decreasing emergence of new words and a generalized model (2015). arXiv:​1412.​4846v2
16.
Zurück zum Zitat Dinu, M.: Personalitatea Limbii Române, Cartea Românească (1996). (in Romanian) Dinu, M.: Personalitatea Limbii Române, Cartea Românească (1996). (in Romanian)
17.
Zurück zum Zitat Juilland, A., Edwards, P., Juilland, I.: Frequency Dictionary of Romanian Words. Mouton & Comp., Clearwater (1965) Juilland, A., Edwards, P., Juilland, I.: Frequency Dictionary of Romanian Words. Mouton & Comp., Clearwater (1965)
Metadaten
Titel
Computer-Based Statistical Description of Phonetical Balance for Romanian Utterances
verfasst von
A. Cocioceanu
T. Ivănoaica
A. I. Nicolin
M. C. Raportaru
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-68855-8_6

Premium Partner