Skip to main content
Top

2020 | OriginalPaper | Chapter

Automatic Marking of Allophone Boundaries in Isolated English Spoken Words

Authors : Janusz Rafałko, Andrzej Czyżewski

Published in: Computer Information Systems and Industrial Management

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The work presents a method that allows delimiting the borders of allophones in isolated English words. The described method is based on the DTW algorithm combining two signals, a reference signal and an analyzed one. As the reference signal, recordings from the MODALITY database were used, from which the words were extracted. This database was also used for tests, which were described. Test results show that the automatic determination of the allophone limits in English words is possible with good accuracy. Tests have been carried out to determine the error of particular allophones borders marking and to find out the cost of matching the given allophone to the reference one. Based on this cost, a coefficient has been introduced that allows for determining in percentage how much the automatically marked allophone is similar to the reference one. This coefficient can be used for an assessment of the correctness of the pronunciation of the allophone. The possibilities of further research and development of this method were also analyzed.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Bellman, R., Kalaba, R.: On adaptive control processes, automatic control. IRE Trans. 4(2), 1–9 (1959)MATH Bellman, R., Kalaba, R.: On adaptive control processes, automatic control. IRE Trans. 4(2), 1–9 (1959)MATH
2.
go back to reference Crystal, D.: English as a Global Language, 2nd edn. Cambridge University Press, Cambridge (2003)CrossRef Crystal, D.: English as a Global Language, 2nd edn. Cambridge University Press, Cambridge (2003)CrossRef
3.
go back to reference Czyżewski, A., Ciszewski, T., Kostek, B.: Methodology and technology for the polymodal allophonic speech transcription. J. Acoust. Soc. Am. 139(4), 2017 (2017)CrossRef Czyżewski, A., Ciszewski, T., Kostek, B.: Methodology and technology for the polymodal allophonic speech transcription. J. Acoust. Soc. Am. 139(4), 2017 (2017)CrossRef
4.
go back to reference Czyżewski, A., Kostek, B., Bratoszewski, P., Kotus, J., Szykulski, M.: An audio-visual corpus for multimodal automatic speech recognition. J. Intell. Inf. Syst. 49(2), 167–192 (2017)CrossRef Czyżewski, A., Kostek, B., Bratoszewski, P., Kotus, J., Szykulski, M.: An audio-visual corpus for multimodal automatic speech recognition. J. Intell. Inf. Syst. 49(2), 167–192 (2017)CrossRef
5.
go back to reference Gafos, A.: The Articulatory Basis of Locality in Phonology. Routledge Taylor & Francis Group, Abingdon (1999) Gafos, A.: The Articulatory Basis of Locality in Phonology. Routledge Taylor & Francis Group, Abingdon (1999)
6.
go back to reference Harris, F.J.: On the use of windows for harmonic analysis with the discrete fourier transform. Proc. IEEE 66(1), 51–84 (1978)CrossRef Harris, F.J.: On the use of windows for harmonic analysis with the discrete fourier transform. Proc. IEEE 66(1), 51–84 (1978)CrossRef
7.
go back to reference Keogh, E.J., Pazzani, M.J.: Derivative dynamic time warping. In: the 1st SIAM International Conference on Data Mining, Chicago, IL, USA (2001) Keogh, E.J., Pazzani, M.J.: Derivative dynamic time warping. In: the 1st SIAM International Conference on Data Mining, Chicago, IL, USA (2001)
8.
go back to reference Kiritani, S., Itoh, K., Hirose, H., Sawashima, M.: Coordination of the consonant and vowel articulations—X-ray microbeam study on Japanese and English. Ann. Bull. Res. Inst. Logoped. Phoniatry 11, 31–37 (1977) Kiritani, S., Itoh, K., Hirose, H., Sawashima, M.: Coordination of the consonant and vowel articulations—X-ray microbeam study on Japanese and English. Ann. Bull. Res. Inst. Logoped. Phoniatry 11, 31–37 (1977)
9.
go back to reference Müller, M.: Information Retrieval for Music and Motion. Springer, Heidelberg (2007). Part I, chapter 4, Dynamic Time Warping, pp. 69–74CrossRef Müller, M.: Information Retrieval for Music and Motion. Springer, Heidelberg (2007). Part I, chapter 4, Dynamic Time Warping, pp. 69–74CrossRef
10.
go back to reference Myers, C.S., Rabiner, L.R.: A comparative study of several dynamic time-warping algorithms for connected word recognition. Bell Syst. Tech. J. 60, 1389–1409 (1981)CrossRef Myers, C.S., Rabiner, L.R.: A comparative study of several dynamic time-warping algorithms for connected word recognition. Bell Syst. Tech. J. 60, 1389–1409 (1981)CrossRef
11.
go back to reference Rabiner, L.R., Rosenberg, A., Levinson, S.: Considerations in dynamic time warping algorithms for discrete word recognition. IEEE Trans. Acoust. Speech Signal Process. 26, 575–582 (1978)CrossRef Rabiner, L.R., Rosenberg, A., Levinson, S.: Considerations in dynamic time warping algorithms for discrete word recognition. IEEE Trans. Acoust. Speech Signal Process. 26, 575–582 (1978)CrossRef
12.
15.
go back to reference Salvador, S., Chan, P.: FastDTW: toward accurate dynamic time warping in linear time and space. In: KDD Workshop on Mining Temporal and Sequential Data, pp. 70–80 (2004) Salvador, S., Chan, P.: FastDTW: toward accurate dynamic time warping in linear time and space. In: KDD Workshop on Mining Temporal and Sequential Data, pp. 70–80 (2004)
16.
go back to reference Szpilewski, E., Piórkowska, B., Rafałko, J., Lobanov, B., Kiselov, V., Tsirulnik, L.: Polish TTS in multi-voice slavonic languages speech synthesis system. In: SPECOM’2004 Proceedings, 9th International Conference Speech and Computer, Saint-Petersburg, Russia, pp. 565–570 (2004) Szpilewski, E., Piórkowska, B., Rafałko, J., Lobanov, B., Kiselov, V., Tsirulnik, L.: Polish TTS in multi-voice slavonic languages speech synthesis system. In: SPECOM’2004 Proceedings, 9th International Conference Speech and Computer, Saint-Petersburg, Russia, pp. 565–570 (2004)
Metadata
Title
Automatic Marking of Allophone Boundaries in Isolated English Spoken Words
Authors
Janusz Rafałko
Andrzej Czyżewski
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-47679-3_5

Premium Partner