Skip to main content
Top
Published in: Medical & Biological Engineering & Computing 12/2017

27-05-2017 | Original Article

Glottal Gap tracking by a continuous background modeling using inpainting

Authors: Gustavo Andrade-Miranda, Juan Ignacio Godino-Llorente

Published in: Medical & Biological Engineering & Computing | Issue 12/2017

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The visual examination of the vibration patterns of the vocal folds is an essential method to understand the phonation process and diagnose voice disorders. However, a detailed analysis of the phonation based on this technique requires a manual or a semi-automatic segmentation of the glottal area, which is difficult and time consuming. The present work presents a cuasi-automatic framework to accurately segment the glottal area introducing several techniques not explored before in the state of the art. The method takes advantage of the possibility of a minimal user intervention for those cases where the automatic computation fails. The presented method shows a reliable delimitation of the glottal gap, achieving an average improvement of 13 and 18% with respect to two other approaches found in the literature, while reducing the error of wrong detection of total closure instants. Additionally, the results suggest that the set of validation guidelines proposed can be used to standardize the criteria of accuracy and efficiency of the segmentation algorithms.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
1.
go back to reference Bohr C, Kräck A, Dubrovskiy D, Eysholdt U, Švec J, Psychogios G, Ziethe A, Döllinger M (2014) Spatiotemporal analysis of high-speed videolaryngoscopic imaging of organic pathologies in males. J Speech Lang Hear Res 57(4):1148–1161CrossRefPubMed Bohr C, Kräck A, Dubrovskiy D, Eysholdt U, Švec J, Psychogios G, Ziethe A, Döllinger M (2014) Spatiotemporal analysis of high-speed videolaryngoscopic imaging of organic pathologies in males. J Speech Lang Hear Res 57(4):1148–1161CrossRefPubMed
2.
go back to reference Voigt D, Döllinger M, Braunschweig T, Yang A, Eysholdt U, Lohscheller J (2010) Classification of functional voice disorders based on phonovibrograms. Artif Intell Med 49(1):51–59CrossRefPubMed Voigt D, Döllinger M, Braunschweig T, Yang A, Eysholdt U, Lohscheller J (2010) Classification of functional voice disorders based on phonovibrograms. Artif Intell Med 49(1):51–59CrossRefPubMed
3.
go back to reference Döllinger M., Lohscheller J, Švec JG, McWhorter A, Kunduk M (2011) Support Vector Machine Classification of Vocal Fold Vibrations Based on Phonovibrogram. Intech Döllinger M., Lohscheller J, Švec JG, McWhorter A, Kunduk M (2011) Support Vector Machine Classification of Vocal Fold Vibrations Based on Phonovibrogram. Intech
4.
go back to reference Unger J, Lohscheller J, Reiter M, Eder K, Betz CS, Schuster M (2015) A noninvasive procedure for early-stage discrimination of malignant and precancerous vocal fold lesions based on laryngeal dynamics analysis. Cancer Res 75(1):31–39CrossRefPubMed Unger J, Lohscheller J, Reiter M, Eder K, Betz CS, Schuster M (2015) A noninvasive procedure for early-stage discrimination of malignant and precancerous vocal fold lesions based on laryngeal dynamics analysis. Cancer Res 75(1):31–39CrossRefPubMed
5.
go back to reference Herbst CT, Lohscheller J, Švec JG, Henrich N, Weissengruber G, Fitch WT (2014) Glottal opening and closing events investigated by electroglottography and super-high-speed video recordings. J Exp Biol 217(6):955–963CrossRefPubMed Herbst CT, Lohscheller J, Švec JG, Henrich N, Weissengruber G, Fitch WT (2014) Glottal opening and closing events investigated by electroglottography and super-high-speed video recordings. J Exp Biol 217(6):955–963CrossRefPubMed
6.
go back to reference Švec JG, Schutte HK (1996) Videokymography: high-speed line scanning of vocal fold vibration. J Voice 10:201–5CrossRefPubMed Švec JG, Schutte HK (1996) Videokymography: high-speed line scanning of vocal fold vibration. J Voice 10:201–5CrossRefPubMed
7.
go back to reference Walker J, Murphy P (2007) Progress in nonlinear speech processing. ch. A Review of Glottal Waveform Analysis. Springer, Berlin, pp 1–21CrossRef Walker J, Murphy P (2007) Progress in nonlinear speech processing. ch. A Review of Glottal Waveform Analysis. Springer, Berlin, pp 1–21CrossRef
8.
go back to reference Lohscheller J, Toy H, Rosanowski F, Eysholdt U, Dollinger M (2007) Clinically evaluated procedure for the reconstruction of vocal fold vibrations from endoscopic digital high-speed videos. IEEE Trans Med Imaging 11(4):400–413 Lohscheller J, Toy H, Rosanowski F, Eysholdt U, Dollinger M (2007) Clinically evaluated procedure for the reconstruction of vocal fold vibrations from endoscopic digital high-speed videos. IEEE Trans Med Imaging 11(4):400–413
9.
go back to reference Karakozoglou S-Z, Nathalie H, D’Alessandro C, Stylianou Y (2011) Automatic glottal segmentation using local-based active contours and application to glottovibrography. Speech Comm 54(5):641–654CrossRef Karakozoglou S-Z, Nathalie H, D’Alessandro C, Stylianou Y (2011) Automatic glottal segmentation using local-based active contours and application to glottovibrography. Speech Comm 54(5):641–654CrossRef
10.
go back to reference Lohscheller J, Eysholdt U (2008) Phonovibrography: mapping high-speed movies of vocal fold vibrations into 2-D diagrams for visualizing and analyzing the underlying laryngeal dynamics. IEEE Trans Med Imaging 27(3):300–309CrossRefPubMed Lohscheller J, Eysholdt U (2008) Phonovibrography: mapping high-speed movies of vocal fold vibrations into 2-D diagrams for visualizing and analyzing the underlying laryngeal dynamics. IEEE Trans Med Imaging 27(3):300–309CrossRefPubMed
11.
go back to reference Yan Y, Du G, Zhu C, Marriott G (2012) Snake based automatic tracing of vocal-fold motion from high-speed digital images. In: IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 593–596 Yan Y, Du G, Zhu C, Marriott G (2012) Snake based automatic tracing of vocal-fold motion from high-speed digital images. In: IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 593–596
12.
go back to reference Skalski A, Zielinki T, Deliyski D (2008) Analysis of vocal folds movement in high speed videoendoscopy based on level set segmentation and image registration. In: International conference on signals and electronic systems, ICSES, pp 223–226 Skalski A, Zielinki T, Deliyski D (2008) Analysis of vocal folds movement in high speed videoendoscopy based on level set segmentation and image registration. In: International conference on signals and electronic systems, ICSES, pp 223–226
13.
go back to reference Mehta DD, Deliyski DD, Quatieri TF, Hillman RE (2013) Automated measurement of vocal fold vibratory asymmetry from high-speed videoendoscopy recordings. J Speech Lang Hear Res 54(1):47–54CrossRef Mehta DD, Deliyski DD, Quatieri TF, Hillman RE (2013) Automated measurement of vocal fold vibratory asymmetry from high-speed videoendoscopy recordings. J Speech Lang Hear Res 54(1):47–54CrossRef
14.
go back to reference Chen J, Gunturk BK, Kunduk M (2013) Glottis segmentation using dynamic programming. In: Proceeding of SPIE, medical imaging, image processing, vol. 8669, pp 86693L Chen J, Gunturk BK, Kunduk M (2013) Glottis segmentation using dynamic programming. In: Proceeding of SPIE, medical imaging, image processing, vol. 8669, pp 86693L
15.
go back to reference Moukalled HJ, Deliyski DD, Schwarz RR, Wang S (2009) Segmentation of laryngeal high-speed videondoscopy in temporal domain using paired active contours. In: Sixth international workshop on models and analysis of vocal emissions for biomedical applications, MAVEBA, pp 137–140 Moukalled HJ, Deliyski DD, Schwarz RR, Wang S (2009) Segmentation of laryngeal high-speed videondoscopy in temporal domain using paired active contours. In: Sixth international workshop on models and analysis of vocal emissions for biomedical applications, MAVEBA, pp 137–140
16.
go back to reference Demeyer J, Dubuisson T, Gosselin B, Remacle M (2009) Glottis segmentation with a high-speed glottography: a fully automatic method. In: 3rd advanced voice function assessment international workshop, pp 113–116 Demeyer J, Dubuisson T, Gosselin B, Remacle M (2009) Glottis segmentation with a high-speed glottography: a fully automatic method. In: 3rd advanced voice function assessment international workshop, pp 113–116
17.
18.
go back to reference Andrade-Miranda G, Godino-Llorente JI, Moro-Velázquez L, Gómez-García JA (2015) An automatic method to detect and track the glottal gap from high speed videoendoscopic images. BioMedical Engineering OnLine 14(1):1–29CrossRef Andrade-Miranda G, Godino-Llorente JI, Moro-Velázquez L, Gómez-García JA (2015) An automatic method to detect and track the glottal gap from high speed videoendoscopic images. BioMedical Engineering OnLine 14(1):1–29CrossRef
19.
go back to reference Lee JS, Kim E, Sung MW, Kim KH, Sung MY, Park KS (2001) A method for assessing the regional vibratory pattern of vocal folds by analysing the video recording of stroboscopy. Med Biol Eng Comput 39(3):273–278CrossRefPubMed Lee JS, Kim E, Sung MW, Kim KH, Sung MY, Park KS (2001) A method for assessing the regional vibratory pattern of vocal folds by analysing the video recording of stroboscopy. Med Biol Eng Comput 39(3):273–278CrossRefPubMed
20.
go back to reference Osma-Ruiz V, Godino-Llorente JI, Sáenz-Lechón N, Fraile R (2008) Segmentation of the glottal space from laryngeal images using the watershed transform. Comput Med Imaging Graph 32:193–201CrossRefPubMed Osma-Ruiz V, Godino-Llorente JI, Sáenz-Lechón N, Fraile R (2008) Segmentation of the glottal space from laryngeal images using the watershed transform. Comput Med Imaging Graph 32:193–201CrossRefPubMed
21.
go back to reference Gloger O, Lehnert B, Schrade A, Volzke H (2015) Fully automated glottis segmentation in endoscopic videos using local color and shape features of glottal regions. IEEE Trans Biomed Eng 62:795–806CrossRefPubMed Gloger O, Lehnert B, Schrade A, Volzke H (2015) Fully automated glottis segmentation in endoscopic videos using local color and shape features of glottal regions. IEEE Trans Biomed Eng 62:795–806CrossRefPubMed
22.
go back to reference JungHwan O, Sae H, JeongKyu L, Wallapak T, Johnny W, Piet dGC (2007) Informative frame classification for endoscopy video. Med Image Anal 11(2):110–127CrossRef JungHwan O, Sae H, JeongKyu L, Wallapak T, Johnny W, Piet dGC (2007) Informative frame classification for endoscopy video. Med Image Anal 11(2):110–127CrossRef
23.
go back to reference Mallick S, Zickler T, Belhumeur P, Kriegman D (2006) Specularity removal in images and videos: a PDE approach. In: Computer Vision â ECCV 2006, vol. 3951 of lecture notes in computer science, pp 550–563. Springer, Berlin Mallick S, Zickler T, Belhumeur P, Kriegman D (2006) Specularity removal in images and videos: a PDE approach. In: Computer Vision â ECCV 2006, vol. 3951 of lecture notes in computer science, pp 550–563. Springer, Berlin
24.
go back to reference Paris S, Kornprobst P, Tumblin J, Durand F (2009) Bilateral filtering: Theory and applications. Foundations and Trends in Computer Graphics and Vision 4(1):1–73CrossRef Paris S, Kornprobst P, Tumblin J, Durand F (2009) Bilateral filtering: Theory and applications. Foundations and Trends in Computer Graphics and Vision 4(1):1–73CrossRef
25.
go back to reference Yilmaz A, Javed O, Shah M (2006) Object tracking: a survey. ACM Computing Surveys, vol. 38 Yilmaz A, Javed O, Shah M (2006) Object tracking: a survey. ACM Computing Surveys, vol. 38
26.
go back to reference Telea A (2004) An image inpainting technique based on the fast marching method. J Graphics, GPU, Game Tools 9(1):23–34CrossRef Telea A (2004) An image inpainting technique based on the fast marching method. J Graphics, GPU, Game Tools 9(1):23–34CrossRef
27.
go back to reference Ridler TW, Calvard S (1978) Picture thresholding using an iterative selection method. IEEE Trans Syst Man Cybern 8:630–632CrossRef Ridler TW, Calvard S (1978) Picture thresholding using an iterative selection method. IEEE Trans Syst Man Cybern 8:630–632CrossRef
28.
go back to reference Birkholz P (2016) Glottalimageexplorer - an open source tool for glottis segmentation in endoscopic high-speed videos of the vocal folds. In: Jokisch, O (ed.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2016, TUDPress-Dresden Birkholz P (2016) Glottalimageexplorer - an open source tool for glottis segmentation in endoscopic high-speed videos of the vocal folds. In: Jokisch, O (ed.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2016, TUDPress-Dresden
29.
go back to reference Zhang H, Fritts JE, Goldman SA (1996) A survey on evaluation methods for image segmentation. Pattern Recog 29:1335–1346CrossRef Zhang H, Fritts JE, Goldman SA (1996) A survey on evaluation methods for image segmentation. Pattern Recog 29:1335–1346CrossRef
30.
go back to reference Ko T, Ciloglu T (2014) Automatic segmentation of high speed video images of vocal folds. J Appl Math 2014:16 Ko T, Ciloglu T (2014) Automatic segmentation of high speed video images of vocal folds. J Appl Math 2014:16
31.
go back to reference Taha AA, Hanbury A (2015) Metrics for evaluating 3d medical image segmentation: analysis, selection, and tool. BMC Medical Imaging 15(1):1–28CrossRef Taha AA, Hanbury A (2015) Metrics for evaluating 3d medical image segmentation: analysis, selection, and tool. BMC Medical Imaging 15(1):1–28CrossRef
Metadata
Title
Glottal Gap tracking by a continuous background modeling using inpainting
Authors
Gustavo Andrade-Miranda
Juan Ignacio Godino-Llorente
Publication date
27-05-2017
Publisher
Springer Berlin Heidelberg
Published in
Medical & Biological Engineering & Computing / Issue 12/2017
Print ISSN: 0140-0118
Electronic ISSN: 1741-0444
DOI
https://doi.org/10.1007/s11517-017-1652-8

Other articles of this Issue 12/2017

Medical & Biological Engineering & Computing 12/2017 Go to the issue

Premium Partner