Skip to main content
Top
Published in: International Journal of Multimedia Information Retrieval 2/2015

01-06-2015 | Regular Paper

Large image modality labeling initiative using semi-supervised and optimized clustering

Authors: Szilárd Vajda, Daekeun You, Sameer Antani, George Thoma

Published in: International Journal of Multimedia Information Retrieval | Issue 2/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Medical image modality detection is a key step for indexing images from biomedical articles. Traditionally, complex supervised classification methods have been used for this. However, they rely on proportionally sized labeled training samples. With the increase in availability of image data it has become increasingly challenging to obtain reasonably accurate manual labels to train classifiers. Toward meeting this shortcoming, we propose a semi-automatic labeling strategy that reduces the human annotator effort. Each image is projected into several feature spaces, and each entry in these spaces is clustered in an unsupervised manner. The cluster centers for each feature representation are then labeled by a human annotator, and the labels propagated through each cluster. To find the optimal cluster numbers for each feature space, a so-called “jump” method is used. The final label of an image is decided by a voting scheme that summarizes the different opinions on the same image provided by the different feature representations. The proposed method is evaluated on ImageCLEFmed2012 data set containing approximately 300,000 images, and showed that annotating \(<\)1 % of the data is sufficient to label correctly 49.95 % of the images. The method spared approximately 700 h of human annotation labor and associated costs.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Chatzichristofis SA, Boutalis YS (2008) Cedd: color and edge directivity descriptor: a compact descriptor for image indexing and retrieval. In: Proceedings of the 6th international conference on computer vision systems, ICVS’08Springer. Berlin, Heidelberg, pp 312–322 Chatzichristofis SA, Boutalis YS (2008) Cedd: color and edge directivity descriptor: a compact descriptor for image indexing and retrieval. In: Proceedings of the 6th international conference on computer vision systems, ICVS’08Springer. Berlin, Heidelberg, pp 312–322
3.
go back to reference Fritzke B (1995) A growing neural gas network learns topologies. In: Tesauro G, Touretzky DS, Leen TK (eds) Advances in neural information processing systems, vol 7. MIT Press, Cambridge, pp 625–632 Fritzke B (1995) A growing neural gas network learns topologies. In: Tesauro G, Touretzky DS, Leen TK (eds) Advances in neural information processing systems, vol 7. MIT Press, Cambridge, pp 625–632
4.
go back to reference He J, Tan AH, Tan CL, Sung SY (2003) On quantitative evaluation of clustering systems. Kluwer Academic Publishers, Boston He J, Tan AH, Tan CL, Sung SY (2003) On quantitative evaluation of clustering systems. Kluwer Academic Publishers, Boston
5.
go back to reference Jain AK (2010) Data clustering: 50 years beyond k-means. Pattern Recogn Lett 31(8):651–666CrossRef Jain AK (2010) Data clustering: 50 years beyond k-means. Pattern Recogn Lett 31(8):651–666CrossRef
6.
go back to reference Kahn CE, Rubin DL (2009) Automated semantic indexing of figure captions to improve radiology image retrieval. J Am Med Inform Assoc 16:380–386CrossRef Kahn CE, Rubin DL (2009) Automated semantic indexing of figure captions to improve radiology image retrieval. J Am Med Inform Assoc 16:380–386CrossRef
7.
go back to reference Kohonen T, Schroeder MR, Huang TS (eds) (2001) Self-organizing maps, 3rd edn. Springer-Verlag New York Inc, Secaucus Kohonen T, Schroeder MR, Huang TS (eds) (2001) Self-organizing maps, 3rd edn. Springer-Verlag New York Inc, Secaucus
8.
go back to reference Krishnamachari S, Yamada A, Abdel-Mottaleb M, Kasutani E (2000) Multimedia content filtering, browsing, and matching using MPEG-7 compact color descriptors. In: Laurini R (ed) Advances in visual information systems, vol 1929., Lecture notes in computer scienceSpringer, Berlin Heidelberg, pp 200–211CrossRef Krishnamachari S, Yamada A, Abdel-Mottaleb M, Kasutani E (2000) Multimedia content filtering, browsing, and matching using MPEG-7 compact color descriptors. In: Laurini R (ed) Advances in visual information systems, vol 1929., Lecture notes in computer scienceSpringer, Berlin Heidelberg, pp 200–211CrossRef
9.
go back to reference Kuncheva LI (2004) Combining pattern classifiers: methods and algorithms. Wiley-Interscience, New YorkCrossRef Kuncheva LI (2004) Combining pattern classifiers: methods and algorithms. Wiley-Interscience, New YorkCrossRef
10.
go back to reference Li J, Mouchère H, Viard-Gaudin C (2014) An annotation assistance system using an unsupervised codebook composed of handwritten graphical multi-stroke symbols. Pattern Recogn Lett 35:46–57CrossRef Li J, Mouchère H, Viard-Gaudin C (2014) An annotation assistance system using an unsupervised codebook composed of handwritten graphical multi-stroke symbols. Pattern Recogn Lett 35:46–57CrossRef
12.
go back to reference Müller H, de Herrera AGS, Kalpathy-Cramer J, Demner-Fushman D, Antani S, Eggel I (2012) Overview of the ImageCLEF 2012 medical image retrieval and classification tasks Müller H, de Herrera AGS, Kalpathy-Cramer J, Demner-Fushman D, Antani S, Eggel I (2012) Overview of the ImageCLEF 2012 medical image retrieval and classification tasks
13.
go back to reference Müller H, Kalpathy-Cramer J, Demner-Fushman D, Antani S (2012) Creating a classification of image types in the medical literature for visual categorization. In: SPIE medical imaging Müller H, Kalpathy-Cramer J, Demner-Fushman D, Antani S (2012) Creating a classification of image types in the medical literature for visual categorization. In: SPIE medical imaging
14.
go back to reference Park DK, Jeon YS, Won CS (2000) Efficient use of local edge histogram descriptor. In:Proceedings of the 2000 ACM workshops on multimedia., Multimedia ’00ACM, New York, NY, USA, pp 51–54 Park DK, Jeon YS, Won CS (2000) Efficient use of local edge histogram descriptor. In:Proceedings of the 2000 ACM workshops on multimedia., Multimedia ’00ACM, New York, NY, USA, pp 51–54
15.
go back to reference Rahman M, You D, Simpson M, Antani SK, Demner-Fushman D, Thoma GR (2013) Multimodal biomedical image retrieval using hierarchical classification and modality fusion. Int J Multimed Inform Retriev 2(3):159–173CrossRef Rahman M, You D, Simpson M, Antani SK, Demner-Fushman D, Thoma GR (2013) Multimodal biomedical image retrieval using hierarchical classification and modality fusion. Int J Multimed Inform Retriev 2(3):159–173CrossRef
16.
go back to reference Richarz J, Vajda S, Grzeszick R, Fink GA (2014) Semi-supervised learning for character recognition in historical archive documents. Pattern Recogn 47(3):1011–1020CrossRef Richarz J, Vajda S, Grzeszick R, Fink GA (2014) Semi-supervised learning for character recognition in historical archive documents. Pattern Recogn 47(3):1011–1020CrossRef
17.
go back to reference Rokach L (2009) Pattern classification using ensemble methods, series in machine perception and artificial intelligence. World Scientific Publishing Company, Singapore Rokach L (2009) Pattern classification using ensemble methods, series in machine perception and artificial intelligence. World Scientific Publishing Company, Singapore
18.
go back to reference Settles B (2009) Active learning literature survey. Tech. Rep. 1648, University of Wisconsin-Madison Settles B (2009) Active learning literature survey. Tech. Rep. 1648, University of Wisconsin-Madison
19.
go back to reference Simpson MS, Rahman MM, Phadnis S, Apostolova E, Demner-Fushman D, Antani S, Thoma GR (2011) Text and content-based approaches to image modality classification and retrieval for the imageclef 2011 medical retrieval track. In: CLEF (Notebook Papers/Labs/Workshop) Simpson MS, Rahman MM, Phadnis S, Apostolova E, Demner-Fushman D, Antani S, Thoma GR (2011) Text and content-based approaches to image modality classification and retrieval for the imageclef 2011 medical retrieval track. In: CLEF (Notebook Papers/Labs/Workshop)
20.
go back to reference Sugar CA, James GM (2003) Finding the number of clusters in a dataset: an information-theoretic approach. J Am Stat Assoc 98(463):750–763CrossRefMATHMathSciNet Sugar CA, James GM (2003) Finding the number of clusters in a dataset: an information-theoretic approach. J Am Stat Assoc 98(463):750–763CrossRefMATHMathSciNet
21.
go back to reference Toselli AH, Romero V, Pastor M, Vidal E (2010) Multimodal interactive transcription of text images. Pattern Recogn 43(5):1814–1825CrossRefMATH Toselli AH, Romero V, Pastor M, Vidal E (2010) Multimodal interactive transcription of text images. Pattern Recogn 43(5):1814–1825CrossRefMATH
22.
go back to reference Vajda S, Junaidi A, Fink GA (2011) A semi-supervised ensemble learning approach for character labeling with minimal human effort. In: ICDAR, pp 259–263 (2011) Vajda S, Junaidi A, Fink GA (2011) A semi-supervised ensemble learning approach for character labeling with minimal human effort. In: ICDAR, pp 259–263 (2011)
23.
go back to reference You D, Rahman MM, Antani S, Demner-Fushman D, Thoma GR (2013) Text- and content-based biomedical image modality classification. In: Proceedings of SPIE medical imaging, pp 86740L–86740L–8 You D, Rahman MM, Antani S, Demner-Fushman D, Thoma GR (2013) Text- and content-based biomedical image modality classification. In: Proceedings of SPIE medical imaging, pp 86740L–86740L–8
24.
go back to reference Zhou ZH (2009) When semi-supervised learning meets ensemble learning. In: MCS, pp 529–538 (2009) Zhou ZH (2009) When semi-supervised learning meets ensemble learning. In: MCS, pp 529–538 (2009)
Metadata
Title
Large image modality labeling initiative using semi-supervised and optimized clustering
Authors
Szilárd Vajda
Daekeun You
Sameer Antani
George Thoma
Publication date
01-06-2015
Publisher
Springer London
Published in
International Journal of Multimedia Information Retrieval / Issue 2/2015
Print ISSN: 2192-6611
Electronic ISSN: 2192-662X
DOI
https://doi.org/10.1007/s13735-015-0078-z

Other articles of this Issue 2/2015

International Journal of Multimedia Information Retrieval 2/2015 Go to the issue

Premium Partner