Skip to main content
Erschienen in: Multimedia Systems 4/2013

01.07.2013 | Regular Paper

Performance analysis of various local and global shape descriptors for image retrieval

verfasst von: Chandan Singh, Pooja Sharma

Erschienen in: Multimedia Systems | Ausgabe 4/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, various prominent local and global descriptors are evaluated against each other for analyzing their performance on shape-based image retrieval. Local descriptors include Fourier descriptors, Weber’s local descriptor, local binary patterns, and local ternary patterns. The prominent global descriptors include moment invariants, generic Fourier descriptor (GFD), angular radial transform (ART), wavelet moments (WM), and Zernike moment descriptor (ZMD). In addition, a novel local descriptor is proposed based on the histograms of circular arcs and linear edges, which are detected by means of Hough transform. The proposed local descriptor provides features, which are invariant to geometric transformations and are robust to noise as compared to some existing prominent local descriptors. We also propose an improvement in the performance of global descriptors GFD, ART, WM, and ZMD by taking advantage of the phase information in the comparison process along with their magnitude. Subsequently, the local and global descriptors with the best image-retrieval performances are combined to design an effective retrieval system, which further enhances the retrieval performance substantially. All descriptors are analyzed in terms of six principles set by MPEG-7. Detailed experiments are performed on standard benchmark image databases along with their rotation-invariance and noise test. The results of experiments reveal that the proposed fusion of local and global descriptors outperforms other major descriptors.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Mejdoub, M., et al.: Embedded lattices tree: an efficient indexing scheme for content based retrieval on image databases. J. Vis. Commun. Image R. 20(2), 145–156 (2009)CrossRef Mejdoub, M., et al.: Embedded lattices tree: an efficient indexing scheme for content based retrieval on image databases. J. Vis. Commun. Image R. 20(2), 145–156 (2009)CrossRef
2.
Zurück zum Zitat Ren, F., Bracewell, D.B.: Advanced information retrieval. Electron. Notes Theor. Comput. Sci. 225(2), 303–317 (2009)CrossRef Ren, F., Bracewell, D.B.: Advanced information retrieval. Electron. Notes Theor. Comput. Sci. 225(2), 303–317 (2009)CrossRef
3.
Zurück zum Zitat Faloutsos, C., et al.: Efficient and effective querying by image content. J. Intell. Inf. Syst. 3, 231–262 (1994)CrossRef Faloutsos, C., et al.: Efficient and effective querying by image content. J. Intell. Inf. Syst. 3, 231–262 (1994)CrossRef
4.
Zurück zum Zitat Pentland, R.P., Scalroff, S.: Photobooks: tools for content-based manipulation of image databases. In: SPIE Conf. Storage Retrieval Image Video Databases, vol. II, pp. 33–47 (1994) Pentland, R.P., Scalroff, S.: Photobooks: tools for content-based manipulation of image databases. In: SPIE Conf. Storage Retrieval Image Video Databases, vol. II, pp. 33–47 (1994)
5.
Zurück zum Zitat Mether, M., Kankanhall, M.S., Lee, W.F.: Content-based image retrieval using a composite color-shape approach. Inf. Process. Manag. 34(1), 109–120 (1998)CrossRef Mether, M., Kankanhall, M.S., Lee, W.F.: Content-based image retrieval using a composite color-shape approach. Inf. Process. Manag. 34(1), 109–120 (1998)CrossRef
6.
Zurück zum Zitat Smeulders, A.W.M., et al.: Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22, 1349–1379 (2000)CrossRef Smeulders, A.W.M., et al.: Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22, 1349–1379 (2000)CrossRef
7.
Zurück zum Zitat Hu, M.K.: Visual pattern recognition by moment invariants. IRE Trans. Inf. Theory 8, 179–187 (1962)MATH Hu, M.K.: Visual pattern recognition by moment invariants. IRE Trans. Inf. Theory 8, 179–187 (1962)MATH
8.
Zurück zum Zitat Zhang, D., Lu, G.: Shape-based image retrieval using generic Fourier descriptor. Signal Process. Image Commun. 17, 825–848 (2002)CrossRef Zhang, D., Lu, G.: Shape-based image retrieval using generic Fourier descriptor. Signal Process. Image Commun. 17, 825–848 (2002)CrossRef
9.
Zurück zum Zitat Bober, M.: MPEG-7 visual shape descriptors. IEEE Trans. Circuits Syst. Video Technol. 11(6), 716–719 (2001)CrossRef Bober, M.: MPEG-7 visual shape descriptors. IEEE Trans. Circuits Syst. Video Technol. 11(6), 716–719 (2001)CrossRef
10.
Zurück zum Zitat Shen, D., Ip, H.H.S.: Discriminative wavelet shape descriptors for recognition of 2-D patterns”. Pattern Recognit. 32, 151–165 (1999)CrossRef Shen, D., Ip, H.H.S.: Discriminative wavelet shape descriptors for recognition of 2-D patterns”. Pattern Recognit. 32, 151–165 (1999)CrossRef
11.
Zurück zum Zitat Kim, W.-Y., Kim, Y.-S.: A region based shape descriptor using Zernike moments. Signal Process. Image Commun. 16, 95–102 (2000)CrossRef Kim, W.-Y., Kim, Y.-S.: A region based shape descriptor using Zernike moments. Signal Process. Image Commun. 16, 95–102 (2000)CrossRef
12.
Zurück zum Zitat Zhang, D., Lu, G.: A comparative study of curvature scale space and Fourier descriptors for shape based image retrieval. J. Vis. Commun. Image R. 14, 41–60 (2003) Zhang, D., Lu, G.: A comparative study of curvature scale space and Fourier descriptors for shape based image retrieval. J. Vis. Commun. Image R. 14, 41–60 (2003)
13.
Zurück zum Zitat Chen, J., Shan, S., He, C., Zhao, G., Pietikainen, M., Chen, X., Gao, W.: WLD: a robust image local descriptor. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1705–1720 (2010)CrossRef Chen, J., Shan, S., He, C., Zhao, G., Pietikainen, M., Chen, X., Gao, W.: WLD: a robust image local descriptor. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1705–1720 (2010)CrossRef
14.
Zurück zum Zitat Ojala, T., Pietikainen, M.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Machine Intell. 24(7), 971–986 (2002)CrossRef Ojala, T., Pietikainen, M.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Machine Intell. 24(7), 971–986 (2002)CrossRef
15.
Zurück zum Zitat Moore, S., Bowden, R.: Local binary patterns for multi-view facial expression recognition. Comput. Vis. Image Underst. 115, 541–558 (2011)CrossRef Moore, S., Bowden, R.: Local binary patterns for multi-view facial expression recognition. Comput. Vis. Image Underst. 115, 541–558 (2011)CrossRef
16.
Zurück zum Zitat Tan, X., Triggs, B.: Enhanced local texture features set for face recognition under different lighting conditions. IEEE Trans. Image Process. 19(6), 1635–1650 (2010)MathSciNetCrossRef Tan, X., Triggs, B.: Enhanced local texture features set for face recognition under different lighting conditions. IEEE Trans. Image Process. 19(6), 1635–1650 (2010)MathSciNetCrossRef
17.
Zurück zum Zitat Zhang, D., Lu, G.: Evaluation of MPEG-7 shape descriptors against other shape descriptors. Multimed. Syst. 9, 15–30 (2003)CrossRef Zhang, D., Lu, G.: Evaluation of MPEG-7 shape descriptors against other shape descriptors. Multimed. Syst. 9, 15–30 (2003)CrossRef
18.
Zurück zum Zitat Amanatiadis, A., Kaburlasos, V.G., Gasteratos, A., Papadakis, S.E.: Evaluation of shape descriptors for shape based image retrieval. IET Image Process. 5(5), 493–499 (2011)CrossRef Amanatiadis, A., Kaburlasos, V.G., Gasteratos, A., Papadakis, S.E.: Evaluation of shape descriptors for shape based image retrieval. IET Image Process. 5(5), 493–499 (2011)CrossRef
19.
Zurück zum Zitat Kim, H., Kim, J.: Region-based shape descriptor invariant to rotation, scale and translation. Signal Process. Image Commun. 16, 87–93 (2000)CrossRef Kim, H., Kim, J.: Region-based shape descriptor invariant to rotation, scale and translation. Signal Process. Image Commun. 16, 87–93 (2000)CrossRef
20.
Zurück zum Zitat Chen, Z., Sun, S.-K.: A Zernike moment phase-based descriptor for local image representation and matching. IEEE Trans. Image Proc. 19(1), 205–219 (2010)CrossRef Chen, Z., Sun, S.-K.: A Zernike moment phase-based descriptor for local image representation and matching. IEEE Trans. Image Proc. 19(1), 205–219 (2010)CrossRef
21.
Zurück zum Zitat Oppenheim, A.V., Lim, J.S.: The importance of phase in signals. Proc. IEEE 69(5), 529–550 (1981)CrossRef Oppenheim, A.V., Lim, J.S.: The importance of phase in signals. Proc. IEEE 69(5), 529–550 (1981)CrossRef
22.
Zurück zum Zitat Li, S., Lee, M.-C., Pun, C.-M.: Complex Zernike moments features for shape based image retrieval. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 39(1), 227–237 (2009)CrossRef Li, S., Lee, M.-C., Pun, C.-M.: Complex Zernike moments features for shape based image retrieval. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 39(1), 227–237 (2009)CrossRef
23.
Zurück zum Zitat Revaud, J., Lavoue, G., Baskurt, A.: Improving Zernike moments comparison for optimal similarity and rotation angle retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 627–636 (2009)CrossRef Revaud, J., Lavoue, G., Baskurt, A.: Improving Zernike moments comparison for optimal similarity and rotation angle retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 627–636 (2009)CrossRef
24.
Zurück zum Zitat Chen, Z., Sun, S.-K.: A Zernike moment phase-based descriptor for local image representation and matching. IEEE Trans. Image Process. 19(1), 205–219 (2010)MathSciNetCrossRef Chen, Z., Sun, S.-K.: A Zernike moment phase-based descriptor for local image representation and matching. IEEE Trans. Image Process. 19(1), 205–219 (2010)MathSciNetCrossRef
25.
Zurück zum Zitat Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 24(4), 509–522 (2002)CrossRef Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 24(4), 509–522 (2002)CrossRef
26.
Zurück zum Zitat Tu, Z., Yuille, A.: Shape matching and recognition-using generative models and informative features. Proc. Eur. Conf. Comput. Vis. III, 195–209 (2004) Tu, Z., Yuille, A.: Shape matching and recognition-using generative models and informative features. Proc. Eur. Conf. Comput. Vis. III, 195–209 (2004)
27.
Zurück zum Zitat Sebastian, T.B., Klein, P.N., Kimia, B.B.: Recognition of shapes by editing their shock graphs. IEEE Trans. Pattern Anal. Mach. Intell. 26(5), 550–571 (2004)CrossRef Sebastian, T.B., Klein, P.N., Kimia, B.B.: Recognition of shapes by editing their shock graphs. IEEE Trans. Pattern Anal. Mach. Intell. 26(5), 550–571 (2004)CrossRef
28.
Zurück zum Zitat Ling, H.B., Jacobs, D.W.: Shape classification using the inner-distance. IEEE Trans. Pattern Anal. Mach. Intell. 29(2), 286–299 (2007)CrossRef Ling, H.B., Jacobs, D.W.: Shape classification using the inner-distance. IEEE Trans. Pattern Anal. Mach. Intell. 29(2), 286–299 (2007)CrossRef
29.
Zurück zum Zitat Kauppinen, H., Seppanen, T., Pietikainen, M.: An experimental comparison of autoregressive and Fourier-based descriptors in 2D shape classification. IEEE Trans. PAMI 17(2), 201–207 (1995)CrossRef Kauppinen, H., Seppanen, T., Pietikainen, M.: An experimental comparison of autoregressive and Fourier-based descriptors in 2D shape classification. IEEE Trans. PAMI 17(2), 201–207 (1995)CrossRef
30.
Zurück zum Zitat Lo, R.-C., Tsai, W.-H.: Gray-scale Hough transform for thick line detection in gray-scale images. Pattern Recognit. 28(5), 647–661 (1995)CrossRef Lo, R.-C., Tsai, W.-H.: Gray-scale Hough transform for thick line detection in gray-scale images. Pattern Recognit. 28(5), 647–661 (1995)CrossRef
31.
Zurück zum Zitat Duda, R.O., Hart, P.E.: Pattern classification and scene analysis, pp. 335–337. Wiley, New York (1973)MATH Duda, R.O., Hart, P.E.: Pattern classification and scene analysis, pp. 335–337. Wiley, New York (1973)MATH
32.
Zurück zum Zitat Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Prentice Hall, Upper Saddle River, NJ (2008) Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Prentice Hall, Upper Saddle River, NJ (2008)
33.
Zurück zum Zitat Canny, J.: A computational approach for edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 8(6), 679–698 (1986)CrossRef Canny, J.: A computational approach for edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 8(6), 679–698 (1986)CrossRef
34.
Zurück zum Zitat Ballard, D.H.: Generalizing the Hough transform to detect arbitrary shapes. Pattern Recognit. 13(2), 111–122 (1981)MATHCrossRef Ballard, D.H.: Generalizing the Hough transform to detect arbitrary shapes. Pattern Recognit. 13(2), 111–122 (1981)MATHCrossRef
35.
Zurück zum Zitat Zhang, D.S., Lu, G.: Generic Fourier descriptor for shape-based image retrieval. Proc. IEEE Int. Conf. Multimed. Expo (ICME2002), pp. 425–428 (2002) Zhang, D.S., Lu, G.: Generic Fourier descriptor for shape-based image retrieval. Proc. IEEE Int. Conf. Multimed. Expo (ICME2002), pp. 425–428 (2002)
36.
Zurück zum Zitat Unser, M., Aldroubi, A., Eden, M.: On the asymptotic convergence of B-spline wavelets to Gabor functions. IEEE Trans. Inf. Theory 38, 864–872 (1992)MathSciNetMATHCrossRef Unser, M., Aldroubi, A., Eden, M.: On the asymptotic convergence of B-spline wavelets to Gabor functions. IEEE Trans. Inf. Theory 38, 864–872 (1992)MathSciNetMATHCrossRef
37.
Zurück zum Zitat Zernike, F.: Beugungstheorie des Schneidenverfahrens und seiner verbesserten Form, der Phasenkontrastmethode. Physica 1, 689–701 (1934)MATHCrossRef Zernike, F.: Beugungstheorie des Schneidenverfahrens und seiner verbesserten Form, der Phasenkontrastmethode. Physica 1, 689–701 (1934)MATHCrossRef
38.
39.
Zurück zum Zitat Swain, M.J., Ballard, D.H.: Color indexing. Int. J. Comput. Vis. 7(1), 11–32 (1991)CrossRef Swain, M.J., Ballard, D.H.: Color indexing. Int. J. Comput. Vis. 7(1), 11–32 (1991)CrossRef
40.
Zurück zum Zitat Tamura, H., Mori, S., Yamawaki, T.: Textural features corresponding to visual perception. IEEE Trans. Syst. Man Cybernet. 8(6), 460–473 (1978)CrossRef Tamura, H., Mori, S., Yamawaki, T.: Textural features corresponding to visual perception. IEEE Trans. Syst. Man Cybernet. 8(6), 460–473 (1978)CrossRef
41.
Zurück zum Zitat Singh, C., Walia, E.: Algorithms for fast computation of Zernike moments and their numerical stability. Image Vis. Comput. 29(4), 251–259 (2011)CrossRef Singh, C., Walia, E.: Algorithms for fast computation of Zernike moments and their numerical stability. Image Vis. Comput. 29(4), 251–259 (2011)CrossRef
Metadaten
Titel
Performance analysis of various local and global shape descriptors for image retrieval
verfasst von
Chandan Singh
Pooja Sharma
Publikationsdatum
01.07.2013
Verlag
Springer-Verlag
Erschienen in
Multimedia Systems / Ausgabe 4/2013
Print ISSN: 0942-4962
Elektronische ISSN: 1432-1882
DOI
https://doi.org/10.1007/s00530-012-0288-7

Neuer Inhalt