Skip to main content
Erschienen in: International Journal of Multimedia Information Retrieval 3/2016

01.09.2016 | Regular Paper

Deep shape-aware descriptor for nonrigid 3D object retrieval

verfasst von: Hamed Ghodrati, A. Ben Hamza

Erschienen in: International Journal of Multimedia Information Retrieval | Ausgabe 3/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Deep learning is a rapidly growing discipline that models high-level features in data as multilayered neural networks. In this paper, we propose a deep learning approach for 3D shape retrieval using a multi-level feature learning methodology. We first extract low-level features or local descriptors from a 3D shape using spectral graph wavelets. Then, we construct mid-level features from these local descriptors via the bag-of-features paradigm by employing locality-constrained linear coding as a feature coding method, together with the biharmonic distance as a measure of the spatial relationship between each pair of bag-of-feature descriptors. Finally, high-level shape features are learned via a deep auto-encoder, resulting in a deep shape-aware descriptor that is compact, geometrically informative and efficient to compute. The proposed 3D shape retrieval approach is evaluated on SHREC-2014 and SHREC-2015 datasets through extensive experiments, and the results show compelling superiority of our approach over the state-of-the-art methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
ARPACK (ARnoldi PACKage) is a MATLAB library for computing the eigenvalues and eigenvectors of large matrices.
 
Literatur
1.
Zurück zum Zitat Pickup D, Sun X, Rosin P, Martin R, Cheng Z, Lian Z, Aono M, Ben Hamza A, Bronstein A, Bronstein M, Bu S, Castellani U, Cheng S, Garro V, Giachetti A, Godil A, Han J, Johan H, Lai L, Li B, Li C, Li H, Litman R, Liu X, Liu Z, Lu Y, Tatsuma A, Ye J (2014) SHREC’14 track: shape retrieval of non-rigid 3D human models. In: Proceedings of the Eurographics Workshop on 3D Object Retrieval, pp 1–10 Pickup D, Sun X, Rosin P, Martin R, Cheng Z, Lian Z, Aono M, Ben Hamza A, Bronstein A, Bronstein M, Bu S, Castellani U, Cheng S, Garro V, Giachetti A, Godil A, Han J, Johan H, Lai L, Li B, Li C, Li H, Litman R, Liu X, Liu Z, Lu Y, Tatsuma A, Ye J (2014) SHREC’14 track: shape retrieval of non-rigid 3D human models. In: Proceedings of the Eurographics Workshop on 3D Object Retrieval, pp 1–10
2.
Zurück zum Zitat Lian JZZ, Choi S, ElNaghy H, El-Sana J, Furuya T, Giachetti A, Isaia RGL, Lai L, Li C, Li H, Limberger F, Martin R, Nakanishi R, Nonato ANL, Ohbuchi R, Pevzner K, Pickup D, Rosin P, Sharf A, Sun L, Sun X, Tari S, Unal G, Wilson R (2015) SHREC’15 track: non-rigid 3D shape retrieval. In: Proceedings of the Eurographics Workshop on 3D Object Retrieval, pp 1–14 Lian JZZ, Choi S, ElNaghy H, El-Sana J, Furuya T, Giachetti A, Isaia RGL, Lai L, Li C, Li H, Limberger F, Martin R, Nakanishi R, Nonato ANL, Ohbuchi R, Pevzner K, Pickup D, Rosin P, Sharf A, Sun L, Sun X, Tari S, Unal G, Wilson R (2015) SHREC’15 track: non-rigid 3D shape retrieval. In: Proceedings of the Eurographics Workshop on 3D Object Retrieval, pp 1–14
3.
Zurück zum Zitat Rustamov R (2007) Laplace-Beltrami eigenfunctions for deformation invariant shape representation. In: Proc. Symp. Geometry Processing, pp 225–233 Rustamov R (2007) Laplace-Beltrami eigenfunctions for deformation invariant shape representation. In: Proc. Symp. Geometry Processing, pp 225–233
4.
Zurück zum Zitat Sun J, Ovsjanikov M, Guibas L (2009) A concise and provably informative multi-scale signature based on heat diffusion. Comput Graph Forum 28(5):1383–1392CrossRef Sun J, Ovsjanikov M, Guibas L (2009) A concise and provably informative multi-scale signature based on heat diffusion. Comput Graph Forum 28(5):1383–1392CrossRef
5.
Zurück zum Zitat Bronstein M, Kokkinos I (2010) Scale-invariant heat kernel signatures for non-rigid shape recognition. In: Proceedings of the CVPR, pp 1704–1711 Bronstein M, Kokkinos I (2010) Scale-invariant heat kernel signatures for non-rigid shape recognition. In: Proceedings of the CVPR, pp 1704–1711
6.
Zurück zum Zitat Aubry M, Schlickewei U, Cremers D (2011) The wave kernel signature: a quantum mechanical approach to shape analysis. In: Proceedings of the Computational Methods for the Innovative Design of Electrical Devices, pp 1626–1633 Aubry M, Schlickewei U, Cremers D (2011) The wave kernel signature: a quantum mechanical approach to shape analysis. In: Proceedings of the Computational Methods for the Innovative Design of Electrical Devices, pp 1626–1633
7.
Zurück zum Zitat Li C, Ben Hamza A (2013) A multiresolution descriptor for deformable 3D shape retrieval. Visual Comput 29:513–524CrossRef Li C, Ben Hamza A (2013) A multiresolution descriptor for deformable 3D shape retrieval. Visual Comput 29:513–524CrossRef
8.
Zurück zum Zitat Reuter M, Wolter F, Peinecke N (2006) Laplace-Beltrami spectra as ‘Shape-DNA’ of surfaces and solids. Comput Aided Design 38(4):342–366CrossRef Reuter M, Wolter F, Peinecke N (2006) Laplace-Beltrami spectra as ‘Shape-DNA’ of surfaces and solids. Comput Aided Design 38(4):342–366CrossRef
9.
Zurück zum Zitat Chaudhari A, Leahy R, Wise B, Lane N, Badawi R, Joshi A (2014) Global point signature for shape analysis of carpal bones. Phys Med Biol 59:961–973CrossRef Chaudhari A, Leahy R, Wise B, Lane N, Badawi R, Joshi A (2014) Global point signature for shape analysis of carpal bones. Phys Med Biol 59:961–973CrossRef
10.
Zurück zum Zitat Ye J, Yu Y (2015) A fast modal space transform for robust nonrigid shape retrieval. Visual Comput 32(5):553–568MathSciNetCrossRef Ye J, Yu Y (2015) A fast modal space transform for robust nonrigid shape retrieval. Visual Comput 32(5):553–568MathSciNetCrossRef
11.
Zurück zum Zitat Bronstein A, Bronstein M, Guibas L, Ovsjanikov M (2011) Shape google: geometric words and expressions for invariant shape retrieval. ACM Trans Graph 30(1):1–20 Bronstein A, Bronstein M, Guibas L, Ovsjanikov M (2011) Shape google: geometric words and expressions for invariant shape retrieval. ACM Trans Graph 30(1):1–20
12.
Zurück zum Zitat Litman R, Bronstein A, Bronstein M, Castellani U (2014) Supervised learning of bag-of-features shape descriptors using sparse coding. Comput Graph Forum 33(5):127–136CrossRef Litman R, Bronstein A, Bronstein M, Castellani U (2014) Supervised learning of bag-of-features shape descriptors using sparse coding. Comput Graph Forum 33(5):127–136CrossRef
13.
Zurück zum Zitat LeCun Y, Boser B, Denker J, Henderson D, Howard R, Hubbard W, Jackel L (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551CrossRef LeCun Y, Boser B, Denker J, Henderson D, Howard R, Hubbard W, Jackel L (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551CrossRef
14.
Zurück zum Zitat Ciresan D, Meier U, Masci J, Gambardella L, Schmidhuber J (2011) Flexible, high performance convolutional neural networks for image classification. In: Proceedings of the IJAC, pp 1237–1242 Ciresan D, Meier U, Masci J, Gambardella L, Schmidhuber J (2011) Flexible, high performance convolutional neural networks for image classification. In: Proceedings of the IJAC, pp 1237–1242
15.
Zurück zum Zitat Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, LeCun Y. OverFeat: integrated recognition, localization and detection using convolutional networks. In: Proceedings of the ICLR Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, LeCun Y. OverFeat: integrated recognition, localization and detection using convolutional networks. In: Proceedings of the ICLR
16.
Zurück zum Zitat Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the CVPR Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the CVPR
17.
Zurück zum Zitat Lee H, Grosse R, Ranganath R, Ng A, Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In: Proceedings of the ICML, pp 609–616 Lee H, Grosse R, Ranganath R, Ng A, Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In: Proceedings of the ICML, pp 609–616
18.
Zurück zum Zitat Masci J, Meier U, Ciresan D, Schmidhuber J (2011) Stacked convolutional auto-encoders for hierarchical feature extraction. In: Proceedings of the International Conference on Artificial Neural Networks, p 5259 Masci J, Meier U, Ciresan D, Schmidhuber J (2011) Stacked convolutional auto-encoders for hierarchical feature extraction. In: Proceedings of the International Conference on Artificial Neural Networks, p 5259
19.
20.
Zurück zum Zitat Hinton G (2012) A practical guide to training restricted Boltzmann machines. In: Montavon G, Orr GB, Müller K (eds) Neural networks: tricks of the trade. Springer, Berlin, pp 599–619 Hinton G (2012) A practical guide to training restricted Boltzmann machines. In: Montavon G, Orr GB, Müller K (eds) Neural networks: tricks of the trade. Springer, Berlin, pp 599–619
21.
Zurück zum Zitat Xie J, Xu L, Chen E (2012) Image denoising and inpainting with deep neural networks. In: Advances in Neural Information Processing Systems (NIPS), pp 350–358 Xie J, Xu L, Chen E (2012) Image denoising and inpainting with deep neural networks. In: Advances in Neural Information Processing Systems (NIPS), pp 350–358
22.
Zurück zum Zitat Krizhevsky A, Sutskever I, Hinton G (2012) ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems (NIPS), pp 1097–1105 Krizhevsky A, Sutskever I, Hinton G (2012) ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems (NIPS), pp 1097–1105
23.
Zurück zum Zitat Karpathy A, Fei-Fei L (2015) Deep visual-semantic alignments for generating image descriptions. In: Proceedings of the CVPR, pp 3128–3137 Karpathy A, Fei-Fei L (2015) Deep visual-semantic alignments for generating image descriptions. In: Proceedings of the CVPR, pp 3128–3137
24.
Zurück zum Zitat Zhang J, Shan S, Kan M, Chen X (2014) Coarse-to-fine autoencoder networks (CFAN) for real-time face alignment. In: Proceedings of the ECCV, pp 1–16 Zhang J, Shan S, Kan M, Chen X (2014) Coarse-to-fine autoencoder networks (CFAN) for real-time face alignment. In: Proceedings of the ECCV, pp 1–16
25.
Zurück zum Zitat Eslami S, Heess N, Williams C, Winn J (2014) The shape Boltzmann machine: a strong model of object shape. Int J Comput Vis 107(2):155–176MathSciNetCrossRefMATH Eslami S, Heess N, Williams C, Winn J (2014) The shape Boltzmann machine: a strong model of object shape. Int J Comput Vis 107(2):155–176MathSciNetCrossRefMATH
26.
Zurück zum Zitat Wu Z, Song S, Khosla A, Yu F, Zhang L, Tang X, Xiao J (2015) 3D ShapeNets: a deep representation for volumetric shapes. In: Proceedings of the CVPR, pp 1912–1920 Wu Z, Song S, Khosla A, Yu F, Zhang L, Tang X, Xiao J (2015) 3D ShapeNets: a deep representation for volumetric shapes. In: Proceedings of the CVPR, pp 1912–1920
27.
Zurück zum Zitat Su H, Maji S, Kalogerakis E, Learned-Miller E (2015) Multi-view convolutional neural networks for 3D shape recognition. In: Proceedings of the ICCV, pp 945–953 Su H, Maji S, Kalogerakis E, Learned-Miller E (2015) Multi-view convolutional neural networks for 3D shape recognition. In: Proceedings of the ICCV, pp 945–953
28.
Zurück zum Zitat Zhu Z, Wang X, Bai S, Yao C, Bai X (2016) Deep learning representation using autoencoder for 3D shape retrieval. Neurocomputing 204(2016):41–50 Zhu Z, Wang X, Bai S, Yao C, Bai X (2016) Deep learning representation using autoencoder for 3D shape retrieval. Neurocomputing 204(2016):41–50
29.
Zurück zum Zitat Qi C, Su H, Nießner M, Dai A, Yan M, Guibas L (2016) Volumetric and multi-view CNNs for object classification on 3D data. In: Proceedings of the CVPR Qi C, Su H, Nießner M, Dai A, Yan M, Guibas L (2016) Volumetric and multi-view CNNs for object classification on 3D data. In: Proceedings of the CVPR
30.
Zurück zum Zitat Savva M, Yu F, Su H, Aono M, Chen B, Cohen-Or D, Deng W, Su H, Bai S, Bai X, Fish JHN, Kalogerakis E, Learned-Miller E, Li Y, Liao M, Maji S, Wang Y, Zhang N, Zhou Z (2016) SHREC’16 track: large-scale 3D shape retrieval from ShapeNet Core55. In: Proceedings of the Eurographics Workshop on 3D Object Retrieval Savva M, Yu F, Su H, Aono M, Chen B, Cohen-Or D, Deng W, Su H, Bai S, Bai X, Fish JHN, Kalogerakis E, Learned-Miller E, Li Y, Liao M, Maji S, Wang Y, Zhang N, Zhou Z (2016) SHREC’16 track: large-scale 3D shape retrieval from ShapeNet Core55. In: Proceedings of the Eurographics Workshop on 3D Object Retrieval
31.
Zurück zum Zitat Fang Y, Xie J, Dai G, Wang M, Zhu F, Xu T, Wong E (2015) 3D deep shape descriptor. In: Proceedings of the CVPR, pp 2319–2328 Fang Y, Xie J, Dai G, Wang M, Zhu F, Xu T, Wong E (2015) 3D deep shape descriptor. In: Proceedings of the CVPR, pp 2319–2328
32.
Zurück zum Zitat Bu S, Liu Z, Han J, Wu J, Ji R (2014) Learning high-level feature by deep belief networks for 3-D model retrieval and recognition. IEEE Trans Multimed 24(16):2154–2167 Bu S, Liu Z, Han J, Wu J, Ji R (2014) Learning high-level feature by deep belief networks for 3-D model retrieval and recognition. IEEE Trans Multimed 24(16):2154–2167
33.
Zurück zum Zitat Lipman Y, Rustamov R, Funkhouser T (2010) Biharmonic distance. ACM Trans Graph 29(3):1–11CrossRef Lipman Y, Rustamov R, Funkhouser T (2010) Biharmonic distance. ACM Trans Graph 29(3):1–11CrossRef
34.
Zurück zum Zitat Rosenberg S (1997) The Laplacian on a Riemannian manifold. Cambridge University Press Rosenberg S (1997) The Laplacian on a Riemannian manifold. Cambridge University Press
35.
Zurück zum Zitat Meyer M, Desbrun M, Schröder P, Barr A (2003) Discrete differential-geometry operators for triangulated 2-manifolds. Vis Math III 3(7):35–57MathSciNetMATH Meyer M, Desbrun M, Schröder P, Barr A (2003) Discrete differential-geometry operators for triangulated 2-manifolds. Vis Math III 3(7):35–57MathSciNetMATH
36.
Zurück zum Zitat Li C, Ben Hamza A (2013) Intrinsic spatial pyramid matching for deformable 3D shape retrieval. Int J Multimed Inf Retr 2:261–271CrossRef Li C, Ben Hamza A (2013) Intrinsic spatial pyramid matching for deformable 3D shape retrieval. Int J Multimed Inf Retr 2:261–271CrossRef
37.
Zurück zum Zitat Dong W, Li X, Zhang D, Shi G (2010) Sparsity-based image denoising via dictionary learning and structural clustering. In: Proceedings of the CVPR, pp 3360–3367 Dong W, Li X, Zhang D, Shi G (2010) Sparsity-based image denoising via dictionary learning and structural clustering. In: Proceedings of the CVPR, pp 3360–3367
38.
Zurück zum Zitat Ben Hamza A, Krim H (2006) Geodesic matching of triangulated surfaces. IEEE Trans Image Process 15(8):2249–2258CrossRef Ben Hamza A, Krim H (2006) Geodesic matching of triangulated surfaces. IEEE Trans Image Process 15(8):2249–2258CrossRef
39.
Zurück zum Zitat Lian Z, Godil A, Bustos B, Daoudi M, Hermans J, Kawamura S, Kurita Y, Lavoué G, Nguyen H, Ohbuchi R, Ohkita Y, Ohishi Y, Porikli F, Reuter M, Sipiran I, Smeets D, Suetens P, Tabia H, Vandermeulen D (2011) SHREC’11 track: shape retrieval on non-rigid 3D watertight meshes. In: Proceedings of the Eurographics/ACM SIGGRAPH Symposium on 3D Object Retrieval, pp 79–88 Lian Z, Godil A, Bustos B, Daoudi M, Hermans J, Kawamura S, Kurita Y, Lavoué G, Nguyen H, Ohbuchi R, Ohkita Y, Ohishi Y, Porikli F, Reuter M, Sipiran I, Smeets D, Suetens P, Tabia H, Vandermeulen D (2011) SHREC’11 track: shape retrieval on non-rigid 3D watertight meshes. In: Proceedings of the Eurographics/ACM SIGGRAPH Symposium on 3D Object Retrieval, pp 79–88
40.
Zurück zum Zitat Giachetti A, Lovato C (2012) Radial symmetry detection and shape characterization with the multiscale area projection transform. Comput Graph Forum 31(5):1669–1678CrossRef Giachetti A, Lovato C (2012) Radial symmetry detection and shape characterization with the multiscale area projection transform. Comput Graph Forum 31(5):1669–1678CrossRef
41.
Zurück zum Zitat Pickup D, Sun X, Rosin P, Martin R (2015) Geometry and context for semantic correspondences and functionality recognition in manmade 3D shapes. Pattern Recognit 48(8):2500–2512CrossRef Pickup D, Sun X, Rosin P, Martin R (2015) Geometry and context for semantic correspondences and functionality recognition in manmade 3D shapes. Pattern Recognit 48(8):2500–2512CrossRef
42.
Zurück zum Zitat Shilane P, Min P, Kazhdan M, Funkhouser T (2004) The Princeton shape benchmark. In: Proceedings of the SMI, pp 167–178 Shilane P, Min P, Kazhdan M, Funkhouser T (2004) The Princeton shape benchmark. In: Proceedings of the SMI, pp 167–178
Metadaten
Titel
Deep shape-aware descriptor for nonrigid 3D object retrieval
verfasst von
Hamed Ghodrati
A. Ben Hamza
Publikationsdatum
01.09.2016
Verlag
Springer London
Erschienen in
International Journal of Multimedia Information Retrieval / Ausgabe 3/2016
Print ISSN: 2192-6611
Elektronische ISSN: 2192-662X
DOI
https://doi.org/10.1007/s13735-016-0103-x

Weitere Artikel der Ausgabe 3/2016

International Journal of Multimedia Information Retrieval 3/2016 Zur Ausgabe