Skip to main content

2016 | OriginalPaper | Buchkapitel

Deep Learning 3D Shape Surfaces Using Geometry Images

verfasst von : Ayan Sinha, Jing Bai, Karthik Ramani

Erschienen in: Computer Vision – ECCV 2016

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Surfaces serve as a natural parametrization to 3D shapes. Learning surfaces using convolutional neural networks (CNNs) is a challenging task. Current paradigms to tackle this challenge are to either adapt the convolutional filters to operate on surfaces, learn spectral descriptors defined by the Laplace-Beltrami operator, or to drop surfaces altogether in lieu of voxelized inputs. Here we adopt an approach of converting the 3D shape into a ‘geometry image’ so that standard CNNs can directly be used to learn 3D shapes. We qualitatively and quantitatively validate that creating geometry images using authalic parametrization on a spherical domain is suitable for robust learning of 3D shape surfaces. This spherically parameterized shape is then projected and cut to convert the original 3D shape into a flat and regular geometry image. We propose a way to implicitly learn the topology and structure of 3D shapes using geometry images encoded with suitable features. We show the efficacy of our approach to learn 3D shape surfaces for classification and retrieval tasks on non-rigid and rigid shape datasets.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
Note we do not report the scores for SG on Mcgill because the author provided implementation failed on several shapes and produced spurious results.
 
Literatur
1.
Zurück zum Zitat Boscaini, D., Masci, J., Melzi, S., Bronstein, M.M., Castellani, U., Vandergheynst, P.: Learning class-specific descriptors for deformable shapes using localized spectral convolutional networks. Comput. Graph. Forum 34, 13–23 (2015)CrossRef Boscaini, D., Masci, J., Melzi, S., Bronstein, M.M., Castellani, U., Vandergheynst, P.: Learning class-specific descriptors for deformable shapes using localized spectral convolutional networks. Comput. Graph. Forum 34, 13–23 (2015)CrossRef
2.
Zurück zum Zitat Bronstein, A.M., Bronstein, M.M., Guibas, L.J., Ovsjanikov, M.: Shape google: geometric words and expressions for invariant shape retrieval. ACM Trans. Graph. (TOG) 30(1), 1 (2011)CrossRef Bronstein, A.M., Bronstein, M.M., Guibas, L.J., Ovsjanikov, M.: Shape google: geometric words and expressions for invariant shape retrieval. ACM Trans. Graph. (TOG) 30(1), 1 (2011)CrossRef
3.
Zurück zum Zitat Chen, D.-Y., Tian, X.-P., Shen, Y.-T., Ouhyoung, M.: On visual similarity based 3D model retrieval. Comput. Graph. Forum 22(3), 223–232 (2003)CrossRef Chen, D.-Y., Tian, X.-P., Shen, Y.-T., Ouhyoung, M.: On visual similarity based 3D model retrieval. Comput. Graph. Forum 22(3), 223–232 (2003)CrossRef
4.
Zurück zum Zitat Desbrun, M., Meyer, M., Alliez, P.: Intrinsic parameterizations of surface meshes. Comput. Graph. Forum 21, (2002) Desbrun, M., Meyer, M., Alliez, P.: Intrinsic parameterizations of surface meshes. Comput. Graph. Forum 21, (2002)
5.
Zurück zum Zitat Dominitz, A., Tannenbaum, A.: Texture mapping via optimal mass transport. IEEE Trans. Vis. Comput. Graph. 16(3), 419–433 (2010)CrossRef Dominitz, A., Tannenbaum, A.: Texture mapping via optimal mass transport. IEEE Trans. Vis. Comput. Graph. 16(3), 419–433 (2010)CrossRef
6.
Zurück zum Zitat Fang, Y., Xie, J., Dai, G., Wang, M., Zhu, F., Xu, T., Wong, E.: 3D deep shape descriptor. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2319–2328 (2015) Fang, Y., Xie, J., Dai, G., Wang, M., Zhu, F., Xu, T., Wong, E.: 3D deep shape descriptor. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2319–2328 (2015)
7.
Zurück zum Zitat Floater, M.S., Hormann, K.: Surface parameterization: a tutorial and survey. Advances in Multiresolution for Geometric Modelling, pp. 157–186. Springer, Heidelberg (2005)CrossRef Floater, M.S., Hormann, K.: Surface parameterization: a tutorial and survey. Advances in Multiresolution for Geometric Modelling, pp. 157–186. Springer, Heidelberg (2005)CrossRef
8.
Zurück zum Zitat Friedel, I., Schröder, P., Desbrun, M.: Unconstrained spherical parameterization. J. Graph. Tools 12(1), 17–26 (2007)CrossRef Friedel, I., Schröder, P., Desbrun, M.: Unconstrained spherical parameterization. J. Graph. Tools 12(1), 17–26 (2007)CrossRef
9.
Zurück zum Zitat Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 580–587. IEEE (2014) Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 580–587. IEEE (2014)
10.
Zurück zum Zitat Gotsman, C., Gu, X., Sheffer, A.: Fundamentals of spherical parameterization for 3D meshes. In: Proceedings of the 2006 Symposium on Interactive 3D Graphics and Games, 14–17 March 2006, pp. 28–29 (2003) Gotsman, C., Gu, X., Sheffer, A.: Fundamentals of spherical parameterization for 3D meshes. In: Proceedings of the 2006 Symposium on Interactive 3D Graphics and Games, 14–17 March 2006, pp. 28–29 (2003)
11.
Zurück zum Zitat Gu, X., Gortler, S.J., Hoppe, H.: Geometry images. In: Proceedings of the 29th Annual Conference on Computer Graphics and Interactive Techniques, SIGGRAPH 2002, pp. 355–361. ACM, New York (2002) Gu, X., Gortler, S.J., Hoppe, H.: Geometry images. In: Proceedings of the 29th Annual Conference on Computer Graphics and Interactive Techniques, SIGGRAPH 2002, pp. 355–361. ACM, New York (2002)
12.
Zurück zum Zitat Gu, X., et al.: Genus zero surface conformal mapping and its application to brain surface mapping. IEEE Trans. Medical Imaging (2003) Gu, X., et al.: Genus zero surface conformal mapping and its application to brain surface mapping. IEEE Trans. Medical Imaging (2003)
13.
Zurück zum Zitat Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., Fei-Fei, L.: Large-scale video classification with convolutional neural networks. In: Proceedings of International Computer Vision and Pattern Recognition (CVPR 2014) (2014) Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., Fei-Fei, L.: Large-scale video classification with convolutional neural networks. In: Proceedings of International Computer Vision and Pattern Recognition (CVPR 2014) (2014)
14.
Zurück zum Zitat Kazhdan, M., Bolitho, M., Hoppe, H.: Poisson surface reconstruction. In: Proceedings of the Fourth Eurographics Symposium on Geometry Processing, SGP 2006, pp. 61–70. Eurographics Association, Aire-la-Ville (2006) Kazhdan, M., Bolitho, M., Hoppe, H.: Poisson surface reconstruction. In: Proceedings of the Fourth Eurographics Symposium on Geometry Processing, SGP 2006, pp. 61–70. Eurographics Association, Aire-la-Ville (2006)
15.
Zurück zum Zitat Kazhdan, M., Funkhouser, T., Rusinkiewicz, S.: Rotation invariant spherical harmonic representation of 3D shape descriptors, June 2003 Kazhdan, M., Funkhouser, T., Rusinkiewicz, S.: Rotation invariant spherical harmonic representation of 3D shape descriptors, June 2003
16.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Pereira, F., Burges, C., Bottou, L., Weinberger, K. (eds.) Advances in Neural Information Processing Systems, vol. 25, pp. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Pereira, F., Burges, C., Bottou, L., Weinberger, K. (eds.) Advances in Neural Information Processing Systems, vol. 25, pp. 1097–1105 (2012)
17.
Zurück zum Zitat Laga, H.T., Schreck, A., Ferreira, A., Godil, I.P., Meshes, W., Lian, Z., Godil, A., Bustos, B., Daoudi, M., Hermans, J., Kawamura, S., Kurita, Y., Lavou, G., Nguyen, H.V., Ohbuchi, R., Ohkita, Y., Ohishi, Y., Porikli, F., Reuter, M., Sipiran, I., Smeets, D., Suetens, P., Tabia, H.: SHREC 2011 Track: shape retrieval on non-rigid 3D (2011) Laga, H.T., Schreck, A., Ferreira, A., Godil, I.P., Meshes, W., Lian, Z., Godil, A., Bustos, B., Daoudi, M., Hermans, J., Kawamura, S., Kurita, Y., Lavou, G., Nguyen, H.V., Ohbuchi, R., Ohkita, Y., Ohishi, Y., Porikli, F., Reuter, M., Sipiran, I., Smeets, D., Suetens, P., Tabia, H.: SHREC 2011 Track: shape retrieval on non-rigid 3D (2011)
18.
Zurück zum Zitat Lévy, B., Petitjean, S., Ray, N., Maillot, J.: Least squares conformal maps for automatic texture atlas generation. ACM Trans. Graph. 21(3), 362–371 (2002)CrossRef Lévy, B., Petitjean, S., Ray, N., Maillot, J.: Least squares conformal maps for automatic texture atlas generation. ACM Trans. Graph. 21(3), 362–371 (2002)CrossRef
19.
Zurück zum Zitat Masci, J., Boscaini, D., Bronstein, M.M., Vandergheynst, P.: Shapenet: Convolutional neural networks on non-euclidean manifolds. arXiv preprint arXiv:1501.06297 (2015) Masci, J., Boscaini, D., Bronstein, M.M., Vandergheynst, P.: Shapenet: Convolutional neural networks on non-euclidean manifolds. arXiv preprint arXiv:​1501.​06297 (2015)
20.
Zurück zum Zitat Maturana, D., Scherer, S.: Voxnet: a 3D convolutional neural network for real-time object recognition. In: Signal Processing Letters (2015) Maturana, D., Scherer, S.: Voxnet: a 3D convolutional neural network for real-time object recognition. In: Signal Processing Letters (2015)
21.
Zurück zum Zitat Novotni, M., Klein, R.: Shape retrieval using 3D zernike descriptors. Comput. Aided Design 36, 1047–1062 (2004)CrossRef Novotni, M., Klein, R.: Shape retrieval using 3D zernike descriptors. Comput. Aided Design 36, 1047–1062 (2004)CrossRef
22.
Zurück zum Zitat Praun, E., Hoppe, H.: Spherical parametrization and remeshing. In: ACM Transactions on Graphics (TOG), vol. 22, pp. 340–349. ACM (2003) Praun, E., Hoppe, H.: Spherical parametrization and remeshing. In: ACM Transactions on Graphics (TOG), vol. 22, pp. 340–349. ACM (2003)
23.
Zurück zum Zitat Rustamov, R.M.: Laplace-beltrami eigen functions for deformation invariant shape representation. Proceedings of the Fifth Eurographics Symposium on Geometry Processing, SGP 2007, pp. 225–233, Aire-la-Ville (2007) Rustamov, R.M.: Laplace-beltrami eigen functions for deformation invariant shape representation. Proceedings of the Fifth Eurographics Symposium on Geometry Processing, SGP 2007, pp. 225–233, Aire-la-Ville (2007)
24.
Zurück zum Zitat Shen, L., Makedon, F.: Spherical mapping for processing of 3-D closed surfaces. In: Image and Vision Computing (2006) Shen, L., Makedon, F.: Spherical mapping for processing of 3-D closed surfaces. In: Image and Vision Computing (2006)
25.
Zurück zum Zitat Shen, L., Makedon, F.: Spherical mapping for processing of 3D closed surfaces. Image Vis. Comput. 24(7), 743–761 (2006)CrossRef Shen, L., Makedon, F.: Spherical mapping for processing of 3D closed surfaces. Image Vis. Comput. 24(7), 743–761 (2006)CrossRef
26.
Zurück zum Zitat Shi, B., Bai, S., Zhou, Z., Bai, X.: DeepPano: deep panoramic representation for 3-D shape recognition. IEEE Signal Process. Lett. 22(12), 2339–2343 (2015)CrossRef Shi, B., Bai, S., Zhou, Z., Bai, X.: DeepPano: deep panoramic representation for 3-D shape recognition. IEEE Signal Process. Lett. 22(12), 2339–2343 (2015)CrossRef
27.
Zurück zum Zitat Sinha, A., Choi, C., Ramani, K.: DeepHand: robust hand pose estimation by completing a matrix imputed with deep features. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016 Sinha, A., Choi, C., Ramani, K.: DeepHand: robust hand pose estimation by completing a matrix imputed with deep features. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016
28.
Zurück zum Zitat Sinha, A., Ramani, K.: Multi-scale kernels using random walks. Comput. Graphics Forum 33(1), 164–177 (2014)CrossRef Sinha, A., Ramani, K.: Multi-scale kernels using random walks. Comput. Graphics Forum 33(1), 164–177 (2014)CrossRef
29.
Zurück zum Zitat Solomon, J., de Goes, F., Studios, P.A., Peyré, G., Cuturi, M., Butscher, A., Nguyen, A., Du, T., Guibas, L.: Convolutional wasserstein distances: efficient optimal transportation on geometric domains. ACM Transactions on Graphics (Proceeding SIGGRAPH 2015) (2015) Solomon, J., de Goes, F., Studios, P.A., Peyré, G., Cuturi, M., Butscher, A., Nguyen, A., Du, T., Guibas, L.: Convolutional wasserstein distances: efficient optimal transportation on geometric domains. ACM Transactions on Graphics (Proceeding SIGGRAPH 2015) (2015)
30.
Zurück zum Zitat Su, H., Maji, S., Kalogerakis, E., Learned-Miller, E.G.: Multi-view convolutional neural networks for 3D shape recognition. In: Proceeding ICCV (2015) Su, H., Maji, S., Kalogerakis, E., Learned-Miller, E.G.: Multi-view convolutional neural networks for 3D shape recognition. In: Proceeding ICCV (2015)
31.
Zurück zum Zitat Sun, J., Ovsjanikov, M., Guibas, L.: A concise and provably informative multi-scale signature based on heat diffusion. In: Proceedings of the Symposium on Geometry Processing, SGP 2009, pp. 1383–1392, Aire-la-Ville (2009) Sun, J., Ovsjanikov, M., Guibas, L.: A concise and provably informative multi-scale signature based on heat diffusion. In: Proceedings of the Symposium on Geometry Processing, SGP 2009, pp. 1383–1392, Aire-la-Ville (2009)
32.
Zurück zum Zitat Wu, Z., Song, S., Khosla, A., Yu, F., Zhang, L., Tang, X., Xiao, J.: 3D ShapeNets: a deep representation for volumetric shapes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1912–1920 (2015) Wu, Z., Song, S., Khosla, A., Yu, F., Zhang, L., Tang, X., Xiao, J.: 3D ShapeNets: a deep representation for volumetric shapes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1912–1920 (2015)
33.
Zurück zum Zitat Yu, Y., Zhou, K., Xu, D., Shi, X., Bao, H., Guo, B., Shum, H.-Y.: Mesh editing with poisson-based gradient field manipulation. ACM Trans. Graph. 23(3), 644–651 (2004)CrossRef Yu, Y., Zhou, K., Xu, D., Shi, X., Bao, H., Guo, B., Shum, H.-Y.: Mesh editing with poisson-based gradient field manipulation. ACM Trans. Graph. 23(3), 644–651 (2004)CrossRef
34.
Zurück zum Zitat Zhao, X., Su, Z., Gu, X.D., Kaufman, A., Sun, J., Gao, J., Luo, F.: Area-preservation mapping using optimal mass transport. IEEE Trans. Visual Comput. Graphics 19(12), 2838–2847 (2013)CrossRef Zhao, X., Su, Z., Gu, X.D., Kaufman, A., Sun, J., Gao, J., Luo, F.: Area-preservation mapping using optimal mass transport. IEEE Trans. Visual Comput. Graphics 19(12), 2838–2847 (2013)CrossRef
35.
Zurück zum Zitat Zou, G., Hu, J., Gu, X., Hua, J.: Authalic parameterization of general surfaces using lie advection. IEEE Trans. Visual Comput. Graphics 17(12), 2005–2014 (2011)CrossRef Zou, G., Hu, J., Gu, X., Hua, J.: Authalic parameterization of general surfaces using lie advection. IEEE Trans. Visual Comput. Graphics 17(12), 2005–2014 (2011)CrossRef
Metadaten
Titel
Deep Learning 3D Shape Surfaces Using Geometry Images
verfasst von
Ayan Sinha
Jing Bai
Karthik Ramani
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-46466-4_14