Skip to main content
Erschienen in:
Buchtitelbild

2018 | OriginalPaper | Buchkapitel

Learning 3D Keypoint Descriptors for Non-rigid Shape Matching

verfasst von : Hanyu Wang, Jianwei Guo, Dong-Ming Yan, Weize Quan, Xiaopeng Zhang

Erschienen in: Computer Vision – ECCV 2018

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we present a novel deep learning framework that derives discriminative local descriptors for 3D surface shapes. In contrast to previous convolutional neural networks (CNNs) that rely on rendering multi-view images or extracting intrinsic shape properties, we parameterize the multi-scale localized neighborhoods of a keypoint into regular 2D grids, which are termed as ‘geometry images’. The benefits of such geometry images include retaining sufficient geometric information, as well as allowing the usage of standard CNNs. Specifically, we leverage a triplet network to perform deep metric learning, which takes a set of triplets as input, and a newly designed triplet loss function is minimized to distinguish between similar and dissimilar pairs of keypoints. At the testing stage, given a geometry image of a point of interest, our network outputs a discriminative local descriptor for it. Experimental results for non-rigid shape matching on several benchmarks demonstrate the superior performance of our learned descriptors over traditional descriptors and the state-of-the-art learning-based alternatives.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Van Kaick, O., Zhang, H., Hamarneh, G., Cohen-Or, D.: A survey on shape correspondence. Comput. Graph. Forum 30(6), 1681–1707 (2011)CrossRef Van Kaick, O., Zhang, H., Hamarneh, G., Cohen-Or, D.: A survey on shape correspondence. Comput. Graph. Forum 30(6), 1681–1707 (2011)CrossRef
2.
Zurück zum Zitat Ovsjanikov, M., Ben-Chen, M., Solomon, J., Butscher, A., Guibas, L.: Functional maps: a flexible representation of maps between shapes. ACM Trans. Graph. (Proceedings of SIGGRAPH) 31(4), 30 (2012) Ovsjanikov, M., Ben-Chen, M., Solomon, J., Butscher, A., Guibas, L.: Functional maps: a flexible representation of maps between shapes. ACM Trans. Graph. (Proceedings of SIGGRAPH) 31(4), 30 (2012)
3.
Zurück zum Zitat Guo, Y., Bennamoun, M., Sohel, F., Lu, M., Wan, J.: 3D object recognition in cluttered scenes with local surface features: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 36(11), 2270–2287 (2014)CrossRef Guo, Y., Bennamoun, M., Sohel, F., Lu, M., Wan, J.: 3D object recognition in cluttered scenes with local surface features: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 36(11), 2270–2287 (2014)CrossRef
5.
Zurück zum Zitat Cosmo, L., Rodola, E., Masci, J., Torsello, A., Bronstein, M.M.: Matching deformable objects in clutter. In: Fourth International Conference on 3D Vision (3DV), pp. 1–10. IEEE (2016) Cosmo, L., Rodola, E., Masci, J., Torsello, A., Bronstein, M.M.: Matching deformable objects in clutter. In: Fourth International Conference on 3D Vision (3DV), pp. 1–10. IEEE (2016)
6.
Zurück zum Zitat Bronstein, A.M., Bronstein, M.M., Guibas, L.J., Ovsjanikov, M.: Shape Google: geometric words and expressions for invariant shape retrieval. ACM Trans. Graph. 30(1), 1 (2011)CrossRef Bronstein, A.M., Bronstein, M.M., Guibas, L.J., Ovsjanikov, M.: Shape Google: geometric words and expressions for invariant shape retrieval. ACM Trans. Graph. 30(1), 1 (2011)CrossRef
7.
Zurück zum Zitat Lian, Z., et al.: A comparison of methods for non-rigid 3D shape retrieval. Pattern Recognit. 46(1), 449–461 (2013)CrossRef Lian, Z., et al.: A comparison of methods for non-rigid 3D shape retrieval. Pattern Recognit. 46(1), 449–461 (2013)CrossRef
8.
Zurück zum Zitat Shah, S.A.A., Bennamoun, M., Boussaid, F.: A novel 3D vorticity based approach for automatic registration of low resolution range images. Pattern Recognit. 48(9), 2859–2871 (2015)CrossRef Shah, S.A.A., Bennamoun, M., Boussaid, F.: A novel 3D vorticity based approach for automatic registration of low resolution range images. Pattern Recognit. 48(9), 2859–2871 (2015)CrossRef
10.
Zurück zum Zitat Sumner, R.W., Popović, J.: Deformation transfer for triangle meshes. ACM Trans. Graph. (Proceedings of SIGGRAPH) 23(3), 399–405 (2004)CrossRef Sumner, R.W., Popović, J.: Deformation transfer for triangle meshes. ACM Trans. Graph. (Proceedings of SIGGRAPH) 23(3), 399–405 (2004)CrossRef
11.
Zurück zum Zitat Johnson, A.E., Hebert, M.: Using spin images for efficient object recognition in cluttered 3D scenes. IEEE Trans. Pattern Anal. Mach. Intell. 21(5), 433–449 (1999)CrossRef Johnson, A.E., Hebert, M.: Using spin images for efficient object recognition in cluttered 3D scenes. IEEE Trans. Pattern Anal. Mach. Intell. 21(5), 433–449 (1999)CrossRef
12.
Zurück zum Zitat Gal, R., Cohen-Or, D.: Salient geometric features for partial shape matching and similarity. ACM Trans. Graph. 25(1), 130–150 (2006)CrossRef Gal, R., Cohen-Or, D.: Salient geometric features for partial shape matching and similarity. ACM Trans. Graph. 25(1), 130–150 (2006)CrossRef
13.
Zurück zum Zitat Sun, J., Ovsjanikov, M., Guibas, L.: A concise and provably informative multi-scale signature based on heat diffusion. Comput. Graph. Forum (Proceedings of SGP) 28(5), 1383–1392 (2009)CrossRef Sun, J., Ovsjanikov, M., Guibas, L.: A concise and provably informative multi-scale signature based on heat diffusion. Comput. Graph. Forum (Proceedings of SGP) 28(5), 1383–1392 (2009)CrossRef
14.
Zurück zum Zitat Huang, H., Kalogerakis, E., Chaudhuri, S., Ceylan, D., Kim, V., Yumer, E.: Learning local shape descriptors from part correspondences with multi-view convolutional networks. ACM Trans. Graph. 37(1), 6:1–6:14 (2018)CrossRef Huang, H., Kalogerakis, E., Chaudhuri, S., Ceylan, D., Kim, V., Yumer, E.: Learning local shape descriptors from part correspondences with multi-view convolutional networks. ACM Trans. Graph. 37(1), 6:1–6:14 (2018)CrossRef
15.
Zurück zum Zitat Zeng, A., Song, S., Nießner, M., Fisher, M., Xiao, J., Funkhouser, T.: 3DMatch: learning local geometric descriptors from RGB-D reconstructions. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 199–208 (2017) Zeng, A., Song, S., Nießner, M., Fisher, M., Xiao, J., Funkhouser, T.: 3DMatch: learning local geometric descriptors from RGB-D reconstructions. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 199–208 (2017)
16.
Zurück zum Zitat Monti, F., Boscaini, D., Masci, J., Rodolà, E., Svoboda, J., Bronstein, M.M.: Geometric deep learning on graphs and manifolds using mixture model CNNs. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 5425–5434 (2017) Monti, F., Boscaini, D., Masci, J., Rodolà, E., Svoboda, J., Bronstein, M.M.: Geometric deep learning on graphs and manifolds using mixture model CNNs. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 5425–5434 (2017)
18.
Zurück zum Zitat Bromley, J., Guyon, I., LeCun, Y., Säckinger, E., Shah, R.: Signature verification using a “siamese” time delay neural network. In: Advances in Neural Information Processing Systems, pp. 737–744 (1994) Bromley, J., Guyon, I., LeCun, Y., Säckinger, E., Shah, R.: Signature verification using a “siamese” time delay neural network. In: Advances in Neural Information Processing Systems, pp. 737–744 (1994)
19.
Zurück zum Zitat Wang, J., et al.: Learning fine-grained image similarity with deep ranking. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 1386–1393 (2014) Wang, J., et al.: Learning fine-grained image similarity with deep ranking. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 1386–1393 (2014)
20.
Zurück zum Zitat Guo, Y., Bennamoun, M., Sohel, F., Lu, M., Wan, J., Kwok, N.M.: A comprehensive performance evaluation of 3D local feature descriptors. Int. J. Comput. Vis. 116(1), 66–89 (2016)MathSciNetCrossRef Guo, Y., Bennamoun, M., Sohel, F., Lu, M., Wan, J., Kwok, N.M.: A comprehensive performance evaluation of 3D local feature descriptors. Int. J. Comput. Vis. 116(1), 66–89 (2016)MathSciNetCrossRef
21.
Zurück zum Zitat Yang, J., Zhang, Q., Cao, Z.: The effect of spatial information characterization on 3D local feature descriptors: a quantitative evaluation. Pattern Recognit. 66, 375–391 (2017)CrossRef Yang, J., Zhang, Q., Cao, Z.: The effect of spatial information characterization on 3D local feature descriptors: a quantitative evaluation. Pattern Recognit. 66, 375–391 (2017)CrossRef
23.
Zurück zum Zitat Zaharescu, A., Boyer, E., Varanasi, K., Horaud, R.: Surface feature detection and description with applications to mesh matching. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 373–380 (2009) Zaharescu, A., Boyer, E., Varanasi, K., Horaud, R.: Surface feature detection and description with applications to mesh matching. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 373–380 (2009)
25.
Zurück zum Zitat Guo, Y., Sohel, F., Bennamoun, M., Lu, M., Wan, J.: Rotational projection statistics for 3D local surface description and object recognition. Int. J. Comput. Vis. 105(1), 63–86 (2013)MathSciNetCrossRef Guo, Y., Sohel, F., Bennamoun, M., Lu, M., Wan, J.: Rotational projection statistics for 3D local surface description and object recognition. Int. J. Comput. Vis. 105(1), 63–86 (2013)MathSciNetCrossRef
26.
Zurück zum Zitat Elad, A., Kimmel, R.: On bending invariant signatures for surfaces. IEEE Trans. Pattern Anal. Mach. Intell. 25(10), 1285–1295 (2003)CrossRef Elad, A., Kimmel, R.: On bending invariant signatures for surfaces. IEEE Trans. Pattern Anal. Mach. Intell. 25(10), 1285–1295 (2003)CrossRef
27.
Zurück zum Zitat Aubry, M., Schlickewei, U., Cremers, D.: The wave kernel signature: a quantum mechanical approach to shape analysis. In: IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 1626–1633 (2011) Aubry, M., Schlickewei, U., Cremers, D.: The wave kernel signature: a quantum mechanical approach to shape analysis. In: IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 1626–1633 (2011)
28.
Zurück zum Zitat Kokkinos, I., Bronstein, M.M., Litman, R., Bronstein, A.M.: Intrinsic shape context descriptors for deformable shapes. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 159–166. IEEE (2012) Kokkinos, I., Bronstein, M.M., Litman, R., Bronstein, A.M.: Intrinsic shape context descriptors for deformable shapes. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 159–166. IEEE (2012)
29.
Zurück zum Zitat Litman, R., Bronstein, A.M.: Learning spectral descriptors for deformable shape correspondence. IEEE Trans. Pattern Anal. Mach. Intell. 36(1), 171–180 (2014)CrossRef Litman, R., Bronstein, A.M.: Learning spectral descriptors for deformable shape correspondence. IEEE Trans. Pattern Anal. Mach. Intell. 36(1), 171–180 (2014)CrossRef
30.
Zurück zum Zitat Wei, L., Huang, Q., Ceylan, D., Vouga, E., Li, H.: Dense human body correspondences using convolutional networks. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 1544–1553 (2016) Wei, L., Huang, Q., Ceylan, D., Vouga, E., Li, H.: Dense human body correspondences using convolutional networks. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 1544–1553 (2016)
31.
Zurück zum Zitat Qi, C.R., Su, H., Mo, K., Guibas, L.J.: Pointnet: deep learning on point sets for 3D classification and segmentation. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 77–85 (2017) Qi, C.R., Su, H., Mo, K., Guibas, L.J.: Pointnet: deep learning on point sets for 3D classification and segmentation. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 77–85 (2017)
32.
Zurück zum Zitat Khoury, M., Zhou, Q.Y., Koltun, V.: Learning compact geometric features. In: IEEE International Conference on Computer Vision (ICCV), pp. 153–161 (2017) Khoury, M., Zhou, Q.Y., Koltun, V.: Learning compact geometric features. In: IEEE International Conference on Computer Vision (ICCV), pp. 153–161 (2017)
33.
Zurück zum Zitat Bronstein, M.M., Bruna, J., LeCun, Y., Szlam, A., Vandergheynst, P.: Geometric deep learning: going beyond Euclidean data. IEEE Signal Process. Mag. 34(4), 18–42 (2017)CrossRef Bronstein, M.M., Bruna, J., LeCun, Y., Szlam, A., Vandergheynst, P.: Geometric deep learning: going beyond Euclidean data. IEEE Signal Process. Mag. 34(4), 18–42 (2017)CrossRef
34.
Zurück zum Zitat Boscaini, D., Masci, J., Melzi, S., Bronstein, M.M., Castellani, U., Vandergheynst, P.: Learning class-specific descriptors for deformable shapes using localized spectral convolutional networks. Comput. Graph. Forum (Proceedings of SGP) 34(5), 13–23 (2015)CrossRef Boscaini, D., Masci, J., Melzi, S., Bronstein, M.M., Castellani, U., Vandergheynst, P.: Learning class-specific descriptors for deformable shapes using localized spectral convolutional networks. Comput. Graph. Forum (Proceedings of SGP) 34(5), 13–23 (2015)CrossRef
35.
Zurück zum Zitat Masci, J., Boscaini, D., Bronstein, M., Vandergheynst, P.: Geodesic convolutional neural networks on Riemannian manifolds. In: IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 37–45 (2015) Masci, J., Boscaini, D., Bronstein, M., Vandergheynst, P.: Geodesic convolutional neural networks on Riemannian manifolds. In: IEEE International Conference on Computer Vision Workshops (ICCV Workshops), pp. 37–45 (2015)
36.
Zurück zum Zitat Boscaini, D., Masci, J., Rodolà, E., Bronstein, M.: Learning shape correspondence with anisotropic convolutional neural networks. In: Advances in Neural Information Processing (NIPS), pp. 3189–3197 (2016) Boscaini, D., Masci, J., Rodolà, E., Bronstein, M.: Learning shape correspondence with anisotropic convolutional neural networks. In: Advances in Neural Information Processing (NIPS), pp. 3189–3197 (2016)
37.
Zurück zum Zitat Litany, O., Remez, T., Rodola, E., Bronstein, A.M., Bronstein, M.M.: Deep functional maps: structured prediction for dense shape correspondence. In: IEEE International Conference on Computer Vision (ICCV), vol. 2, p. 8 (2017) Litany, O., Remez, T., Rodola, E., Bronstein, A.M., Bronstein, M.M.: Deep functional maps: structured prediction for dense shape correspondence. In: IEEE International Conference on Computer Vision (ICCV), vol. 2, p. 8 (2017)
38.
Zurück zum Zitat Gu, X., Gortler, S.J., Hoppe, H.: Geometry images. ACM Trans. Graph. (Proceedings of SIGGRAPH) 21(3), 355–361 (2002) Gu, X., Gortler, S.J., Hoppe, H.: Geometry images. ACM Trans. Graph. (Proceedings of SIGGRAPH) 21(3), 355–361 (2002)
39.
Zurück zum Zitat Bogo, F., Romero, J., Loper, M., Black, M.J.: Faust: dataset and evaluation for 3D mesh registration. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 3794–3801 (2014) Bogo, F., Romero, J., Loper, M., Black, M.J.: Faust: dataset and evaluation for 3D mesh registration. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 3794–3801 (2014)
40.
Zurück zum Zitat Sipiran, I., Bustos, B.: Harris 3D: a robust extension of the Harris operator for interest point detection on 3D meshes. Vis. Comput. 27(11), 963–976 (2011)CrossRef Sipiran, I., Bustos, B.: Harris 3D: a robust extension of the Harris operator for interest point detection on 3D meshes. Vis. Comput. 27(11), 963–976 (2011)CrossRef
41.
Zurück zum Zitat Desbrun, M., Meyer, M., Alliez, P.: Intrinsic parameterizations of surface meshes. In: Computer Graphics Forum, vol. 21, pp. 209–218. Wiley Online Library (2002) Desbrun, M., Meyer, M., Alliez, P.: Intrinsic parameterizations of surface meshes. In: Computer Graphics Forum, vol. 21, pp. 209–218. Wiley Online Library (2002)
42.
Zurück zum Zitat Boscaini, D., Masci, J., Rodolà, E., Bronstein, M.M., Cremers, D.: Anisotropic diffusion descriptors. Comput. Graph. Forum (Proceedings of EUROGRAPHICS) 35(2), 431–441 (2016)CrossRef Boscaini, D., Masci, J., Rodolà, E., Bronstein, M.M., Cremers, D.: Anisotropic diffusion descriptors. Comput. Graph. Forum (Proceedings of EUROGRAPHICS) 35(2), 431–441 (2016)CrossRef
43.
Zurück zum Zitat Schroff, F., Kalenichenko, D., Philbin, J.: Facenet: a unified embedding for face recognition and clustering. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 815–823 (2015) Schroff, F., Kalenichenko, D., Philbin, J.: Facenet: a unified embedding for face recognition and clustering. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 815–823 (2015)
44.
Zurück zum Zitat Kumar, B., Carneiro, G., Reid, I., et al.: Learning local image descriptors with deep Siamese and triplet convolutional networks by minimising global loss functions. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 5385–5394 (2016) Kumar, B., Carneiro, G., Reid, I., et al.: Learning local image descriptors with deep Siamese and triplet convolutional networks by minimising global loss functions. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 5385–5394 (2016)
45.
Zurück zum Zitat Svoboda, J., Masci, J., Bronstein, M.M.: Palmprint recognition via discriminative index learning. In: 23rd International Conference on Pattern Recognition (ICPR), pp. 4232–4237. IEEE (2016) Svoboda, J., Masci, J., Bronstein, M.M.: Palmprint recognition via discriminative index learning. In: 23rd International Conference on Pattern Recognition (ICPR), pp. 4232–4237. IEEE (2016)
46.
Zurück zum Zitat Maas, A.L., Hannun, A.Y., Ng, A.Y.: Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of ICML, vol. 30 (2013) Maas, A.L., Hannun, A.Y., Ng, A.Y.: Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of ICML, vol. 30 (2013)
47.
Zurück zum Zitat Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pp. 249–256 (2010) Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pp. 249–256 (2010)
49.
Zurück zum Zitat Anguelov, D., Srinivasan, P., Koller, D., Thrun, S., Rodgers, J., Davis, J.: SCAPE: shape completion and animation of people. ACM Trans. Graph. (Proceedings of SIGGRAPH) 24(3), 408–416 (2005)CrossRef Anguelov, D., Srinivasan, P., Koller, D., Thrun, S., Rodgers, J., Davis, J.: SCAPE: shape completion and animation of people. ACM Trans. Graph. (Proceedings of SIGGRAPH) 24(3), 408–416 (2005)CrossRef
50.
Zurück zum Zitat Yang, Y., Yu, Y., Zhou, Y., Du, S., Davis, J., Yang, R.: Semantic parametric reshaping of human body models. In: 2nd International Conference on 3D Vision (3DV), vol. 2, pp. 41–48. IEEE (2014) Yang, Y., Yu, Y., Zhou, Y., Du, S., Davis, J., Yang, R.: Semantic parametric reshaping of human body models. In: 2nd International Conference on 3D Vision (3DV), vol. 2, pp. 41–48. IEEE (2014)
52.
Zurück zum Zitat Sagonas, C., Antonakos, E., Tzimiropoulos, G., Zafeiriou, S., Pantic, M.: 300 faces in-the-wild challenge: database and results. Image Vis. Comput. 47, 3–18 (2016)CrossRef Sagonas, C., Antonakos, E., Tzimiropoulos, G., Zafeiriou, S., Pantic, M.: 300 faces in-the-wild challenge: database and results. Image Vis. Comput. 47, 3–18 (2016)CrossRef
Metadaten
Titel
Learning 3D Keypoint Descriptors for Non-rigid Shape Matching
verfasst von
Hanyu Wang
Jianwei Guo
Dong-Ming Yan
Weize Quan
Xiaopeng Zhang
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-030-01237-3_1