Skip to main content

2018 | OriginalPaper | Buchkapitel

GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints

verfasst von : Zixin Luo, Tianwei Shen, Lei Zhou, Siyu Zhu, Runze Zhang, Yao Yao, Tian Fang, Long Quan

Erschienen in: Computer Vision – ECCV 2018

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Learned local descriptors based on Convolutional Neural Networks (CNNs) have achieved significant improvements on patch-based benchmarks, whereas not having demonstrated strong generalization ability on recent benchmarks of image-based 3D reconstruction. In this paper, we mitigate this limitation by proposing a novel local descriptor learning approach that integrates geometry constraints from multi-view reconstructions, which benefits the learning process in terms of data generation, data sampling and loss computation. We refer to the proposed descriptor as GeoDesc, and demonstrate its superior performance on various large-scale benchmarks, and in particular show its great success on challenging reconstruction tasks. Moreover, we provide guidelines towards practical integration of learned descriptors in Structure-from-Motion (SfM) pipelines, showing the good trade-off that GeoDesc delivers to 3D reconstruction tasks between accuracy and efficiency.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Abadi, M., et al.: TensorFlow: large-scale machine learning on heterogeneous distributed systems. arXiv (2016) Abadi, M., et al.: TensorFlow: large-scale machine learning on heterogeneous distributed systems. arXiv (2016)
2.
Zurück zum Zitat Arandjelovic, R., Zisserman, A.: Three things everyone should know to improve object retrieval. In: CVPR (2012) Arandjelovic, R., Zisserman, A.: Three things everyone should know to improve object retrieval. In: CVPR (2012)
3.
Zurück zum Zitat Balntas, V., Lenc, K., Vedaldi, A., Mikolajczyk, K.: HPatches: a benchmark and evaluation of handcrafted and learned local lescriptors. In: CVPR (2017) Balntas, V., Lenc, K., Vedaldi, A., Mikolajczyk, K.: HPatches: a benchmark and evaluation of handcrafted and learned local lescriptors. In: CVPR (2017)
4.
Zurück zum Zitat Balntas, V., Riba, E., Ponsa, D., Mikolajczyk, K.: Learning local feature descriptors with triplets and shallow convolutional neural networks. In: BMVC (2016) Balntas, V., Riba, E., Ponsa, D., Mikolajczyk, K.: Learning local feature descriptors with triplets and shallow convolutional neural networks. In: BMVC (2016)
5.
Zurück zum Zitat Balntas, V., Johns, E., Tang, L., Mikolajczyk, K.: PN-net: conjoined triple deep network for learning local image descriptors. arXiv (2016) Balntas, V., Johns, E., Tang, L., Mikolajczyk, K.: PN-net: conjoined triple deep network for learning local image descriptors. arXiv (2016)
6.
Zurück zum Zitat Brown, M.A., Hua, G., Winder, S.A.J.: Discriminative learning of local image descriptors. PAMI 33, 43–57 (2011)CrossRef Brown, M.A., Hua, G., Winder, S.A.J.: Discriminative learning of local image descriptors. PAMI 33, 43–57 (2011)CrossRef
7.
Zurück zum Zitat Vijay Kumar, B.G., Carneiro, G., Reid, I.: Learning local image descriptors with deep siamese and triplet convolutional networks by minimizing global loss functions. In: CVPR (2016) Vijay Kumar, B.G., Carneiro, G., Reid, I.: Learning local image descriptors with deep siamese and triplet convolutional networks by minimizing global loss functions. In: CVPR (2016)
8.
Zurück zum Zitat Han, X., Leung, T., Jia, Y., Sukthankar, R., Berg, A.C.: MatchNet - unifying feature and metric learning for patch-based matching. In: CVPR (2015) Han, X., Leung, T., Jia, Y., Sukthankar, R., Berg, A.C.: MatchNet - unifying feature and metric learning for patch-based matching. In: CVPR (2015)
11.
Zurück zum Zitat Labatut, P., Pons, J.P., Keriven, R.: Efficient multi-view reconstruction of large-scale scenes using interest points, delaunay triangulation and graph cuts. In: ICCV (2007) Labatut, P., Pons, J.P., Keriven, R.: Efficient multi-view reconstruction of large-scale scenes using interest points, delaunay triangulation and graph cuts. In: ICCV (2007)
12.
Zurück zum Zitat Li, S., Yuan, L., Sun, J., Quan, L.: Dual-feature warping-based motion model estimation. In: ICCV (2015) Li, S., Yuan, L., Sun, J., Quan, L.: Dual-feature warping-based motion model estimation. In: ICCV (2015)
13.
Zurück zum Zitat Lin, J., Morere, O., Chandrasekhar, V., Veillard, A., Goh, H.: DeepHash: getting regularization, depth and fine-tuning right. arXiv (2015) Lin, J., Morere, O., Chandrasekhar, V., Veillard, A., Goh, H.: DeepHash: getting regularization, depth and fine-tuning right. arXiv (2015)
15.
Zurück zum Zitat Markuš, N., Pandžić, I.S., Ahlberg, J.: Learning local descriptors by optimizing the keypoint-correspondence criterion. In: ICPR (2016) Markuš, N., Pandžić, I.S., Ahlberg, J.: Learning local descriptors by optimizing the keypoint-correspondence criterion. In: ICPR (2016)
16.
Zurück zum Zitat Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. PAMI 27, 1615–1630 (2005)CrossRef Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. PAMI 27, 1615–1630 (2005)CrossRef
17.
Zurück zum Zitat Mishchuk, A., Mishkin, D., Radenovic, F.: Working hard to know your neighbor’s margins: local descriptor learning loss. In: NIPS (2017) Mishchuk, A., Mishkin, D., Radenovic, F.: Working hard to know your neighbor’s margins: local descriptor learning loss. In: NIPS (2017)
18.
Zurück zum Zitat Mishkin, D., Matas, J., Perdoch, M., Lenc, K.: WxBS: wide baseline stereo generalizations. In: BMVC (2015) Mishkin, D., Matas, J., Perdoch, M., Lenc, K.: WxBS: wide baseline stereo generalizations. In: BMVC (2015)
19.
Zurück zum Zitat Mitra, R., et al.: A large dataset for improving patch matching. arXiv (2018) Mitra, R., et al.: A large dataset for improving patch matching. arXiv (2018)
20.
Zurück zum Zitat Morel, J.M., Yu, G.: ASIFT: a new framework for fully affine invariant image comparison. SIAM J. Imaging Sci. 2, 438–469 (2009)MathSciNetCrossRef Morel, J.M., Yu, G.: ASIFT: a new framework for fully affine invariant image comparison. SIAM J. Imaging Sci. 2, 438–469 (2009)MathSciNetCrossRef
21.
Zurück zum Zitat Movshovitz-Attias, Y., Toshev, A., Leung, T.K., Ioffe, S., Singh, S.: No Fuss Distance Metric Learning using Proxies. In: ICCV (2017) Movshovitz-Attias, Y., Toshev, A., Leung, T.K., Ioffe, S., Singh, S.: No Fuss Distance Metric Learning using Proxies. In: ICCV (2017)
22.
Zurück zum Zitat Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: CVPR (2007) Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: CVPR (2007)
24.
Zurück zum Zitat Schnberger, J.L., Frahm, J.M.: Structure-from-motion revisited. In: CVPR (2016) Schnberger, J.L., Frahm, J.M.: Structure-from-motion revisited. In: CVPR (2016)
25.
Zurück zum Zitat Schönberger, J.L., Hardmeier, H., Sattler, T., Pollefeys, M.: Comparative evaluation of hand-crafted and learned local features. In: CVPR (2017) Schönberger, J.L., Hardmeier, H., Sattler, T., Pollefeys, M.: Comparative evaluation of hand-crafted and learned local features. In: CVPR (2017)
27.
Zurück zum Zitat Simo-Serra, E., Trulls, E., Ferraz, L., Kokkinos, I., Fua, P., Moreno-Noguer, F.: Discriminative learning of deep convolutional feature point descriptors. In: CVPR (2015) Simo-Serra, E., Trulls, E., Ferraz, L., Kokkinos, I., Fua, P., Moreno-Noguer, F.: Discriminative learning of deep convolutional feature point descriptors. In: CVPR (2015)
28.
Zurück zum Zitat Sohn, K.: Improved deep metric learning with multi-class n-pair loss objective. In: NIPS (2016) Sohn, K.: Improved deep metric learning with multi-class n-pair loss objective. In: NIPS (2016)
29.
Zurück zum Zitat Tian, B.F.Y., Wu, F: L2-net: deep learning of discriminative patch descriptor in Euclidean space. In: CVPR (2017) Tian, B.F.Y., Wu, F: L2-net: deep learning of discriminative patch descriptor in Euclidean space. In: CVPR (2017)
31.
Zurück zum Zitat Winder, S., Hua, G., Brown, M.: Picking the best daisy. In: CVPR (2009) Winder, S., Hua, G., Brown, M.: Picking the best daisy. In: CVPR (2009)
34.
Zurück zum Zitat Yi, K.M., Verdie, Y., Fua, P., Lepetit, V.: Learning to assign orientations to feature points. In: CVPR (2015) Yi, K.M., Verdie, Y., Fua, P., Lepetit, V.: Learning to assign orientations to feature points. In: CVPR (2015)
35.
Zurück zum Zitat Zagoruyko, S., Komodakis, N.: Learning to compare image patches via convolutional neural networks. In: CVPR (2015) Zagoruyko, S., Komodakis, N.: Learning to compare image patches via convolutional neural networks. In: CVPR (2015)
36.
Zurück zum Zitat Zhang, R., Li, S., Fang, T., Zhu, S., Quan, L.: Joint camera clustering and surface segmentation for large-scale multi-view stereo. In: ICCV (2015) Zhang, R., Li, S., Fang, T., Zhu, S., Quan, L.: Joint camera clustering and surface segmentation for large-scale multi-view stereo. In: ICCV (2015)
37.
Zurück zum Zitat Zhang, R., Zhu, S., Fang, T., Quan, L.: Distributed very large scale bundle adjustment by global camera consensus. In: ICCV (2017) Zhang, R., Zhu, S., Fang, T., Quan, L.: Distributed very large scale bundle adjustment by global camera consensus. In: ICCV (2017)
38.
Zurück zum Zitat Zhang, X., Yu, F.X., Kumar, S., Chang, S.F.: Learning spread-out local feature descriptors. In: CVPR (2017) Zhang, X., Yu, F.X., Kumar, S., Chang, S.F.: Learning spread-out local feature descriptors. In: CVPR (2017)
39.
Zurück zum Zitat Zhou, L., Zhu, S., Shen, T., Wang, J., Fang, T., Quan, L.: Progressive large scale-invariant image matching in scale space. In: ICCV (2017) Zhou, L., Zhu, S., Shen, T., Wang, J., Fang, T., Quan, L.: Progressive large scale-invariant image matching in scale space. In: ICCV (2017)
40.
Zurück zum Zitat Zhu, S., Fang, T., Xiao, J., Quan, L.: Local readjustment for high-resolution 3D reconstruction. In: CVPR (2014) Zhu, S., Fang, T., Xiao, J., Quan, L.: Local readjustment for high-resolution 3D reconstruction. In: CVPR (2014)
41.
Zurück zum Zitat Zhu, S., et al.: Very large-scale global SFM by distributed motion averaging. In: CVPR (2018) Zhu, S., et al.: Very large-scale global SFM by distributed motion averaging. In: CVPR (2018)
Metadaten
Titel
GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints
verfasst von
Zixin Luo
Tianwei Shen
Lei Zhou
Siyu Zhu
Runze Zhang
Yao Yao
Tian Fang
Long Quan
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-030-01240-3_11