Skip to main content
Top

2017 | Supplement | Chapter

Down to Earth: Using Semantics for Robust Hypothesis Selection for the Five-Point Algorithm

Authors : Andreas Kuhn, True Price, Jan-Michael Frahm, Helmut Mayer

Published in: Pattern Recognition

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The computation of the essential matrix using the five-point algorithm is a staple task usually considered as being solved. However, we show that the algorithm frequently selects erroneous solutions in the presence of noise and outliers. These errors arise when the supporting point correspondences supplied to the algorithm do not adequately cover all essential planes in the scene, leading to ambiguous essential matrix solutions. This is not merely a theoretical problem: such scene conditions often occur in 3D reconstruction of real-world data when fronto-parallel point correspondences, such as points on building facades, are captured but correspondences on obliquely observed planes, such as the ground plane, are missed. To solve this problem, we propose to leverage semantic labelings of image features to guide hypothesis selection in the five-point algorithm. More specifically, we propose a two-stage RANSAC procedure in which, in the first step, only features classified as ground points are processed. These inlier ground features are subsequently used to score two-view geometry hypotheses generated by the five-point algorithm using samples of non-ground points. Results for scenes with prominent ground regions demonstrate the ability of our approach to recover epipolar geometries that describe the entire scene, rather than only well-sampled scene planes.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
1.
go back to reference Agarwal, S., Furukawa, Y., Snavely, N., Simon, I., Curless, B., Seitz, S.M., Szeliski, R.: Building Rome in a day. Commun. ACM 54(10), 105–112 (2011)CrossRef Agarwal, S., Furukawa, Y., Snavely, N., Simon, I., Curless, B., Seitz, S.M., Szeliski, R.: Building Rome in a day. Commun. ACM 54(10), 105–112 (2011)CrossRef
3.
go back to reference Chum, O., Werner, T., Matas, J.: Two-view geometry estimation unaffected by a dominant plane. In: CVPR, vol. 1, pp. 772–779 (2005) Chum, O., Werner, T., Matas, J.: Two-view geometry estimation unaffected by a dominant plane. In: CVPR, vol. 1, pp. 772–779 (2005)
4.
go back to reference Cordts, M., Omran, M., Ramos, S., Rehfeld, T., Enzweiler, M., Benenson, R., Franke, U., Roth, S., Schiele, B.: The cityscapes dataset for semantic urban scene understanding. In: CVPR (2016) Cordts, M., Omran, M., Ramos, S., Rehfeld, T., Enzweiler, M., Benenson, R., Franke, U., Roth, S., Schiele, B.: The cityscapes dataset for semantic urban scene understanding. In: CVPR (2016)
6.
go back to reference Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)MathSciNetCrossRef Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)MathSciNetCrossRef
8.
go back to reference Han, X., Leung, T., Jia, Y., Sukthankar, R., Berg, A.C.: Matchnet: unifying feature and metric learning for patch-based matching. In: CVPR (2015) Han, X., Leung, T., Jia, Y., Sukthankar, R., Berg, A.C.: Matchnet: unifying feature and metric learning for patch-based matching. In: CVPR (2015)
9.
go back to reference Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University Press, Cambridge (2004). ISBN 0521540518CrossRefMATH Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision. Cambridge University Press, Cambridge (2004). ISBN 0521540518CrossRefMATH
10.
11.
go back to reference Hartley, R.I.: In defense of the eight-point algorithm. PAMI 19(6), 580–593 (1997)CrossRef Hartley, R.I.: In defense of the eight-point algorithm. PAMI 19(6), 580–593 (1997)CrossRef
12.
go back to reference Hartley, R.I., Kahl, F.: Global optimization through searching rotation space and optimal estimation of the essential matrix. In: ICCV (2007) Hartley, R.I., Kahl, F.: Global optimization through searching rotation space and optimal estimation of the essential matrix. In: ICCV (2007)
13.
go back to reference Heinly, J., Schönberger, J.L., Dunn, E., Frahm, J.M.: Reconstructing the world* in six days *(as captured by the Yahoo 100 million image dataset). In: CVPR (2015) Heinly, J., Schönberger, J.L., Dunn, E., Frahm, J.M.: Reconstructing the world* in six days *(as captured by the Yahoo 100 million image dataset). In: CVPR (2015)
14.
go back to reference Ke, Y., Sukthankar, R.: PCA-SIFT: a more distinctive representation for local image descriptors. In: CVPR (2004) Ke, Y., Sukthankar, R.: PCA-SIFT: a more distinctive representation for local image descriptors. In: CVPR (2004)
15.
go back to reference Kushnir, M., Shimshoni, I.: Epipolar geometry estimation for urban scenes with repetitive structures. PAMI 36(12), 2381–2395 (2014)CrossRef Kushnir, M., Shimshoni, I.: Epipolar geometry estimation for urban scenes with repetitive structures. PAMI 36(12), 2381–2395 (2014)CrossRef
16.
go back to reference Li, H., Hartley, R.: Five-point motion estimation made easy. In: ICPR (2006) Li, H., Hartley, R.: Five-point motion estimation made easy. In: ICPR (2006)
17.
go back to reference Lowe, D.G.: Distinctive image features from scale-invariant keypoints. IJCV 60(2), 91–110 (2004)CrossRef Lowe, D.G.: Distinctive image features from scale-invariant keypoints. IJCV 60(2), 91–110 (2004)CrossRef
18.
go back to reference Mayer, H., Bartelsen, J., Hirschmüller, H., Kuhn, A.: Dense 3D reconstruction from wide baseline image sets. In: 15th International Workshop on Theoretical Foundations of Computer Vision (2011) Mayer, H., Bartelsen, J., Hirschmüller, H., Kuhn, A.: Dense 3D reconstruction from wide baseline image sets. In: 15th International Workshop on Theoretical Foundations of Computer Vision (2011)
21.
go back to reference Nistér, D.: An efficient solution to the five-point relative pose problem. PAMI 26(6), 756–777 (2004)CrossRef Nistér, D.: An efficient solution to the five-point relative pose problem. PAMI 26(6), 756–777 (2004)CrossRef
22.
go back to reference Rodehorst, V., Heinrichs, M., Hellwich, O.: Evaluation of relative pose estimation methods for multi-camera setups. International Archives of Photogrammetry and Remote Sensing (ISPRS), pp. 135–140 (2008) Rodehorst, V., Heinrichs, M., Hellwich, O.: Evaluation of relative pose estimation methods for multi-camera setups. International Archives of Photogrammetry and Remote Sensing (ISPRS), pp. 135–140 (2008)
23.
go back to reference Schönberger, J.L., Frahm, J.M.: Structure-from-motion revisited. In: CVPR (2016) Schönberger, J.L., Frahm, J.M.: Structure-from-motion revisited. In: CVPR (2016)
24.
go back to reference Snavely, N., Seitz, S.M., Szeliski, R.: Modeling the world from internet photo collections. IJCV 80(2), 189–210 (2008)CrossRef Snavely, N., Seitz, S.M., Szeliski, R.: Modeling the world from internet photo collections. IJCV 80(2), 189–210 (2008)CrossRef
26.
go back to reference Torr, P.H.: An assessment of information criteria for motion model selection. In: Computer Vision and Pattern Recognition (CVPR), pp. 47–52 (1997) Torr, P.H.: An assessment of information criteria for motion model selection. In: Computer Vision and Pattern Recognition (CVPR), pp. 47–52 (1997)
27.
go back to reference Torr, P.H., Zisserman, A.: MLESAC: a new robust estimator with application to estimating image geometry. Comput. Vis. Image Underst. 78(1), 138–156 (2000)CrossRef Torr, P.H., Zisserman, A.: MLESAC: a new robust estimator with application to estimating image geometry. Comput. Vis. Image Underst. 78(1), 138–156 (2000)CrossRef
28.
go back to reference Wu, C.: Towards linear-time incremental structure from motion. In: 3DV, pp. 127–134 (2013) Wu, C.: Towards linear-time incremental structure from motion. In: 3DV, pp. 127–134 (2013)
29.
go back to reference Yi, K.M., Trulls, E., Lepetit, V., Fua, P.: LIFT: learned invariant feature transform. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9910, pp. 467–483. Springer, Cham (2016). doi:10.1007/978-3-319-46466-4_28 CrossRef Yi, K.M., Trulls, E., Lepetit, V., Fua, P.: LIFT: learned invariant feature transform. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9910, pp. 467–483. Springer, Cham (2016). doi:10.​1007/​978-3-319-46466-4_​28 CrossRef
30.
go back to reference Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions. In: ICLR (2016) Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions. In: ICLR (2016)
31.
go back to reference Zach, C., Klopschitz, M., Pollefeys, M.: Disambiguating visual relations using loop constraints. In: Computer Vision and Pattern Recognition (CVPR), pp. 1426–1433 (2010) Zach, C., Klopschitz, M., Pollefeys, M.: Disambiguating visual relations using loop constraints. In: Computer Vision and Pattern Recognition (CVPR), pp. 1426–1433 (2010)
Metadata
Title
Down to Earth: Using Semantics for Robust Hypothesis Selection for the Five-Point Algorithm
Authors
Andreas Kuhn
True Price
Jan-Michael Frahm
Helmut Mayer
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-66709-6_31

Premium Partner