Skip to main content
Erschienen in: Machine Vision and Applications 7-8/2019

05.09.2019 | Original Paper

View synthesis for pose computation

verfasst von: Pierre Rolin, Marie-Odile Berger, Frédéric Sur

Erschienen in: Machine Vision and Applications | Ausgabe 7-8/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Geometrical registration of a query image with respect to a 3D model, or pose estimation, is the cornerstone of many computer vision applications. It is often based on the matching of local photometric descriptors invariant to limited viewpoint changes. However, when the query image has been acquired from a camera position not covered by the model images, pose estimation is often not accurate and sometimes even fails, precisely because of the limited invariance of descriptors. In this paper, we propose to add descriptors to the model, obtained from synthesized views associated with virtual cameras completing the covering of the scene by the real cameras. We propose an efficient strategy to localize the virtual cameras in the scene and generate valuable descriptors from synthetic views. We also discuss a guided sampling strategy for registration in this context. Experiments show that the accuracy of pose estimation is dramatically improved when large viewpoint changes makes the matching of classic descriptors a challenging task.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Billinghurst, M., Clark, A., Lee, G.: A survey of augmented reality. Found. Trends Hum. Comput. Interact. 8(2–3), 73–272 (2015)CrossRef Billinghurst, M., Clark, A., Lee, G.: A survey of augmented reality. Found. Trends Hum. Comput. Interact. 8(2–3), 73–272 (2015)CrossRef
2.
Zurück zum Zitat Marchand, E., Uchiyama, H., Spindler, F.: Pose estimation for augmented reality: a hands-on survey. IEEE Trans. Vis. Comput. Graph. 22(12), 2633–2651 (2016)CrossRef Marchand, E., Uchiyama, H., Spindler, F.: Pose estimation for augmented reality: a hands-on survey. IEEE Trans. Vis. Comput. Graph. 22(12), 2633–2651 (2016)CrossRef
3.
Zurück zum Zitat Charmette, B., Royer, E., Chausse, F.: Vision-based robot localization based on the efficient matching of planar features. Mach. Vis. Appl. 27(4), 415–436 (2016) CrossRef Charmette, B., Royer, E., Chausse, F.: Vision-based robot localization based on the efficient matching of planar features. Mach. Vis. Appl. 27(4), 415–436 (2016) CrossRef
4.
Zurück zum Zitat Shan, Q., Wu, C., Curless, B., Furukawa, Y., Hernandez, C., Seitz, S.M.: Accurate geo-registration by ground-to-aerial image matching. In: Proceedings of International Conference on 3D Vision (3DV) (2014) Shan, Q., Wu, C., Curless, B., Furukawa, Y., Hernandez, C., Seitz, S.M.: Accurate geo-registration by ground-to-aerial image matching. In: Proceedings of International Conference on 3D Vision (3DV) (2014)
5.
Zurück zum Zitat Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef
6.
Zurück zum Zitat Hesch, J.A., Roumeliotis, S.I.: A direct least-squares (DLS) method for PnP. In: Proceedings of International Conference on Computer Vision (2011) Hesch, J.A., Roumeliotis, S.I.: A direct least-squares (DLS) method for PnP. In: Proceedings of International Conference on Computer Vision (2011)
7.
Zurück zum Zitat Moreels, P., Perona, P.: Evaluation of features detectors and descriptors based on 3D objects. Int. J. Comput. Vis. 73(3), 263–284 (2007)CrossRef Moreels, P., Perona, P.: Evaluation of features detectors and descriptors based on 3D objects. Int. J. Comput. Vis. 73(3), 263–284 (2007)CrossRef
8.
Zurück zum Zitat Kendall, A., Grimes, M., Cipolla, R.: Posenet: A convolutional network for real-time 6-dof camera relocalization. In: The IEEE International Conference on Computer Vision (ICCV) (2015) Kendall, A., Grimes, M., Cipolla, R.: Posenet: A convolutional network for real-time 6-dof camera relocalization. In: The IEEE International Conference on Computer Vision (ICCV) (2015)
10.
Zurück zum Zitat Purkait, P., Zhao, C., Zach, C.: Synthetic view generation for absolute pose regression and image synthesis. In: Proceedings of British Machine Vision Conference (BMVC) (2018) Purkait, P., Zhao, C., Zach, C.: Synthetic view generation for absolute pose regression and image synthesis. In: Proceedings of British Machine Vision Conference (BMVC) (2018)
11.
Zurück zum Zitat Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Kadir, T., Van Gool, L.: A comparison of affine region detectors. Int. J. Comput. Vis. 65(1–2), 43–72 (2005)CrossRef Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Kadir, T., Van Gool, L.: A comparison of affine region detectors. Int. J. Comput. Vis. 65(1–2), 43–72 (2005)CrossRef
12.
Zurück zum Zitat Mikolajczyk, K., Schmid, C.: An affine invariant interest point detector. In: Proceedings of European Conference on Computer Vision (ECCV) (2002)CrossRef Mikolajczyk, K., Schmid, C.: An affine invariant interest point detector. In: Proceedings of European Conference on Computer Vision (ECCV) (2002)CrossRef
13.
Zurück zum Zitat Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide-baseline stereo from maximally stable extremal regions. Image Vis. Comput. 22(10), 761–767 (2004)CrossRef Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide-baseline stereo from maximally stable extremal regions. Image Vis. Comput. 22(10), 761–767 (2004)CrossRef
14.
Zurück zum Zitat Yi, K., Trulls, E., Lepetit, V., Fua, P.: LIFT: Learned invariant feature transform. In: Proceedings of European Conference on Computer Vision (ECCV) (2016) Yi, K., Trulls, E., Lepetit, V., Fua, P.: LIFT: Learned invariant feature transform. In: Proceedings of European Conference on Computer Vision (ECCV) (2016)
15.
Zurück zum Zitat Lepetit, V., Fua, P.: Keypoint recognition using randomized trees. IEEE Trans. Pattern Anal. Mach. Intell. 28(9), 1465–1479 (2006)CrossRef Lepetit, V., Fua, P.: Keypoint recognition using randomized trees. IEEE Trans. Pattern Anal. Mach. Intell. 28(9), 1465–1479 (2006)CrossRef
16.
Zurück zum Zitat Williams, B., Klein, G., Reid, I.: Real-time SLAM relocalisation. In: Proceedings of International Conference on Computer Vision (ICCV) (2007) Williams, B., Klein, G., Reid, I.: Real-time SLAM relocalisation. In: Proceedings of International Conference on Computer Vision (ICCV) (2007)
17.
Zurück zum Zitat Paulin, M., Revaud, J., Harchaoui, Z., Perronnin, F., Schmid, C.: Transformation pursuit for image classification. In: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR) (2014) Paulin, M., Revaud, J., Harchaoui, Z., Perronnin, F., Schmid, C.: Transformation pursuit for image classification. In: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR) (2014)
18.
Zurück zum Zitat Morel, J.-M., Yu, G.: ASIFT: A new framework for fully affine invariant image comparison. SIAM J. Imaging Sci. 2(2), 438–469 (2009)MathSciNetCrossRef Morel, J.-M., Yu, G.: ASIFT: A new framework for fully affine invariant image comparison. SIAM J. Imaging Sci. 2(2), 438–469 (2009)MathSciNetCrossRef
19.
Zurück zum Zitat Rolin, P., Berger, M.-O., Sur, F.: Viewpoint simulation for camera pose estimation from an unstructured scene model. In: Proceedings of International Conference on Robotics and Automation (ICRA) (2015) Rolin, P., Berger, M.-O., Sur, F.: Viewpoint simulation for camera pose estimation from an unstructured scene model. In: Proceedings of International Conference on Robotics and Automation (ICRA) (2015)
20.
Zurück zum Zitat Savarese, S., Fei-Fei, L.: View synthesis for recognizing unseen poses of object classes. In: Proceedings of European Conference on Computer Vision (ECCV) (2008) Savarese, S., Fei-Fei, L.: View synthesis for recognizing unseen poses of object classes. In: Proceedings of European Conference on Computer Vision (ECCV) (2008)
21.
Zurück zum Zitat Ozuysal, M., Calonder, M., Lepetit, V., Fua, P.: Fast keypoint recognition using random ferns. IEEE Trans. Pattern Anal. Mach. Intell. 32(3), 448–461 (2010)CrossRef Ozuysal, M., Calonder, M., Lepetit, V., Fua, P.: Fast keypoint recognition using random ferns. IEEE Trans. Pattern Anal. Mach. Intell. 32(3), 448–461 (2010)CrossRef
22.
Zurück zum Zitat Mishkin, D., Matas, J., Perdoch, M.: MODS: Fast and robust method for two-view matching. Comput. Vis. Image Underst. 141, 81–93 (2015a)CrossRef Mishkin, D., Matas, J., Perdoch, M.: MODS: Fast and robust method for two-view matching. Comput. Vis. Image Underst. 141, 81–93 (2015a)CrossRef
23.
Zurück zum Zitat Mishkin, D., Matas, J., Perdoch, M., Lenc, K.: WXBS: Wide baseline stereo generalizations. In: Proceedings of British Machine Vision Conference (BMVC) (2015) Mishkin, D., Matas, J., Perdoch, M., Lenc, K.: WXBS: Wide baseline stereo generalizations. In: Proceedings of British Machine Vision Conference (BMVC) (2015)
24.
Zurück zum Zitat Rodriguez, M., Delon, J., Morel, J.-M.: Covering the space of tilts: application to affine invariant image comparison. SIAM J. Imaging Sci. 11(2), 1230–1267 (2018)MathSciNetCrossRef Rodriguez, M., Delon, J., Morel, J.-M.: Covering the space of tilts: application to affine invariant image comparison. SIAM J. Imaging Sci. 11(2), 1230–1267 (2018)MathSciNetCrossRef
25.
Zurück zum Zitat Köser, K., Koch, R.: Perspectively invariant normal features. In: Proceedings of International Conference on Computer Vision (ICCV) (2007) Köser, K., Koch, R.: Perspectively invariant normal features. In: Proceedings of International Conference on Computer Vision (ICCV) (2007)
26.
Zurück zum Zitat Kushnir, M., Shimshoni, I.: Epipolar geometry estimation for urban scenes with repetitive structures. IEEE Trans. Pattern Anal. Mach. Intell. 36(12), 2381–2395 (2014)CrossRef Kushnir, M., Shimshoni, I.: Epipolar geometry estimation for urban scenes with repetitive structures. IEEE Trans. Pattern Anal. Mach. Intell. 36(12), 2381–2395 (2014)CrossRef
27.
Zurück zum Zitat Wu, C., Clipp, B., Li, X., Frahm, J.-M., Pollefeys, M.: 3D model matching with viewpoint-invariant patches (VIP). In: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR) (2008) Wu, C., Clipp, B., Li, X., Frahm, J.-M., Pollefeys, M.: 3D model matching with viewpoint-invariant patches (VIP). In: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR) (2008)
28.
Zurück zum Zitat Petit, A., Marchand, E., Kanani, K.: Tracking complex targets for space rendezvous and debris removal applications. In: Proceedings of International Conference on Intelligent Robots and Systems (IROS) (2012) Petit, A., Marchand, E., Kanani, K.: Tracking complex targets for space rendezvous and debris removal applications. In: Proceedings of International Conference on Intelligent Robots and Systems (IROS) (2012)
29.
Zurück zum Zitat Torii, A., Arandjelović, R., Sivic, J., Okutomi, M., Pajdla, T.: 24/7 place recognition by view synthesis. In: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR) (2015) Torii, A., Arandjelović, R., Sivic, J., Okutomi, M., Pajdla, T.: 24/7 place recognition by view synthesis. In: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR) (2015)
30.
Zurück zum Zitat Irschara, A., Zach, C., Frahm, J.-M., Bischof, H.: From structure-from-motion point clouds to fast location recognition. In: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR) (2009) Irschara, A., Zach, C., Frahm, J.-M., Bischof, H.: From structure-from-motion point clouds to fast location recognition. In: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR) (2009)
31.
Zurück zum Zitat Wendel, A., Irschara, A., Bischof, H.: Natural landmark-based monocular localization for MAVs. In: Proceedings of International Conference on Robotics and Automation (ICRA) (2011) Wendel, A., Irschara, A., Bischof, H.: Natural landmark-based monocular localization for MAVs. In: Proceedings of International Conference on Robotics and Automation (ICRA) (2011)
32.
Zurück zum Zitat Molton, N, Davison, A.J., Reid, I.: Locally planar patch features for real-time structure from motion. In: Proceedings of British Machine Vision Conference (BMVC) (2004) Molton, N, Davison, A.J., Reid, I.: Locally planar patch features for real-time structure from motion. In: Proceedings of British Machine Vision Conference (BMVC) (2004)
33.
Zurück zum Zitat Reitmayr, G., Drummond, T.W.: Going out: robust tracking for outdoor augmented reality. In: Proceedings of International Symposium on Mixed and Augmented Reality (ISMAR) (2006) Reitmayr, G., Drummond, T.W.: Going out: robust tracking for outdoor augmented reality. In: Proceedings of International Symposium on Mixed and Augmented Reality (ISMAR) (2006)
34.
Zurück zum Zitat Simon, G.: Tracking-by-synthesis using point features and pyramidal blurring. In: Proceedings of International Symposium on Mixed and Augmented Reality (ISMAR) (2011) Simon, G.: Tracking-by-synthesis using point features and pyramidal blurring. In: Proceedings of International Symposium on Mixed and Augmented Reality (ISMAR) (2011)
35.
Zurück zum Zitat Delaunoy, A., Pollefeys, M.: Photometric bundle adjustment for dense multi-view 3D modeling. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2014) Delaunoy, A., Pollefeys, M.: Photometric bundle adjustment for dense multi-view 3D modeling. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2014)
36.
Zurück zum Zitat Rolin, P., Berger, M.-O., Sur, F.: Enhancing pose estimation through efficient patch synthesis. In: Proceedings British Machine Vision Conference (BMVC) (2016) Rolin, P., Berger, M.-O., Sur, F.: Enhancing pose estimation through efficient patch synthesis. In: Proceedings British Machine Vision Conference (BMVC) (2016)
37.
Zurück zum Zitat Fischler, M., Bolles, R.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)MathSciNetCrossRef Fischler, M., Bolles, R.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)MathSciNetCrossRef
38.
Zurück zum Zitat Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision, second edn. Cambridge University Press, Cambridge (2004)CrossRef Hartley, R.I., Zisserman, A.: Multiple View Geometry in Computer Vision, second edn. Cambridge University Press, Cambridge (2004)CrossRef
39.
Zurück zum Zitat Hoppe, H., DeRose, T., Duchamp, T., McDonald, J., Stuetzle, W.: Surface reconstruction from unorganized points. In: Proceedings of SIGGRAPH (1992) Hoppe, H., DeRose, T., Duchamp, T., McDonald, J., Stuetzle, W.: Surface reconstruction from unorganized points. In: Proceedings of SIGGRAPH (1992)
40.
Zurück zum Zitat Muja, M., Lowe, D.G.: Scalable nearest neighbor algorithms for high dimensional data. IEEE Trans. Pattern Anal. Mach. Intell. 36(11), 2227–2240 (2014)CrossRef Muja, M., Lowe, D.G.: Scalable nearest neighbor algorithms for high dimensional data. IEEE Trans. Pattern Anal. Mach. Intell. 36(11), 2227–2240 (2014)CrossRef
41.
Zurück zum Zitat Morel, J.-M., Yu, G.: Is SIFT scale invariant? AIMS Inverse Probl. Imaging 5(1), 115–136 (2011)CrossRef Morel, J.-M., Yu, G.: Is SIFT scale invariant? AIMS Inverse Probl. Imaging 5(1), 115–136 (2011)CrossRef
42.
Zurück zum Zitat Rusu, R.B., Cousins, S.: 3D is here: Point Cloud Library (PCL). In: Proceedings of International Conference on Robotics and Automation (ICRA) (2011) Rusu, R.B., Cousins, S.: 3D is here: Point Cloud Library (PCL). In: Proceedings of International Conference on Robotics and Automation (ICRA) (2011)
43.
Zurück zum Zitat Boulch, A., Marlet, R.: Fast normal estimation for point clouds with sharp features using a robust randomized Hough transform. Comput. Graph. Forum 31(5), 1765–1774 (2012)CrossRef Boulch, A., Marlet, R.: Fast normal estimation for point clouds with sharp features using a robust randomized Hough transform. Comput. Graph. Forum 31(5), 1765–1774 (2012)CrossRef
44.
Zurück zum Zitat Rolin, P., Berger, M.-O., Sur, F.: Simulation de point de vue pour la mise en correspondance et la localisation. Traitement du Signal 32(2–3), 169–194 (2015b)CrossRef Rolin, P., Berger, M.-O., Sur, F.: Simulation de point de vue pour la mise en correspondance et la localisation. Traitement du Signal 32(2–3), 169–194 (2015b)CrossRef
45.
Zurück zum Zitat Katz, S., Tal, A., Basri, R.: Direct visibility of point sets. ACM Trans. Graph. 26(3), 24 (2007)CrossRef Katz, S., Tal, A., Basri, R.: Direct visibility of point sets. ACM Trans. Graph. 26(3), 24 (2007)CrossRef
46.
Zurück zum Zitat Raguram, R., Frahm, J.-M., Pollefeys, M.: A comparative analysis of RANSAC techniques leading to adaptive real-time random sample consensus. In: Proceedings of European Conference on Computer Vision (ECCV) (2008) Raguram, R., Frahm, J.-M., Pollefeys, M.: A comparative analysis of RANSAC techniques leading to adaptive real-time random sample consensus. In: Proceedings of European Conference on Computer Vision (ECCV) (2008)
47.
Zurück zum Zitat Chum, O., Matas, J.: Matching with PROSAC—progressive sample consensus. In: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR) (2005) Chum, O., Matas, J.: Matching with PROSAC—progressive sample consensus. In: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR) (2005)
48.
Zurück zum Zitat Boiman, O., Shechtman, E., Irani, M.: In defense of nearest-neighbor based image classification. In: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR) (2008) Boiman, O., Shechtman, E., Irani, M.: In defense of nearest-neighbor based image classification. In: Proceedings of Conference on Computer Vision and Pattern Recognition (CVPR) (2008)
50.
Zurück zum Zitat Li, Y., Snavely, N., Huttenlocher, D.: Location recognition using prioritized feature matching. In: Proceedings of European Conference on Computer Vision (ECCV) (2010)CrossRef Li, Y., Snavely, N., Huttenlocher, D.: Location recognition using prioritized feature matching. In: Proceedings of European Conference on Computer Vision (ECCV) (2010)CrossRef
51.
Zurück zum Zitat Li, Y., Noah, S., Huttenlocher, D., Fua, P.: Worldwide pose estimation using 3D point clouds. In: Proceedings of European Conference on Computer Vision (ECCV) (2012)CrossRef Li, Y., Noah, S., Huttenlocher, D., Fua, P.: Worldwide pose estimation using 3D point clouds. In: Proceedings of European Conference on Computer Vision (ECCV) (2012)CrossRef
53.
Zurück zum Zitat Aanæs, H., Dahl, A.L., Pedersen, K.S.: Interesting interest points. Int. J. Comput. Vis. 97(1), 18–35 (2012)CrossRef Aanæs, H., Dahl, A.L., Pedersen, K.S.: Interesting interest points. Int. J. Comput. Vis. 97(1), 18–35 (2012)CrossRef
56.
Zurück zum Zitat Simon, G., Fond, A., Berger, M.-O.: A simple and effective method to detect orthogonal vanishing points in uncalibrated images of man-made environments. In: Proceedings of Eurographics (2016) Simon, G., Fond, A., Berger, M.-O.: A simple and effective method to detect orthogonal vanishing points in uncalibrated images of man-made environments. In: Proceedings of Eurographics (2016)
Metadaten
Titel
View synthesis for pose computation
verfasst von
Pierre Rolin
Marie-Odile Berger
Frédéric Sur
Publikationsdatum
05.09.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Machine Vision and Applications / Ausgabe 7-8/2019
Print ISSN: 0932-8092
Elektronische ISSN: 1432-1769
DOI
https://doi.org/10.1007/s00138-019-01045-5

Weitere Artikel der Ausgabe 7-8/2019

Machine Vision and Applications 7-8/2019 Zur Ausgabe