Skip to main content

2018 | OriginalPaper | Buchkapitel

Deep Model-Based 6D Pose Refinement in RGB

verfasst von : Fabian Manhardt, Wadim Kehl, Nassir Navab, Federico Tombari

Erschienen in: Computer Vision – ECCV 2018

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We present a novel approach for model-based 6D pose refinement in color data. Building on the established idea of contour-based pose tracking, we teach a deep neural network to predict a translational and rotational update. At the core, we propose a new visual loss that drives the pose update by aligning object contours, thus avoiding the definition of any explicit appearance model. In contrast to previous work our method is correspondence-free, segmentation-free, can handle occlusion and is agnostic to geometrical symmetry as well as visual ambiguities. Additionally, we observe a strong robustness towards rough initialization. The approach can run in real-time and produces pose accuracies that come close to 3D ICP without the need for depth data. Furthermore, our networks are trained from purely synthetic data and will be published together with the refinement code at http://​campar.​in.​tum.​de/​Main/​FabianManhardt to ensure reproducibility.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
The authors acknowledged our conclusions in correspondence.
 
Literatur
5.
Zurück zum Zitat Brachmann, E., Michel, F., Krull, A., Yang, M.Y., Gumhold, S., Rother, C.: uncertainty-driven 6D pose estimation of objects and scenes from a single RGB image. In: CVPR (2016) Brachmann, E., Michel, F., Krull, A., Yang, M.Y., Gumhold, S., Rother, C.: uncertainty-driven 6D pose estimation of objects and scenes from a single RGB image. In: CVPR (2016)
6.
Zurück zum Zitat Brox, T., Rosenhahn, B., Gall, J., Cremers, D.: Combined region and motion-based 3D tracking of rigid and articulated objects. TPAMI 32(3), 402–415 (2010)CrossRef Brox, T., Rosenhahn, B., Gall, J., Cremers, D.: Combined region and motion-based 3D tracking of rigid and articulated objects. TPAMI 32(3), 402–415 (2010)CrossRef
7.
Zurück zum Zitat Choi, C., Christensen, H.: RGB-D object tracking: a particle filter approach on GPU. In: IROS (2013) Choi, C., Christensen, H.: RGB-D object tracking: a particle filter approach on GPU. In: IROS (2013)
8.
Zurück zum Zitat Dambreville, S., Sandhu, R., Yezzi, A., Tannenbaum, A.: A geometric approach to joint 2D region-based segmentation and 3D pose estimation using a 3D shape prior. SIAM J. Imaging Sci. 3, 110–132 (2010)MathSciNetCrossRef Dambreville, S., Sandhu, R., Yezzi, A., Tannenbaum, A.: A geometric approach to joint 2D region-based segmentation and 3D pose estimation using a 3D shape prior. SIAM J. Imaging Sci. 3, 110–132 (2010)MathSciNetCrossRef
9.
Zurück zum Zitat Drummond, T., Cipolla, R.: Real-time visual tracking of complex structures. TPAMI 24, 932–946 (2002)CrossRef Drummond, T., Cipolla, R.: Real-time visual tracking of complex structures. TPAMI 24, 932–946 (2002)CrossRef
11.
Zurück zum Zitat Hexner, J., Hagege, R.R.: 2D–3D pose estimation of heterogeneous objects using a region based approach. IJCV 118, 95–112 (2016)MathSciNetCrossRef Hexner, J., Hagege, R.R.: 2D–3D pose estimation of heterogeneous objects using a region based approach. IJCV 118, 95–112 (2016)MathSciNetCrossRef
17.
Zurück zum Zitat Kehl, W., Manhardt, F., Ilic, S., Tombari, F., Navab, N.: SSD-6D: making RGB-based 3D detection and 6D pose estimation great again. In: ICCV (2017) Kehl, W., Manhardt, F., Ilic, S., Tombari, F., Navab, N.: SSD-6D: making RGB-based 3D detection and 6D pose estimation great again. In: ICCV (2017)
18.
Zurück zum Zitat Kehl, W., Tombari, F., Ilic, S., Navab, N.: Real-time 3D model tracking in color and depth on a single CPU core. In: CVPR (2017) Kehl, W., Tombari, F., Ilic, S., Navab, N.: Real-time 3D model tracking in color and depth on a single CPU core. In: CVPR (2017)
20.
Zurück zum Zitat Kendall, A., Grimes, M., Cipolla, R.: PoseNet: a convolutional network for real-time 6-DOF camera relocalization. In: ICCV (2015) Kendall, A., Grimes, M., Cipolla, R.: PoseNet: a convolutional network for real-time 6-DOF camera relocalization. In: ICCV (2015)
23.
Zurück zum Zitat Park, Y., Lepetit, V.: Multiple 3D object tracking for augmented reality. In: ISMAR (2008) Park, Y., Lepetit, V.: Multiple 3D object tracking for augmented reality. In: ISMAR (2008)
24.
Zurück zum Zitat Pauwels, K., Rubio, L., Diaz, J., Ros, E.: Real-time model-based rigid object pose estimation and tracking combining dense and sparse visual cues. In: CVPR (2013) Pauwels, K., Rubio, L., Diaz, J., Ros, E.: Real-time model-based rigid object pose estimation and tracking combining dense and sparse visual cues. In: CVPR (2013)
26.
Zurück zum Zitat Prisacariu, V.A., Murray, D.W., Reid, I.D.: Real-time 3D tracking and reconstruction on mobile phones. TVCG 21, 557–570 (2015) Prisacariu, V.A., Murray, D.W., Reid, I.D.: Real-time 3D tracking and reconstruction on mobile phones. TVCG 21, 557–570 (2015)
27.
Zurück zum Zitat Prisacariu, V.A., Reid, I.D.: PWP3D: real-time segmentation and tracking of 3D objects. IJCV 98, 335–354 (2012)MathSciNetCrossRef Prisacariu, V.A., Reid, I.D.: PWP3D: real-time segmentation and tracking of 3D objects. IJCV 98, 335–354 (2012)MathSciNetCrossRef
29.
Zurück zum Zitat Rosenhahn, B., Brox, T., Cremers, D., Seidel, H.-P.: A comparison of shape matching methods for contour based pose estimation. In: Reulke, R., Eckardt, U., Flach, B., Knauer, U., Polthier, K. (eds.) IWCIA 2006. LNCS, vol. 4040, pp. 263–276. Springer, Heidelberg (2006). https://doi.org/10.1007/11774938_21CrossRef Rosenhahn, B., Brox, T., Cremers, D., Seidel, H.-P.: A comparison of shape matching methods for contour based pose estimation. In: Reulke, R., Eckardt, U., Flach, B., Knauer, U., Polthier, K. (eds.) IWCIA 2006. LNCS, vol. 4040, pp. 263–276. Springer, Heidelberg (2006). https://​doi.​org/​10.​1007/​11774938_​21CrossRef
31.
Zurück zum Zitat Schmaltz, C., Rosenhahn, B., Brox, T., Weickert, J.: Region-based pose tracking with occlusions using 3D models. MVA 23, 557–577 (2012) Schmaltz, C., Rosenhahn, B., Brox, T., Weickert, J.: Region-based pose tracking with occlusions using 3D models. MVA 23, 557–577 (2012)
32.
Zurück zum Zitat Seo, B.K., Park, H., Park, J.I., Hinterstoisser, S., Ilic, S.: Optimal local searching for fast and robust textureless 3D object tracking in highly cluttered backgrounds. In: TVCG (2014) Seo, B.K., Park, H., Park, J.I., Hinterstoisser, S., Ilic, S.: Optimal local searching for fast and robust textureless 3D object tracking in highly cluttered backgrounds. In: TVCG (2014)
34.
Zurück zum Zitat Tan, D.J., Tombari, F., Ilic, S., Navab, N.: A versatile learning-based 3D temporal tracker: scalable, robust. In: ICCV, Online (2015) Tan, D.J., Tombari, F., Ilic, S., Navab, N.: A versatile learning-based 3D temporal tracker: scalable, robust. In: ICCV, Online (2015)
35.
Zurück zum Zitat Tateno, K., Kotake, D., Uchiyama, S.: Model-based 3D object tracking with online texture update. In: MVA (2009) Tateno, K., Kotake, D., Uchiyama, S.: Model-based 3D object tracking with online texture update. In: MVA (2009)
39.
Zurück zum Zitat Ummenhofer, B., et al.: DeMoN: depth and motion network for learning monocular stereo. In: CVPR (2017) Ummenhofer, B., et al.: DeMoN: depth and motion network for learning monocular stereo. In: CVPR (2017)
40.
Zurück zum Zitat Vacchetti, L., Lepetit, V., Fua, P.: Stable real-time 3D tracking using online and offline information. TPAMI 26, 1385–1391 (2004)CrossRef Vacchetti, L., Lepetit, V., Fua, P.: Stable real-time 3D tracking using online and offline information. TPAMI 26, 1385–1391 (2004)CrossRef
41.
Zurück zum Zitat Wang, S., Clark, R., Wen, H., Trigoni, N.: DeepVO: Towards End to End Visual Odometry with Deep Recurrent Convolutional Neural Networks. In: ICRA (2017) Wang, S., Clark, R., Wen, H., Trigoni, N.: DeepVO: Towards End to End Visual Odometry with Deep Recurrent Convolutional Neural Networks. In: ICRA (2017)
Metadaten
Titel
Deep Model-Based 6D Pose Refinement in RGB
verfasst von
Fabian Manhardt
Wadim Kehl
Nassir Navab
Federico Tombari
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-030-01264-9_49