Skip to main content
Erschienen in: Autonomous Robots 2/2018

18.04.2017

Improving object orientation estimates by considering multiple viewpoints

Orientation histograms of symmetries and measurement models for view selection

verfasst von: Zoltán Csaba Márton, Serkan Türker, Christian Rink, Manuel Brucker, Simon Kriegel, Tim Bodenmüller, Sebastian Riedel

Erschienen in: Autonomous Robots | Ausgabe 2/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This article describes a probabilistic approach for improving the accuracy of general object pose estimation algorithms. We propose a histogram filter variant that uses the exploration capabilities of robots, and supports active perception through a next-best-view proposal algorithm. For the histogram-based fusion method we focus on the orientation of the 6 degrees of freedom (DoF) pose, since the position can be processed with common filtering techniques. The detected orientations of the object, estimated with a pose estimator, are used to update the hypothesis of its actual orientation. We discuss the design of experiments to estimate the error model of a detection method, and describe a suitable representation of the orientation histograms. This allows us to consider priors about likely object poses or symmetries, and use information gain measures for view selection. The method is validated and compared to alternatives, based on the outputs of different 6 DoF pose estimators, using real-world depth images acquired using different sensors, and on a large synthetic dataset.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
2
In our experiments the prior was always uniformly distributed, but it can be easily modified to any distribution the discretization allows to represent (which is much more flexible than a parametric model).
 
3
We do not apply an additional uncertainty to the estimates after a viewpoint change, as we were using very precise sensor movements to obtain ground truth.
 
5
Please note that the box plot drawing library detects outliers based on interquartile range distance, marks them as dots, and excludes them from percentile calculations.
 
Literatur
Zurück zum Zitat Aldoma, A., Marton, Z. C., Tombari, F., Wohlkinger, W., Potthast, C., Zeisl, B., et al. (2012). Tutorial: Point cloud library—three-dimensional object recognition and 6 DoF pose estimation. IEEE Robotics & Automation Magazine, 19(3), 80–91.CrossRef Aldoma, A., Marton, Z. C., Tombari, F., Wohlkinger, W., Potthast, C., Zeisl, B., et al. (2012). Tutorial: Point cloud library—three-dimensional object recognition and 6 DoF pose estimation. IEEE Robotics & Automation Magazine, 19(3), 80–91.CrossRef
Zurück zum Zitat Arbel, T., & Ferrie, F. P. (2001). Entropy-based gaze planning. Image and Vision Computing, 19(11), 779–786.CrossRef Arbel, T., & Ferrie, F. P. (2001). Entropy-based gaze planning. Image and Vision Computing, 19(11), 779–786.CrossRef
Zurück zum Zitat Barequet, G., & Sharir, M. (1994). Partial surface and volume matching in three dimensions. IEEE Transactions PAMI, 19, 929–948.CrossRef Barequet, G., & Sharir, M. (1994). Partial surface and volume matching in three dimensions. IEEE Transactions PAMI, 19, 929–948.CrossRef
Zurück zum Zitat Barequet, G., & Sharir, M. (1999). Partial surface matching by using directed footprints. Computational Geometry, 12, 45–62.MathSciNetCrossRefMATH Barequet, G., & Sharir, M. (1999). Partial surface matching by using directed footprints. Computational Geometry, 12, 45–62.MathSciNetCrossRefMATH
Zurück zum Zitat Chen, S., Li, Y., & Kwok, N. M. (2011). Active vision in robotic systems: A survey of recent developments. IJRR, 30(11), 1343–1377. Chen, S., Li, Y., & Kwok, N. M. (2011). Active vision in robotic systems: A survey of recent developments. IJRR, 30(11), 1343–1377.
Zurück zum Zitat Denzler, J., & Brown, C. M. (2002). Information theoretic sensor data selection for active object recognition and state estimation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(2), 145–157.CrossRef Denzler, J., & Brown, C. M. (2002). Information theoretic sensor data selection for active object recognition and state estimation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(2), 145–157.CrossRef
Zurück zum Zitat Doya, K., Ishii, S., Pouget, A., & Rao, R. P. N. (2007). Bayesian brain: Probabilistic approaches to neural coding. Cambridge: The MIT Press.MATH Doya, K., Ishii, S., Pouget, A., & Rao, R. P. N. (2007). Bayesian brain: Probabilistic approaches to neural coding. Cambridge: The MIT Press.MATH
Zurück zum Zitat Drost, B., Ulrich, M., Navab, N., & Ilic, S. (2010). Model globally, match locally: Efficient and robust 3d object recognition. In IEEE conference on computer vision and pattern recognition (CVPR), 2010, pp. 998–1005. doi:10.1109/CVPR.2010.5540108 Drost, B., Ulrich, M., Navab, N., & Ilic, S. (2010). Model globally, match locally: Efficient and robust 3d object recognition. In IEEE conference on computer vision and pattern recognition (CVPR), 2010, pp. 998–1005. doi:10.​1109/​CVPR.​2010.​5540108
Zurück zum Zitat Eidenberger, R., Grundmann, T., Feiten, W., & Zoellner, R. (2008). Fast parametric viewpoint estimation for active object detection. In IEEE international conference on multisensor fusion and integration for intelligent systems, 2008 (pp. 309–314). IEEE. Eidenberger, R., Grundmann, T., Feiten, W., & Zoellner, R. (2008). Fast parametric viewpoint estimation for active object detection. In IEEE international conference on multisensor fusion and integration for intelligent systems, 2008 (pp. 309–314). IEEE.
Zurück zum Zitat Fitzgibbon, A. (2001). Robust registration of 2D and 3D point sets. In: Proceedings of the British Machine Vision Conference (pp. 662–670). Manchester, UK. Fitzgibbon, A. (2001). Robust registration of 2D and 3D point sets. In: Proceedings of the British Machine Vision Conference (pp. 662–670). Manchester, UK.
Zurück zum Zitat Glover, J., Rusu, R., & Bradski, G. (2011). Monte Carlo pose estimation with quaternion kernels and the bingham distribution. In Proceedings of robotics: Science and systems. Los Angeles, CA, USA. Glover, J., Rusu, R., & Bradski, G. (2011). Monte Carlo pose estimation with quaternion kernels and the bingham distribution. In Proceedings of robotics: Science and systems. Los Angeles, CA, USA.
Zurück zum Zitat Grzyb, B. J., Castelló, V., & del Pobil, A. P. (2012). Reachable by walking: Inappropriate integration of near and far space may lead to distance errors. In: Szufnarowska, J. (Ed.), Proceedings of the post-graduate conference on robotics and development of cognition (pp. 12–15). Grzyb, B. J., Castelló, V., & del Pobil, A. P. (2012). Reachable by walking: Inappropriate integration of near and far space may lead to distance errors. In: Szufnarowska, J. (Ed.), Proceedings of the post-graduate conference on robotics and development of cognition (pp. 12–15).
Zurück zum Zitat Hough, P. (1962). Method and means for recognizing complex patterns. U.S. Patent 3.069.654. Hough, P. (1962). Method and means for recognizing complex patterns. U.S. Patent 3.069.654.
Zurück zum Zitat Kasper, A., Xue, Z., & Dillmann, R. (2012). The KIT object models database: An object model database for object recognition, localization and manipulation in service robotics. IJRR, 31(8), 927–934. Kasper, A., Xue, Z., & Dillmann, R. (2012). The KIT object models database: An object model database for object recognition, localization and manipulation in service robotics. IJRR, 31(8), 927–934.
Zurück zum Zitat Kriegel, S., Brucker, M., Marton, Z. C., Bodenmüller, T., & Suppa, M. (2013). Combining object modeling and recognition for active scene exploration. In IEEE international conference on intelligent robots and systems (IROS), Tokyo, Japan. Kriegel, S., Brucker, M., Marton, Z. C., Bodenmüller, T., & Suppa, M. (2013). Combining object modeling and recognition for active scene exploration. In IEEE international conference on intelligent robots and systems (IROS), Tokyo, Japan.
Zurück zum Zitat Kriegel, S., Rink, C., Bodenmüller, T., & Suppa, M. (2015). Efficient next-best-scan planning for autonomous 3d surface reconstruction of unknown objects. JRTIP, 10(4), 611–631. Kriegel, S., Rink, C., Bodenmüller, T., & Suppa, M. (2015). Efficient next-best-scan planning for autonomous 3d surface reconstruction of unknown objects. JRTIP, 10(4), 611–631.
Zurück zum Zitat Lynott, D., & Connell, L. (2009). Modality exclusivity norms for 423 object properties. Behavior Research Methods, 41(2), 558–564.CrossRef Lynott, D., & Connell, L. (2009). Modality exclusivity norms for 423 object properties. Behavior Research Methods, 41(2), 558–564.CrossRef
Zurück zum Zitat Marton, Z. C., Pangercic, D., Blodow, N., & Beetz, M. (2011). Combined 2D–3D categorization and classification for multimodal perception systems. The International Journal of Robotics Research, 30(11), 1378–1402.CrossRef Marton, Z. C., Pangercic, D., Blodow, N., & Beetz, M. (2011). Combined 2D–3D categorization and classification for multimodal perception systems. The International Journal of Robotics Research, 30(11), 1378–1402.CrossRef
Zurück zum Zitat Marton, Z. C., Seidel, F., Balint-Benczedi, F., & Beetz, M. (2012). Ensembles of strong learners for multi-cue classification. Pattern Recognition Letters (PRL), Special Isssue on Scene Understandings and Behaviours Analysis. Marton, Z. C., Seidel, F., Balint-Benczedi, F., & Beetz, M. (2012). Ensembles of strong learners for multi-cue classification. Pattern Recognition Letters (PRL), Special Isssue on Scene Understandings and Behaviours Analysis.
Zurück zum Zitat Mian, A., Bennamoun, M., & Owens, R. (2010). On the repeatability and quality of keypoints for local feature-based 3d object retrieval from cluttered scenes. International Journal of Computer Vision, 89(2–3), 348–361. doi:10.1007/s11263-009-0296-z.CrossRef Mian, A., Bennamoun, M., & Owens, R. (2010). On the repeatability and quality of keypoints for local feature-based 3d object retrieval from cluttered scenes. International Journal of Computer Vision, 89(2–3), 348–361. doi:10.​1007/​s11263-009-0296-z.CrossRef
Zurück zum Zitat Mian, A. S., Bennamoun, M., & Owens, R. (2006). Three-dimensional model-based object recognition and segmentation in cluttered scenes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(10), 1584–1601. doi:10.1109/TPAMI.2006.213. Mian, A. S., Bennamoun, M., & Owens, R. (2006). Three-dimensional model-based object recognition and segmentation in cluttered scenes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(10), 1584–1601. doi:10.​1109/​TPAMI.​2006.​213.
Zurück zum Zitat Novatnack, J., & Nishino, K. (2008). Scale-dependent/invariant local 3d shape descriptors for fully automatic registration of multiple sets of range images. ECCV, 3, 440–453. Novatnack, J., & Nishino, K. (2008). Scale-dependent/invariant local 3d shape descriptors for fully automatic registration of multiple sets of range images. ECCV, 3, 440–453.
Zurück zum Zitat Papazov, C., Haddadin, S., Parusel, S., Krieger, K., & Burschka, D. (2012). Rigid 3d geometry matching for grasping of known objects in cluttered scenes. The International Journal of Robotics Research, 31(4), 538–553.CrossRef Papazov, C., Haddadin, S., Parusel, S., Krieger, K., & Burschka, D. (2012). Rigid 3d geometry matching for grasping of known objects in cluttered scenes. The International Journal of Robotics Research, 31(4), 538–553.CrossRef
Zurück zum Zitat Pronobis, A., Mozos, O. M., Caputo, B., & Jensfelt, P. (2010). Multi-modal semantic place classification. The International Journal of Robotics Research (IJRR), 29(2–3), 298–320. doi:10.1177/0278364909356483. Pronobis, A., Mozos, O. M., Caputo, B., & Jensfelt, P. (2010). Multi-modal semantic place classification. The International Journal of Robotics Research (IJRR), 29(2–3), 298–320. doi:10.​1177/​0278364909356483​.
Zurück zum Zitat Riedel, S., Marton, Z. C., & Kriegel, S. (2016). Multi-view orientation estimation using Bingham mixture models. In: 2016 IEEE international conference on automation, quality and testing, robotics (AQTR). doi:10.1109/AQTR.2016.7501381 Riedel, S., Marton, Z. C., & Kriegel, S. (2016). Multi-view orientation estimation using Bingham mixture models. In: 2016 IEEE international conference on automation, quality and testing, robotics (AQTR). doi:10.​1109/​AQTR.​2016.​7501381
Zurück zum Zitat Rink, C., Kriegel, S., Seth, D., Denninger, M., Marton, Z. C., & Bodenmüller, T. (2016). Monte Carlo registration and its application with autonomous robots. Journal of Sensors. doi:10.1155/2016/2546819 Rink, C., Kriegel, S., Seth, D., Denninger, M., Marton, Z. C., & Bodenmüller, T. (2016). Monte Carlo registration and its application with autonomous robots. Journal of Sensors. doi:10.​1155/​2016/​2546819
Zurück zum Zitat Rink, C., Marton, Z. C., Seth, D., Bodenmüller, T., & Suppa, M. (2013). Feature based particle filter registration of 3D surface models and its application in robotics. In IEEE international conference on intelligent robots and systems (IROS), Tokyo, Japan. Rink, C., Marton, Z. C., Seth, D., Bodenmüller, T., & Suppa, M. (2013). Feature based particle filter registration of 3D surface models and its application in robotics. In IEEE international conference on intelligent robots and systems (IROS), Tokyo, Japan.
Zurück zum Zitat Romea, A., & Srinivasa, S. (2010). Efficient multi-view object recognition and full pose estimation. In 2010 IEEE international conference on robotics and automation (ICRA 2010). Romea, A., & Srinivasa, S. (2010). Efficient multi-view object recognition and full pose estimation. In 2010 IEEE international conference on robotics and automation (ICRA 2010).
Zurück zum Zitat Roy, S. D., Chaudhury, S., & Banerjee, S. (2004). Active recognition through next view planning: A survey. Pattern Recognition, 37(3), 429–446.CrossRef Roy, S. D., Chaudhury, S., & Banerjee, S. (2004). Active recognition through next view planning: A survey. Pattern Recognition, 37(3), 429–446.CrossRef
Zurück zum Zitat Rusu, R., Blodow, N., & Beetz, M. (2009). Fast point feature histograms (fpfh) for 3d registration. In IEEE international conference on robotics and automation, 2009. ICRA ’09 (pp. 3212–3217). doi:10.1109/ROBOT.2009.5152473 Rusu, R., Blodow, N., & Beetz, M. (2009). Fast point feature histograms (fpfh) for 3d registration. In IEEE international conference on robotics and automation, 2009. ICRA ’09 (pp. 3212–3217). doi:10.​1109/​ROBOT.​2009.​5152473
Zurück zum Zitat Rusu, R., Blodow, N., Marton, Z., & Beetz, M. (2008). Aligning point cloud views using persistent feature histograms. In IEEE/RSJ international conference on intelligent robots and systems, 2008. IROS 2008 (pp. 3384–3391). doi:10.1109/IROS.2008.4650967 Rusu, R., Blodow, N., Marton, Z., & Beetz, M. (2008). Aligning point cloud views using persistent feature histograms. In IEEE/RSJ international conference on intelligent robots and systems, 2008. IROS 2008 (pp. 3384–3391). doi:10.​1109/​IROS.​2008.​4650967
Zurück zum Zitat Selinger, A., & Nelson, R. (2001). Appearance-based object recognition using multiple views. In IN CVPR01 (pp. 905–911). Selinger, A., & Nelson, R. (2001). Appearance-based object recognition using multiple views. In IN CVPR01 (pp. 905–911).
Zurück zum Zitat Thrun, S., Burgard, W., & Fox, D. (2005). Probabilistic robotics. Cambridge, MA: MIT Press.MATH Thrun, S., Burgard, W., & Fox, D. (2005). Probabilistic robotics. Cambridge, MA: MIT Press.MATH
Zurück zum Zitat Tombari, F., & Di Stefano, F. (2012). Hough voting for 3d object recognition under occlusion and clutter. IPSJ Transactions on Computer Vision and Applications, 4, 20–29.CrossRef Tombari, F., & Di Stefano, F. (2012). Hough voting for 3d object recognition under occlusion and clutter. IPSJ Transactions on Computer Vision and Applications, 4, 20–29.CrossRef
Zurück zum Zitat Voit, M., & Stiefelhagen, R. (2006). A bayesian approach for multi-view head pose estimation. In IEEE international conference on multisensor fusion and integration for intelligent systems - MFI06. Heidelberg, Germany. Voit, M., & Stiefelhagen, R. (2006). A bayesian approach for multi-view head pose estimation. In IEEE international conference on multisensor fusion and integration for intelligent systems - MFI06. Heidelberg, Germany.
Zurück zum Zitat Wahl, E., Hillenbrand, U., & Hirzinger, G. (2003). Surflet-pair-relation histograms: A statistical 3D-shape representation for rapid classification. In 3DIM (pp. 474–481). Wahl, E., Hillenbrand, U., & Hirzinger, G. (2003). Surflet-pair-relation histograms: A statistical 3D-shape representation for rapid classification. In 3DIM (pp. 474–481).
Metadaten
Titel
Improving object orientation estimates by considering multiple viewpoints
Orientation histograms of symmetries and measurement models for view selection
verfasst von
Zoltán Csaba Márton
Serkan Türker
Christian Rink
Manuel Brucker
Simon Kriegel
Tim Bodenmüller
Sebastian Riedel
Publikationsdatum
18.04.2017
Verlag
Springer US
Erschienen in
Autonomous Robots / Ausgabe 2/2018
Print ISSN: 0929-5593
Elektronische ISSN: 1573-7527
DOI
https://doi.org/10.1007/s10514-017-9633-1

Weitere Artikel der Ausgabe 2/2018

Autonomous Robots 2/2018 Zur Ausgabe

Neuer Inhalt