Skip to main content

2016 | OriginalPaper | Buchkapitel

Recursive Inference for Prediction of Objects in Urban Environments

verfasst von : Cesar Cadena, Jana Košecká

Erschienen in: Robotics Research

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Future advancements in robotic navigation and mapping rest to a large extent on robust, efficient and more advanced semantic understanding of the surrounding environment. The existing semantic mapping approaches typically consider small number of semantic categories, require complex inference or large number of training examples to achieve desirable performance. In the proposed work we present an efficient approach for predicting locations of generic objects in urban environments by means of semantic segmentation of a video into object and non-object categories. We exploit widely available exemplars of non-object categories (such as road, buildings, vegetation) and use geometric cues which are indicative of the presence of object boundaries to gather the evidence about objects regardless of their category. We formulate the object/non-object semantic segmentation problem in the Conditional Random Field framework, where the structure of the graph is induced by a minimum spanning tree computed over a 3D point cloud, yielding an efficient algorithm for an exact inference. The chosen 3D representation naturally lends itself for on-line recursive belief updates with a simple soft data association mechanism. We carry out extensive experiments on videos of urban environments acquired by a moving vehicle and show quantitatively and qualitatively the benefits of our proposal.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
Literatur
1.
Zurück zum Zitat Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., Süsstrunk, S.: SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Patt. Anal. Mach. Intell. 34(11), 2274–2282 (2012)CrossRef Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., Süsstrunk, S.: SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Patt. Anal. Mach. Intell. 34(11), 2274–2282 (2012)CrossRef
2.
Zurück zum Zitat Alexe, B., Deselaers, T., Ferrari, V.: What is an object?. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 73–80, June 2010 Alexe, B., Deselaers, T., Ferrari, V.: What is an object?. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 73–80, June 2010
3.
Zurück zum Zitat Ayvaci, A., Soatto, S.: Detachable object detection: segmentation and depth ordering from short-baseline video. IEEE Trans. Patt. Anal. Mach. Intell. 34(10):1942–1951 (2012) Ayvaci, A., Soatto, S.: Detachable object detection: segmentation and depth ordering from short-baseline video. IEEE Trans. Patt. Anal. Mach. Intell. 34(10):1942–1951 (2012)
4.
Zurück zum Zitat Buchanan, A.M., Fitzgibbon, A.W.: Interactive feature tracking using K-D trees and dynamic programming. IEEE Conf. Comput. Vis. Patt. Recognit. 1, 626–633 (2006) Buchanan, A.M., Fitzgibbon, A.W.: Interactive feature tracking using K-D trees and dynamic programming. IEEE Conf. Comput. Vis. Patt. Recognit. 1, 626–633 (2006)
5.
Zurück zum Zitat Douillard, B., Fox, D., Ramos, F., Durrant-Whyte, H.: Classification and semantic mapping of urban environments. Int. J. Rob. Res. 30, 5–32 (2011)CrossRef Douillard, B., Fox, D., Ramos, F., Durrant-Whyte, H.: Classification and semantic mapping of urban environments. Int. J. Rob. Res. 30, 5–32 (2011)CrossRef
6.
Zurück zum Zitat Eigen, D., Fergus, R.: Nonparametric image parsing using adaptive neighbor sets. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2799–2806, June 2012 Eigen, D., Fergus, R.: Nonparametric image parsing using adaptive neighbor sets. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2799–2806, June 2012
7.
Zurück zum Zitat Floros, G., Leibe, B.: Joint 2d–3d temporally consistent semantic segmentation of street scenes. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2823–2830, 2012 Floros, G., Leibe, B.: Joint 2d–3d temporally consistent semantic segmentation of street scenes. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2823–2830, 2012
8.
Zurück zum Zitat Galvez-Lopez, D., Tardos, J.D.: Bags of binary words for fast place recognition in image sequences. IEEE Trans. Robot. 28(5), 1188–1197 (2012)CrossRef Galvez-Lopez, D., Tardos, J.D.: Bags of binary words for fast place recognition in image sequences. IEEE Trans. Robot. 28(5), 1188–1197 (2012)CrossRef
9.
Zurück zum Zitat Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? the kitti vision benchmark suite. In: Conference on Computer Vision and PatternRecognition (CVPR), 2012 Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? the kitti vision benchmark suite. In: Conference on Computer Vision and PatternRecognition (CVPR), 2012
10.
Zurück zum Zitat Geiger, A., Ziegler, J., Stiller, C.: Stereoscan: dense 3d reconstruction in real-time. In: Intelligent Vehicles Symposium (IV), 2011 Geiger, A., Ziegler, J., Stiller, C.: Stereoscan: dense 3d reconstruction in real-time. In: Intelligent Vehicles Symposium (IV), 2011
11.
Zurück zum Zitat Klasing, K.: Aspects of 3D perception, abstraction, and interpretation in autonomous mobile robotics. Ph.D. thesis, Technical Univeristy of Munich, Germany (2010) Klasing, K.: Aspects of 3D perception, abstraction, and interpretation in autonomous mobile robotics. Ph.D. thesis, Technical Univeristy of Munich, Germany (2010)
12.
Zurück zum Zitat Koller, D., Friedman, N.: Probabilistic Graphical Models: Principles and Techniques. MIT Press, USA (2009) Koller, D., Friedman, N.: Probabilistic Graphical Models: Principles and Techniques. MIT Press, USA (2009)
13.
Zurück zum Zitat Kümmerle, R., Grisetti, G., Strasdat, H., Konolige, K., Burgard, W.: g2o: A general framework for graph optimization. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Shanghai, China, May 2011 Kümmerle, R., Grisetti, G., Strasdat, H., Konolige, K., Burgard, W.: g2o: A general framework for graph optimization. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Shanghai, China, May 2011
14.
Zurück zum Zitat Ladický, L., Sturgess, P., Alahari, K., Russell, C., Torr, P.H.S.: What, where and how many? combining object detectors and CRFs. In: Computer Vision—ECCV 2010. Springer, Berlin (2010) Ladický, L., Sturgess, P., Alahari, K., Russell, C., Torr, P.H.S.: What, where and how many? combining object detectors and CRFs. In: Computer Vision—ECCV 2010. Springer, Berlin (2010)
15.
Zurück zum Zitat Ladický, L., Sturgess, P., Russell, C., Sengupta, S., Bastanlar, Y., Clocksin, W., Torr, P.H.S.: Joint optimization for object class segmentation and dense stereo reconstruction. Int. J. Comput. Vis. 100(2), 122–133 (2012)MathSciNetCrossRef Ladický, L., Sturgess, P., Russell, C., Sengupta, S., Bastanlar, Y., Clocksin, W., Torr, P.H.S.: Joint optimization for object class segmentation and dense stereo reconstruction. Int. J. Comput. Vis. 100(2), 122–133 (2012)MathSciNetCrossRef
16.
Zurück zum Zitat Leibe, B., Cornelis, N., Cornelis, K., Van Gool, L.: Dynamic 3d scene analysis from a moving vehicle. In: IEEE Conference on Computer Vision and Pattern Recognition, 2007. CVPR ’07. pp. 1–8 (2007) Leibe, B., Cornelis, N., Cornelis, K., Van Gool, L.: Dynamic 3d scene analysis from a moving vehicle. In: IEEE Conference on Computer Vision and Pattern Recognition, 2007. CVPR ’07. pp. 1–8 (2007)
17.
Zurück zum Zitat Maji, S., Berg, A.C., Malik, J.: Classification using intersection kernel support vector machines is efficient. In: IEEE Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008. pp. 1–8 (2008) Maji, S., Berg, A.C., Malik, J.: Classification using intersection kernel support vector machines is efficient. In: IEEE Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008. pp. 1–8 (2008)
18.
Zurück zum Zitat Micusik, B., Košecká, J.: Semantic segmentation of street scenes by superpixel co-occurrence and 3d geometry. In: 2009 IEEE 12th International Conference on Computer Vision Workshops (ICCV Workshops), 625–632 Oct 2009 Micusik, B., Košecká, J.: Semantic segmentation of street scenes by superpixel co-occurrence and 3d geometry. In: 2009 IEEE 12th International Conference on Computer Vision Workshops (ICCV Workshops), 625–632 Oct 2009
19.
Zurück zum Zitat Moosmann, F., Stiller, C.: Joint self-localization and tracking of generic objects in 3d range data. In: Proceedings of the IEEE International Conference on Robotics and Automation, pp. 1138–1144, Karlsruhe, Germany (2013) Moosmann, F., Stiller, C.: Joint self-localization and tracking of generic objects in 3d range data. In: Proceedings of the IEEE International Conference on Robotics and Automation, pp. 1138–1144, Karlsruhe, Germany (2013)
20.
Zurück zum Zitat Posner, I., Cummins, M., Newman, P.: A generative framework for fast urban labeling using spatial and temporal context. Auton. Robot. 26, 153–170 (2009)CrossRef Posner, I., Cummins, M., Newman, P.: A generative framework for fast urban labeling using spatial and temporal context. Auton. Robot. 26, 153–170 (2009)CrossRef
21.
Zurück zum Zitat Ren, C.Y., Reid, I.: gSLIC: a real-time implementation of SLIC superpixel segmentation. Technical report, University of Oxford, Department of Engineering (2011) Ren, C.Y., Reid, I.: gSLIC: a real-time implementation of SLIC superpixel segmentation. Technical report, University of Oxford, Department of Engineering (2011)
22.
Zurück zum Zitat Sengupta, S., Greveson, E., Shahrokni, A., Torr, P.H.S.: Urban 3D Semantic Modelling Using Stereo Vision. In: ICRA (2013) Sengupta, S., Greveson, E., Shahrokni, A., Torr, P.H.S.: Urban 3D Semantic Modelling Using Stereo Vision. In: ICRA (2013)
23.
Zurück zum Zitat Tighe, J., Lazebnik, S.: Superparsing: scalable nonparametric image parsing with superpixels. In: Computer Vision—ECCV 2010. Springer, Berlin (2010) Tighe, J., Lazebnik, S.: Superparsing: scalable nonparametric image parsing with superpixels. In: Computer Vision—ECCV 2010. Springer, Berlin (2010)
24.
Zurück zum Zitat Tighe, J., Lazebnik, S.: Finding things: image parsing with regions and per-exemplar detectors. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2013) Tighe, J., Lazebnik, S.: Finding things: image parsing with regions and per-exemplar detectors. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2013)
25.
Zurück zum Zitat Triebel, R.A., Paul, R., Rus, D., Newman, P.: Parsing outdoor scenes from streamed 3d laser data using online clustering and incremental belief updates. In: Twenty-Sixth AAAI Conference on Artificial Intelligence (2012) Triebel, R.A., Paul, R., Rus, D., Newman, P.: Parsing outdoor scenes from streamed 3d laser data using online clustering and incremental belief updates. In: Twenty-Sixth AAAI Conference on Artificial Intelligence (2012)
27.
Zurück zum Zitat Wang, D., Posner, I., Newman, P.: What could move? finding cars, pedestrians and bicyclists in 3d laser data. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Minnesota, USA (2012) Wang, D., Posner, I., Newman, P.: What could move? finding cars, pedestrians and bicyclists in 3d laser data. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Minnesota, USA (2012)
28.
Zurück zum Zitat Xiao, J., Quan, L.: Multiple view semantic segmentation for street view images. In: 2009 IEEE 12th International Conference on Computer Vision, pp. 686–693 Oct 2009 Xiao, J., Quan, L.: Multiple view semantic segmentation for street view images. In: 2009 IEEE 12th International Conference on Computer Vision, pp. 686–693 Oct 2009
29.
Zurück zum Zitat Zhang, C., Wang, L., Yang, R.: Semantic segmentation of urban scenes using dense depth maps. In: Computer Vision—ECCV 2010. Springer, Berlin (2010) Zhang, C., Wang, L., Yang, R.: Semantic segmentation of urban scenes using dense depth maps. In: Computer Vision—ECCV 2010. Springer, Berlin (2010)
Metadaten
Titel
Recursive Inference for Prediction of Objects in Urban Environments
verfasst von
Cesar Cadena
Jana Košecká
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-28872-7_31

Neuer Inhalt