Skip to main content
Top

2016 | OriginalPaper | Chapter

Recursive Inference for Prediction of Objects in Urban Environments

Authors : Cesar Cadena, Jana Košecká

Published in: Robotics Research

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Future advancements in robotic navigation and mapping rest to a large extent on robust, efficient and more advanced semantic understanding of the surrounding environment. The existing semantic mapping approaches typically consider small number of semantic categories, require complex inference or large number of training examples to achieve desirable performance. In the proposed work we present an efficient approach for predicting locations of generic objects in urban environments by means of semantic segmentation of a video into object and non-object categories. We exploit widely available exemplars of non-object categories (such as road, buildings, vegetation) and use geometric cues which are indicative of the presence of object boundaries to gather the evidence about objects regardless of their category. We formulate the object/non-object semantic segmentation problem in the Conditional Random Field framework, where the structure of the graph is induced by a minimum spanning tree computed over a 3D point cloud, yielding an efficient algorithm for an exact inference. The chosen 3D representation naturally lends itself for on-line recursive belief updates with a simple soft data association mechanism. We carry out extensive experiments on videos of urban environments acquired by a moving vehicle and show quantitatively and qualitatively the benefits of our proposal.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
Literature
1.
go back to reference Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., Süsstrunk, S.: SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Patt. Anal. Mach. Intell. 34(11), 2274–2282 (2012)CrossRef Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., Süsstrunk, S.: SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Patt. Anal. Mach. Intell. 34(11), 2274–2282 (2012)CrossRef
2.
go back to reference Alexe, B., Deselaers, T., Ferrari, V.: What is an object?. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 73–80, June 2010 Alexe, B., Deselaers, T., Ferrari, V.: What is an object?. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 73–80, June 2010
3.
go back to reference Ayvaci, A., Soatto, S.: Detachable object detection: segmentation and depth ordering from short-baseline video. IEEE Trans. Patt. Anal. Mach. Intell. 34(10):1942–1951 (2012) Ayvaci, A., Soatto, S.: Detachable object detection: segmentation and depth ordering from short-baseline video. IEEE Trans. Patt. Anal. Mach. Intell. 34(10):1942–1951 (2012)
4.
go back to reference Buchanan, A.M., Fitzgibbon, A.W.: Interactive feature tracking using K-D trees and dynamic programming. IEEE Conf. Comput. Vis. Patt. Recognit. 1, 626–633 (2006) Buchanan, A.M., Fitzgibbon, A.W.: Interactive feature tracking using K-D trees and dynamic programming. IEEE Conf. Comput. Vis. Patt. Recognit. 1, 626–633 (2006)
5.
go back to reference Douillard, B., Fox, D., Ramos, F., Durrant-Whyte, H.: Classification and semantic mapping of urban environments. Int. J. Rob. Res. 30, 5–32 (2011)CrossRef Douillard, B., Fox, D., Ramos, F., Durrant-Whyte, H.: Classification and semantic mapping of urban environments. Int. J. Rob. Res. 30, 5–32 (2011)CrossRef
6.
go back to reference Eigen, D., Fergus, R.: Nonparametric image parsing using adaptive neighbor sets. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2799–2806, June 2012 Eigen, D., Fergus, R.: Nonparametric image parsing using adaptive neighbor sets. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2799–2806, June 2012
7.
go back to reference Floros, G., Leibe, B.: Joint 2d–3d temporally consistent semantic segmentation of street scenes. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2823–2830, 2012 Floros, G., Leibe, B.: Joint 2d–3d temporally consistent semantic segmentation of street scenes. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2823–2830, 2012
8.
go back to reference Galvez-Lopez, D., Tardos, J.D.: Bags of binary words for fast place recognition in image sequences. IEEE Trans. Robot. 28(5), 1188–1197 (2012)CrossRef Galvez-Lopez, D., Tardos, J.D.: Bags of binary words for fast place recognition in image sequences. IEEE Trans. Robot. 28(5), 1188–1197 (2012)CrossRef
9.
go back to reference Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? the kitti vision benchmark suite. In: Conference on Computer Vision and PatternRecognition (CVPR), 2012 Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? the kitti vision benchmark suite. In: Conference on Computer Vision and PatternRecognition (CVPR), 2012
10.
go back to reference Geiger, A., Ziegler, J., Stiller, C.: Stereoscan: dense 3d reconstruction in real-time. In: Intelligent Vehicles Symposium (IV), 2011 Geiger, A., Ziegler, J., Stiller, C.: Stereoscan: dense 3d reconstruction in real-time. In: Intelligent Vehicles Symposium (IV), 2011
11.
go back to reference Klasing, K.: Aspects of 3D perception, abstraction, and interpretation in autonomous mobile robotics. Ph.D. thesis, Technical Univeristy of Munich, Germany (2010) Klasing, K.: Aspects of 3D perception, abstraction, and interpretation in autonomous mobile robotics. Ph.D. thesis, Technical Univeristy of Munich, Germany (2010)
12.
go back to reference Koller, D., Friedman, N.: Probabilistic Graphical Models: Principles and Techniques. MIT Press, USA (2009) Koller, D., Friedman, N.: Probabilistic Graphical Models: Principles and Techniques. MIT Press, USA (2009)
13.
go back to reference Kümmerle, R., Grisetti, G., Strasdat, H., Konolige, K., Burgard, W.: g2o: A general framework for graph optimization. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Shanghai, China, May 2011 Kümmerle, R., Grisetti, G., Strasdat, H., Konolige, K., Burgard, W.: g2o: A general framework for graph optimization. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Shanghai, China, May 2011
14.
go back to reference Ladický, L., Sturgess, P., Alahari, K., Russell, C., Torr, P.H.S.: What, where and how many? combining object detectors and CRFs. In: Computer Vision—ECCV 2010. Springer, Berlin (2010) Ladický, L., Sturgess, P., Alahari, K., Russell, C., Torr, P.H.S.: What, where and how many? combining object detectors and CRFs. In: Computer Vision—ECCV 2010. Springer, Berlin (2010)
15.
go back to reference Ladický, L., Sturgess, P., Russell, C., Sengupta, S., Bastanlar, Y., Clocksin, W., Torr, P.H.S.: Joint optimization for object class segmentation and dense stereo reconstruction. Int. J. Comput. Vis. 100(2), 122–133 (2012)MathSciNetCrossRef Ladický, L., Sturgess, P., Russell, C., Sengupta, S., Bastanlar, Y., Clocksin, W., Torr, P.H.S.: Joint optimization for object class segmentation and dense stereo reconstruction. Int. J. Comput. Vis. 100(2), 122–133 (2012)MathSciNetCrossRef
16.
go back to reference Leibe, B., Cornelis, N., Cornelis, K., Van Gool, L.: Dynamic 3d scene analysis from a moving vehicle. In: IEEE Conference on Computer Vision and Pattern Recognition, 2007. CVPR ’07. pp. 1–8 (2007) Leibe, B., Cornelis, N., Cornelis, K., Van Gool, L.: Dynamic 3d scene analysis from a moving vehicle. In: IEEE Conference on Computer Vision and Pattern Recognition, 2007. CVPR ’07. pp. 1–8 (2007)
17.
go back to reference Maji, S., Berg, A.C., Malik, J.: Classification using intersection kernel support vector machines is efficient. In: IEEE Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008. pp. 1–8 (2008) Maji, S., Berg, A.C., Malik, J.: Classification using intersection kernel support vector machines is efficient. In: IEEE Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008. pp. 1–8 (2008)
18.
go back to reference Micusik, B., Košecká, J.: Semantic segmentation of street scenes by superpixel co-occurrence and 3d geometry. In: 2009 IEEE 12th International Conference on Computer Vision Workshops (ICCV Workshops), 625–632 Oct 2009 Micusik, B., Košecká, J.: Semantic segmentation of street scenes by superpixel co-occurrence and 3d geometry. In: 2009 IEEE 12th International Conference on Computer Vision Workshops (ICCV Workshops), 625–632 Oct 2009
19.
go back to reference Moosmann, F., Stiller, C.: Joint self-localization and tracking of generic objects in 3d range data. In: Proceedings of the IEEE International Conference on Robotics and Automation, pp. 1138–1144, Karlsruhe, Germany (2013) Moosmann, F., Stiller, C.: Joint self-localization and tracking of generic objects in 3d range data. In: Proceedings of the IEEE International Conference on Robotics and Automation, pp. 1138–1144, Karlsruhe, Germany (2013)
20.
go back to reference Posner, I., Cummins, M., Newman, P.: A generative framework for fast urban labeling using spatial and temporal context. Auton. Robot. 26, 153–170 (2009)CrossRef Posner, I., Cummins, M., Newman, P.: A generative framework for fast urban labeling using spatial and temporal context. Auton. Robot. 26, 153–170 (2009)CrossRef
21.
go back to reference Ren, C.Y., Reid, I.: gSLIC: a real-time implementation of SLIC superpixel segmentation. Technical report, University of Oxford, Department of Engineering (2011) Ren, C.Y., Reid, I.: gSLIC: a real-time implementation of SLIC superpixel segmentation. Technical report, University of Oxford, Department of Engineering (2011)
22.
go back to reference Sengupta, S., Greveson, E., Shahrokni, A., Torr, P.H.S.: Urban 3D Semantic Modelling Using Stereo Vision. In: ICRA (2013) Sengupta, S., Greveson, E., Shahrokni, A., Torr, P.H.S.: Urban 3D Semantic Modelling Using Stereo Vision. In: ICRA (2013)
23.
go back to reference Tighe, J., Lazebnik, S.: Superparsing: scalable nonparametric image parsing with superpixels. In: Computer Vision—ECCV 2010. Springer, Berlin (2010) Tighe, J., Lazebnik, S.: Superparsing: scalable nonparametric image parsing with superpixels. In: Computer Vision—ECCV 2010. Springer, Berlin (2010)
24.
go back to reference Tighe, J., Lazebnik, S.: Finding things: image parsing with regions and per-exemplar detectors. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2013) Tighe, J., Lazebnik, S.: Finding things: image parsing with regions and per-exemplar detectors. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2013)
25.
go back to reference Triebel, R.A., Paul, R., Rus, D., Newman, P.: Parsing outdoor scenes from streamed 3d laser data using online clustering and incremental belief updates. In: Twenty-Sixth AAAI Conference on Artificial Intelligence (2012) Triebel, R.A., Paul, R., Rus, D., Newman, P.: Parsing outdoor scenes from streamed 3d laser data using online clustering and incremental belief updates. In: Twenty-Sixth AAAI Conference on Artificial Intelligence (2012)
27.
go back to reference Wang, D., Posner, I., Newman, P.: What could move? finding cars, pedestrians and bicyclists in 3d laser data. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Minnesota, USA (2012) Wang, D., Posner, I., Newman, P.: What could move? finding cars, pedestrians and bicyclists in 3d laser data. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Minnesota, USA (2012)
28.
go back to reference Xiao, J., Quan, L.: Multiple view semantic segmentation for street view images. In: 2009 IEEE 12th International Conference on Computer Vision, pp. 686–693 Oct 2009 Xiao, J., Quan, L.: Multiple view semantic segmentation for street view images. In: 2009 IEEE 12th International Conference on Computer Vision, pp. 686–693 Oct 2009
29.
go back to reference Zhang, C., Wang, L., Yang, R.: Semantic segmentation of urban scenes using dense depth maps. In: Computer Vision—ECCV 2010. Springer, Berlin (2010) Zhang, C., Wang, L., Yang, R.: Semantic segmentation of urban scenes using dense depth maps. In: Computer Vision—ECCV 2010. Springer, Berlin (2010)
Metadata
Title
Recursive Inference for Prediction of Objects in Urban Environments
Authors
Cesar Cadena
Jana Košecká
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-28872-7_31