Skip to main content

2015 | OriginalPaper | Buchkapitel

Scene Parsing and Fusion-Based Continuous Traversable Region Formation

verfasst von : Xuhong Xiao, Gee Wah Ng, Yuan Sin Tan, Yeo Ye Chuan

Erschienen in: Computer Vision - ACCV 2014 Workshops

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Determining the categories of different parts of a scene and generating a continuous traversable region map in the physical coordinate system are crucial for autonomous vehicle navigation. This paper presents our efforts in these two aspects for an autonomous vehicle operating in open terrain environment. Driven by the ideas that have been proposed in our Cognitive Architecture, we have designed novel strategies for the top-down facilitation process to explicitly interpret spatial relationship between objects in the scene, and have incorporated a visual attention mechanism into the image-based scene parsing module. The scene parsing module is able to process images fast enough for real-time vehicle navigation applications. To alleviate the challenges in using sparse 3D occupancy grids for path planning, we are proposing an approach to interpolate the category of occupancy grids not hit by 3D LIDAR, with reference to the aligned image-based scene parsing result, so that a continuous \(2\frac{1}{2}D\) traversable region map can be formed.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Goodale, M.A., Milner, A.D.: Separate visual pathways for perception and action. trends Neurosci. 15(1), 20–25 (1992)CrossRef Goodale, M.A., Milner, A.D.: Separate visual pathways for perception and action. trends Neurosci. 15(1), 20–25 (1992)CrossRef
2.
3.
Zurück zum Zitat Lowe, D.G.: Object recognition from local scale-invariant features. In: Proceedings of the International Conference on Computer Vision, pp. 1150–1157 (1999) Lowe, D.G.: Object recognition from local scale-invariant features. In: Proceedings of the International Conference on Computer Vision, pp. 1150–1157 (1999)
4.
Zurück zum Zitat Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR (2005) Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR (2005)
5.
Zurück zum Zitat Riesenhuber, M., Poggio, T.: Hierarchical models of object recognition in Cortex. Nature Neurosci. 2, 1019–1025 (1999)CrossRef Riesenhuber, M., Poggio, T.: Hierarchical models of object recognition in Cortex. Nature Neurosci. 2, 1019–1025 (1999)CrossRef
6.
Zurück zum Zitat Serre, T., Wolf, L., Bileschi, S., Riesenhuber, M., Poggio, T.: Robust object recognition with cortex-like mechanisms. IEEE Trans. Pattern Anal. Mach. Intell. 29(3), 411–426 (2007)CrossRef Serre, T., Wolf, L., Bileschi, S., Riesenhuber, M., Poggio, T.: Robust object recognition with cortex-like mechanisms. IEEE Trans. Pattern Anal. Mach. Intell. 29(3), 411–426 (2007)CrossRef
7.
Zurück zum Zitat Felzenszwalb, P., McAllester, D., Ramanan, D.: A discriminatively trained multiscale deformable part model. In: CVPR (2008) Felzenszwalb, P., McAllester, D., Ramanan, D.: A discriminatively trained multiscale deformable part model. In: CVPR (2008)
8.
Zurück zum Zitat Viola, P., Michael J.J.: Rapid object detection using a boosted cascade of simple features. In: CVPR (2001) Viola, P., Michael J.J.: Rapid object detection using a boosted cascade of simple features. In: CVPR (2001)
9.
Zurück zum Zitat Felzenszwalb, P., Girshick, R. McAllester, D.: Cascade object detection with deformable part models. In: CVPR (2010) Felzenszwalb, P., Girshick, R. McAllester, D.: Cascade object detection with deformable part models. In: CVPR (2010)
10.
Zurück zum Zitat Laxebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: CVPR (2006) Laxebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: CVPR (2006)
11.
Zurück zum Zitat Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. Int. Comput. Vis. 42(3), 145–175 (2001)CrossRefMATH Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. Int. Comput. Vis. 42(3), 145–175 (2001)CrossRefMATH
12.
Zurück zum Zitat Torralba, A., Murphy, K., P., Freeman, W.T., Rubin, M. A.: Context-based vision system for place and object recognition. In: ICCV, pp. 1023–1029 (2003) Torralba, A., Murphy, K., P., Freeman, W.T., Rubin, M. A.: Context-based vision system for place and object recognition. In: ICCV, pp. 1023–1029 (2003)
13.
Zurück zum Zitat Siagian, C., Itti, L.: Rapid biologically-inspired scene classication using features shared with visual attention. PAMI 29(2), 300–312 (2007)CrossRef Siagian, C., Itti, L.: Rapid biologically-inspired scene classication using features shared with visual attention. PAMI 29(2), 300–312 (2007)CrossRef
14.
Zurück zum Zitat Renniger, L., Malik, J.: When is scene identification just texture recognition? Vis. Res. 44, 2301–2311 (2004)CrossRef Renniger, L., Malik, J.: When is scene identification just texture recognition? Vis. Res. 44, 2301–2311 (2004)CrossRef
15.
Zurück zum Zitat Tighe, J., Lazebnik, S.: SuperParsing: scalable nonparametric image parsing with superpixels. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol. 6315, pp. 352–365. Springer, Heidelberg (2010) CrossRef Tighe, J., Lazebnik, S.: SuperParsing: scalable nonparametric image parsing with superpixels. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol. 6315, pp. 352–365. Springer, Heidelberg (2010) CrossRef
16.
Zurück zum Zitat Li, L.J., Socher, R., Li, F.F.: Towards total scene understanding: classification, annotation and segmentation in an automatic framework. In: CVPR (2009) Li, L.J., Socher, R., Li, F.F.: Towards total scene understanding: classification, annotation and segmentation in an automatic framework. In: CVPR (2009)
17.
Zurück zum Zitat Du, L., Ren, L., Dunson, D., B., Carin, L.: A Bayesian model for simultaneous image clustering, annotation and object segmentation. In: NIPS (2009) Du, L., Ren, L., Dunson, D., B., Carin, L.: A Bayesian model for simultaneous image clustering, annotation and object segmentation. In: NIPS (2009)
18.
Zurück zum Zitat Rabinovich, A., Vedaldi, A., Galleguillos, C.: Object in context. In: ICCV (2007) Rabinovich, A., Vedaldi, A., Galleguillos, C.: Object in context. In: ICCV (2007)
19.
Zurück zum Zitat Galleguillos, C., Belongie, S.: Context-based object categorization: a critical survey. J. Comput. Vis. Image Underst. 114(6), 712–722 (2010)CrossRef Galleguillos, C., Belongie, S.: Context-based object categorization: a critical survey. J. Comput. Vis. Image Underst. 114(6), 712–722 (2010)CrossRef
20.
Zurück zum Zitat He, X., Zemel, R., Carreira-Perpindn, M.A.: Multiscale conditional random fields for image labelling. In: CVPR, pp. 695–702 (2004) He, X., Zemel, R., Carreira-Perpindn, M.A.: Multiscale conditional random fields for image labelling. In: CVPR, pp. 695–702 (2004)
21.
Zurück zum Zitat Kumar, S., Hebert, M.: A hierarchical field framework for unified context-based classification. In: ICCV, pp. 1284–1291 (2005) Kumar, S., Hebert, M.: A hierarchical field framework for unified context-based classification. In: ICCV, pp. 1284–1291 (2005)
22.
Zurück zum Zitat Verbeek, J., Triggs, B.: Scene segmentation with conditional random fields learned from partially labeled images. In: NIPS (2008) Verbeek, J., Triggs, B.: Scene segmentation with conditional random fields learned from partially labeled images. In: NIPS (2008)
24.
Zurück zum Zitat Vandapel, N., Huber, D.F., Kapuria, A., Hebert, M.: Natural terrain classification using three-dimensional Ladar data for ground robot mobility. J. Field Robot. 23(10), 839–861 (2006)CrossRef Vandapel, N., Huber, D.F., Kapuria, A., Hebert, M.: Natural terrain classification using three-dimensional Ladar data for ground robot mobility. J. Field Robot. 23(10), 839–861 (2006)CrossRef
25.
Zurück zum Zitat Himmelsbach, M., Luettel, T., Wuensche, H.J.: Real-time object classification in 3D point clouds using point feature histograms. In: Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems, USA (2009) Himmelsbach, M., Luettel, T., Wuensche, H.J.: Real-time object classification in 3D point clouds using point feature histograms. In: Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems, USA (2009)
26.
Zurück zum Zitat Thrun, S., et al.: Stanley: the robot that won the DARPA grand challenge. J. Robot. Syst. 23(9), 661–692 (2006) Thrun, S., et al.: Stanley: the robot that won the DARPA grand challenge. J. Robot. Syst. 23(9), 661–692 (2006)
27.
Zurück zum Zitat Rasmussen, C.: A hybrid vision+Ladar rural road follower. In: Proceedings of the IEEE Conference on Robotics and Automation, pp. 156–161 (2006) Rasmussen, C.: A hybrid vision+Ladar rural road follower. In: Proceedings of the IEEE Conference on Robotics and Automation, pp. 156–161 (2006)
28.
Zurück zum Zitat Manz, M., Himmelsbach, M., Luettel, T., Wuensche, H.: Detection and tracking of road networks in rural terrain by fusing vision and LIDAR. In: Proceedings IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 4562–4568 (2011) Manz, M., Himmelsbach, M., Luettel, T., Wuensche, H.: Detection and tracking of road networks in rural terrain by fusing vision and LIDAR. In: Proceedings IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 4562–4568 (2011)
29.
Zurück zum Zitat Ng, G.W., Xiao, X., Chan, R.Z., Tan, Y.S.: Scene understanding using DSO cognitive architecture. In: Proceedings of the 15th International Conference on Information Fusion (2012) Ng, G.W., Xiao, X., Chan, R.Z., Tan, Y.S.: Scene understanding using DSO cognitive architecture. In: Proceedings of the 15th International Conference on Information Fusion (2012)
30.
Zurück zum Zitat Zhao, G., Xiao, X., Yuan, J., Ng, G.W.: Fusion of 3D-LIDAR and camera data for scene parsing. J. Vis. Commun. Image Represent. 25(1), 165–183 (2013)CrossRef Zhao, G., Xiao, X., Yuan, J., Ng, G.W.: Fusion of 3D-LIDAR and camera data for scene parsing. J. Vis. Commun. Image Represent. 25(1), 165–183 (2013)CrossRef
31.
Zurück zum Zitat Hochstein, S., Ahissar, M.: View from the top: hierarchies and reverse hierarchies in the visual system. Neuron 36, 791–804 (2002)CrossRef Hochstein, S., Ahissar, M.: View from the top: hierarchies and reverse hierarchies in the visual system. Neuron 36, 791–804 (2002)CrossRef
32.
Zurück zum Zitat Bar, M.: A cortical mechanism for triggering top-down facilitation in visual object recognition. J. Cogn. Neurosci. 15(4), 600–609 (2003)CrossRefMathSciNet Bar, M.: A cortical mechanism for triggering top-down facilitation in visual object recognition. J. Cogn. Neurosci. 15(4), 600–609 (2003)CrossRefMathSciNet
33.
Zurück zum Zitat Yao, J., Fidler, S., and Urtasun, R.: Describing the scene as a whole: joint object detection, scene classfication and semantic segmentation. In: CVPR (2012) Yao, J., Fidler, S., and Urtasun, R.: Describing the scene as a whole: joint object detection, scene classfication and semantic segmentation. In: CVPR (2012)
34.
Zurück zum Zitat Kasther, S., Ungerleider, G.: Mechanisms of visual attention in the human cortex. Annu. Rev. Neural Sci. 23, 315–341 (2000) Kasther, S., Ungerleider, G.: Mechanisms of visual attention in the human cortex. Annu. Rev. Neural Sci. 23, 315–341 (2000)
35.
Zurück zum Zitat Felzenszwalb, P., Huttenlocker, D.: Efficient graph-Based imagesegmentation. IJCV 2, 167–181 (2004)CrossRef Felzenszwalb, P., Huttenlocker, D.: Efficient graph-Based imagesegmentation. IJCV 2, 167–181 (2004)CrossRef
38.
Zurück zum Zitat Ojala, T., Pietikainen, M., Maenpaa, T.: Multi-resolution gray-scaleand rotation invariant texture classification with local binary patterns. PAMI 24(7), 971–986 (2002)CrossRef Ojala, T., Pietikainen, M., Maenpaa, T.: Multi-resolution gray-scaleand rotation invariant texture classification with local binary patterns. PAMI 24(7), 971–986 (2002)CrossRef
39.
Zurück zum Zitat Fenske, M.J., Aminoff, E., Gronau, N., Bar, M.: Top-down facilitation of visual object recognition: object-based and context-based contributions. Prog. Brain Res. 155, 3–21 (2006)CrossRef Fenske, M.J., Aminoff, E., Gronau, N., Bar, M.: Top-down facilitation of visual object recognition: object-based and context-based contributions. Prog. Brain Res. 155, 3–21 (2006)CrossRef
40.
Zurück zum Zitat Oliva, A., Torralba, A.: The role of context in object recognition. Trends Cogn. Sci. 11(2), 520–527 (2007)CrossRef Oliva, A., Torralba, A.: The role of context in object recognition. Trends Cogn. Sci. 11(2), 520–527 (2007)CrossRef
41.
Zurück zum Zitat Desai, C., Ramanan, D., Fowlkes, C.C.: Discriminative models for multi-class object layout. IJCV 2, 169–176 (2012) Desai, C., Ramanan, D., Fowlkes, C.C.: Discriminative models for multi-class object layout. IJCV 2, 169–176 (2012)
42.
Zurück zum Zitat Achanta, R., Hemami, S., Estrada, F., Susstrunk, S.: Frequency-tuned Salient Region Detection. In: CVPR (2009) Achanta, R., Hemami, S., Estrada, F., Susstrunk, S.: Frequency-tuned Salient Region Detection. In: CVPR (2009)
43.
Zurück zum Zitat Rensink, R.A.: The dynamic representation of scenes. Visual Cognition 7(1/2/3), 17–42 (2000)CrossRef Rensink, R.A.: The dynamic representation of scenes. Visual Cognition 7(1/2/3), 17–42 (2000)CrossRef
44.
Zurück zum Zitat Nistér, D., Stewénius, H.: Linear time maximally stable extremal regions. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 183–196. Springer, Heidelberg (2008) CrossRef Nistér, D., Stewénius, H.: Linear time maximally stable extremal regions. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 183–196. Springer, Heidelberg (2008) CrossRef
45.
Zurück zum Zitat Matas, J., Chum, O., Urban, M., Pajdla, T: Robust wide baseline stereo from maximally stable extremal regions. In: BMVC (2002) Matas, J., Chum, O., Urban, M., Pajdla, T: Robust wide baseline stereo from maximally stable extremal regions. In: BMVC (2002)
Metadaten
Titel
Scene Parsing and Fusion-Based Continuous Traversable Region Formation
verfasst von
Xuhong Xiao
Gee Wah Ng
Yuan Sin Tan
Yeo Ye Chuan
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-16628-5_28

Premium Partner