Skip to main content

2015 | OriginalPaper | Buchkapitel

3D-Guided Multiscale Sliding Window for Pedestrian Detection

verfasst von : Alejandro González, Gabriel Villalonga, German Ros, David Vázquez, Antonio M. López

Erschienen in: Pattern Recognition and Image Analysis

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The most relevant modules of a pedestrian detector are the candidate generation and the candidate classification. The former aims at presenting image windows to the latter so that they are classified as containing a pedestrian or not. Much attention has being paid to the classification module, while candidate generation has mainly relied on (multiscale) sliding window pyramid. However, candidate generation is critical for achieving real-time. In this paper we assume a context of autonomous driving based on stereo vision. Accordingly, we evaluate the effect of taking into account the 3D information (derived from the stereo) in order to prune the hundred of thousands windows per image generated by classical pyramidal sliding window. For our study we use a multi-modal (RGB, disparity) and multi-descriptor (HOG, LBP, HOG+LBP) holistic ensemble based on linear SVM. Evaluation on data from the challenging KITTI benchmark suite shows the effectiveness of using 3D information to dramatically reduce the number of candidate windows, even improving the overall pedestrian detection accuracy.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Alonso, I.P., Llorca, D.F., Sotelo, M.A., Bergasa, L.M., de Toro, P.R., Nuevo, J., Ocana, M., Garrido, M.A.: Combination of feature extraction methods for svm pedestrian detection. Trans. Intell. Transport. Sys. 8(2), 292–307 (2007)CrossRef Alonso, I.P., Llorca, D.F., Sotelo, M.A., Bergasa, L.M., de Toro, P.R., Nuevo, J., Ocana, M., Garrido, M.A.: Combination of feature extraction methods for svm pedestrian detection. Trans. Intell. Transport. Sys. 8(2), 292–307 (2007)CrossRef
2.
Zurück zum Zitat Benenson, R., Mathias, M., Timofte, R., Van Gool, L.: Pedestrian detection at 100 frames per second. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA (2012) Benenson, R., Mathias, M., Timofte, R., Van Gool, L.: Pedestrian detection at 100 frames per second. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA (2012)
3.
Zurück zum Zitat Dollár, P., Tu, Z., Perona, P., Belongie, S.: Integral channel features. In: Proceedings of the British Machine Vision Conference, London, UK (2009) Dollár, P., Tu, Z., Perona, P., Belongie, S.: Integral channel features. In: Proceedings of the British Machine Vision Conference, London, UK (2009)
4.
Zurück zum Zitat Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)CrossRef Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)CrossRef
5.
Zurück zum Zitat Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? the kitti vision benchmark suite. In: CVPR 2012 (2012) Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? the kitti vision benchmark suite. In: CVPR 2012 (2012)
6.
Zurück zum Zitat Gerónimo, D., López, A.: Vision-based Pedestrian Protection Systems for Intelligent Vehicles. Springer Briefs in Computer Science. Springer, New York (2013) Gerónimo, D., López, A.: Vision-based Pedestrian Protection Systems for Intelligent Vehicles. Springer Briefs in Computer Science. Springer, New York (2013)
7.
Zurück zum Zitat Gerónimo, D., Sappa, A., Ponsa, D., López, A.: 2D–3D based on-board pedestrian detection system. J. Comput. Vis. Image Underst. 114(5), 583–595 (2010)CrossRef Gerónimo, D., Sappa, A., Ponsa, D., López, A.: 2D–3D based on-board pedestrian detection system. J. Comput. Vis. Image Underst. 114(5), 583–595 (2010)CrossRef
8.
Zurück zum Zitat Gu, C., Lim, J.J., Arbelez, P., Malik, J.: Recognition using regions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2009) Gu, C., Lim, J.J., Arbelez, P., Malik, J.: Recognition using regions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2009)
9.
Zurück zum Zitat Hosang, J., Benenson, R., Schiele, B.: How good are detection proposals, really? In: Proceedings of the British Machine Vision Conference (2014) Hosang, J., Benenson, R., Schiele, B.: How good are detection proposals, really? In: Proceedings of the British Machine Vision Conference (2014)
10.
Zurück zum Zitat Labayrade, R., Aubert, D., Tarel, J.P.: Real time obstacle detection in stereovision on non flat road geometry through “v-disparity" representation. In: IEEE Intelligent Vehicle Symposium (2002) Labayrade, R., Aubert, D., Tarel, J.P.: Real time obstacle detection in stereovision on non flat road geometry through “v-disparity" representation. In: IEEE Intelligent Vehicle Symposium (2002)
11.
Zurück zum Zitat Marin, J., Vázquez, D., López, A., Amores, J., Leibe, B.: Random forests of local experts for pedestrian detection. In: Proceedings of the IEEE International Conference on Computer Vision (2013) Marin, J., Vázquez, D., López, A., Amores, J., Leibe, B.: Random forests of local experts for pedestrian detection. In: Proceedings of the IEEE International Conference on Computer Vision (2013)
12.
Zurück zum Zitat Ros, G., Ramos, S., Granados, M., Bakhtiary, A., Vazquez, D., Lopez, A.: Vision-based offline-online paradigm for autonomous driving. In: Winter Conference on Applications of Computer Vision (WACV) (2015) Ros, G., Ramos, S., Granados, M., Bakhtiary, A., Vazquez, D., Lopez, A.: Vision-based offline-online paradigm for autonomous driving. In: Winter Conference on Applications of Computer Vision (WACV) (2015)
13.
Zurück zum Zitat Uijlings, J.R.R., van de Sande, K.E.A., Gevers, T., Smeulders, A.W.M.: Selective search for object recognition. Int. J. Comput. Vision 104(2), 154–171 (2013)CrossRef Uijlings, J.R.R., van de Sande, K.E.A., Gevers, T., Smeulders, A.W.M.: Selective search for object recognition. Int. J. Comput. Vision 104(2), 154–171 (2013)CrossRef
14.
Zurück zum Zitat van de Sande, K.E.A., Uijlings, J.R.R., Gevers, T., Smeulders, A.W.M.: Segmentation as selective search for object recognition. In: Proceedings of the IEEE International Conference on Computer Vision (2011) van de Sande, K.E.A., Uijlings, J.R.R., Gevers, T., Smeulders, A.W.M.: Segmentation as selective search for object recognition. In: Proceedings of the IEEE International Conference on Computer Vision (2011)
15.
Zurück zum Zitat Wang, X., Han, T.X., Yan, S.: An HOG-LBP human detector with partial occlusion handling. In: Proceedings of the IEEE International Conference on Computer Vision, Kyoto, Japan (2009) Wang, X., Han, T.X., Yan, S.: An HOG-LBP human detector with partial occlusion handling. In: Proceedings of the IEEE International Conference on Computer Vision, Kyoto, Japan (2009)
16.
Zurück zum Zitat Zitnick, C.L., Dollár, P.: Edge boxes: locating object proposals from edges. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part V. LNCS, vol. 8693, pp. 391–405. Springer, Heidelberg (2014) CrossRef Zitnick, C.L., Dollár, P.: Edge boxes: locating object proposals from edges. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part V. LNCS, vol. 8693, pp. 391–405. Springer, Heidelberg (2014) CrossRef
Metadaten
Titel
3D-Guided Multiscale Sliding Window for Pedestrian Detection
verfasst von
Alejandro González
Gabriel Villalonga
German Ros
David Vázquez
Antonio M. López
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-19390-8_63

Premium Partner