Skip to main content
Erschienen in: Pattern Analysis and Applications 4/2018

31.05.2017 | Theoretical Advances

Constant-time monocular object detection using scene geometry

verfasst von: Marcos Nieto, Juan Diego Ortega, Peter Leškovský, Orti Senderos

Erschienen in: Pattern Analysis and Applications | Ausgabe 4/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper presents a structured approach for efficiently exploiting the perspective information of a scene to enhance the detection of objects in monocular systems. It defines a finite grid of 3D positions on the dominant ground plane and computes occupancy maps from which object location estimates are extracted . This method works on the top of any detection method, either pixel-wise (e.g. background subtraction) or region-wise (e.g. detection-by-classification) technique, which can be linked to the proposed scheme with minimal fine tuning. Its flexibility thus allows for applying this approach in a wide variety of applications and sectors, such as surveillance applications (e.g. person detection) or driver assistance systems (e.g. vehicle or pedestrian detection). Extensive results provide evidence of its excellent performance and its ease of use in combination with different image processing techniques.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Sobral A, Bouwmans T (2014) BGS library: a library framework for algorithm’s evaluation in foreground/background segmentation. In: Bouwmans T et al (eds) Background modeling and foreground detection for video surveillance. Chapman and Hall/CRC, UK. doi:10.1201/b17223-29 CrossRef Sobral A, Bouwmans T (2014) BGS library: a library framework for algorithm’s evaluation in foreground/background segmentation. In: Bouwmans T et al (eds) Background modeling and foreground detection for video surveillance. Chapman and Hall/CRC, UK. doi:10.​1201/​b17223-29 CrossRef
2.
Zurück zum Zitat Bouwmans T (2015) Traditional and recent approaches in background modeling for foreground detection: an overview. Comput Sci Rev 11–12:31–36MATH Bouwmans T (2015) Traditional and recent approaches in background modeling for foreground detection: an overview. Comput Sci Rev 11–12:31–36MATH
3.
Zurück zum Zitat Cheng L, Gong M (2009) Real time background subtraction from dynamics scenes. In: International conference on computer vision (ICCV). pp 2066–2073 Cheng L, Gong M (2009) Real time background subtraction from dynamics scenes. In: International conference on computer vision (ICCV). pp 2066–2073
4.
Zurück zum Zitat Kryjak T, Komorkiewicz M, Gorgon M (2012) Real-time background generation and foreground object segmentation for high-definition colour video stream in FPGA device. J Real Time Image Proc 9(1):61–77CrossRef Kryjak T, Komorkiewicz M, Gorgon M (2012) Real-time background generation and foreground object segmentation for high-definition colour video stream in FPGA device. J Real Time Image Proc 9(1):61–77CrossRef
5.
Zurück zum Zitat Del Bimbo A, Lisanti G, Masi I, Pernici F (2010) Person detection using temporal and geometric context with a pan tilt zoom camera. In: 20th International conference on pattern recognition (ICPR). pp 3886–3889 Del Bimbo A, Lisanti G, Masi I, Pernici F (2010) Person detection using temporal and geometric context with a pan tilt zoom camera. In: 20th International conference on pattern recognition (ICPR). pp 3886–3889
6.
Zurück zum Zitat Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645CrossRef Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645CrossRef
7.
Zurück zum Zitat Ortega JD, Nieto M, Cortes A, Florez J (2013) Perspective multiscale detection of vehicles for real-time forward collision avoidance systems. In: Advanced concepts for intelligent vision systems. Lecture notes in computer science, vol 8192. pp 645–656CrossRef Ortega JD, Nieto M, Cortes A, Florez J (2013) Perspective multiscale detection of vehicles for real-time forward collision avoidance systems. In: Advanced concepts for intelligent vision systems. Lecture notes in computer science, vol 8192. pp 645–656CrossRef
8.
Zurück zum Zitat Carr P, Sheikh Y, Matthews I (2012) Monocular object detection using 3D geometric primitives. In: European conference on computer vision (ECCV). Lecture notes in computer science, vol 7572. pp 864–878CrossRef Carr P, Sheikh Y, Matthews I (2012) Monocular object detection using 3D geometric primitives. In: European conference on computer vision (ECCV). Lecture notes in computer science, vol 7572. pp 864–878CrossRef
9.
Zurück zum Zitat Buch N, Cracknell M, Orwell J, Velastin SA (2009) Vehicle localisation and classification in urban CCTV streams. In: 16th World congress on intelligent transport systems Buch N, Cracknell M, Orwell J, Velastin SA (2009) Vehicle localisation and classification in urban CCTV streams. In: 16th World congress on intelligent transport systems
10.
Zurück zum Zitat Gonzalez A, Villalonga G, Ros G, Vazquez D, Lopez AM (2015) 3D-guided multiscale sliding window for pedestrian detection. In: Pattern recognition and image analysis. Lecture notes in computer science, vol 9117. pp 560–568CrossRef Gonzalez A, Villalonga G, Ros G, Vazquez D, Lopez AM (2015) 3D-guided multiscale sliding window for pedestrian detection. In: Pattern recognition and image analysis. Lecture notes in computer science, vol 9117. pp 560–568CrossRef
11.
Zurück zum Zitat Brown L, Feris R, Pankanti S (2014) Temporal non-maximum suppression for pedestrian detection using self-calibration. In: 22nd International conference on pattern recognition (ICPR). pp 2239–2244 Brown L, Feris R, Pankanti S (2014) Temporal non-maximum suppression for pedestrian detection using self-calibration. In: 22nd International conference on pattern recognition (ICPR). pp 2239–2244
12.
Zurück zum Zitat Hoeim D, Efros AA, Hebert M (2008) Putting objects in perspective. Int J Comput Vis 80(1):3–15CrossRef Hoeim D, Efros AA, Hebert M (2008) Putting objects in perspective. Int J Comput Vis 80(1):3–15CrossRef
13.
Zurück zum Zitat Pan J, Kanade T (2013) Coherent object detection with 3D geometric context from a single image. In: IEEE international conference on computer vision (ICCV). pp 2576–2583 Pan J, Kanade T (2013) Coherent object detection with 3D geometric context from a single image. In: IEEE international conference on computer vision (ICCV). pp 2576–2583
14.
Zurück zum Zitat Bartoli F, Lisanti G, Karaman S, Bagdanov A, Del Bimbo A (2014) Unsupervised scene adaptation for faster multi-scale pedestrian detection. In: 22nd International conference on pattern recognition (ICPR). pp 3534–3539 Bartoli F, Lisanti G, Karaman S, Bagdanov A, Del Bimbo A (2014) Unsupervised scene adaptation for faster multi-scale pedestrian detection. In: 22nd International conference on pattern recognition (ICPR). pp 3534–3539
15.
Zurück zum Zitat Cai Y (2006) Robust visual tracking for multiple targets. In: European conference on computer vision (ECCV). pp 107–118CrossRef Cai Y (2006) Robust visual tracking for multiple targets. In: European conference on computer vision (ECCV). pp 107–118CrossRef
16.
Zurück zum Zitat Broggi A, Bertozzi M, Fascioli A (2001) Self-calibration of a stereo vision system for automotive applications. IEEE Int Conf Robot Autom (ICRA) 4:3698–3703 Broggi A, Bertozzi M, Fascioli A (2001) Self-calibration of a stereo vision system for automotive applications. IEEE Int Conf Robot Autom (ICRA) 4:3698–3703
17.
Zurück zum Zitat Fleuret F, Berclaz J, Lengagne R, Fua P (2008) Multicamera people tracking with a probabilistic occupancy map. IEEE Trans Pattern Anal Mach Intell 30(2):267–282CrossRef Fleuret F, Berclaz J, Lengagne R, Fua P (2008) Multicamera people tracking with a probabilistic occupancy map. IEEE Trans Pattern Anal Mach Intell 30(2):267–282CrossRef
18.
Zurück zum Zitat Benenson R, Omran M, Hosang J, Schiele B (2014) Ten years of pedestrian detection, what have we learned? In: ECCV, CVRSUAD workshop Benenson R, Omran M, Hosang J, Schiele B (2014) Ten years of pedestrian detection, what have we learned? In: ECCV, CVRSUAD workshop
19.
Zurück zum Zitat Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. IEEE Conf Comput Vis Pattern Recognit (CVPR) 1:886–893 Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. IEEE Conf Comput Vis Pattern Recognit (CVPR) 1:886–893
20.
Zurück zum Zitat Dollar P, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–1545CrossRef Dollar P, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–1545CrossRef
21.
Zurück zum Zitat Benenson R, Mathias M, Timofte R, Van Gool L (2012) Pedestrian detection at 100 frames per second. In: IEEE conference on computer vision and pattern recognition (CVPR). pp 2903–2910 Benenson R, Mathias M, Timofte R, Van Gool L (2012) Pedestrian detection at 100 frames per second. In: IEEE conference on computer vision and pattern recognition (CVPR). pp 2903–2910
22.
Zurück zum Zitat LeCun Y, Bengio Y, Hinton G (2005) Deep learning. Nature 521(7553):436–444CrossRef LeCun Y, Bengio Y, Hinton G (2005) Deep learning. Nature 521(7553):436–444CrossRef
24.
Zurück zum Zitat Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. In: Conference on neural information processing systems (NIPS) Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. In: Conference on neural information processing systems (NIPS)
25.
Zurück zum Zitat Dai J, Li Y, He K, Sun J (2016) R-FCN: object detection via region-based fully convolutional networks. In: Conference on neural information processing systems (NIPS) Dai J, Li Y, He K, Sun J (2016) R-FCN: object detection via region-based fully convolutional networks. In: Conference on neural information processing systems (NIPS)
26.
Zurück zum Zitat Nieto M, Ortega JD, Cortes A, Gaines S (2014) Perspective multiscale detection and tracking of persons. In: Multimedia modeling. Lecture notes in computer science, vol 8326. pp 92–103CrossRef Nieto M, Ortega JD, Cortes A, Gaines S (2014) Perspective multiscale detection and tracking of persons. In: Multimedia modeling. Lecture notes in computer science, vol 8326. pp 92–103CrossRef
27.
Zurück zum Zitat Hartley RI, Zisserman A (2004) Multiple view geometry in computer vision. Cambridge University Press, CambridgeCrossRef Hartley RI, Zisserman A (2004) Multiple view geometry in computer vision. Cambridge University Press, CambridgeCrossRef
28.
Zurück zum Zitat Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154CrossRef Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154CrossRef
29.
Zurück zum Zitat Satzoda RK, Trivedi MM (2014) Efficient lane and vehicle detection with integrated synergies (ELVIS). In: IEEE conference on computer vision and pattern recognition (CVPR) workshops Satzoda RK, Trivedi MM (2014) Efficient lane and vehicle detection with integrated synergies (ELVIS). In: IEEE conference on computer vision and pattern recognition (CVPR) workshops
30.
Zurück zum Zitat Benfold B, Reid I (2011) Stable multi-target tracking in real-time surveillance video. In: IEEE conference on computer vision and pattern recognition (CVPR). pp 3457–3464 Benfold B, Reid I (2011) Stable multi-target tracking in real-time surveillance video. In: IEEE conference on computer vision and pattern recognition (CVPR). pp 3457–3464
31.
Zurück zum Zitat D’Orazio T, Leo M, Mosca N, Spagnolo P, Mazzeo PL (2009) A semi-automatic system for ground truth generation of soccer video sequences. In: Sixth IEEE international conference on advanced video and signal based surveillance (AVSS). pp 559–564 D’Orazio T, Leo M, Mosca N, Spagnolo P, Mazzeo PL (2009) A semi-automatic system for ground truth generation of soccer video sequences. In: Sixth IEEE international conference on advanced video and signal based surveillance (AVSS). pp 559–564
32.
Zurück zum Zitat Blunsden SJ, Fisher RB (2010) The BEHAVE video dataset: ground truthed video for multi-person behavior classification. Ann BMVA 4:1–12CrossRef Blunsden SJ, Fisher RB (2010) The BEHAVE video dataset: ground truthed video for multi-person behavior classification. Ann BMVA 4:1–12CrossRef
33.
Zurück zum Zitat Zivkovic Z (2004) Improved adaptive Gaussian mixture model for background subtraction. In: 17th International conference on pattern recognition (ICPR). pp 28–31 Zivkovic Z (2004) Improved adaptive Gaussian mixture model for background subtraction. In: 17th International conference on pattern recognition (ICPR). pp 28–31
34.
Zurück zum Zitat MacFarlane NJB, Schofield CP (1995) Segmentation and tracking of piglets in images. Mach Vis Appl 8(3):187–193CrossRef MacFarlane NJB, Schofield CP (1995) Segmentation and tracking of piglets in images. Mach Vis Appl 8(3):187–193CrossRef
35.
Zurück zum Zitat Godbehere AB, Matsukawa A, Goldberg K (2012) Visual tracking of human visitors under variable-lighting conditions for a responsive audio art installation. In: American control conference (ACC). pp 4305–4312 Godbehere AB, Matsukawa A, Goldberg K (2012) Visual tracking of human visitors under variable-lighting conditions for a responsive audio art installation. In: American control conference (ACC). pp 4305–4312
36.
Zurück zum Zitat Garrido-Jurado S, Muñoz-Salinas R, Madrid-Cuevas FJ, Marín-Jiménez MJ (2014) Automatic generation and detection of highly reliable fiducial markers under occlusion. Pattern Recogn 47(6):2280–2292CrossRef Garrido-Jurado S, Muñoz-Salinas R, Madrid-Cuevas FJ, Marín-Jiménez MJ (2014) Automatic generation and detection of highly reliable fiducial markers under occlusion. Pattern Recogn 47(6):2280–2292CrossRef
Metadaten
Titel
Constant-time monocular object detection using scene geometry
verfasst von
Marcos Nieto
Juan Diego Ortega
Peter Leškovský
Orti Senderos
Publikationsdatum
31.05.2017
Verlag
Springer London
Erschienen in
Pattern Analysis and Applications / Ausgabe 4/2018
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-017-0625-8

Weitere Artikel der Ausgabe 4/2018

Pattern Analysis and Applications 4/2018 Zur Ausgabe