Skip to main content

2020 | OriginalPaper | Buchkapitel

Monocular Vision-Based Real-Time Vehicle Detection at Container Terminals

verfasst von : Zijian Liu, Tianlei Zhang, Bei He, Yu Liu, Li Sun, Wenyang Tang

Erschienen in: Proceedings of China SAE Congress 2018: Selected Papers

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We present a practical approach to vehicle detection at container terminals based on a single camera and prevailing convolutional neural network models in computer vision domain. Aiming at container terminal scenarios, we introduce a specialized data labelling strategy for network training, as well as an optimized setting of crucial hyperparameters, leading to a significant improvement on results. Our solution achieves 83% precision with 90% recall for semitrailer trunks within 30 m ahead of the vehicle-mounted monocular camera, at a speed of 32 frames per second (FPS) on a Nvidia Titan X for 416 × 416 image input, also providing more alternatives of fairly easy speed/accuracy trade-off. Compared to traditional lidar-based vehicle detection method for autonomous driving, our solution is more robust for particular container terminal scenarios while still maintaining a real-time performance by completely eliminating the region proposal stage.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks[C]. In: International conference on neural information processing systems. Curran Associates Inc., pp 1097–1105 Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks[C]. In: International conference on neural information processing systems. Curran Associates Inc., pp 1097–1105
2.
Zurück zum Zitat Everingham M, Gool LV, Williams CKI et al (2010) The pascal visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338CrossRef Everingham M, Gool LV, Williams CKI et al (2010) The pascal visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338CrossRef
3.
Zurück zum Zitat Lin T, Maire M, Belongie SJ et al (2014) Microsoft COCO: common objects in context. In: European conference on computer vision, pp 740–755 Lin T, Maire M, Belongie SJ et al (2014) Microsoft COCO: common objects in context. In: European conference on computer vision, pp 740–755
4.
Zurück zum Zitat Felzenszwalb P, Mcallester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. In: Computer vision and pattern recognition, 2008. CVPR 2008. IEEE conference on. IEEE, pp 1–8 Felzenszwalb P, Mcallester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. In: Computer vision and pattern recognition, 2008. CVPR 2008. IEEE conference on. IEEE, pp 1–8
5.
Zurück zum Zitat Girshick RB, Donahue J, Darrell T et al (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Computer vision and pattern recognition, pp 580–587 Girshick RB, Donahue J, Darrell T et al (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Computer vision and pattern recognition, pp 580–587
6.
Zurück zum Zitat Girshick R (2015) Fast R-CNN[C]. In: IEEE international conference on computer vision. IEEE Computer Society, pp 1440–1448 Girshick R (2015) Fast R-CNN[C]. In: IEEE international conference on computer vision. IEEE Computer Society, pp 1440–1448
7.
Zurück zum Zitat Ren S, Girshick R, Girshick R et al (2017) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149CrossRef Ren S, Girshick R, Girshick R et al (2017) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149CrossRef
8.
Zurück zum Zitat Liu W, Anguelov D, Erhan D et al (2016) SSD: single shot MultiBox detector[C]. In: European conference on computer vision. Springer, Cham, pp 21–37 Liu W, Anguelov D, Erhan D et al (2016) SSD: single shot MultiBox detector[C]. In: European conference on computer vision. Springer, Cham, pp 21–37
9.
Zurück zum Zitat Redmon J, Divvala S, Girshick R et al (2016) You only look once: unified, real-time object detection[C]. In: IEEE conference on computer vision and pattern recognition. IEEE Computer Society, pp 779–788 Redmon J, Divvala S, Girshick R et al (2016) You only look once: unified, real-time object detection[C]. In: IEEE conference on computer vision and pattern recognition. IEEE Computer Society, pp 779–788
10.
Zurück zum Zitat Geiger A (2012) Are we ready for autonomous driving? The KITTI vision benchmark suite[C]. In: IEEE conference on computer vision and pattern recognition. IEEE Computer Society, pp 3354–3361 Geiger A (2012) Are we ready for autonomous driving? The KITTI vision benchmark suite[C]. In: IEEE conference on computer vision and pattern recognition. IEEE Computer Society, pp 3354–3361
11.
Zurück zum Zitat Uijlings JRR, Sande KEAVD, Gevers T et al (2013) Selective search for object recognition. Int J Comput Vis 104(2):154–171CrossRef Uijlings JRR, Sande KEAVD, Gevers T et al (2013) Selective search for object recognition. Int J Comput Vis 104(2):154–171CrossRef
12.
Zurück zum Zitat Erhan D, Szegedy C, Toshev A et al (2014) Scalable object detection using deep neural networks[C]. In: IEEE conference on computer vision and pattern recognition. IEEE Computer Society, pp 2155–2162 Erhan D, Szegedy C, Toshev A et al (2014) Scalable object detection using deep neural networks[C]. In: IEEE conference on computer vision and pattern recognition. IEEE Computer Society, pp 2155–2162
13.
Zurück zum Zitat Howard AG, Zhu M, Chen B et al (2017) MobileNets: efficient convolutional neural networks for mobile vision applications Howard AG, Zhu M, Chen B et al (2017) MobileNets: efficient convolutional neural networks for mobile vision applications
14.
Zurück zum Zitat Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Comput Sci Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Comput Sci
15.
Zurück zum Zitat Chen X, Ma H, Wan J et al (2017) Multi-view 3D object detection network for autonomous driving. IEEE CVPR Chen X, Ma H, Wan J et al (2017) Multi-view 3D object detection network for autonomous driving. IEEE CVPR
Metadaten
Titel
Monocular Vision-Based Real-Time Vehicle Detection at Container Terminals
verfasst von
Zijian Liu
Tianlei Zhang
Bei He
Yu Liu
Li Sun
Wenyang Tang
Copyright-Jahr
2020
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-9718-9_63

    Premium Partner