Skip to main content
Top

2020 | OriginalPaper | Chapter

Monocular Vision-Based Real-Time Vehicle Detection at Container Terminals

Authors : Zijian Liu, Tianlei Zhang, Bei He, Yu Liu, Li Sun, Wenyang Tang

Published in: Proceedings of China SAE Congress 2018: Selected Papers

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We present a practical approach to vehicle detection at container terminals based on a single camera and prevailing convolutional neural network models in computer vision domain. Aiming at container terminal scenarios, we introduce a specialized data labelling strategy for network training, as well as an optimized setting of crucial hyperparameters, leading to a significant improvement on results. Our solution achieves 83% precision with 90% recall for semitrailer trunks within 30 m ahead of the vehicle-mounted monocular camera, at a speed of 32 frames per second (FPS) on a Nvidia Titan X for 416 × 416 image input, also providing more alternatives of fairly easy speed/accuracy trade-off. Compared to traditional lidar-based vehicle detection method for autonomous driving, our solution is more robust for particular container terminal scenarios while still maintaining a real-time performance by completely eliminating the region proposal stage.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks[C]. In: International conference on neural information processing systems. Curran Associates Inc., pp 1097–1105 Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks[C]. In: International conference on neural information processing systems. Curran Associates Inc., pp 1097–1105
2.
go back to reference Everingham M, Gool LV, Williams CKI et al (2010) The pascal visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338CrossRef Everingham M, Gool LV, Williams CKI et al (2010) The pascal visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338CrossRef
3.
go back to reference Lin T, Maire M, Belongie SJ et al (2014) Microsoft COCO: common objects in context. In: European conference on computer vision, pp 740–755 Lin T, Maire M, Belongie SJ et al (2014) Microsoft COCO: common objects in context. In: European conference on computer vision, pp 740–755
4.
go back to reference Felzenszwalb P, Mcallester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. In: Computer vision and pattern recognition, 2008. CVPR 2008. IEEE conference on. IEEE, pp 1–8 Felzenszwalb P, Mcallester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. In: Computer vision and pattern recognition, 2008. CVPR 2008. IEEE conference on. IEEE, pp 1–8
5.
go back to reference Girshick RB, Donahue J, Darrell T et al (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Computer vision and pattern recognition, pp 580–587 Girshick RB, Donahue J, Darrell T et al (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Computer vision and pattern recognition, pp 580–587
6.
go back to reference Girshick R (2015) Fast R-CNN[C]. In: IEEE international conference on computer vision. IEEE Computer Society, pp 1440–1448 Girshick R (2015) Fast R-CNN[C]. In: IEEE international conference on computer vision. IEEE Computer Society, pp 1440–1448
7.
go back to reference Ren S, Girshick R, Girshick R et al (2017) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149CrossRef Ren S, Girshick R, Girshick R et al (2017) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149CrossRef
8.
go back to reference Liu W, Anguelov D, Erhan D et al (2016) SSD: single shot MultiBox detector[C]. In: European conference on computer vision. Springer, Cham, pp 21–37 Liu W, Anguelov D, Erhan D et al (2016) SSD: single shot MultiBox detector[C]. In: European conference on computer vision. Springer, Cham, pp 21–37
9.
go back to reference Redmon J, Divvala S, Girshick R et al (2016) You only look once: unified, real-time object detection[C]. In: IEEE conference on computer vision and pattern recognition. IEEE Computer Society, pp 779–788 Redmon J, Divvala S, Girshick R et al (2016) You only look once: unified, real-time object detection[C]. In: IEEE conference on computer vision and pattern recognition. IEEE Computer Society, pp 779–788
10.
go back to reference Geiger A (2012) Are we ready for autonomous driving? The KITTI vision benchmark suite[C]. In: IEEE conference on computer vision and pattern recognition. IEEE Computer Society, pp 3354–3361 Geiger A (2012) Are we ready for autonomous driving? The KITTI vision benchmark suite[C]. In: IEEE conference on computer vision and pattern recognition. IEEE Computer Society, pp 3354–3361
11.
go back to reference Uijlings JRR, Sande KEAVD, Gevers T et al (2013) Selective search for object recognition. Int J Comput Vis 104(2):154–171CrossRef Uijlings JRR, Sande KEAVD, Gevers T et al (2013) Selective search for object recognition. Int J Comput Vis 104(2):154–171CrossRef
12.
go back to reference Erhan D, Szegedy C, Toshev A et al (2014) Scalable object detection using deep neural networks[C]. In: IEEE conference on computer vision and pattern recognition. IEEE Computer Society, pp 2155–2162 Erhan D, Szegedy C, Toshev A et al (2014) Scalable object detection using deep neural networks[C]. In: IEEE conference on computer vision and pattern recognition. IEEE Computer Society, pp 2155–2162
13.
go back to reference Howard AG, Zhu M, Chen B et al (2017) MobileNets: efficient convolutional neural networks for mobile vision applications Howard AG, Zhu M, Chen B et al (2017) MobileNets: efficient convolutional neural networks for mobile vision applications
14.
go back to reference Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Comput Sci Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Comput Sci
15.
go back to reference Chen X, Ma H, Wan J et al (2017) Multi-view 3D object detection network for autonomous driving. IEEE CVPR Chen X, Ma H, Wan J et al (2017) Multi-view 3D object detection network for autonomous driving. IEEE CVPR
Metadata
Title
Monocular Vision-Based Real-Time Vehicle Detection at Container Terminals
Authors
Zijian Liu
Tianlei Zhang
Bei He
Yu Liu
Li Sun
Wenyang Tang
Copyright Year
2020
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-9718-9_63

Premium Partner