nach oben

Machine Vision and Applications

Erschienen in:

01.09.2020 | Original Paper

Detection and pose estimation of auto-rickshaws from traffic images

verfasst von: Blossom Treesa Bastian, Jiji Charangatt Victor

Erschienen in: Machine Vision and Applications | Ausgabe 6/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

In intelligent transport systems, detection and identification of vehicle types enact a substantial role. In this context, this paper addresses the detection and pose classification of a specific vehicle type: auto-rickshaws which have been heavily neglected by the publicly available vehicle datasets, but remains the most commonly used and cheap form of transportation in south Asian countries. Here, we introduce a dataset for auto-rickshaws which consists of instances of varying shape, orientation, size, scale, colour, viewpoint and many more. Further, we carry out a detailed analysis on the performance of state-of-the-art detection algorithms based on both hand-designed and deep features on the proposed dataset. The introduction of pose classification along with the detection eventually results in better understanding of road scenes involving auto-rickshaws. As a matter of fact, we came up with revisions for the currently employed detection algorithms to achieve a low miss rate on the validation sets. It is evident that the findings of this study are tangible and enormously consequential to the road scene understanding and intelligent transportation of developing countries where auto-rickshaws play a pivotal role in public transportation.

Vorheriger Artikel Deep learning applications in pulmonary medical imaging: recent updates and insights on COVID-19

Nächster Artikel Pedestrian detection using multi-scale squeeze-and-excitation module

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

https://github.com/BastianBlossom/Detection-and-Pose-estimation-of-Auto-rickshaws.

Bai, S., Liu, Z., Yao, C.: Classify vehicles in traffic scene images with deformable part-based models. Mach. Vis. Appl. 29(3), 393–403 (2018)CrossRef

Benenson, R., Mathias, M., Tuytelaars, T., Van Gool, L.: Seeking the strongest rigid detector. In: CVPR (2013)

Betke, M., Haritaoglu, E., Davis, L.S.: Real-time multiple vehicle detection and tracking from a moving vehicle. Mach. Vis. Appl. 12(2), 69–83 (2000)CrossRef

Cai, Z., Fan, Q., Feris, R.S., Vasconcelos, N.: A unified multi-scale deep convolutional neural network for fast object detection. In: European Conference on Computer Vision, pp. 354–370. Springer, Berlin (2016)

Chen, D.Y., Lin, Y.H., Peng, Y.J.: Nighttime brake-light detection by Nakagami imaging. IEEE Trans. Intell. Transp. Syst. 13(4), 1627–1637 (2012)CrossRef

Chen, L., Zhang, Z., Peng, L.: Fast single shot multibox detector and its application on vehicle counting system. IET Intell. Transp. Syst. 12(10), 1406–1413 (2018)CrossRef

Cheon, M., Lee, W., Yoon, C., Park, M.: Vision-based vehicle detection system with consideration of the detecting location. IEEE Trans. Intell. Transp. Syst. 13(3), 1243–1252 (2012)CrossRef

Chu, W., Liu, Y., Shen, C., Cai, D., Hua, X.S.: Multi-task vehicle detection with region-of-interest voting. IEEE Trans. Image Process. 27(1), 432–441 (2018)MathSciNetMATHCrossRef

CVIT, I.: Auto-Rickshaw Detection Challenge. http://cvit.iiit.ac.in/autorickshaw_detection/ (2017)

10.

Dai, J., Li, Y., He, K., Sun, J.: R-FCN: Object detection via region-based fully convolutional networks. In: Advances in Neural Information Processing Systems, pp. 379–387 (2016)

11.

Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005 (CVPR 2005), vol. 1, pp. 886–893. IEEE (2005)

12.

Deng, J., Dong, W., Socher, R., jia Li, L., Li, K., Fei-fei, L.: Imagenet: a large-scale hierarchical image database. In: CVPR (2009)

13.

Dollár, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. Pattern Anal. Mach. Intell. 36(8), 1532–1545 (2014)CrossRef

14.

Dollár, P., Tu, Z., Perona, P., Belongie, S.: Integral Channel Features (2009)

15.

Dollar, P., Wojek, C., Schiele, B., Perona, P.: Pedestrian detection: an evaluation of the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 34(4), 743–761 (2012)CrossRef

16.

Espinosa, J.E., Velastin, S.A., Branch, J.W.: Motorcycle Detection and Classification in Urban Scenarios Using a Model Based on Faster r-CNN (2018)

17.

Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)CrossRef

18.

Felzenszwalb, P., McAllester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: IEEE Conference on Computer Vision and Pattern Recognition, 2008 (CVPR 2008), pp. 1–8. IEEE (2008)

19.

Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? The Kitti vision benchmark suite. In: Conference on Computer Vision and Pattern Recognition (CVPR) (2012)

20.

Girshick, R.: Fast R-CNN. In: Proceedings of the International Conference on Computer Vision (ICCV) (2015)

21.

Girshick, R., Donahue, J., Darrell, T., Malik, J.: Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 142–158 (2016)CrossRef

22.

Haselhoff, A., Kummert, A.: A vehicle detection system based on Haar and triangle features. In: 2009 IEEE Intelligent Vehicles Symposium, pp. 261–266. IEEE (2009)

23.

He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference On Computer Vision and Pattern Recognition, pp. 770–778 (2016)

24.

Hu, Q., Paisitkriangkrai, S., Shen, C., van den Hengel, A., Porikli, F.: Fast detection of multiple objects in traffic scenes with a common detection framework. IEEE Trans. Intell. Transp. Syst. 17(4), 1002–1014 (2016)CrossRef

25.

Hwang, S., Kim, N., Choi, Y., Lee, S., Kweon, I.S.: Fast multiple objects detection and tracking fusing color camera and 3D Lidar for intelligent vehicles. In: 2016 13th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI), pp. 234–239. IEEE (2016)

26.

Jiang, Y., Ma, J.: Combination features and models for human detection. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 07–12-June, pp. 240–248 (2015). https://doi.org/10.1109/CVPR.2015.7298620

27.

Khairdoost, N., Monadjemi, S.A., Jamshidi, K.: Front and rear vehicle detection using hypothesis generation and verification. Signal Image Process. 4(4), 31 (2013)

28.

Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)

29.

Liang, J., Chen, X., He, M.L., Chen, L., Cai, T., Zhu, N.: Car detection and classification using cascade model. IET Intell. Transp. Syst. 12(10), 1201–1209 (2018)CrossRef

30.

Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: CVPR, vol. 1, p. 4 (2017)

31.

Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. arXiv preprint arXiv:1708.02002 (2017)

32.

Lin, T.Y., Maire, M., Belongie, S.J., Bourdev, L.D., Girshick, R.B., Hays, J., Perona, P., Ramanan, D., Dollár, P., Zitnick, C.L.: Microsoft coco: Common objects in context. In: ECCV (2014)

33.

Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef

34.

Mani, A., Pai, M., Aggarwal, R.A.: Sustainable Urban Transport in India Role of the Auto-Rickshaw Sector (2012)

35.

Mu, K., Hui, F., Zhao, X., Prehofer, C.: Multiscale edge fusion for vehicle detection based on difference of Gaussian. Optik Int. J. Light Electron Opt. 127(11), 4794–4798 (2016)CrossRef

36.

Nam, W., Dollár, P., Han, J.H.: Local decorrelation for improved pedestrian detection. In: Advances in Neural Information Processing Systems, pp. 424–432 (2014)

37.

Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)

38.

Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: Towards real-time object detection with region proposal networks. In: Neural Information Processing Systems (NIPS) (2015)

39.

Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR arXiv:1409.1556 (2014)

40.

Sivaraman, S., Trivedi, M.M.: A general active-learning framework for on-road vehicle recognition and tracking. IEEE Trans. Intell. Transp. Syst. 11(2), 267–276 (2010)CrossRef

41.

Sivaraman, S., Trivedi, M.M.: Active learning for on-road vehicle detection: a comparative study. Mach. Vis. Appl. 25(3), 599–611 (2014)CrossRef

42.

Teoh, S.S., Bräunl, T.: Symmetry-based monocular vehicle detection system. Mach. Vis. Appl. 23(5), 831–842 (2012)CrossRef

43.

Uijlings, J.R., Van De Sande, K.E., Gevers, T., Smeulders, A.W.: Selective search for object recognition. Int. J. Comput. Vis. 104(2), 154–171 (2013)CrossRef

44.

Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2001 (CVPR 2001), vol. 1, pp. I–I. IEEE (2001)

45.

Wada, K.: Image polygonal annotation with python. https://github.com/wkentaro/labelme

46.

Wen, X., Shao, L., Fang, W., Xue, Y.: Efficient feature selection and classification for vehicle detection. IEEE Trans. Circuits Syst. Video Technol. 25(3), 508–517 (2015) CrossRef

47.

Zhang, F., Clarke, D., Knoll, A.: Vehicle detection based on Lidar and camera fusion. In: 17th International IEEE Conference on Intelligent Transportation Systems (ITSC), pp. 1620–1625. IEEE (2014)

48.

Zhang, L., Lin, L., Liang, X., He, K.: Is faster R-CNN doing well for pedestrian detection? arXiv preprint arXiv:1607.07032 (2016)

49.

Zhang, S., Bauckhage, C., Cremers, A.B.: Informed Haar-like features improve pedestrian detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 947–954 (2014)

50.

Zhou, Y., Liu, L., Shao, L., Mellor, M.: Fast automatic vehicle annotation for urban traffic surveillance. IEEE Trans. Intell. Transp. Syst. 19(6), 1973–1984 (2017)CrossRef

Titel: Detection and pose estimation of auto-rickshaws from traffic images
verfasst von: Blossom Treesa Bastian
Jiji Charangatt Victor
Publikationsdatum: 01.09.2020
Verlag: Springer Berlin Heidelberg
Erschienen in: Machine Vision and Applications / Ausgabe 6/2020
Print ISSN: 0932-8092
Elektronische ISSN: 1432-1769
DOI: https://doi.org/10.1007/s00138-020-01106-0

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Weitere Artikel der Ausgabe 6/2020

Efficient use of recent progresses for Real-time Semantic segmentation

Root identification in minirhizotron imagery with multiple instance learning

Semi-supervised learning using adversarial training with good and bad samples

WatchNet++: efficient and accurate depth-based network for detecting people attacks and intrusion

Boosting binary masks for multi-domain learning through affine transformations

3D video semantic segmentation for wildfire smoke

Premium Partner