Skip to main content
Erschienen in: Machine Vision and Applications 6/2020

01.09.2020 | Original Paper

Detection and pose estimation of auto-rickshaws from traffic images

verfasst von: Blossom Treesa Bastian, Jiji Charangatt Victor

Erschienen in: Machine Vision and Applications | Ausgabe 6/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In intelligent transport systems, detection and identification of vehicle types enact a substantial role. In this context, this paper addresses the detection and pose classification of a specific vehicle type: auto-rickshaws which have been heavily neglected by the publicly available vehicle datasets, but remains the most commonly used and cheap form of transportation in south Asian countries. Here, we introduce a dataset for auto-rickshaws which consists of instances of varying shape, orientation, size, scale, colour, viewpoint and many more. Further, we carry out a detailed analysis on the performance of state-of-the-art detection algorithms based on both hand-designed and deep features on the proposed dataset. The introduction of pose classification along with the detection eventually results in better understanding of road scenes involving auto-rickshaws. As a matter of fact, we came up with revisions for the currently employed detection algorithms to achieve a low miss rate on the validation sets. It is evident that the findings of this study are tangible and enormously consequential to the road scene understanding and intelligent transportation of developing countries where auto-rickshaws play a pivotal role in public transportation.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bai, S., Liu, Z., Yao, C.: Classify vehicles in traffic scene images with deformable part-based models. Mach. Vis. Appl. 29(3), 393–403 (2018)CrossRef Bai, S., Liu, Z., Yao, C.: Classify vehicles in traffic scene images with deformable part-based models. Mach. Vis. Appl. 29(3), 393–403 (2018)CrossRef
2.
Zurück zum Zitat Benenson, R., Mathias, M., Tuytelaars, T., Van Gool, L.: Seeking the strongest rigid detector. In: CVPR (2013) Benenson, R., Mathias, M., Tuytelaars, T., Van Gool, L.: Seeking the strongest rigid detector. In: CVPR (2013)
3.
Zurück zum Zitat Betke, M., Haritaoglu, E., Davis, L.S.: Real-time multiple vehicle detection and tracking from a moving vehicle. Mach. Vis. Appl. 12(2), 69–83 (2000)CrossRef Betke, M., Haritaoglu, E., Davis, L.S.: Real-time multiple vehicle detection and tracking from a moving vehicle. Mach. Vis. Appl. 12(2), 69–83 (2000)CrossRef
4.
Zurück zum Zitat Cai, Z., Fan, Q., Feris, R.S., Vasconcelos, N.: A unified multi-scale deep convolutional neural network for fast object detection. In: European Conference on Computer Vision, pp. 354–370. Springer, Berlin (2016) Cai, Z., Fan, Q., Feris, R.S., Vasconcelos, N.: A unified multi-scale deep convolutional neural network for fast object detection. In: European Conference on Computer Vision, pp. 354–370. Springer, Berlin (2016)
5.
Zurück zum Zitat Chen, D.Y., Lin, Y.H., Peng, Y.J.: Nighttime brake-light detection by Nakagami imaging. IEEE Trans. Intell. Transp. Syst. 13(4), 1627–1637 (2012)CrossRef Chen, D.Y., Lin, Y.H., Peng, Y.J.: Nighttime brake-light detection by Nakagami imaging. IEEE Trans. Intell. Transp. Syst. 13(4), 1627–1637 (2012)CrossRef
6.
Zurück zum Zitat Chen, L., Zhang, Z., Peng, L.: Fast single shot multibox detector and its application on vehicle counting system. IET Intell. Transp. Syst. 12(10), 1406–1413 (2018)CrossRef Chen, L., Zhang, Z., Peng, L.: Fast single shot multibox detector and its application on vehicle counting system. IET Intell. Transp. Syst. 12(10), 1406–1413 (2018)CrossRef
7.
Zurück zum Zitat Cheon, M., Lee, W., Yoon, C., Park, M.: Vision-based vehicle detection system with consideration of the detecting location. IEEE Trans. Intell. Transp. Syst. 13(3), 1243–1252 (2012)CrossRef Cheon, M., Lee, W., Yoon, C., Park, M.: Vision-based vehicle detection system with consideration of the detecting location. IEEE Trans. Intell. Transp. Syst. 13(3), 1243–1252 (2012)CrossRef
8.
Zurück zum Zitat Chu, W., Liu, Y., Shen, C., Cai, D., Hua, X.S.: Multi-task vehicle detection with region-of-interest voting. IEEE Trans. Image Process. 27(1), 432–441 (2018)MathSciNetMATHCrossRef Chu, W., Liu, Y., Shen, C., Cai, D., Hua, X.S.: Multi-task vehicle detection with region-of-interest voting. IEEE Trans. Image Process. 27(1), 432–441 (2018)MathSciNetMATHCrossRef
10.
Zurück zum Zitat Dai, J., Li, Y., He, K., Sun, J.: R-FCN: Object detection via region-based fully convolutional networks. In: Advances in Neural Information Processing Systems, pp. 379–387 (2016) Dai, J., Li, Y., He, K., Sun, J.: R-FCN: Object detection via region-based fully convolutional networks. In: Advances in Neural Information Processing Systems, pp. 379–387 (2016)
11.
Zurück zum Zitat Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005 (CVPR 2005), vol. 1, pp. 886–893. IEEE (2005) Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005 (CVPR 2005), vol. 1, pp. 886–893. IEEE (2005)
12.
Zurück zum Zitat Deng, J., Dong, W., Socher, R., jia Li, L., Li, K., Fei-fei, L.: Imagenet: a large-scale hierarchical image database. In: CVPR (2009) Deng, J., Dong, W., Socher, R., jia Li, L., Li, K., Fei-fei, L.: Imagenet: a large-scale hierarchical image database. In: CVPR (2009)
13.
Zurück zum Zitat Dollár, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. Pattern Anal. Mach. Intell. 36(8), 1532–1545 (2014)CrossRef Dollár, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. Pattern Anal. Mach. Intell. 36(8), 1532–1545 (2014)CrossRef
14.
Zurück zum Zitat Dollár, P., Tu, Z., Perona, P., Belongie, S.: Integral Channel Features (2009) Dollár, P., Tu, Z., Perona, P., Belongie, S.: Integral Channel Features (2009)
15.
Zurück zum Zitat Dollar, P., Wojek, C., Schiele, B., Perona, P.: Pedestrian detection: an evaluation of the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 34(4), 743–761 (2012)CrossRef Dollar, P., Wojek, C., Schiele, B., Perona, P.: Pedestrian detection: an evaluation of the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 34(4), 743–761 (2012)CrossRef
16.
Zurück zum Zitat Espinosa, J.E., Velastin, S.A., Branch, J.W.: Motorcycle Detection and Classification in Urban Scenarios Using a Model Based on Faster r-CNN (2018) Espinosa, J.E., Velastin, S.A., Branch, J.W.: Motorcycle Detection and Classification in Urban Scenarios Using a Model Based on Faster r-CNN (2018)
17.
Zurück zum Zitat Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)CrossRef Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)CrossRef
18.
Zurück zum Zitat Felzenszwalb, P., McAllester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: IEEE Conference on Computer Vision and Pattern Recognition, 2008 (CVPR 2008), pp. 1–8. IEEE (2008) Felzenszwalb, P., McAllester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: IEEE Conference on Computer Vision and Pattern Recognition, 2008 (CVPR 2008), pp. 1–8. IEEE (2008)
19.
Zurück zum Zitat Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? The Kitti vision benchmark suite. In: Conference on Computer Vision and Pattern Recognition (CVPR) (2012) Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? The Kitti vision benchmark suite. In: Conference on Computer Vision and Pattern Recognition (CVPR) (2012)
20.
Zurück zum Zitat Girshick, R.: Fast R-CNN. In: Proceedings of the International Conference on Computer Vision (ICCV) (2015) Girshick, R.: Fast R-CNN. In: Proceedings of the International Conference on Computer Vision (ICCV) (2015)
21.
Zurück zum Zitat Girshick, R., Donahue, J., Darrell, T., Malik, J.: Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 142–158 (2016)CrossRef Girshick, R., Donahue, J., Darrell, T., Malik, J.: Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 142–158 (2016)CrossRef
22.
Zurück zum Zitat Haselhoff, A., Kummert, A.: A vehicle detection system based on Haar and triangle features. In: 2009 IEEE Intelligent Vehicles Symposium, pp. 261–266. IEEE (2009) Haselhoff, A., Kummert, A.: A vehicle detection system based on Haar and triangle features. In: 2009 IEEE Intelligent Vehicles Symposium, pp. 261–266. IEEE (2009)
23.
Zurück zum Zitat He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference On Computer Vision and Pattern Recognition, pp. 770–778 (2016) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference On Computer Vision and Pattern Recognition, pp. 770–778 (2016)
24.
Zurück zum Zitat Hu, Q., Paisitkriangkrai, S., Shen, C., van den Hengel, A., Porikli, F.: Fast detection of multiple objects in traffic scenes with a common detection framework. IEEE Trans. Intell. Transp. Syst. 17(4), 1002–1014 (2016)CrossRef Hu, Q., Paisitkriangkrai, S., Shen, C., van den Hengel, A., Porikli, F.: Fast detection of multiple objects in traffic scenes with a common detection framework. IEEE Trans. Intell. Transp. Syst. 17(4), 1002–1014 (2016)CrossRef
25.
Zurück zum Zitat Hwang, S., Kim, N., Choi, Y., Lee, S., Kweon, I.S.: Fast multiple objects detection and tracking fusing color camera and 3D Lidar for intelligent vehicles. In: 2016 13th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI), pp. 234–239. IEEE (2016) Hwang, S., Kim, N., Choi, Y., Lee, S., Kweon, I.S.: Fast multiple objects detection and tracking fusing color camera and 3D Lidar for intelligent vehicles. In: 2016 13th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI), pp. 234–239. IEEE (2016)
27.
Zurück zum Zitat Khairdoost, N., Monadjemi, S.A., Jamshidi, K.: Front and rear vehicle detection using hypothesis generation and verification. Signal Image Process. 4(4), 31 (2013) Khairdoost, N., Monadjemi, S.A., Jamshidi, K.: Front and rear vehicle detection using hypothesis generation and verification. Signal Image Process. 4(4), 31 (2013)
28.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
29.
Zurück zum Zitat Liang, J., Chen, X., He, M.L., Chen, L., Cai, T., Zhu, N.: Car detection and classification using cascade model. IET Intell. Transp. Syst. 12(10), 1201–1209 (2018)CrossRef Liang, J., Chen, X., He, M.L., Chen, L., Cai, T., Zhu, N.: Car detection and classification using cascade model. IET Intell. Transp. Syst. 12(10), 1201–1209 (2018)CrossRef
30.
Zurück zum Zitat Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: CVPR, vol. 1, p. 4 (2017) Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: CVPR, vol. 1, p. 4 (2017)
31.
Zurück zum Zitat Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. arXiv preprint arXiv:1708.02002 (2017) Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. arXiv preprint arXiv:​1708.​02002 (2017)
32.
Zurück zum Zitat Lin, T.Y., Maire, M., Belongie, S.J., Bourdev, L.D., Girshick, R.B., Hays, J., Perona, P., Ramanan, D., Dollár, P., Zitnick, C.L.: Microsoft coco: Common objects in context. In: ECCV (2014) Lin, T.Y., Maire, M., Belongie, S.J., Bourdev, L.D., Girshick, R.B., Hays, J., Perona, P., Ramanan, D., Dollár, P., Zitnick, C.L.: Microsoft coco: Common objects in context. In: ECCV (2014)
33.
Zurück zum Zitat Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef
34.
Zurück zum Zitat Mani, A., Pai, M., Aggarwal, R.A.: Sustainable Urban Transport in India Role of the Auto-Rickshaw Sector (2012) Mani, A., Pai, M., Aggarwal, R.A.: Sustainable Urban Transport in India Role of the Auto-Rickshaw Sector (2012)
35.
Zurück zum Zitat Mu, K., Hui, F., Zhao, X., Prehofer, C.: Multiscale edge fusion for vehicle detection based on difference of Gaussian. Optik Int. J. Light Electron Opt. 127(11), 4794–4798 (2016)CrossRef Mu, K., Hui, F., Zhao, X., Prehofer, C.: Multiscale edge fusion for vehicle detection based on difference of Gaussian. Optik Int. J. Light Electron Opt. 127(11), 4794–4798 (2016)CrossRef
36.
Zurück zum Zitat Nam, W., Dollár, P., Han, J.H.: Local decorrelation for improved pedestrian detection. In: Advances in Neural Information Processing Systems, pp. 424–432 (2014) Nam, W., Dollár, P., Han, J.H.: Local decorrelation for improved pedestrian detection. In: Advances in Neural Information Processing Systems, pp. 424–432 (2014)
37.
Zurück zum Zitat Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016) Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
38.
Zurück zum Zitat Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: Towards real-time object detection with region proposal networks. In: Neural Information Processing Systems (NIPS) (2015) Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: Towards real-time object detection with region proposal networks. In: Neural Information Processing Systems (NIPS) (2015)
39.
Zurück zum Zitat Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR arXiv:1409.1556 (2014) Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR arXiv:​1409.​1556 (2014)
40.
Zurück zum Zitat Sivaraman, S., Trivedi, M.M.: A general active-learning framework for on-road vehicle recognition and tracking. IEEE Trans. Intell. Transp. Syst. 11(2), 267–276 (2010)CrossRef Sivaraman, S., Trivedi, M.M.: A general active-learning framework for on-road vehicle recognition and tracking. IEEE Trans. Intell. Transp. Syst. 11(2), 267–276 (2010)CrossRef
41.
Zurück zum Zitat Sivaraman, S., Trivedi, M.M.: Active learning for on-road vehicle detection: a comparative study. Mach. Vis. Appl. 25(3), 599–611 (2014)CrossRef Sivaraman, S., Trivedi, M.M.: Active learning for on-road vehicle detection: a comparative study. Mach. Vis. Appl. 25(3), 599–611 (2014)CrossRef
42.
Zurück zum Zitat Teoh, S.S., Bräunl, T.: Symmetry-based monocular vehicle detection system. Mach. Vis. Appl. 23(5), 831–842 (2012)CrossRef Teoh, S.S., Bräunl, T.: Symmetry-based monocular vehicle detection system. Mach. Vis. Appl. 23(5), 831–842 (2012)CrossRef
43.
Zurück zum Zitat Uijlings, J.R., Van De Sande, K.E., Gevers, T., Smeulders, A.W.: Selective search for object recognition. Int. J. Comput. Vis. 104(2), 154–171 (2013)CrossRef Uijlings, J.R., Van De Sande, K.E., Gevers, T., Smeulders, A.W.: Selective search for object recognition. Int. J. Comput. Vis. 104(2), 154–171 (2013)CrossRef
44.
Zurück zum Zitat Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2001 (CVPR 2001), vol. 1, pp. I–I. IEEE (2001) Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2001 (CVPR 2001), vol. 1, pp. I–I. IEEE (2001)
46.
Zurück zum Zitat Wen, X., Shao, L., Fang, W., Xue, Y.: Efficient feature selection and classification for vehicle detection. IEEE Trans. Circuits Syst. Video Technol. 25(3), 508–517 (2015) CrossRef Wen, X., Shao, L., Fang, W., Xue, Y.: Efficient feature selection and classification for vehicle detection. IEEE Trans. Circuits Syst. Video Technol. 25(3), 508–517 (2015) CrossRef
47.
Zurück zum Zitat Zhang, F., Clarke, D., Knoll, A.: Vehicle detection based on Lidar and camera fusion. In: 17th International IEEE Conference on Intelligent Transportation Systems (ITSC), pp. 1620–1625. IEEE (2014) Zhang, F., Clarke, D., Knoll, A.: Vehicle detection based on Lidar and camera fusion. In: 17th International IEEE Conference on Intelligent Transportation Systems (ITSC), pp. 1620–1625. IEEE (2014)
48.
Zurück zum Zitat Zhang, L., Lin, L., Liang, X., He, K.: Is faster R-CNN doing well for pedestrian detection? arXiv preprint arXiv:1607.07032 (2016) Zhang, L., Lin, L., Liang, X., He, K.: Is faster R-CNN doing well for pedestrian detection? arXiv preprint arXiv:​1607.​07032 (2016)
49.
Zurück zum Zitat Zhang, S., Bauckhage, C., Cremers, A.B.: Informed Haar-like features improve pedestrian detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 947–954 (2014) Zhang, S., Bauckhage, C., Cremers, A.B.: Informed Haar-like features improve pedestrian detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 947–954 (2014)
50.
Zurück zum Zitat Zhou, Y., Liu, L., Shao, L., Mellor, M.: Fast automatic vehicle annotation for urban traffic surveillance. IEEE Trans. Intell. Transp. Syst. 19(6), 1973–1984 (2017)CrossRef Zhou, Y., Liu, L., Shao, L., Mellor, M.: Fast automatic vehicle annotation for urban traffic surveillance. IEEE Trans. Intell. Transp. Syst. 19(6), 1973–1984 (2017)CrossRef
Metadaten
Titel
Detection and pose estimation of auto-rickshaws from traffic images
verfasst von
Blossom Treesa Bastian
Jiji Charangatt Victor
Publikationsdatum
01.09.2020
Verlag
Springer Berlin Heidelberg
Erschienen in
Machine Vision and Applications / Ausgabe 6/2020
Print ISSN: 0932-8092
Elektronische ISSN: 1432-1769
DOI
https://doi.org/10.1007/s00138-020-01106-0

Weitere Artikel der Ausgabe 6/2020

Machine Vision and Applications 6/2020 Zur Ausgabe

Premium Partner