Skip to main content
Top

2017 | OriginalPaper | Chapter

4. Deep Learning Techniques for Roadside Video Data Analysis

Authors : Brijesh Verma, Ligang Zhang, David Stockwell

Published in: Roadside Video Data Analysis

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this chapter, we describe deep learning techniques that are proposed for roadside video data analysis. We firstly present an introduction to deep learning concepts, and a short review of several typical types of CNN.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
4.
go back to reference J. Donahue, L. Anne Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, et al., Long-term recurrent convolutional networks for visual recognition and description, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2015), pp. 2625–2634 J. Donahue, L. Anne Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, et al., Long-term recurrent convolutional networks for visual recognition and description, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2015), pp. 2625–2634
6.
go back to reference D.V. Nguyen, L. Kuhnert, K.D. Kuhnert, Structure overview of vegetation detection. A novel approach for efficient vegetation detection using an active lighting system. Robot. Auton. Syst. 60, 498–508 (2012)CrossRef D.V. Nguyen, L. Kuhnert, K.D. Kuhnert, Structure overview of vegetation detection. A novel approach for efficient vegetation detection using an active lighting system. Robot. Auton. Syst. 60, 498–508 (2012)CrossRef
7.
go back to reference I. Lenz, H. Lee, A. Saxena, Deep learning for detecting robotic grasps. Int. J. Robot. Res. 34, 705–724 (2015)CrossRef I. Lenz, H. Lee, A. Saxena, Deep learning for detecting robotic grasps. Int. J. Robot. Res. 34, 705–724 (2015)CrossRef
8.
go back to reference L. Romaszko, A deep learning approach with an ensemble-based neural network classifier for black box ICML 2013 contest, in Workshop on Challenges in Representation Learning, International Conference on Machine Learning (ICML) (2013), pp. 1–3 L. Romaszko, A deep learning approach with an ensemble-based neural network classifier for black box ICML 2013 contest, in Workshop on Challenges in Representation Learning, International Conference on Machine Learning (ICML) (2013), pp. 1–3
9.
go back to reference S. Ahmad Radzi, K.-H. Mohamad, S.S. Liew, R. Bakhteri, Convolutional neural network for face recognition with pose and illumination variation. Int. J. Eng. Technol. (IJET) 6, 44–57 (2014) S. Ahmad Radzi, K.-H. Mohamad, S.S. Liew, R. Bakhteri, Convolutional neural network for face recognition with pose and illumination variation. Int. J. Eng. Technol. (IJET) 6, 44–57 (2014)
10.
go back to reference F. Shaheen, B. Verma, M. Asafuddoula, Impact of automatic feature extraction in deep learning architecture, in Digital Image Computing: Techniques and Applications (DICTA), International Conference on (2016), pp. 1–8 F. Shaheen, B. Verma, M. Asafuddoula, Impact of automatic feature extraction in deep learning architecture, in Digital Image Computing: Techniques and Applications (DICTA), International Conference on (2016), pp. 1–8
13.
go back to reference Y. Lecun, L. Bottou, Y. Bengio, P. Haffner, Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998)CrossRef Y. Lecun, L. Bottou, Y. Bengio, P. Haffner, Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998)CrossRef
14.
go back to reference D.V. Nguyen, L. Kuhnert, K.D. Kuhnert, Spreading algorithm for efficient vegetation detection in cluttered outdoor environments. Robot. Auton. Syst. 60, 1498–1507 (2012)CrossRef D.V. Nguyen, L. Kuhnert, K.D. Kuhnert, Spreading algorithm for efficient vegetation detection in cluttered outdoor environments. Robot. Auton. Syst. 60, 1498–1507 (2012)CrossRef
15.
go back to reference D.V. Nguyen, L. Kuhnert, T. Jiang, S. Thamke, K.D. Kuhnert, Vegetation detection for outdoor automobile guidance, in Industrial Technology (ICIT), IEEE International Conference on (2011), pp. 358–364 D.V. Nguyen, L. Kuhnert, T. Jiang, S. Thamke, K.D. Kuhnert, Vegetation detection for outdoor automobile guidance, in Industrial Technology (ICIT), IEEE International Conference on (2011), pp. 358–364
16.
go back to reference A. Bosch, X. Muñoz, J. Freixenet, Segmentation and description of natural outdoor scenes. Image Vis. Comput. 25, 727–740 (2007)CrossRef A. Bosch, X. Muñoz, J. Freixenet, Segmentation and description of natural outdoor scenes. Image Vis. Comput. 25, 727–740 (2007)CrossRef
17.
go back to reference W. Guo, U.K. Rage, S. Ninomiya, Illumination invariant segmentation of vegetation for time series wheat images based on decision tree model. Comput. Electron. Agri. 96, 58–66 (2013)CrossRef W. Guo, U.K. Rage, S. Ninomiya, Illumination invariant segmentation of vegetation for time series wheat images based on decision tree model. Comput. Electron. Agri. 96, 58–66 (2013)CrossRef
18.
go back to reference F. Shaheen, B. Verma, An ensemble of deep learning architectures for automatic feature extraction, in Computational Intelligence (ISSCI), IEEE Symposium Series on (2016) (in Press) F. Shaheen, B. Verma, An ensemble of deep learning architectures for automatic feature extraction, in Computational Intelligence (ISSCI), IEEE Symposium Series on (2016) (in Press)
19.
go back to reference D.-X. Liu, T. Wu, B. Dai, Fusing ladar and color image for detection grass off-road scenario, in Vehicular Electronics and Safety (ICVES), IEEE International Conference on (2007), pp. 1–4 D.-X. Liu, T. Wu, B. Dai, Fusing ladar and color image for detection grass off-road scenario, in Vehicular Electronics and Safety (ICVES), IEEE International Conference on (2007), pp. 1–4
20.
go back to reference R. Mottaghi, S. Fidler, A. Yuille, R. Urtasun, D. Parikh, Human-machine CRFS for identifying bottlenecks in scene understanding. Pattern Anal. Mach. Intell. IEEE Trans. 38, 74–87 (2016)CrossRef R. Mottaghi, S. Fidler, A. Yuille, R. Urtasun, D. Parikh, Human-machine CRFS for identifying bottlenecks in scene understanding. Pattern Anal. Mach. Intell. IEEE Trans. 38, 74–87 (2016)CrossRef
21.
go back to reference J. Shotton, J. Winn, C. Rother, A. Criminisi, Textonboost for image understanding: multi-class object recognition and segmentation by jointly modeling texture, layout, and context. Int. J. Comput. Vis. 81, 2–23 (2009)CrossRef J. Shotton, J. Winn, C. Rother, A. Criminisi, Textonboost for image understanding: multi-class object recognition and segmentation by jointly modeling texture, layout, and context. Int. J. Comput. Vis. 81, 2–23 (2009)CrossRef
22.
go back to reference S. Gould, J. Rodgers, D. Cohen, G. Elidan, D. Koller, Multi-class segmentation with relative location prior. Int. J. Comput. Vis. 80, 300–316 (2008)CrossRef S. Gould, J. Rodgers, D. Cohen, G. Elidan, D. Koller, Multi-class segmentation with relative location prior. Int. J. Comput. Vis. 80, 300–316 (2008)CrossRef
23.
go back to reference Y. Jimei, B. Price, S. Cohen, Y. Ming-Hsuan, Context driven scene parsing with attention to rare classes, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2014), pp. 3294–3301 Y. Jimei, B. Price, S. Cohen, Y. Ming-Hsuan, Context driven scene parsing with attention to rare classes, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2014), pp. 3294–3301
24.
go back to reference A. Singhal, L. Jiebo, Z. Weiyu, Probabilistic spatial context models for scene content understanding, in Computer Vision and Pattern Recognition, (CVPR), IEEE Conference on (2003), pp. 235–241 A. Singhal, L. Jiebo, Z. Weiyu, Probabilistic spatial context models for scene content understanding, in Computer Vision and Pattern Recognition, (CVPR), IEEE Conference on (2003), pp. 235–241
25.
go back to reference B. Micusik, J. Kosecka, Semantic segmentation of street scenes by superpixel co-occurrence and 3D geometry, in Computer Vision Workshops (ICCV Workshops), IEEE 12th International Conference on (2009), pp. 625–632 B. Micusik, J. Kosecka, Semantic segmentation of street scenes by superpixel co-occurrence and 3D geometry, in Computer Vision Workshops (ICCV Workshops), IEEE 12th International Conference on (2009), pp. 625–632
26.
go back to reference C. Farabet, C. Couprie, L. Najman, Y. LeCun, Learning hierarchical features for scene labeling. Pattern Anal. Mach. Intell. IEEE Trans. 35, 1915–1929 (2013)CrossRef C. Farabet, C. Couprie, L. Najman, Y. LeCun, Learning hierarchical features for scene labeling. Pattern Anal. Mach. Intell. IEEE Trans. 35, 1915–1929 (2013)CrossRef
27.
go back to reference M. Seyedhosseini, T. Tasdizen, Semantic image segmentation with contextual hierarchical models. Pattern Anal. Mach. Intell. IEEE Trans. 38(5), 951–964 (2015)CrossRef M. Seyedhosseini, T. Tasdizen, Semantic image segmentation with contextual hierarchical models. Pattern Anal. Mach. Intell. IEEE Trans. 38(5), 951–964 (2015)CrossRef
28.
go back to reference D. Batra, R. Sukthankar, C. Tsuhan, Learning class-specific affinities for image labelling, in Computer Vision and Pattern Recognition, (CVPR), IEEE Conference on (2008), pp. 1–8 D. Batra, R. Sukthankar, C. Tsuhan, Learning class-specific affinities for image labelling, in Computer Vision and Pattern Recognition, (CVPR), IEEE Conference on (2008), pp. 1–8
29.
go back to reference Z. Lei, J. Qiang, Image segmentation with a unified graphical model. Pattern Anal. Mach. Intell. IEEE Trans. 32, 1406–1425 (2010)CrossRef Z. Lei, J. Qiang, Image segmentation with a unified graphical model. Pattern Anal. Mach. Intell. IEEE Trans. 32, 1406–1425 (2010)CrossRef
30.
go back to reference R. Xiaofeng, B. Liefeng, D. Fox, RGB-(D) scene labeling: features and algorithms, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2012), pp. 2759–2766 R. Xiaofeng, B. Liefeng, D. Fox, RGB-(D) scene labeling: features and algorithms, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2012), pp. 2759–2766
32.
go back to reference S. Zheng, S. Jayasumana, B. Romera-Paredes, V. Vineet, Z. Su, et al., Conditional random fields as recurrent neural networks. arXiv preprint arXiv:1502.03240 (2015) S. Zheng, S. Jayasumana, B. Romera-Paredes, V. Vineet, Z. Su, et al., Conditional random fields as recurrent neural networks. arXiv preprint arXiv:​1502.​03240 (2015)
33.
34.
go back to reference A. Sharma, O. Tuzel, M.-Y. Liu, Recursive context propagation network for semantic scene labeling, in Advances in Neural Information Processing Systems (2014), pp. 2447–2455 A. Sharma, O. Tuzel, M.-Y. Liu, Recursive context propagation network for semantic scene labeling, in Advances in Neural Information Processing Systems (2014), pp. 2447–2455
35.
go back to reference A. Sharma, O. Tuzel, D.W. Jacobs, Deep hierarchical parsing for semantic segmentation, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2015), pp. 530–538 A. Sharma, O. Tuzel, D.W. Jacobs, Deep hierarchical parsing for semantic segmentation, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2015), pp. 530–538
36.
go back to reference S. Ling, L. Li, L. Xuelong, Feature learning for image classification via multiobjective genetic programming. Neural Netw. Learn. Syst. IEEE Trans. 25, 1359–1371 (2014)CrossRef S. Ling, L. Li, L. Xuelong, Feature learning for image classification via multiobjective genetic programming. Neural Netw. Learn. Syst. IEEE Trans. 25, 1359–1371 (2014)CrossRef
37.
go back to reference L. Zhang, B. Verma, D. Stockwell, S. Chowdhury, Spatially constrained location prior for scene parsing, in Neural Networks (IJCNN), International Joint Conference on (2016), pp. 1480–1486 L. Zhang, B. Verma, D. Stockwell, S. Chowdhury, Spatially constrained location prior for scene parsing, in Neural Networks (IJCNN), International Joint Conference on (2016), pp. 1480–1486
38.
go back to reference J. Tighe, S. Lazebnik, Superparsing: scalable nonparametric image parsing with superpixels, in Computer Vision (ECCV), European Conference on (2010), pp. 352–365 J. Tighe, S. Lazebnik, Superparsing: scalable nonparametric image parsing with superpixels, in Computer Vision (ECCV), European Conference on (2010), pp. 352–365
39.
go back to reference P. Hanchuan, L. Fuhui, C. Ding, Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. Pattern Anal. Mach. Intell. IEEE Trans. 27, 1226–1238 (2005)CrossRef P. Hanchuan, L. Fuhui, C. Ding, Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. Pattern Anal. Mach. Intell. IEEE Trans. 27, 1226–1238 (2005)CrossRef
40.
go back to reference P. Felzenszwalb, D. Huttenlocher, Efficient graph-based image segmentation. Int. J. Comput. Vis. 59, 167–181 (2004)CrossRef P. Felzenszwalb, D. Huttenlocher, Efficient graph-based image segmentation. Int. J. Comput. Vis. 59, 167–181 (2004)CrossRef
41.
go back to reference S. Gould, R. Fulton, D. Koller, Decomposing a scene into geometric and semantically consistent regions, in Computer Vision (ICCV), IEEE 12th International Conference on (2009), pp. 1–8 S. Gould, R. Fulton, D. Koller, Decomposing a scene into geometric and semantically consistent regions, in Computer Vision (ICCV), IEEE 12th International Conference on (2009), pp. 1–8
42.
go back to reference L. Ce, J. Yuen, A. Torralba, Nonparametric scene parsing: label transfer via dense scene alignment, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2009), pp. 1972–1979 L. Ce, J. Yuen, A. Torralba, Nonparametric scene parsing: label transfer via dense scene alignment, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2009), pp. 1972–1979
43.
go back to reference V. Lempitsky, A. Vedaldi, A. Zisserman, Pylon model for semantic segmentation, in Advances in Neural Information Processing Systems (2011), pp. 1485–1493 V. Lempitsky, A. Vedaldi, A. Zisserman, Pylon model for semantic segmentation, in Advances in Neural Information Processing Systems (2011), pp. 1485–1493
44.
go back to reference D. Munoz, J.A. Bagnell, M. Hebert, Stacked hierarchical labeling, in Computer Vision (ECCV), European Conference on (2010), pp. 57–70 D. Munoz, J.A. Bagnell, M. Hebert, Stacked hierarchical labeling, in Computer Vision (ECCV), European Conference on (2010), pp. 57–70
45.
go back to reference L. Ladicky, C. Russell, P. Kohli, P.H.S. Torr, Associative hierarchical random fields. Pattern Anal. Mach. Intell. IEEE Trans. 36, 1056–1077 (2014)CrossRef L. Ladicky, C. Russell, P. Kohli, P.H.S. Torr, Associative hierarchical random fields. Pattern Anal. Mach. Intell. IEEE Trans. 36, 1056–1077 (2014)CrossRef
46.
go back to reference A. Krizhevsky, I. Sutskever, G.E. Hinton, ImageNet classification with deep convolutional neural networks, in Advances in Neural Information Processing Systems (2012), pp. 1097–1105 A. Krizhevsky, I. Sutskever, G.E. Hinton, ImageNet classification with deep convolutional neural networks, in Advances in Neural Information Processing Systems (2012), pp. 1097–1105
47.
go back to reference M.D. Zeiler, R. Fergus, Visualizing and understanding convolutional networks, in European Conference on Computer Vision (2014), pp. 818–833 M.D. Zeiler, R. Fergus, Visualizing and understanding convolutional networks, in European Conference on Computer Vision (2014), pp. 818–833
48.
go back to reference C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, et al., Going deeper with convolutions, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2015), pp. 1–9 C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, et al., Going deeper with convolutions, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2015), pp. 1–9
49.
go back to reference K. Simonyan, A. Zisserman, Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014) K. Simonyan, A. Zisserman, Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:​1409.​1556 (2014)
51.
go back to reference S. Bing, W. Gang, Z. Zhen, W. Bing, Z. Lifan, Integrating parametric and non-parametric models for scene labeling, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2015), pp. 4249–4258 S. Bing, W. Gang, Z. Zhen, W. Bing, Z. Lifan, Integrating parametric and non-parametric models for scene labeling, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2015), pp. 4249–4258
52.
go back to reference J. Shotton, J. Winn, C. Rother, A. Criminisi, Textonboost: joint appearance, shape and context modeling for multi-class object recognition and segmentation, in Computer Vision (ECCV), European Conference on (2006), pp. 1–15 J. Shotton, J. Winn, C. Rother, A. Criminisi, Textonboost: joint appearance, shape and context modeling for multi-class object recognition and segmentation, in Computer Vision (ECCV), European Conference on (2006), pp. 1–15
53.
go back to reference Z. Long, C. Yuanhao, L. Yuan, L. Chenxi, A. Yuille, Recursive segmentation and recognition templates for image parsing. Pattern Anal. Mach. Intell. IEEE Trans. 34, 359–371 (2012)CrossRef Z. Long, C. Yuanhao, L. Yuan, L. Chenxi, A. Yuille, Recursive segmentation and recognition templates for image parsing. Pattern Anal. Mach. Intell. IEEE Trans. 34, 359–371 (2012)CrossRef
54.
go back to reference E. Akbas, N. Ahuja, Low-level hierarchical multiscale segmentation statistics of natural images. Pattern Anal. Mach. Intell. IEEE Trans. 36, 1900–1906 (2014)CrossRef E. Akbas, N. Ahuja, Low-level hierarchical multiscale segmentation statistics of natural images. Pattern Anal. Mach. Intell. IEEE Trans. 36, 1900–1906 (2014)CrossRef
55.
go back to reference A. Lucchi, L. Yunpeng, P. Fua, Learning for structured prediction using approximate subgradient descent with working sets, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2013), pp. 1987–1994 A. Lucchi, L. Yunpeng, P. Fua, Learning for structured prediction using approximate subgradient descent with working sets, in Computer Vision and Pattern Recognition (CVPR), IEEE Conference on (2013), pp. 1987–1994
56.
go back to reference C. Gatta, F. Ciompi, Stacked sequential scale-spacetaylor context. Pattern Anal. Mach. Intell. IEEE Trans. 36, 1694–1700 (2014)CrossRef C. Gatta, F. Ciompi, Stacked sequential scale-spacetaylor context. Pattern Anal. Mach. Intell. IEEE Trans. 36, 1694–1700 (2014)CrossRef
57.
go back to reference M. Najafi, S.T. Namin, M. Salzmann, L. Petersson, Sample and filter: nonparametric scene parsing via efficient filtering. arXiv preprint arXiv:1511.04960 (2015) M. Najafi, S.T. Namin, M. Salzmann, L. Petersson, Sample and filter: nonparametric scene parsing via efficient filtering. arXiv preprint arXiv:​1511.​04960 (2015)
58.
go back to reference T.V. Nguyen, L. Canyi, J. Sepulveda, Y. Shuicheng, Adaptive nonparametric image parsing. Circ. Syst. Video Technol. IEEE Trans. 25, 1565–1575 (2015)CrossRef T.V. Nguyen, L. Canyi, J. Sepulveda, Y. Shuicheng, Adaptive nonparametric image parsing. Circ. Syst. Video Technol. IEEE Trans. 25, 1565–1575 (2015)CrossRef
Metadata
Title
Deep Learning Techniques for Roadside Video Data Analysis
Authors
Brijesh Verma
Ligang Zhang
David Stockwell
Copyright Year
2017
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-4539-4_4