Skip to main content
Top
Published in: Neural Computing and Applications 2/2019

05-07-2017 | Original Article

Object detection using hybridization of static and dynamic feature spaces and its exploitation by ensemble classification

Authors: Iqbal Murtza, Asifullah Khan, Naeem Akhtar

Published in: Neural Computing and Applications | Issue 2/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper presents a learning mechanism based on hybridization of static and dynamic learning. Realizing the detection performances offered by the state-of-the-art deep learning techniques and the competitive performances offered by the conventional static learning techniques, we propose the idea of exploitation of the concatenated (parallel) hybridization of the static and dynamic learning-based feature spaces. This is contrary to the cascaded (series) hybridization topology in which the initial feature space (provided by the conventional, static, and handcrafted feature extraction technique) is explored using deep, dynamic, and automated learning technique. Consequently, the characteristics already suppressed by the conventional representation cannot be explored by the dynamic learning technique. Instead, the proposed technique combines the conventional static and deep dynamic representation in concatenated (parallel) topology to generate an information-rich hybrid feature space. Thus, this hybrid feature space may aggregate the good characteristics of both conventional and deep representations, which are then explored using an appropriate classification technique. We also hypothesize that ensemble classification may better exploit this parallel hybrid perspective of the feature spaces. For this purpose, pyramid histogram of oriented gradients-based static learning has been incorporated in conjunction with convolution neural network-based deep learning to produce concatenated hybrid feature space. This hybrid space is then explored with various state-of-the-art ensemble classification techniques. We have considered the publicly available INRIA person and Caltech pedestrian standard image datasets to assess the performance of the proposed hybrid learning system. Furthermore, McNemar’s test has been used to statistically validate the outperformance of the proposed technique over various contemporary techniques. The validated experimental results show that the employment of the proposed hybrid representation results in effective detection performance (an AUC of 0.9996 for INRIA person and 0.9985 for Caltech pedestrian datasets) as compared to the individual static and dynamic representations.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Footnotes
1
In the context of this paper, conventional, static, and handcrafted are interchangeably used. Likewise, the deep, dynamic, and automated are also interchangeably used.
 
Literature
1.
go back to reference Deng L (2014) A tutorial survey of architectures, algorithms, and applications for deep learning. APSIPA Trans Signal Inf Process 3:e2CrossRef Deng L (2014) A tutorial survey of architectures, algorithms, and applications for deep learning. APSIPA Trans Signal Inf Process 3:e2CrossRef
2.
go back to reference Szarvas M, Yoshizawa A, Yamamoto M, Ogata J (2005) Pedestrian detection with convolutional neural networks. In: Proceedings of the intelligent vehicles symposium. IEEE, pp 224–229 Szarvas M, Yoshizawa A, Yamamoto M, Ogata J (2005) Pedestrian detection with convolutional neural networks. In: Proceedings of the intelligent vehicles symposium. IEEE, pp 224–229
3.
go back to reference Lecun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86:2278–2324CrossRef Lecun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86:2278–2324CrossRef
4.
go back to reference LeCun Y, Kavukcuoglu K, Farabet C (2010) Convolutional networks and applications in vision. In: Proceedings of 2010 IEEE international symposium on circuits and systems (ISCAS), pp 253–256 LeCun Y, Kavukcuoglu K, Farabet C (2010) Convolutional networks and applications in vision. In: Proceedings of 2010 IEEE international symposium on circuits and systems (ISCAS), pp 253–256
5.
go back to reference Ciresan DC, Meier U, Gambardella LM, Schmidhuber J (2011) Convolutional neural network committees for handwritten character classification. In: International conference on document analysis and recognition (ICDAR), pp 1135–1139 Ciresan DC, Meier U, Gambardella LM, Schmidhuber J (2011) Convolutional neural network committees for handwritten character classification. In: International conference on document analysis and recognition (ICDAR), pp 1135–1139
6.
go back to reference Tao W, Wu DJ, Coates A, Ng AY (2012) End-to-end text recognition with convolutional neural networks. In: 21st international conference on pattern recognition (ICPR), pp 3304–3308 Tao W, Wu DJ, Coates A, Ng AY (2012) End-to-end text recognition with convolutional neural networks. In: 21st international conference on pattern recognition (ICPR), pp 3304–3308
7.
go back to reference Simard PY, Steinkraus D, Platt JC (2003) Best practices for convolutional neural networks applied to visual document analysis. In: Proceedings of the seventh international conference on document analysis and recognition, pp 958–963 Simard PY, Steinkraus D, Platt JC (2003) Best practices for convolutional neural networks applied to visual document analysis. In: Proceedings of the seventh international conference on document analysis and recognition, pp 958–963
8.
go back to reference Chellapilla K, Puri S, Simard P (2006) High performance convolutional neural networks for document processing. In: Tenth international workshop on frontiers in handwriting recognition, La Baule Chellapilla K, Puri S, Simard P (2006) High performance convolutional neural networks for document processing. In: Tenth international workshop on frontiers in handwriting recognition, La Baule
9.
go back to reference Lawrence S, Giles CL, Ah Chung T, Back AD (1997) Face recognition: a convolutional neural-network approach. IEEE Trans Neural Netw 8:98–113CrossRef Lawrence S, Giles CL, Ah Chung T, Back AD (1997) Face recognition: a convolutional neural-network approach. IEEE Trans Neural Netw 8:98–113CrossRef
10.
go back to reference Fasel B (2002) Robust face analysis using convolutional neural networks. In: Proceedings of the 16th international conference on pattern recognition, vol. 2, pp 40–43 Fasel B (2002) Robust face analysis using convolutional neural networks. In: Proceedings of the 16th international conference on pattern recognition, vol. 2, pp 40–43
11.
go back to reference Abdel-Hamid O, Mohamed AR, Hui J, Li D, Penn G, Dong Y (2014) Convolutional Neural networks for speech recognition. IEEE/ACM Trans Audio Speech Lang Process 22:1533–1545CrossRef Abdel-Hamid O, Mohamed AR, Hui J, Li D, Penn G, Dong Y (2014) Convolutional Neural networks for speech recognition. IEEE/ACM Trans Audio Speech Lang Process 22:1533–1545CrossRef
12.
go back to reference Jialue F, Wei X, Ying W, Yihong G (2010) Human tracking using convolutional neural networks. IEEE Trans Neural Netw 21:1610–1623CrossRef Jialue F, Wei X, Ying W, Yihong G (2010) Human tracking using convolutional neural networks. IEEE Trans Neural Netw 21:1610–1623CrossRef
13.
go back to reference Shuiwang J, Wei X, Ming Y, Kai Y (2013) 3D convolutional neural networks for human action recognition. IEEE Trans Pattern Anal Mach Intell 35:221–231CrossRef Shuiwang J, Wei X, Ming Y, Kai Y (2013) 3D convolutional neural networks for human action recognition. IEEE Trans Pattern Anal Mach Intell 35:221–231CrossRef
14.
go back to reference Cireşan DC, Giusti A, Gambardella LM, Schmidhuber J (2013) Mitosis Detection in breast cancer histology images with deep neural networks. In: Mori K, Sakuma I, Sato Y, Barillot C, Navab N (eds) Proceedings, Part II medical image computing and computer-assisted intervention—MICCAI 2013: 16th international conference, Nagoya. Springer, Berlin, pp 411–418, 22–26 Sept 2013 Cireşan DC, Giusti A, Gambardella LM, Schmidhuber J (2013) Mitosis Detection in breast cancer histology images with deep neural networks. In: Mori K, Sakuma I, Sato Y, Barillot C, Navab N (eds) Proceedings, Part II medical image computing and computer-assisted intervention—MICCAI 2013: 16th international conference, Nagoya. Springer, Berlin, pp 411–418, 22–26 Sept 2013
15.
go back to reference Biglari O, Ahsan R, Rahi M (2014) Human detection using SURF and SIFT feature extraction methods in different color spaces. J Math Comput Sci (JMCS) 11:111–122CrossRef Biglari O, Ahsan R, Rahi M (2014) Human detection using SURF and SIFT feature extraction methods in different color spaces. J Math Comput Sci (JMCS) 11:111–122CrossRef
16.
go back to reference Lowe DG (1999) Object recognition from local scale-invariant features. In: The proceedings of the seventh IEEE international conference on computer vision, vol. 2, pp 1150–1157 Lowe DG (1999) Object recognition from local scale-invariant features. In: The proceedings of the seventh IEEE international conference on computer vision, vol. 2, pp 1150–1157
17.
go back to reference Lowe D (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60:91–110CrossRef Lowe D (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60:91–110CrossRef
18.
go back to reference Bay H, Tuytelaars T, Gool L (2006) SURF: speeded up robust features. In: Leonardis A, Bischof H, Pinz A (eds) Proceedings, Part I computer vision—ECCV 2006: 9th European conference on computer vision, Graz. Springer, Berlin, pp 404–417, 7–13 May 2006 Bay H, Tuytelaars T, Gool L (2006) SURF: speeded up robust features. In: Leonardis A, Bischof H, Pinz A (eds) Proceedings, Part I computer vision—ECCV 2006: 9th European conference on computer vision, Graz. Springer, Berlin, pp 404–417, 7–13 May 2006
19.
go back to reference Bay H, Ess A, Tuytelaars T, Van Gool L (2008) Speeded-up robust features (SURF). Comput Vis Image Underst 110:346–359CrossRef Bay H, Ess A, Tuytelaars T, Van Gool L (2008) Speeded-up robust features (SURF). Comput Vis Image Underst 110:346–359CrossRef
20.
go back to reference Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE computer society conference on computer vision and pattern recognition (CVPR), vol. 1, pp 886–893 Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE computer society conference on computer vision and pattern recognition (CVPR), vol. 1, pp 886–893
21.
go back to reference Anna B, Andrew Z, Xavier M (2007) Representing shape with a spatial pyramid kernel. In: Proceedings of the 6th ACM international conference on Image and video retrieval, Amsterdam, pp 401–408 Anna B, Andrew Z, Xavier M (2007) Representing shape with a spatial pyramid kernel. In: Proceedings of the 6th ACM international conference on Image and video retrieval, Amsterdam, pp 401–408
22.
go back to reference Murtza I, Abdullah D, Khan A, Arif M, Mirza S (2015) Cortex-inspired multilayer hierarchy based object detection system using PHOG descriptors and ensemble classification. Vis Comput 33:99–112CrossRef Murtza I, Abdullah D, Khan A, Arif M, Mirza S (2015) Cortex-inspired multilayer hierarchy based object detection system using PHOG descriptors and ensemble classification. Vis Comput 33:99–112CrossRef
23.
go back to reference Jin W, Ping L, She MFH, Kouzani A, Nahavandi S (2011) Human action recognition based on Pyramid Histogram of Oriented Gradients. In: 2011 IEEE international conference on systems, man, and cybernetics (SMC), pp 2449–2454 Jin W, Ping L, She MFH, Kouzani A, Nahavandi S (2011) Human action recognition based on Pyramid Histogram of Oriented Gradients. In: 2011 IEEE international conference on systems, man, and cybernetics (SMC), pp 2449–2454
24.
go back to reference Serre T, Wolf L, Bileschi S, Riesenhuber M, Poggio T (2007) Robust object recognition with cortex-like mechanisms. IEEE Trans Pattern Anal Mach Intell (PAMI) 29:411–426CrossRef Serre T, Wolf L, Bileschi S, Riesenhuber M, Poggio T (2007) Robust object recognition with cortex-like mechanisms. IEEE Trans Pattern Anal Mach Intell (PAMI) 29:411–426CrossRef
25.
go back to reference Serre T, Wolf L, Poggio T (2005) Object recognition with features inspired by visual cortex. In IEEE computer society conference on computer vision and pattern recognition (CVPR), vol. 2, pp 994–1000 Serre T, Wolf L, Poggio T (2005) Object recognition with features inspired by visual cortex. In IEEE computer society conference on computer vision and pattern recognition (CVPR), vol. 2, pp 994–1000
26.
go back to reference Watanabe T, Ito S, Yokoi K (2009) Co-occurrence histograms of oriented gradients for pedestrian detection. In: Wada T, Huang F, Lin S (eds) Advances in image and video technology, vol 5414. Springer, Berlin, pp 37–47 Watanabe T, Ito S, Yokoi K (2009) Co-occurrence histograms of oriented gradients for pedestrian detection. In: Wada T, Huang F, Lin S (eds) Advances in image and video technology, vol 5414. Springer, Berlin, pp 37–47
27.
go back to reference Dalal N, Triggs B, Schmid C (2006) Human detection using oriented histograms of flow and appearance. In: Leonardis A, Bischof H, Pinz A (eds) Proceedings, Part II computer vision—ECCV 2006: 9th European conference on computer vision, Graz. Springer, Berlin, pp 428–441, 7–13 May 2006 Dalal N, Triggs B, Schmid C (2006) Human detection using oriented histograms of flow and appearance. In: Leonardis A, Bischof H, Pinz A (eds) Proceedings, Part II computer vision—ECCV 2006: 9th European conference on computer vision, Graz. Springer, Berlin, pp 428–441, 7–13 May 2006
28.
go back to reference Hong H, Minglei T (2013) Human detection based on optical flow and spare geometric flow. In: 2013 seventh international conference on image and graphics (ICIG), pp 459–464 Hong H, Minglei T (2013) Human detection based on optical flow and spare geometric flow. In: 2013 seventh international conference on image and graphics (ICIG), pp 459–464
29.
30.
31.
go back to reference Freund Y, Schapire RE (1996) Experiments with a new boosting algorithm. In Thirteenth international conference on machine learning, pp 148–156 Freund Y, Schapire RE (1996) Experiments with a new boosting algorithm. In Thirteenth international conference on machine learning, pp 148–156
32.
go back to reference van de Sande KEA, Gevers T, Snoek CGM (2010) Evaluating Color descriptors for object and scene recognition. IEEE Trans Pattern Anal Mach Intell (PAMI) 32:1582–1596CrossRef van de Sande KEA, Gevers T, Snoek CGM (2010) Evaluating Color descriptors for object and scene recognition. IEEE Trans Pattern Anal Mach Intell (PAMI) 32:1582–1596CrossRef
33.
34.
go back to reference Yanwei P, He Y, Yuan Y, Kongqiao W (2012) Robust CoHOG feature extraction in human-centered image/video management system. IEEE Trans Syst Man Cybern B Cybern 42:458–468CrossRef Yanwei P, He Y, Yuan Y, Kongqiao W (2012) Robust CoHOG feature extraction in human-centered image/video management system. IEEE Trans Syst Man Cybern B Cybern 42:458–468CrossRef
35.
go back to reference Azar AT (2013) Fast neural network learning algorithms for medical applications. Neural Comput Appl 23:1019–1034CrossRef Azar AT (2013) Fast neural network learning algorithms for medical applications. Neural Comput Appl 23:1019–1034CrossRef
36.
go back to reference Ebied HM, Revett K, Tolba MF (2013) Evaluation of unsupervised feature extraction neural networks for face recognition. Neural Comput Appl 22:1211–1222CrossRef Ebied HM, Revett K, Tolba MF (2013) Evaluation of unsupervised feature extraction neural networks for face recognition. Neural Comput Appl 22:1211–1222CrossRef
37.
38.
go back to reference Rebai I, BenAyed Y, Mahdi W (2016) Deep multilayer multiple kernel learning. Neural Comput Appl 27:2305–2314CrossRef Rebai I, BenAyed Y, Mahdi W (2016) Deep multilayer multiple kernel learning. Neural Comput Appl 27:2305–2314CrossRef
39.
go back to reference Liu W, Wang Z, Liu X, Zeng N, Liu Y, Alsaadi FE (2017) A survey of deep neural network architectures and their applications. Neurocomputing 234:11–26CrossRef Liu W, Wang Z, Liu X, Zeng N, Liu Y, Alsaadi FE (2017) A survey of deep neural network architectures and their applications. Neurocomputing 234:11–26CrossRef
40.
go back to reference He K, Zhang X, Ren S, Sun J (2015) Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal Mach Intell 37:1904–1916CrossRef He K, Zhang X, Ren S, Sun J (2015) Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal Mach Intell 37:1904–1916CrossRef
41.
go back to reference Haykin S (2007) Neural networks: a comprehensive foundation, 3rd edn. Prentice-Hall Inc., Upper Saddle River, NJ, USAMATH Haykin S (2007) Neural networks: a comprehensive foundation, 3rd edn. Prentice-Hall Inc., Upper Saddle River, NJ, USAMATH
42.
go back to reference Zhang GP (2000) Neural networks for classification: a survey. IEEE Trans Syst Man Cybern C (Appl Rev) 30:451–462CrossRef Zhang GP (2000) Neural networks for classification: a survey. IEEE Trans Syst Man Cybern C (Appl Rev) 30:451–462CrossRef
43.
go back to reference Vedaldi A, Lenc K (2015) MatConvNet—convolutional neural networks for MATLAB. In: Proceeding of association of computing machinery (ACM) international conference on multimedia Vedaldi A, Lenc K (2015) MatConvNet—convolutional neural networks for MATLAB. In: Proceeding of association of computing machinery (ACM) international conference on multimedia
44.
go back to reference Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Advances in neural information processing systems 25 (NIPS 2012), pp 1097–1105 Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Advances in neural information processing systems 25 (NIPS 2012), pp 1097–1105
45.
go back to reference Sutton RS, Barto AG (1998) Softmax action selection. In: Reinforcement learning: an introduction, Adaptive Computation and Machine Learning series, vol 17, no 2. MIT Press, Cambridge, pp 229–235. ISBN 0-262-19398-1 Sutton RS, Barto AG (1998) Softmax action selection. In: Reinforcement learning: an introduction, Adaptive Computation and Machine Learning series, vol 17, no 2. MIT Press, Cambridge, pp 229–235. ISBN 0-262-19398-1
46.
48.
go back to reference Sumner M, Frank E, Hall M (2005) Speeding up logistic model tree induction. In: Jorge AM, Torgo L, Brazdil P, Camacho R, Gama J (eds) Proceedings of the knowledge discovery in databases: PKDD 2005: 9th European conference on principles and practice of knowledge discovery in databases, Porto. Springer, Berlin, pp 675–683, 3–7 Oct 2005 Sumner M, Frank E, Hall M (2005) Speeding up logistic model tree induction. In: Jorge AM, Torgo L, Brazdil P, Camacho R, Gama J (eds) Proceedings of the knowledge discovery in databases: PKDD 2005: 9th European conference on principles and practice of knowledge discovery in databases, Porto. Springer, Berlin, pp 675–683, 3–7 Oct 2005
49.
go back to reference Witten IH, Frank E, Hall MA (2011) Data mining: practical machine learning tools and techniques with Java implementations, 3rd edn. Morgan Kaufmann Publishers Inc., Burlington, USA Witten IH, Frank E, Hall MA (2011) Data mining: practical machine learning tools and techniques with Java implementations, 3rd edn. Morgan Kaufmann Publishers Inc., Burlington, USA
50.
go back to reference Zhao L, Chen Y, Schaffner DW (2001) Comparison of logistic regression and linear regression in modeling percentage data. Appl Environ Microbiol 67: 2129–2135, Received 11 March Accepted 27 Feb 2001 Zhao L, Chen Y, Schaffner DW (2001) Comparison of logistic regression and linear regression in modeling percentage data. Appl Environ Microbiol 67: 2129–2135, Received 11 March Accepted 27 Feb 2001
51.
go back to reference Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an update. ACM SIGKDD Explor Newsl 11:10–18CrossRef Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an update. ACM SIGKDD Explor Newsl 11:10–18CrossRef
52.
go back to reference Dalal N (2006) Finding people in images and videos. Doctoral Dissertation, Grenoble Institute of Technology, Grenoble, France Dalal N (2006) Finding people in images and videos. Doctoral Dissertation, Grenoble Institute of Technology, Grenoble, France
54.
go back to reference Dollar P, Wojek C, Schiele B, Perona P (2009) Pedestrian detection: a benchmark. In: IEEE conference on computer vision and pattern recognition CVPR 2009, pp 304–311 Dollar P, Wojek C, Schiele B, Perona P (2009) Pedestrian detection: a benchmark. In: IEEE conference on computer vision and pattern recognition CVPR 2009, pp 304–311
55.
go back to reference Dollar P, Wojek C, Schiele B, Perona P (2012) Pedestrian detection: an evaluation of the state of the art. IEEE Trans Pattern Anal Mach Intell 34:743–761CrossRef Dollar P, Wojek C, Schiele B, Perona P (2012) Pedestrian detection: an evaluation of the state of the art. IEEE Trans Pattern Anal Mach Intell 34:743–761CrossRef
56.
go back to reference McNemar Q (1947) Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 12:153–157CrossRef McNemar Q (1947) Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 12:153–157CrossRef
57.
Metadata
Title
Object detection using hybridization of static and dynamic feature spaces and its exploitation by ensemble classification
Authors
Iqbal Murtza
Asifullah Khan
Naeem Akhtar
Publication date
05-07-2017
Publisher
Springer London
Published in
Neural Computing and Applications / Issue 2/2019
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-017-3050-4

Other articles of this Issue 2/2019

Neural Computing and Applications 2/2019 Go to the issue

Premium Partner