Skip to main content
Erschienen in: Neural Processing Letters 2/2017

03.04.2017

Scene Classification Using Multi-Resolution WAHOLB Features and Neural Network Classifier

verfasst von: Gholam Ali Montazer, Davar Giveki

Erschienen in: Neural Processing Letters | Ausgabe 2/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This article approaches scene classification problem by proposing an enhanced bag of features (BoF) model and a modified radial basis function neural network (RBFNN) classifier. The proposed BoF model integrates the image features extracted by histogram of oriented gradients, local binary pattern and wavelet coefficients. The extracted features are obtained in a hierarchical multi-resolution manner. The proposed approach is able to capture multi-level (the pixel-, patch-, and image-level) features. The histograms of features constructed by BoF model are then used for training a modified RBFNN classifier. As a modification, we propose using a new variant of particle swarm optimization, in which the parameters are updated adaptively, for determining the center of Gaussian functions in RBFNN. Experimental results demonstrate that our proposed approach significantly outperforms the state-of-the-art methods on scene classification of OT, FP, and LSP benchmark datasets.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Alexandridis A, Chondrodima E, Sarimveis H (2013) Radial basis function network training using a nonsymmetric partition of the input space and particle swarm optimization. IEEE Trans Neural Netw Learn 24(2):219–230. doi:10.1109/Tnnls.2012.2227794 CrossRef Alexandridis A, Chondrodima E, Sarimveis H (2013) Radial basis function network training using a nonsymmetric partition of the input space and particle swarm optimization. IEEE Trans Neural Netw Learn 24(2):219–230. doi:10.​1109/​Tnnls.​2012.​2227794 CrossRef
4.
Zurück zum Zitat Bosch A, Zisserman A, Munoz X (2006) Scene classification via pLSA. In: Proceedings of computer vision-Eccv 2006, Pt 4, 3954: 517–530 Bosch A, Zisserman A, Munoz X (2006) Scene classification via pLSA. In: Proceedings of computer vision-Eccv 2006, Pt 4, 3954: 517–530
5.
Zurück zum Zitat Bosch A, Zisserman A, Munoz X (2007) Representing shape with a spatial pyramid kernel. In: Proceedings of the 6th ACM international conference on image and video retrieval. ACM, pp 401–408 Bosch A, Zisserman A, Munoz X (2007) Representing shape with a spatial pyramid kernel. In: Proceedings of the 6th ACM international conference on image and video retrieval. ACM, pp 401–408
6.
Zurück zum Zitat Box GE, Jenkins GM, Reinsel GC, Ljung GM (2015) Time series analysis: forecasting and control. Wiley, HobokenMATH Box GE, Jenkins GM, Reinsel GC, Ljung GM (2015) Time series analysis: forecasting and control. Wiley, HobokenMATH
8.
Zurück zum Zitat Chan T, Jia K, Gao S, Lu J, Zeng Z, Ma YP (2014) A simple deep learning baseline for image classification? arXiv preprint. arXiv preprint arXiv:1404.3606 1(3) Chan T, Jia K, Gao S, Lu J, Zeng Z, Ma YP (2014) A simple deep learning baseline for image classification? arXiv preprint. arXiv preprint arXiv:​1404.​3606 1(3)
9.
Zurück zum Zitat Chen XY (2007) Deformation measurement of the large flexible surface by improved RBFNN algorithm and BPNN algorithm. In: Proceedings advances in neural networks-ISNN 2007, Pt 3, 4493: 41–48 Chen XY (2007) Deformation measurement of the large flexible surface by improved RBFNN algorithm and BPNN algorithm. In: Proceedings advances in neural networks-ISNN 2007, Pt 3, 4493: 41–48
10.
Zurück zum Zitat Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceedings of CVPR IEEE, pp 886–893 Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceedings of CVPR IEEE, pp 886–893
11.
Zurück zum Zitat Dong C-R, Chan PP, Ng WW, Yeung DS (2011) A survey of the initialization of centers and widths in radial basis function network for classification. In: 2011 IEEE international conference on machine learning and cybernetics (ICMLC), pp 1082–1087 Dong C-R, Chan PP, Ng WW, Yeung DS (2011) A survey of the initialization of centers and widths in radial basis function network for classification. In: 2011 IEEE international conference on machine learning and cybernetics (ICMLC), pp 1082–1087
12.
Zurück zum Zitat Fan R-E, Chang K-W, Hsieh C-J, Wang X-R, Lin C-J (2008) LIBLINEAR: a library for large linear classification. J Mach Learn Res 9:1871–1874MATH Fan R-E, Chang K-W, Hsieh C-J, Wang X-R, Lin C-J (2008) LIBLINEAR: a library for large linear classification. J Mach Learn Res 9:1871–1874MATH
13.
Zurück zum Zitat Fan H, Zhou E (2016) Approaching human level facial landmark localization by deep learning. Image Vis Comput 47:27–35CrossRef Fan H, Zhou E (2016) Approaching human level facial landmark localization by deep learning. Image Vis Comput 47:27–35CrossRef
14.
Zurück zum Zitat Farhidzadeh H, Zhou M, Goldgof DB, Hall LO, Raghavan M, Gatenby RA (2014) Prediction of treatment response and metastatic disease in soft tissue sarcoma. Med Imaging Comput Aided Diagn. doi:10.1117/12.2043792 Farhidzadeh H, Zhou M, Goldgof DB, Hall LO, Raghavan M, Gatenby RA (2014) Prediction of treatment response and metastatic disease in soft tissue sarcoma. Med Imaging Comput Aided Diagn. doi:10.​1117/​12.​2043792
15.
Zurück zum Zitat Farhidzadeh H, Chaudhury B, Zhou M, Goldgof DB, Hall LO, Gatenby RA, Gillies RJ, Raghavan M (2015) Prediction of treatment outcome in soft tissue sarcoma based on radiologically defined habitats. Proc Spie. doi:10.1117/12.2082324 Farhidzadeh H, Chaudhury B, Zhou M, Goldgof DB, Hall LO, Gatenby RA, Gillies RJ, Raghavan M (2015) Prediction of treatment outcome in soft tissue sarcoma based on radiologically defined habitats. Proc Spie. doi:10.​1117/​12.​2082324
16.
Zurück zum Zitat Farhidzadeh H, Goldgof DB, Hall LO, Gatenby RA, Gillies RJ, Raghavan M (2015) Texture feature analysis to predict metastatic and necrotic soft tissue sarcomas. IEEE Syst Man Cybern. doi:10.1109/Smc.2015.488 Farhidzadeh H, Goldgof DB, Hall LO, Gatenby RA, Gillies RJ, Raghavan M (2015) Texture feature analysis to predict metastatic and necrotic soft tissue sarcomas. IEEE Syst Man Cybern. doi:10.​1109/​Smc.​2015.​488
17.
Zurück zum Zitat Farhidzadeh H, Kim JY, Scott JG, Goldgof DB, Hall LO, Harrison LB (2016) Classification of progression free survivalwith nasopharyngeal carcinoma tumors. In: SPIE medical imaging, international society for optics and photonics Farhidzadeh H, Kim JY, Scott JG, Goldgof DB, Hall LO, Harrison LB (2016) Classification of progression free survivalwith nasopharyngeal carcinoma tumors. In: SPIE medical imaging, international society for optics and photonics
19.
Zurück zum Zitat Fei-Fei L, Perona P (2005) A Bayesian hierarchical model for learning natural scene categories 2005. In: Proceedings of IEEE computer society conference on computer vision and pattern recognition, 2: 524–531 Fei-Fei L, Perona P (2005) A Bayesian hierarchical model for learning natural scene categories 2005. In: Proceedings of IEEE computer society conference on computer vision and pattern recognition, 2: 524–531
22.
24.
Zurück zum Zitat Huang X, Li SZ, Wang Y (2004) Shape localization based on statistical method using extended local binary pattern. In: Third international conference on IEEE image and graphics (ICIG’04), pp 184–187 Huang X, Li SZ, Wang Y (2004) Shape localization based on statistical method using extended local binary pattern. In: Third international conference on IEEE image and graphics (ICIG’04), pp 184–187
25.
Zurück zum Zitat Jain AK, Farrokhnia F (1990) Unsupervised texture segmentation using gabor filters. In: 1990 IEEE international conference on systems, man, and cybernetics: 14–19. doi:10.1109/Icsmc.1990.142050 Jain AK, Farrokhnia F (1990) Unsupervised texture segmentation using gabor filters. In: 1990 IEEE international conference on systems, man, and cybernetics: 14–19. doi:10.​1109/​Icsmc.​1990.​142050
26.
Zurück zum Zitat Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems. pp 1097–1105 Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems. pp 1097–1105
27.
Zurück zum Zitat Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE computer society conference on computer vision and pattern recognition (CVPR’06). pp 2169–2178 Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE computer society conference on computer vision and pattern recognition (CVPR’06). pp 2169–2178
28.
Zurück zum Zitat Lin IC, Liou CY (2007) Least-mean-square training of cluster-weighted modeling. In: Proceedings artificial neural networks-ICANN, Pt 2, 4669: 301–310 Lin IC, Liou CY (2007) Least-mean-square training of cluster-weighted modeling. In: Proceedings artificial neural networks-ICANN, Pt 2, 4669: 301–310
29.
Zurück zum Zitat Loo CK, Rajeswari M, Rao MVC (2004) Novel direct and self-regulating approaches to determine optimum growing multi-experts network structure. IEEE Trans Neural Netw 15(6):1378–1395. doi:10.1109/Tnn.2004.837779 CrossRef Loo CK, Rajeswari M, Rao MVC (2004) Novel direct and self-regulating approaches to determine optimum growing multi-experts network structure. IEEE Trans Neural Netw 15(6):1378–1395. doi:10.​1109/​Tnn.​2004.​837779 CrossRef
31.
Zurück zum Zitat Meng X, Wang Z, Wu L (2012) Building global image features for scene recognition. Pattern Recognit 45(1):373–380CrossRef Meng X, Wang Z, Wu L (2012) Building global image features for scene recognition. Pattern Recognit 45(1):373–380CrossRef
35.
Zurück zum Zitat Montazer GA, Giveki D (2015) Content based image retrieval system using clustered scale invariant feature transforms. Opt Int J Light Electron Opt 126(18):1695–1699CrossRef Montazer GA, Giveki D (2015) Content based image retrieval system using clustered scale invariant feature transforms. Opt Int J Light Electron Opt 126(18):1695–1699CrossRef
39.
Zurück zum Zitat Patrinos P, Alexandridis A, Ninos K, Sarimveis H (2010) Variable selection in nonlinear modeling based on RBF networks and evolutionary computation. Int J Neural Syst 20(05):365–379CrossRef Patrinos P, Alexandridis A, Ninos K, Sarimveis H (2010) Variable selection in nonlinear modeling based on RBF networks and evolutionary computation. Int J Neural Syst 20(05):365–379CrossRef
40.
Zurück zum Zitat Prechelt L (1994) Proben1: A set of neural network benchmark problems and benchmarking rules Prechelt L (1994) Proben1: A set of neural network benchmark problems and benchmarking rules
41.
Zurück zum Zitat Qin J, Yung NH (2010) Scene categorization via contextual visual words. Pattern Recognit 43(5):1874–1888CrossRefMATH Qin J, Yung NH (2010) Scene categorization via contextual visual words. Pattern Recognit 43(5):1874–1888CrossRefMATH
42.
Zurück zum Zitat Qin J, Yung NH (2012) Feature fusion within local region using localized maximum-margin learning for scene categorization. Pattern Recognit 45(4):1671–1683CrossRef Qin J, Yung NH (2012) Feature fusion within local region using localized maximum-margin learning for scene categorization. Pattern Recognit 45(4):1671–1683CrossRef
43.
Zurück zum Zitat Quelhas P, Monay F, Odobez JM, Gatica-Perez D, Tuytelaars T, Van Gool L (2005) Modeling scenes with local descriptors and latent aspects. In: IEEE international conference on computer vision: 883–890 Quelhas P, Monay F, Odobez JM, Gatica-Perez D, Tuytelaars T, Van Gool L (2005) Modeling scenes with local descriptors and latent aspects. In: IEEE international conference on computer vision: 883–890
44.
Zurück zum Zitat Samad T (1991) Back propagation with expected source values. Neural Netw 4(5):615–618CrossRef Samad T (1991) Back propagation with expected source values. Neural Netw 4(5):615–618CrossRef
46.
Zurück zum Zitat Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:​1409.​1556
48.
Zurück zum Zitat Sun S (2013) A survey of multi-view machine learning. Neural Comput Appl 23(7–8):2031–2038CrossRef Sun S (2013) A survey of multi-view machine learning. Neural Comput Appl 23(7–8):2031–2038CrossRef
50.
Zurück zum Zitat Tian Y, Luo P, Wang X, Tang X (2015) Deep learning strong parts for pedestrian detection. In: Proceedings of the IEEE international conference on computer vision. pp 1904–1912 Tian Y, Luo P, Wang X, Tang X (2015) Deep learning strong parts for pedestrian detection. In: Proceedings of the IEEE international conference on computer vision. pp 1904–1912
52.
Zurück zum Zitat Wang XY, Han TX, Yan SC (2009) An HOG-LBP human detector with partial occlusion handling. In: 2009 IEEE 12th international conference on computer vision (ICCV), pp 32–39. doi:10.1109/Iccv.2009.5459207 Wang XY, Han TX, Yan SC (2009) An HOG-LBP human detector with partial occlusion handling. In: 2009 IEEE 12th international conference on computer vision (ICCV), pp 32–39. doi:10.​1109/​Iccv.​2009.​5459207
53.
Zurück zum Zitat Wang W, Yang X, Ooi BC, Zhang D, Zhuang Y (2016) Effective deep learning-based multi-modal retrieval. VLDB J 25(1):79–101CrossRef Wang W, Yang X, Ooi BC, Zhang D, Zhuang Y (2016) Effective deep learning-based multi-modal retrieval. VLDB J 25(1):79–101CrossRef
54.
Zurück zum Zitat Wang Y, Gong S (2007) Conditional random field for natural scene categorization. In: BMVC. Citeseer, pp 1–10 Wang Y, Gong S (2007) Conditional random field for natural scene categorization. In: BMVC. Citeseer, pp 1–10
55.
Zurück zum Zitat Wang R, Tao D (2016) Non-local auto-encoder with collaborative stabilization for image restoration. IEEE Trans Image Process 25(5):2117–2129MathSciNetCrossRef Wang R, Tao D (2016) Non-local auto-encoder with collaborative stabilization for image restoration. IEEE Trans Image Process 25(5):2117–2129MathSciNetCrossRef
56.
Zurück zum Zitat Wang S, Wang Y, Zhu S-C (2012) Hierarchical space tiling for scene modeling. In: Asian conference on computer vision. Springer, pp 796–810 Wang S, Wang Y, Zhu S-C (2012) Hierarchical space tiling for scene modeling. In: Asian conference on computer vision. Springer, pp 796–810
57.
Zurück zum Zitat Wang N, Yeung D-Y (2013) Learning a deep compact image representation for visual tracking. In: Advances in neural information processing systems. pp 809–817 Wang N, Yeung D-Y (2013) Learning a deep compact image representation for visual tracking. In: Advances in neural information processing systems. pp 809–817
61.
Zurück zum Zitat Yu J, Rui Y, Tang YY, Tao D (2014) High-order distance-based multiview stochastic learning in image classification. IEEE Trans Cybern 44(12):2431–2442CrossRef Yu J, Rui Y, Tang YY, Tao D (2014) High-order distance-based multiview stochastic learning in image classification. IEEE Trans Cybern 44(12):2431–2442CrossRef
62.
Zurück zum Zitat Zhang WC, Shan SG, Gao W, Chen XL, Zhang HM (2005) Local gabor binary pattern histogram sequence (LGBPHS): a novel non-statistical model for face representation and recognition. IEEE Int Conf Comput Vis 1:786–791 Zhang WC, Shan SG, Gao W, Chen XL, Zhang HM (2005) Local gabor binary pattern histogram sequence (LGBPHS): a novel non-statistical model for face representation and recognition. IEEE Int Conf Comput Vis 1:786–791
63.
Zurück zum Zitat Zhang S, Tian Q, Hua G, Huang Q, Gao W (2014) ObjectPatchNet: towards scalable and semantic image annotation and retrieval. Comput Vis Image Underst 118:16–29CrossRef Zhang S, Tian Q, Hua G, Huang Q, Gao W (2014) ObjectPatchNet: towards scalable and semantic image annotation and retrieval. Comput Vis Image Underst 118:16–29CrossRef
64.
Zurück zum Zitat Zheng YB, Huang XS, Feng SJ (2010) An image matching algorithm based on combination of SIFT and the rotation invariant LBP. J Comput Aided Design Comput Gr 22(2):286–292 Zheng YB, Huang XS, Feng SJ (2010) An image matching algorithm based on combination of SIFT and the rotation invariant LBP. J Comput Aided Design Comput Gr 22(2):286–292
65.
Zurück zum Zitat Zhou L, Zhou Z, Hu D (2013) Scene classification using a multi-resolution bag-of-features model. Pattern Recognit 46(1):424–433CrossRef Zhou L, Zhou Z, Hu D (2013) Scene classification using a multi-resolution bag-of-features model. Pattern Recognit 46(1):424–433CrossRef
66.
Zurück zum Zitat Zhu Z, Wang X, Bai S, Yao C, Bai X (2016) Deep learning representation using autoencoder for 3d shape retrieval. Neurocomputing 204:41–50CrossRef Zhu Z, Wang X, Bai S, Yao C, Bai X (2016) Deep learning representation using autoencoder for 3d shape retrieval. Neurocomputing 204:41–50CrossRef
Metadaten
Titel
Scene Classification Using Multi-Resolution WAHOLB Features and Neural Network Classifier
verfasst von
Gholam Ali Montazer
Davar Giveki
Publikationsdatum
03.04.2017
Verlag
Springer US
Erschienen in
Neural Processing Letters / Ausgabe 2/2017
Print ISSN: 1370-4621
Elektronische ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-017-9614-6

Weitere Artikel der Ausgabe 2/2017

Neural Processing Letters 2/2017 Zur Ausgabe