Skip to main content
Top

2017 | OriginalPaper | Chapter

A Novel Method for Scene Classification Feeding Mid-Level Image Patch to Convolutional Neural Networks

Authors : Fei Yang, Jinfu Yang, Ying Wang, Gaoming Zhang

Published in: Information Technology and Intelligent Transportation Systems

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Scene classification is an important task for computer vision, and Convolutional Neural Networks, a model of deep learning, is widely used for object classification. However, they rely on pooling and large fully connected layers to combine information from spatially disparate regions; these operations can throw away useful fine-grained information, and in natural scenes, there are many useless information which will increase computation cost. In this paper, mid-level discriminative patches are utilized to pre-process the full images. The proposed method which combines mid-level discriminative patches for preprocessing with CNN for feature extraction improved the efficiency of computation and are more suitable for classifying scenes. Firstly, full images are divided into discriminative parts. Then utilize these patches to go through CNN for feature extraction. Finally, a support vector machine will be used to classify the scenes. Experimental evaluations using MIT 67 indoor dataset performs well and proved that proposed method can be applied to scene classification.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Dalal N, Triggs B (2015) Histograms of oriented gradients for human detection. In: CVPR Dalal N, Triggs B (2015) Histograms of oriented gradients for human detection. In: CVPR
2.
go back to reference Lowe DG (2003) Distinctive image features from scale-invariant keypoints Lowe DG (2003) Distinctive image features from scale-invariant keypoints
3.
go back to reference Csurka G, Dance C, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In: ECCV workshop Csurka G, Dance C, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In: ECCV workshop
4.
go back to reference Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: CVPR Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: CVPR
5.
go back to reference Perronnin F, Sanchez J, Mensink T (2010) Improving the fisher kernel for large-scale image classification. In: ECCV Perronnin F, Sanchez J, Mensink T (2010) Improving the fisher kernel for large-scale image classification. In: ECCV
6.
go back to reference Pandey M, Lazebnik S (2011) Scene recognition and weakly supervised object localization with deformable part-based models. In: ICCV Pandey M, Lazebnik S (2011) Scene recognition and weakly supervised object localization with deformable part-based models. In: ICCV
7.
go back to reference LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef
8.
go back to reference Gong Y, Wang L, Guo R, Lazebnik S (2014) Multi-scale orderless pooling of deep convo-lutional activation features. In: ECCV Gong Y, Wang L, Guo R, Lazebnik S (2014) Multi-scale orderless pooling of deep convo-lutional activation features. In: ECCV
9.
go back to reference Zhou B, Lapedriza A, Xiao J, Torralba A, Oliva A (2014) Learning deep features for scene recognition using places database. In: NIPS Zhou B, Lapedriza A, Xiao J, Torralba A, Oliva A (2014) Learning deep features for scene recognition using places database. In: NIPS
10.
go back to reference Doersch C, Gupta A, Efros A (2013) Mid-level visual element discovery as discriminative mode seeking. In: NIPS Doersch C, Gupta A, Efros A (2013) Mid-level visual element discovery as discriminative mode seeking. In: NIPS
11.
go back to reference Dixit M, Chen S (2015) Scene classification with semantic fisher vectors. In: CVPR Dixit M, Chen S (2015) Scene classification with semantic fisher vectors. In: CVPR
12.
go back to reference Liu L, Shen C, van den Hengel A (2015) The treasure beneath convolutional layers: cross-convolutional-layer pooling for image classification. In: CVPR Liu L, Shen C, van den Hengel A (2015) The treasure beneath convolutional layers: cross-convolutional-layer pooling for image classification. In: CVPR
13.
go back to reference He K, Zhang X, Ren S, Sun J (2014) Spatial pyramid pooling in deep convolutional net-works for visual recognition. In: ECCV He K, Zhang X, Ren S, Sun J (2014) Spatial pyramid pooling in deep convolutional net-works for visual recognition. In: ECCV
14.
go back to reference Dai J, He K, Sun J (2015) Convolutional feature masking for joint object and stuff segmentation. In: CVPR Dai J, He K, Sun J (2015) Convolutional feature masking for joint object and stuff segmentation. In: CVPR
15.
go back to reference Ciresan DC, Giusti A, Gambardella LM, Schmidhuber J (2012) Deep neural networks segment neuronal membranes in electron microscopy images. In: NIPS, pp 2852–2860 Ciresan DC, Giusti A, Gambardella LM, Schmidhuber J (2012) Deep neural networks segment neuronal membranes in electron microscopy images. In: NIPS, pp 2852–2860
16.
go back to reference Farabet C, Couprie C, Najman L, LeCun Y (2013) Learning hierarchical features for scene labeling. IEEE transactions on pattern analysis and machine intelligence Farabet C, Couprie C, Najman L, LeCun Y (2013) Learning hierarchical features for scene labeling. IEEE transactions on pattern analysis and machine intelligence
17.
go back to reference Hariharan B, Arbelaez P, Girshick R, Malik J (2014) Simultaneous detection and segmentation. In: european conference on computer vision (ECCV) Hariharan B, Arbelaez P, Girshick R, Malik J (2014) Simultaneous detection and segmentation. In: european conference on computer vision (ECCV)
18.
go back to reference Pinheiro PH (2014) Recurrent convolutional neural networks for scene labelling. In: ICML Pinheiro PH (2014) Recurrent convolutional neural networks for scene labelling. In: ICML
19.
go back to reference Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, LeCun Y (2014) Overfeat: integrated recognition, localization and detection using convolutional networks. In: ICLR Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, LeCun Y (2014) Overfeat: integrated recognition, localization and detection using convolutional networks. In: ICLR
20.
go back to reference Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: CVPR Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: CVPR
21.
go back to reference Shen W, Wang X, Wang Y (2015) DeepContour: a deep convolutional feature learned by positive-sharing loss for contour detection. In: CVPR Shen W, Wang X, Wang Y (2015) DeepContour: a deep convolutional feature learned by positive-sharing loss for contour detection. In: CVPR
22.
go back to reference Uijlings JRR, Ferrari V (2015) Situational object boundary detection. In: CVPR Uijlings JRR, Ferrari V (2015) Situational object boundary detection. In: CVPR
23.
go back to reference Albaradei S, Wang Y (2014) Learning mid-level features from object hierarchy for image classification. In: WACV Albaradei S, Wang Y (2014) Learning mid-level features from object hierarchy for image classification. In: WACV
24.
go back to reference Zhao R, Ouyang W, Wang X (2014) Learning mid-level filters for person reidentification. In: IEEE conference on computer vision and pattern recognition Zhao R, Ouyang W, Wang X (2014) Learning mid-level filters for person reidentification. In: IEEE conference on computer vision and pattern recognition
25.
go back to reference Singh S, Gupta A, Efros AA (2013) Representing videos using mid-level discriminative patches. In: CVPR Singh S, Gupta A, Efros AA (2013) Representing videos using mid-level discriminative patches. In: CVPR
26.
go back to reference Boureau Y-L, Bach F, LeCun Y, Ponce J (2010) Learning mid-level features for recognition. In: CVPR Boureau Y-L, Bach F, LeCun Y, Ponce J (2010) Learning mid-level features for recognition. In: CVPR
27.
go back to reference Quattoni A, Torralba A (2009) Recognizing indoor scenes. In: CVPR Quattoni A, Torralba A (2009) Recognizing indoor scenes. In: CVPR
28.
go back to reference Li L.-J, Su H, Fei-Fei L, Xing EP (2010) Object bank: a high-level image representation for scene classification and semantic feature sparsification. In: NIPS Li L.-J, Su H, Fei-Fei L, Xing EP (2010) Object bank: a high-level image representation for scene classification and semantic feature sparsification. In: NIPS
29.
go back to reference Singh S, Gupta A, Efros AA (2012) Unsupervised discovery of mid-level discriminative patches. In: ECCV Singh S, Gupta A, Efros AA (2012) Unsupervised discovery of mid-level discriminative patches. In: ECCV
30.
go back to reference Lin D, Lu C, Liao R, Jia J (2014) Learning important spatial pooling regions for scene classification. In: IEEE conference on computer vision and pattern recognition Lin D, Lu C, Liao R, Jia J (2014) Learning important spatial pooling regions for scene classification. In: IEEE conference on computer vision and pattern recognition
31.
go back to reference Cheng G, Han J, Guo L, Liu T (2015) Learning coarse-to-fine sparselets for efficient object detection and scene classification. In: CVPR Cheng G, Han J, Guo L, Liu T (2015) Learning coarse-to-fine sparselets for efficient object detection and scene classification. In: CVPR
32.
go back to reference Sivic J, Zisserman A (2003) Video Google: a text retrieval approach to object matching in videos. In: ICCV Sivic J, Zisserman A (2003) Video Google: a text retrieval approach to object matching in videos. In: ICCV
33.
go back to reference Kim G, Torralba A (2009) Unsupervised detection of regions of interest using iterative link analysis. In: NIPS Kim G, Torralba A (2009) Unsupervised detection of regions of interest using iterative link analysis. In: NIPS
Metadata
Title
A Novel Method for Scene Classification Feeding Mid-Level Image Patch to Convolutional Neural Networks
Authors
Fei Yang
Jinfu Yang
Ying Wang
Gaoming Zhang
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-38771-0_34

Premium Partner