Skip to main content

2015 | OriginalPaper | Buchkapitel

Orthogonal and Smooth Subspace Based on Sparse Coding for Image Classification

verfasst von : Fushuang Dai, Yao Zhao, Dongxia Chang, Chunyu Lin

Erschienen in: Advances in Multimedia Information Processing -- PCM 2015

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Many real-world problems usually deal with high-dimensional data, such as images, videos, text, web documents and so on. In fact, the classification algorithms used to process these high-dimensional data often suffer from the low accuracy and high computational complexity. Therefore, we propose a framework of transforming images from a high-dimensional image space to a low-dimensional target image space, based on learning an orthogonal smooth subspace for the SIFT sparse codes (SC-OSS). It is a two stage framework for subspace learning. Firstly, a sparse coding followed by spatial pyramid max pooling is used to get the image representation. Then, the image descriptor is mapped into an orthonormal and smooth subspace to classify images in low dimension. The proposed algorithm adds the orthogonality and a Laplacian smoothing penalty to constrain the projective function coefficient to be orthogonal and spatially smooth. The experimental results on the public datasets have shown that the proposed algorithm outperforms other subspace methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)CrossRef
2.
Zurück zum Zitat Bai, E.: Big data: The curse of dimensionality in modeling. In: Control Conference, pp.6–13. IEEE press, Chinese (2014) Bai, E.: Big data: The curse of dimensionality in modeling. In: Control Conference, pp.6–13. IEEE press, Chinese (2014)
3.
Zurück zum Zitat Lou, X., Huang, D., Fan, L., et al.: An image classification algorithm based on bag of visual words and multi-kernel learning. J. Multimedia 9(2), 269–277 (2014)CrossRef Lou, X., Huang, D., Fan, L., et al.: An image classification algorithm based on bag of visual words and multi-kernel learning. J. Multimedia 9(2), 269–277 (2014)CrossRef
4.
Zurück zum Zitat Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recogn. 2, 2169–2178 (2006) Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recogn. 2, 2169–2178 (2006)
5.
Zurück zum Zitat Yang, J., Yu, K., Gong, Y., et al.: Linear spatial pyramid matching using sparse coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1794–1801 (2009) Yang, J., Yu, K., Gong, Y., et al.: Linear spatial pyramid matching using sparse coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1794–1801 (2009)
6.
Zurück zum Zitat Yan, Y., Zhang, Y.: Discriminant projection embedding for face and palmprint recognition. Neurocomputing 17, 3534–3543 (2008)CrossRef Yan, Y., Zhang, Y.: Discriminant projection embedding for face and palmprint recognition. Neurocomputing 17, 3534–3543 (2008)CrossRef
7.
Zurück zum Zitat Zhou, J., Jin, Z., Yang, J.: Multiscale saliency detection using principle component analysis. In: The 2012 International Joint Conference on Neural Networks (IJCNN), pp. 1–6. IEEE press, Brisbane (2012) Zhou, J., Jin, Z., Yang, J.: Multiscale saliency detection using principle component analysis. In: The 2012 International Joint Conference on Neural Networks (IJCNN), pp. 1–6. IEEE press, Brisbane (2012)
8.
Zurück zum Zitat Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Wiley-Inter science, Hoboken, NJ (2000)MATH Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Wiley-Inter science, Hoboken, NJ (2000)MATH
9.
Zurück zum Zitat Hou, C., Nie, F., Zhang, C., et al.: Learning an orthogonal and smooth subspace for image classification. Signal Process. Lett. 16(4), 303–306 (2009). IEEE pressCrossRef Hou, C., Nie, F., Zhang, C., et al.: Learning an orthogonal and smooth subspace for image classification. Signal Process. Lett. 16(4), 303–306 (2009). IEEE pressCrossRef
10.
Zurück zum Zitat Lee, H., Battle, A., Raina, R., et al.: Efficient sparse coding algorithms. In: Advances in neural information processing systems, pp.801–808 (2006) Lee, H., Battle, A., Raina, R., et al.: Efficient sparse coding algorithms. In: Advances in neural information processing systems, pp.801–808 (2006)
11.
Zurück zum Zitat Liu, F., Liu, X.: Locality enhanced spectral embedding and spatially smooth spectral regression for face recognition. In: Information and Automation, pp.299–303. IEEE press, Shenyang (2012) Liu, F., Liu, X.: Locality enhanced spectral embedding and spatially smooth spectral regression for face recognition. In: Information and Automation, pp.299–303. IEEE press, Shenyang (2012)
12.
Zurück zum Zitat Ye, J.: Characterization of a family of algorithms for generalized discriminant analysis on undersampled problems. J. Mach. Learn. Res. 6, 483–502 (2005)MathSciNetMATH Ye, J.: Characterization of a family of algorithms for generalized discriminant analysis on undersampled problems. J. Mach. Learn. Res. 6, 483–502 (2005)MathSciNetMATH
13.
Zurück zum Zitat Serre, T., Wolf, L., Poggio, T.: Object recognition with features inspired by visual cortex. Comput. Vis. Pattern Recogn. 2, 994–1000 (2005) Serre, T., Wolf, L., Poggio, T.: Object recognition with features inspired by visual cortex. Comput. Vis. Pattern Recogn. 2, 994–1000 (2005)
14.
Zurück zum Zitat Wang, J., Yang, J., Yu, K., et al.: Locality-constrained linear coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp.3360–3367 (2010) Wang, J., Yang, J., Yu, K., et al.: Locality-constrained linear coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp.3360–3367 (2010)
Metadaten
Titel
Orthogonal and Smooth Subspace Based on Sparse Coding for Image Classification
verfasst von
Fushuang Dai
Yao Zhao
Dongxia Chang
Chunyu Lin
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-24078-7_5

Neuer Inhalt