Skip to main content
Erschienen in: Multimedia Systems 2/2015

01.03.2015 | Special Issue Paper

A new discriminative coding method for image classification

verfasst von: Xiaoshan Yang, Tianzhu Zhang, Changsheng Xu

Erschienen in: Multimedia Systems | Ausgabe 2/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The bag-of-words (BOW) based methods are widely used in image classification. However, huge number of visual information is omitted inevitably in the quantization step of the BOW. Recently, NBNN and its improved methods like Local NBNN were proposed to solve this problem. Nevertheless, these methods do not perform better than the state-of-the-art BOW based methods. In this paper, based on the advantages of BOW and Local NBNN, we introduce a novel locality discriminative coding (LDC) method. We convert each low level local feature, such as SIFT, into code vector using the Local Feature-to-Class distance other than by k-means quantization. After coding, sum-pooling combined with SPM is used to construct a single feature representation vector for each image. Extensive experimental results on several challenging benchmark datasets show that our LDC method outperforms six state-of-the-art image classification methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bao, B.K., Li, T., Yan, S.: Hidden-concept driven multilabel image annotation and label ranking. IEEE IEEE Trans Multimed. 14(1), 199–210 (2012)CrossRef Bao, B.K., Li, T., Yan, S.: Hidden-concept driven multilabel image annotation and label ranking. IEEE IEEE Trans Multimed. 14(1), 199–210 (2012)CrossRef
2.
Zurück zum Zitat Bao, B.K., Zhu, G., Shen, J., Yan, S.: Robust image analysis with sparse representation on quantized visual features. IEEE Trans Image Process. 22(3), 860–871 (2013)CrossRefMathSciNet Bao, B.K., Zhu, G., Shen, J., Yan, S.: Robust image analysis with sparse representation on quantized visual features. IEEE Trans Image Process. 22(3), 860–871 (2013)CrossRefMathSciNet
3.
Zurück zum Zitat Behmo, R., Marcombes, P., Dalalyan, A., Prinet, V.: Towards optimal naive bayes nearest neighbor. In: Proceedings of the 11th European Conference on Computer Vision, pp. 171–184 (2010) Behmo, R., Marcombes, P., Dalalyan, A., Prinet, V.: Towards optimal naive bayes nearest neighbor. In: Proceedings of the 11th European Conference on Computer Vision, pp. 171–184 (2010)
4.
Zurück zum Zitat Boiman, O., Shechtman, E., Irani, M.: In defense of nearest-neighbor based image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2008) Boiman, O., Shechtman, E., Irani, M.: In defense of nearest-neighbor based image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2008)
5.
Zurück zum Zitat Csurka, G., Dance, C.R., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision, ECCV, pp. 1–22 (2004) Csurka, G., Dance, C.R., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision, ECCV, pp. 1–22 (2004)
6.
Zurück zum Zitat Duda, R.O., Hart, P.E., Stork, D.G.: Pattern classification, 2nd edn. Wiley Interscience (2000) Duda, R.O., Hart, P.E., Stork, D.G.: Pattern classification, 2nd edn. Wiley Interscience (2000)
7.
Zurück zum Zitat Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: Liblinear: A library for large linear classification. J Mach Learn Res. 9, 1871–1874 (2008)MATH Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: Liblinear: A library for large linear classification. J Mach Learn Res. 9, 1871–1874 (2008)MATH
8.
Zurück zum Zitat Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. In: workshop on generative-model based vision, CVPR (2004) Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. In: workshop on generative-model based vision, CVPR (2004)
9.
Zurück zum Zitat van Gemert, J., Geusebroek, J.M., Veenman, C.J., Smeulders, A.W.M.: Kernel codebooks for scene categorization. In: European Conference on Computer Vision, vol. 3, pp. 696–709 (2008) van Gemert, J., Geusebroek, J.M., Veenman, C.J., Smeulders, A.W.M.: Kernel codebooks for scene categorization. In: European Conference on Computer Vision, vol. 3, pp. 696–709 (2008)
10.
Zurück zum Zitat Huang, Y., Huang, K., Yu, Y., Tan, T.: Salient coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1753–1760 (2011) Huang, Y., Huang, K., Yu, Y., Tan, T.: Salient coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1753–1760 (2011)
11.
Zurück zum Zitat Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2169–2178 (2006) Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2169–2178 (2006)
12.
Zurück zum Zitat Li, L.J., Li, F.F.: What, where and who? classifying events by scene and object recognition. In: IEEE International Conference on Computer Vision, pp. 1–8 (2007) Li, L.J., Li, F.F.: What, where and who? classifying events by scene and object recognition. In: IEEE International Conference on Computer Vision, pp. 1–8 (2007)
13.
Zurück zum Zitat Liu, L., Wang, L., Liu, X.: In defense of soft-assignment coding. In: IEEE International Conference on Computer Vision, pp. 2486–2493 (2011) Liu, L., Wang, L., Liu, X.: In defense of soft-assignment coding. In: IEEE International Conference on Computer Vision, pp. 2486–2493 (2011)
14.
Zurück zum Zitat Liu, S., Feng, J., Song, Z., Zhang, T., Lu, H., Xu, C., Yan, S.: Hi, magic closet, tell me what to wear! In: Proceedings of the ACM International Conference on Multimedia, pp. 619–628 (2012) Liu, S., Feng, J., Song, Z., Zhang, T., Lu, H., Xu, C., Yan, S.: Hi, magic closet, tell me what to wear! In: Proceedings of the ACM International Conference on Multimedia, pp. 619–628 (2012)
15.
Zurück zum Zitat McCann, S., Lowe, D.G.: Local naive bayes nearest neighbor for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3650–3656 (2012) McCann, S., Lowe, D.G.: Local naive bayes nearest neighbor for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3650–3656 (2012)
16.
Zurück zum Zitat Nilsback, M.E., Zisserman, A.: A visual vocabulary for flower classification. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 1447–1454 (2006) Nilsback, M.E., Zisserman, A.: A visual vocabulary for flower classification. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 1447–1454 (2006)
17.
Zurück zum Zitat Sivic, J., Zisserman, A.: Video google: a text retrieval approach to object matching in videos. In: IEEE International Conference on Computer Vision, vol. 2, pp. 1470-1477 (2003) Sivic, J., Zisserman, A.: Video google: a text retrieval approach to object matching in videos. In: IEEE International Conference on Computer Vision, vol. 2, pp. 1470-1477 (2003)
18.
Zurück zum Zitat Tuytelaars, T., Fritz, M., Saenko, K., Darrell, T.: The nbnn kernel. In: IEEE International Conference on Computer Vision, pp. 1824–1831 (2011) Tuytelaars, T., Fritz, M., Saenko, K., Darrell, T.: The nbnn kernel. In: IEEE International Conference on Computer Vision, pp. 1824–1831 (2011)
19.
Zurück zum Zitat Wang, J., Yang, J., Yu, K., Lv, F., Huang, T.S., Gong, Y.: Locality-constrained linear coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3360–3367 (2010) Wang, J., Yang, J., Yu, K., Lv, F., Huang, T.S., Gong, Y.: Locality-constrained linear coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3360–3367 (2010)
20.
Zurück zum Zitat Yang, J., Yu, K., Gong, Y., Huang, T.S.: Linear spatial pyramid matching using sparse coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1794–1801 (2009) Yang, J., Yu, K., Gong, Y., Huang, T.S.: Linear spatial pyramid matching using sparse coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1794–1801 (2009)
21.
Zurück zum Zitat Yang, X., Zhang, T., Xu, C.: Locality discriminative coding for image classification. In: International Conference on Internet Multimedia Computing and Service, pp. 52–55 (2013) Yang, X., Zhang, T., Xu, C.: Locality discriminative coding for image classification. In: International Conference on Internet Multimedia Computing and Service, pp. 52–55 (2013)
22.
Zurück zum Zitat Yao, B., Fei-Fei, L.: Grouplet: A structured image representation for recognizing human and object interactions. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 9–16 (2010) Yao, B., Fei-Fei, L.: Grouplet: A structured image representation for recognizing human and object interactions. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 9–16 (2010)
23.
Zurück zum Zitat Yu, J., Liu, D., Tao, D., Seah, H.S.: On combining multiple features for cartoon character retrieval and clip synthesis. IEEE Trans Syst, Man Cybern, Part B. 42(5), 1413–1427 (2012)CrossRef Yu, J., Liu, D., Tao, D., Seah, H.S.: On combining multiple features for cartoon character retrieval and clip synthesis. IEEE Trans Syst, Man Cybern, Part B. 42(5), 1413–1427 (2012)CrossRef
24.
Zurück zum Zitat Yu, J., Tao, D., Rui, Y., Cheng, J.: Exploiting click constraints and multi-view features for image re-ranking. IEEE Trans Multimed. 16, 159–168 (2013)CrossRef Yu, J., Tao, D., Rui, Y., Cheng, J.: Exploiting click constraints and multi-view features for image re-ranking. IEEE Trans Multimed. 16, 159–168 (2013)CrossRef
25.
Zurück zum Zitat Yu, J., Tao, D., Rui, Y., Cheng, J.: Pairwise constraints based multiview features fusion for scene classification. Pattern Recogn. 46(2), 483–496 (2013)CrossRefMATH Yu, J., Tao, D., Rui, Y., Cheng, J.: Pairwise constraints based multiview features fusion for scene classification. Pattern Recogn. 46(2), 483–496 (2013)CrossRefMATH
26.
Zurück zum Zitat Yu, J., Tao, D., Wang, M.: Adaptive hypergraph learning and its application in image classification. IEEE Trans Image Process. 21(7), 3262–3272 (2012)CrossRefMathSciNet Yu, J., Tao, D., Wang, M.: Adaptive hypergraph learning and its application in image classification. IEEE Trans Image Process. 21(7), 3262–3272 (2012)CrossRefMathSciNet
27.
Zurück zum Zitat Yu, J., Wang, M., Tao, D.: Semisupervised multiview distance metric learning for cartoon synthesis. IEEE Trans Image Process. 21(11), 4636–4648 (2012)CrossRefMathSciNet Yu, J., Wang, M., Tao, D.: Semisupervised multiview distance metric learning for cartoon synthesis. IEEE Trans Image Process. 21(11), 4636–4648 (2012)CrossRefMathSciNet
28.
Zurück zum Zitat Zhang, T., Ghanem, B., Liu, S., Ahuja, N.: Low-rank sparse learning for robust visual tracking. In: European Conference on Computer Vision, pp. 470–484 (2012) Zhang, T., Ghanem, B., Liu, S., Ahuja, N.: Low-rank sparse learning for robust visual tracking. In: European Conference on Computer Vision, pp. 470–484 (2012)
29.
Zurück zum Zitat Zhang, T., Ghanem, B., Liu, S., Ahuja, N.: Robust visual tracking via multi-task sparse learning. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2042–2049 (2012) Zhang, T., Ghanem, B., Liu, S., Ahuja, N.: Robust visual tracking via multi-task sparse learning. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2042–2049 (2012)
30.
Zurück zum Zitat Zhang, T., Ghanem, B., Liu, S., Xu, C., Ahuja, N.: Low-rank sparse coding for image classification. In: IEEE International Conference on Computer Vision, pp. 281–288 (2013) Zhang, T., Ghanem, B., Liu, S., Xu, C., Ahuja, N.: Low-rank sparse coding for image classification. In: IEEE International Conference on Computer Vision, pp. 281–288 (2013)
31.
Zurück zum Zitat Zhang, T., Liu, S., Xu, C., Lu, H.: Mining semantic context information for intelligent video surveillance of traffic scenes. IEEE Trans Ind Inform. 9(1), 149–160 (2013)CrossRefMathSciNet Zhang, T., Liu, S., Xu, C., Lu, H.: Mining semantic context information for intelligent video surveillance of traffic scenes. IEEE Trans Ind Inform. 9(1), 149–160 (2013)CrossRefMathSciNet
32.
Zurück zum Zitat Zhang, T., Xu, C., Zhu, G., Liu, S., Lu, H.: A generic framework for video annotation via semi-supervised learning. IEEE Trans Multimed. 14(4), 1206–1219 (2012)CrossRef Zhang, T., Xu, C., Zhu, G., Liu, S., Lu, H.: A generic framework for video annotation via semi-supervised learning. IEEE Trans Multimed. 14(4), 1206–1219 (2012)CrossRef
Metadaten
Titel
A new discriminative coding method for image classification
verfasst von
Xiaoshan Yang
Tianzhu Zhang
Changsheng Xu
Publikationsdatum
01.03.2015
Verlag
Springer Berlin Heidelberg
Erschienen in
Multimedia Systems / Ausgabe 2/2015
Print ISSN: 0942-4962
Elektronische ISSN: 1432-1882
DOI
https://doi.org/10.1007/s00530-014-0376-y

Weitere Artikel der Ausgabe 2/2015

Multimedia Systems 2/2015 Zur Ausgabe

Neuer Inhalt