Skip to main content
Log in

Image classification using label constrained sparse coding

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Sparse coding has been widely used for feature encoding in recent years. However, the encoded parameters’ similarity is ignored with sparse coding. Besides, the label information from which class the local feature is extracted is also ignored. To solve this problem, in this paper, we propose a novel feature encoding method called label constrained sparse coding (LCSC) for visual representation. The visual similarities between local features are jointly considered with the corresponding label information of local features. This is achieved by combining the label constraints with the encoding of local features. In this way, we can ensure that similar local features with the same label are encoded with similar parameters. Local features with different labels are encoded with dissimilar parameters to increase the discriminative power of encoded parameters. Besides, instead of optimizing for the coding parameter of each local feature separately, we jointly encode the local features within one sub-region in the spatial pyramid way to combine the spatial and contextual information of local features. We apply this label constrained sparse coding technique for classification tasks on several public image datasets to evaluate its effectiveness. The experimental results shows the effectiveness of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. Agarwal J, Bedi SS (2015) Implementation of hybrid image fusion technique for feature enhancement in medical diagnosis. Human-centric Computing and Information Sciences 5(1):1–3

    Article  Google Scholar 

  2. Belongie S, Malik J, Puzicha J (2002) Shape matching and object recognition using shape contexts. IEEE Trans Pattern Anal Mach Intell 24(24):509–521

    Article  Google Scholar 

  3. Benlamri R, Zhang X (2014) Context-aware recommender for mobile learners. Human-centric Computing and Information Sciences 4(1):1–12

    Article  Google Scholar 

  4. Boiman O, Shechtman E, Irani M (2008) In defense of nearest-neighbor based image classification. In: CVPR

  5. Datta R, Joshi D, Li J, Wang J (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(2):1–60

    Article  Google Scholar 

  6. Fan B, Kong Q, Trzcinski T, Wang Z, Pan C, Fua P (2014) Receptive fields selection for binary feature description. IEEE Trans Image Process 23(6):2583–2595

    Article  MathSciNet  Google Scholar 

  7. Gao S, Tsang I, Chia L (2013) Laplacian sparse coding, hypergraph laplacian sparse coding, and applications. IEEE Trans Pattern Anal Mach Intell 35(1):92–104

    Article  Google Scholar 

  8. Gehler P, Nowozin S (2009) On feature combination for multiclass object classification. In: ICCV

  9. Gemert J, Veenman C, Smeulders A, Geusebroek J (2010) Visual word ambiguity. IEEE Trans Pattern Anal Mach Intell 32(7):1271–1283

    Article  Google Scholar 

  10. Ghimire D, Lee J (2014) Extreme learning machine ensemble using bagging for facial expression recognition. J Inf Process Syst 10(3):443–458

    Article  Google Scholar 

  11. Griffin G, Holub A, Perona P (2006) The caltech 256 dataset. Caltech Technical Report

  12. Harchaoui Z, Bach F (2007) Image classification with segmentation graph kernels. In: CVPR, pp 1–8

  13. Ho YS (2013) Challenging technical issues of 3D video processing. J Convergence 4(1):1–6

    Article  Google Scholar 

  14. Huang Y, Huang K, Yu Y, Tan T (2011) Salient coding for image classification. In: Proceedings of CVPR, pp 1753–1760

  15. Inoue N, Shinoda K (2013) Neighbor-to-neighbor search for fast coding of feature vectors. In: ICCV, pp 1233–1240

  16. Jegou H, Zisserman A (2014) Triangulation embedding and democratic aggregation for image search. In: CVPR

  17. Ji R, Yao H, Tian Q, Xu P, Sun X, Liu X (2012) Context-aware semi-local feature detector. ACM Trans Intell Syst Technol 3(3):44

    Article  Google Scholar 

  18. Kim J, Byun J, Jeong H (2013) Cloud AEHS: advanced learning system using user preferences. J Convergence 4(3):31–36

    Google Scholar 

  19. Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2169–2178

  20. Lee H, Battle A, Raina R, Ng A (2007) Efficient sparse coding algorithms. In: Proceedings of the neural information processing systems

  21. Lee SH, Jung KH, Kang DW, Jung KH (2014) Pixel-based Fusion Algorithm for Multi-Focused Image by Comparison and Filtering of SML map. J Convergence 5(4):1–4

    Article  Google Scholar 

  22. LeCun Y, Kavukcuoglu K, Farabet C (2010) Convolutional networks and applications in vision. In: Proceedings of the International symposium on circuits systtem, pp 253–256

  23. Li L-J, Fei-Fei L (2007) What, where and who? Clasifying events by scene and object recognition. In: ICCV

  24. Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vision 60:91–110

    Article  Google Scholar 

  25. Manh HT, Lee G (2013) Small object segmentation based on visual saliency in natural images. J Inf Process Syst 9(4):592–601

    Article  Google Scholar 

  26. Moosmann F, Nowak E, Jurie F (2008) Randomized clustering forests for image classification. IEEE Trans Pattern Anal Mach Intell 30(9):1632–1646

    Article  Google Scholar 

  27. Patil PB, Kokare MB (2013) Interactive semantic image retrieval. J Inf Process Syst 9(3):349–264

    Article  Google Scholar 

  28. Perronnin F (2008) Universal and adapted vocabularies for generic visual categorization. IEEE Trans Pattern Anal Mach Intell 30(7):1243–1256

    Article  Google Scholar 

  29. Ramirez I, Sprechmann P, Sapiro G (2010) Classification and clustering via dictionary learning with structured incoherence and shared features. In: Proceedings of CVPR

  30. Rueda A, Gonzalez F, Romero E (2014) Extracting salient brain patterns for imaging-based classification of neurodegenerative diseases. IEEE Trans Med Imaging 33(6):1262–1274

    Article  Google Scholar 

  31. Russakovsky O, Lin Y, Yu K, Li F (2012) Object-centric spatial pooling for image classification. In: ECCV, pp 1–15

  32. Sivic J, Zisserman A (2003) Video google: a text retrieval approach to object matching in videos. In: Proceedings of IEEE International conference on computer vision, pp 1470–1477

  33. Song Z, Chen Q, Huang Z, Hua Y, Yan S (2011) Contextualizing object detection and classification. In: Proceedings of CVPR, pp 1585–1592

  34. Vipparthi S, Nagar S (2014) Color directional local quinary patterns for content based indexing and retrieval. Human-centric Computing and Information Sciences 4(1):1–6

    Article  Google Scholar 

  35. Wang C, Yan S, Zhang L, Zhang H (2009) Multi-label sparse coding for automatic image annotation. In: CVPR

  36. Wang J, Yang J, Yu K, Lv F, Huang T, Gong Y (2010) Locality-constrained linear coding for image classification. In: Proceedings of CVPR

  37. Wu J, Rehg J (2009) Beyond the Euclidean distance: creating effective visual codebooks using the histogram intersection kernel. In: ICCV

  38. Yang J, Yu K, Gong Y, Huang T (2009) Linear spatial pyramid matching using sparse coding for image classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1794–1801

  39. Yuan X, Yan S (2010) Visual classification with multi-task joint sparse representation. In: CVPR

  40. Zhang C, Liu J, Liang C, Xue Z, Pang J, Huang Q (2014) Image classification by non-negative sparse coding, correlation constrained low-rank and sparse decomposition. Comput Vis Image Underst 123:14–22

    Article  Google Scholar 

  41. Zhang C, Liu J, Liang C, Huang Q, Tian Q (2013) Image classification using Harr-like transformation of local features with coding residuals. Signal Process 93(8):2111–2118

    Article  Google Scholar 

  42. Zhang C, Liu J, Tian Q, Liang C, Huang Q (2013) Beyond visual features: a weak semantic image representation using exemplar classifiers for classification. Neurocomputing 120:318–324

    Article  Google Scholar 

  43. Zhang S, Tian Q, Hua G, Huang Q, Gao W (2011) Generating descriptive visual words and visual phrases for large-scale image applications. IEEE Trans Image Process 20(9):2664–2677

    Article  MathSciNet  Google Scholar 

  44. Zhang C, Wang S, Huang Q, Liu J, Liang C, Tian Q (2013) Image classification using spatial pyramid robust sparse coding. Pattern Recogn Lett 34(9):1046–1052

    Article  Google Scholar 

  45. Zeiler M, Krishnan D, Taylor G, Fergus R (2010) Deconvolutional networks. In: Proceedings CVPR, pp 1–8

  46. Zhou N, Fan J (2012) Learning inter-related visual dictionary for object recognition. In: Proceedings of CVPR

Download references

Acknowledgment

This work is supported by ”TwelfthFive Year Plan” National Science and technology support program (No.2012BAD29B01-2), National Natural Science Foundation of China (NO. 61402023), the open funding project of State Key Laboratory of Virtual Reality Technology and Systems at Beihang University (No. BUAA-VR-14KF-04), Collaborative Innovation Centre for State-owned Assets Administration of Beijing Technology and Business University (No. GZ20131102).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ruijun Liu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, R., Chen, Y., Zhu, X. et al. Image classification using label constrained sparse coding. Multimed Tools Appl 75, 15619–15633 (2016). https://doi.org/10.1007/s11042-015-2626-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-015-2626-1

Keywords

Navigation