nach oben

Erschienen in:

2017 | OriginalPaper | Buchkapitel

3. Improved Soft Assignment Coding for Image Classification

verfasst von : Qingfeng Liu, Chengjun Liu

Erschienen in: Recent Advances in Intelligent Image Search and Video Retrieval

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Feature coding plays an important role in image classification, and the soft-assignment coding (SAC) method is popular in many practical applications due to its conceptual simplicity and computational efficiency. The SAC method, however, fails to achieve the optimal image classification performance when compared with the recently developed coding methods, such as the sparse coding and the locality-constrained linear coding methods. This chapter first analyzes the SAC method from the perspective of kernel density estimation, and then presents an improved soft-assignment coding (ISAC) method that enhances the image classification performance of the SAC method and keeps its simplicity and efficiency. Specifically, the ISAC method introduces two enhancements, namely, the thresholding normalized visual word plausibility (TNVWP) and the power transformation method. These improvements are further shown to establish the connection between the proposed ISAC method and the Vector of Locally Aggregated Descriptors (VLAD) coding method. Experiments on four representative datasets (the UIUC sports event dataset, the scene 15 dataset, the Caltech 101 dataset, and the Caltech 256 dataset) show that the proposed ISAC method achieves competitive results to and even better results than some popular image classification methods without sacrificing much computational efficiency.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Learning and Recognition Methods for Image Search and Video Retrieval

Nächstes Kapitel Inheritable Color Space (InCS) and Generalized InCS Framework with Applications to Kinship Verification

Huang, Y., Wu, Z., Wang, L., Tan, T.: Feature coding in image classification: A comprehensive study. IEEE Trans. Pattern Anal. Mach. Intell. pp. 493–506 (2014)

Boureau, Y., Bach, F., LeCun, Y., Ponce, J.: Learning mid-level features for recognition. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR’10) (2010)

Boureau, Y., Ponce, J., LeCun, Y.: A theoretical analysis of feature pooling in vision algorithms. In: Proceedings of the International Conference on Machine learning (ICML’ 10) (2010)

Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2169–2178 (2006)

van Gemert, J.C., Veenman, C.J., Smeulders, A.W.M., Geusebroek, J.M.: Visual word ambiguity. IEEE Trans. Pattern Anal. Mach. Intell. 32(7), 1271–1283 (2010)CrossRef

Gemert, J.C., Geusebroek, J.M., Veenman, C.J., Smeulders, A.W.: Kernel codebooks for scene categorization. In: European Conference on Computer Vision, pp. 696–709 (2008)

Liu, L., Wang, L., Liu, X.: In defense of soft-assignment coding. In: Proceedings of the 2011 International Conference on Computer Vision, pp. 2486–2493 (2011)

Yang, J., Yu, K., Gong, Y., Huang, T.S.: Linear spatial pyramid matching using sparse coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1794–1801 (2009)

Wang, J., Yang, J., Yu, K., Lv, F., Huang, T.S., Gong, Y.: Locality-constrained linear coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3360–3367 (2010)

10.

Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples. In: IEEE Proceedings of the Workshop on Generative-Model Based Vision, CVPR (2004)

11.

Griffin, G., Holub, A., Perona, P.: Caltech-256 object category dataset. Technical report 7694. California Institute of Technology (2007). http://authors.library.caltech.edu/7694

12.

Jégou, H., Douze, M., Schmid, C., Pérez, P.: Aggregating local descriptors into a compact image representation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3304–3311 (2010)

13.

Jegou, H., Perronnin, F., Douze, M., Sanchez, J., Perez, P., Schmid, C.: Aggregating local image descriptors into compact codes. IEEE Trans. Pattern Anal. Mach. Intell. 34(9), 1704–1716 (2012)CrossRef

14.

Snchez, J., Perronnin, F., Mensink, T., Verbeek, J.J.: Image classification with the fisher vector: Theory and practice. Int. J. Comput. Vis. 105(3), 222–245 (2013)MathSciNetCrossRefMATH

15.

Arandjelović, R., Zisserman, A.: All about VLAD. In: IEEE Conference on Computer Vision and Pattern Recognition (2013)

16.

Bosch, A., Zisserman, A., Munoz, X.: Scene classification using a hybrid generative/discriminative approach. IEEE Trans. Pattern Anal. Mach. Intell. 30(4) (2008)

17.

Perina, A., Cristani, M., Castellani, U., Murino, V., Jojic, N.: Free energy score spaces: Using generative information in discriminative classifiers. IEEE Trans. Pattern Anal. Mach. Intell. 34(7), 1249–1262 (2012)CrossRef

18.

Jebara, T., Kondor, R., Howard, A.: Probability product kernels. J. Mach. Learn. Res. 5, 819–844 (2004)MathSciNetMATH

19.

jia Li, L., fei Li, F.: What, where and who? Classifying event by scene and object recognition. In: IEEE International Conference on Computer Vision (2007)

20.

Gao, S., Tsang, I.W.H., Chia, L.T.: Laplacian sparse coding, hypergraph laplacian sparse coding, and applications. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 92–104 (2013)CrossRef

21.

Bo, L., Ren, X., Fox, D.: Hierarchical matching pursuit for image classification: Architecture and fast algorithms. In: Advances in Neural Information Processing Systems (2011)

22.

Zhou, X., Yu, K., Zhang, T., Huang, T.S.: Image classification using super-vector coding of local image descriptors. In: Proceedings of the 11th European Conference on Computer Vision: Part V, pp. 141–154 (2010)

23.

Wand, M.P., Marron, J.S., Ruppert, D.: Transformations in density estimation. J. Am. Stat. Assoc. 86(414), 343–353 (1991)MathSciNetCrossRefMATH

24.

Simonyan, K., Parkhi, O.M., Vedaldi, A., Zisserman, A.: Fisher vector faces in the wild. In: British Machine Vision Conference (BMVC) (2013)

25.

Jaakkola, T., Haussler, D.: Exploiting generative models in discriminative classifiers. In: Advances in Neural Information Processing Systems, pp. 487–493 (1998)

26.

Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: Liblinear: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)MATH

27.

Li, L.J., Su, H., Xing, E.P., Li, F.F.: Object bank: A high-level image representation for scene classification and semantic feature sparsification. In: Advances in Neural Information Processing Systems, pp. 1378–1386 (2010)

28.

Niu, Z., Hua, G., Gao, X., Tian, Q.: Context aware topic model for scene recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2743–2750 (2012)

29.

Zhang, H., Berg, A.C., Maire, M., Malik, J.: Svm-knn: Discriminative nearest neighbor classification for visual category recognition. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2126–2136 (2006)

30.

Jain, P., Kulis, B., Grauman, K.: Fast image search for learned metrics. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8. IEEE (2008)

31.

Boiman, O., Shechtman, E., Irani, M.: In defense of nearest-neighbor based image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8. IEEE (2008)

Titel: Improved Soft Assignment Coding for Image Classification
verfasst von: Qingfeng Liu
Chengjun Liu
Verlag: Springer International Publishing
Buch: Recent Advances in Intelligent Image Search and Video Retrieval
Print ISBN: 978-3-319-52080-3

Electronic ISBN: 978-3-319-52081-0

Copyright-Jahr: 2017
DOI: https://doi.org/10.1007/978-3-319-52081-0_3

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner