Skip to main content
Top
Published in: Neural Processing Letters 2/2017

23-08-2016

Multi-view Discriminant Dictionary Learning via Learning View-specific and Shared Structured Dictionaries for Image Classification

Authors: Fei Wu, Xiao-Yuan Jing, Dong Yue

Published in: Neural Processing Letters | Issue 2/2017

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Recently, multi-view dictionary learning technique has attracted lots of research interest. Although some multi-view dictionary learning methods have been addressed, there exists much room for improvement. How to explore and utilize both the diversity and the useful correlation information of different views with dictionaries has not been well studied. In this paper, we propose a novel multi-view dictionary learning approach named multi-view discriminant dictionary learning via learning view-specific and shared structured dictionaries (MDVSD), which aims to learn a structured dictionary shared by all views and multiple view-specific structured dictionaries with each corresponding to a specific view. The shared dictionary is combined with each view-specific dictionary to represent data of the specific view. MDVSD makes the view-specific dictionaries corresponding to different views uncorrelated for effectively exploring the diversity of different views. Furthermore, we introduce structural uncorrelation into shared dictionary learning procedure, such that the useful correlation information of different views can be effectively exploited. Dictionary-atoms in shared and view-specific dictionaries have correspondence to class labels so that the learned dictionaries have favorable discriminant ability and the obtained reconstruction error is discriminative. Three widely used datasets are employed as test data. Experimental results demonstrate the effectiveness of the proposed approach.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Memisevic R (2012) On multi-view feature learning. In: International conference on machine learning (ICML), pp 161–168 Memisevic R (2012) On multi-view feature learning. In: International conference on machine learning (ICML), pp 161–168
2.
go back to reference Kumar A, Daumé H (2011) A co-training approach for multi-view spectral clustering. In: International conference on machine learning (ICML), pp 393–400 Kumar A, Daumé H (2011) A co-training approach for multi-view spectral clustering. In: International conference on machine learning (ICML), pp 393–400
3.
go back to reference Kumar A, Rai P, Daumé H (2011) Co-regularized multi-view spectral clustering. In: Advances in neural information processing systems (NIPS), pp 1413–1421 Kumar A, Rai P, Daumé H (2011) Co-regularized multi-view spectral clustering. In: Advances in neural information processing systems (NIPS), pp 1413–1421
4.
5.
go back to reference Kloft M, Brefeld U, Sonnenburg S, Zien A (2011) Lp-norm multiple kernel learning. J Mach Learn Res 12:953–997MathSciNetMATH Kloft M, Brefeld U, Sonnenburg S, Zien A (2011) Lp-norm multiple kernel learning. J Mach Learn Res 12:953–997MathSciNetMATH
6.
go back to reference Sun S (2013) A survey of multi-view machine learning. Neural Comput Appl 23(7–8):2031–2038CrossRef Sun S (2013) A survey of multi-view machine learning. Neural Comput Appl 23(7–8):2031–2038CrossRef
8.
go back to reference Gao L, Qi L, Chen E, Guan L (2012) Discriminative multiple canonical correlation analysis for multi-feature information fusion. In: IEEE international symposium on multimedia, pp 36–43 Gao L, Qi L, Chen E, Guan L (2012) Discriminative multiple canonical correlation analysis for multi-feature information fusion. In: IEEE international symposium on multimedia, pp 36–43
9.
go back to reference Shen X, Sun Q (2015) Orthogonal multiset canonical correlation analysis based on fractional-order and its application in multiple feature extraction and recognition. Neural Process Lett 42(2):301–316CrossRef Shen X, Sun Q (2015) Orthogonal multiset canonical correlation analysis based on fractional-order and its application in multiple feature extraction and recognition. Neural Process Lett 42(2):301–316CrossRef
10.
go back to reference Yuan YH, Sun QS, Ge HW (2014) Fractional-order embedding canonical correlation analysis and its applications to multi-view dimensionality reduction and recognition. Pattern Recogn 47(3):1411–1424CrossRefMATH Yuan YH, Sun QS, Ge HW (2014) Fractional-order embedding canonical correlation analysis and its applications to multi-view dimensionality reduction and recognition. Pattern Recogn 47(3):1411–1424CrossRefMATH
11.
go back to reference Li YO, Adali T, Wang W, Calhoun VD (2009) Joint blind source separation by multiset canonical correlation analysis. IEEE Trans Signal Process 57(10):3918–3929MathSciNetCrossRef Li YO, Adali T, Wang W, Calhoun VD (2009) Joint blind source separation by multiset canonical correlation analysis. IEEE Trans Signal Process 57(10):3918–3929MathSciNetCrossRef
12.
go back to reference Jing X, Hu R, Zhu Y, Wu S, Liang C, Yang J (2014). Intra-view and inter-view supervised correlation analysis for multi-view feature learning. In: AAAI conference on artificial intelligence (AAAI), pp 1882–1889 Jing X, Hu R, Zhu Y, Wu S, Liang C, Yang J (2014). Intra-view and inter-view supervised correlation analysis for multi-view feature learning. In: AAAI conference on artificial intelligence (AAAI), pp 1882–1889
13.
go back to reference Sharma A, Kumar A, Daume H, Jacobs DW (2012) Generalized multiview analysis: a discriminative latent space. In IEEE conference on computer vision and pattern recognition (CVPR), pp 2160–2167 Sharma A, Kumar A, Daume H, Jacobs DW (2012) Generalized multiview analysis: a discriminative latent space. In IEEE conference on computer vision and pattern recognition (CVPR), pp 2160–2167
14.
go back to reference Diethe T, Hardoon DR, Shawe-Taylor J (2008) Multiview fisher discriminant analysis. In: NIPS workshop on learning from multiple sources Diethe T, Hardoon DR, Shawe-Taylor J (2008) Multiview fisher discriminant analysis. In: NIPS workshop on learning from multiple sources
15.
go back to reference Kan M, Shan S, Zhang H, Lao S, Chen X (2012) Multi-view discriminant analysis. In European conference on computer vision (ECCV), pp 808–821 Kan M, Shan S, Zhang H, Lao S, Chen X (2012) Multi-view discriminant analysis. In European conference on computer vision (ECCV), pp 808–821
16.
go back to reference Sun S, Xie X, Yang M (2015) Multiview uncorrelated discriminant analysis. IEEE Trans Cybern (in press) Sun S, Xie X, Yang M (2015) Multiview uncorrelated discriminant analysis. IEEE Trans Cybern (in press)
17.
go back to reference Shekhar S, Patel VM, Nasrabadi NM, Chellappa R (2014) Joint sparse representation for robust multimodal biometrics recognition. IEEE Trans Pattern Anal Mach Intell 36(1):113–126CrossRef Shekhar S, Patel VM, Nasrabadi NM, Chellappa R (2014) Joint sparse representation for robust multimodal biometrics recognition. IEEE Trans Pattern Anal Mach Intell 36(1):113–126CrossRef
18.
go back to reference Jia Y, Salzmann M, Darrell T (2010) Factorized latent spaces with structured sparsity. In: Advances in neural information processing systems (NIPS), pp 982–990 Jia Y, Salzmann M, Darrell T (2010) Factorized latent spaces with structured sparsity. In: Advances in neural information processing systems (NIPS), pp 982–990
19.
go back to reference Zheng S, Xie B, Huang K, Tao D (2011) Multi-view pedestrian recognition using shared dictionary learning with group sparsity. In: International conference on neural information processing (ICONIP), pp 629–638 Zheng S, Xie B, Huang K, Tao D (2011) Multi-view pedestrian recognition using shared dictionary learning with group sparsity. In: International conference on neural information processing (ICONIP), pp 629–638
20.
go back to reference Zheng J, Jiang Z, Phillips PJ, Chellappa R (2012) Cross-view action recognition via a transferable dictionary pair. In: British machine vision conference (BMVC) Zheng J, Jiang Z, Phillips PJ, Chellappa R (2012) Cross-view action recognition via a transferable dictionary pair. In: British machine vision conference (BMVC)
21.
go back to reference Gao Z, Zhang H, Xu GP, Xue YB, Hauptmann AG (2014) Multi-view discriminative and structured dictionary learning with group sparsity for human action recognition. Signal Process 112:83–97CrossRef Gao Z, Zhang H, Xu GP, Xue YB, Hauptmann AG (2014) Multi-view discriminative and structured dictionary learning with group sparsity for human action recognition. Signal Process 112:83–97CrossRef
22.
go back to reference Zhuang Y, Wang Y, Wu F, Zhang Y, Lu W (2013) Supervised coupled dictionary learning with group structures for multi-modal retrieval. In AAAI conference on artificial intelligence (AAAI), pp 1070–1076 Zhuang Y, Wang Y, Wu F, Zhang Y, Lu W (2013) Supervised coupled dictionary learning with group structures for multi-modal retrieval. In AAAI conference on artificial intelligence (AAAI), pp 1070–1076
23.
go back to reference Patel VM, Gopalan R, Li R, Chellappa R (2015) Visual domain adaptation: a survey of recent advances. IEEE Signal Process Mag 32(3):53–69CrossRef Patel VM, Gopalan R, Li R, Chellappa R (2015) Visual domain adaptation: a survey of recent advances. IEEE Signal Process Mag 32(3):53–69CrossRef
24.
go back to reference Shekhar S, Patel VM, Nguyen HV, Chellappa R (2013) Generalized domain-adaptive dictionaries. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 361–368 Shekhar S, Patel VM, Nguyen HV, Chellappa R (2013) Generalized domain-adaptive dictionaries. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 361–368
25.
go back to reference Zhang H, Nasrabadi NM, Zhang Y, Huang TS (2012) Joint dynamic sparse representation for multi-view face recognition. Pattern Recogn 45(4):1290–1298CrossRef Zhang H, Nasrabadi NM, Zhang Y, Huang TS (2012) Joint dynamic sparse representation for multi-view face recognition. Pattern Recogn 45(4):1290–1298CrossRef
26.
go back to reference Zheng J, Jiang Z (2013) Learning view-invariant sparse representations for cross-view action recognition. In: IEEE conference on computer vision (ICCV), pp 3176–3183 Zheng J, Jiang Z (2013) Learning view-invariant sparse representations for cross-view action recognition. In: IEEE conference on computer vision (ICCV), pp 3176–3183
27.
go back to reference Shi Y, Gao Y, Yang Y, Zhang Y, Wang D (2013) Multi-modal sparse representation-based classification for lung needle biopsy images. IEEE Trans Biomed Eng 60(10):2675–2685CrossRef Shi Y, Gao Y, Yang Y, Zhang Y, Wang D (2013) Multi-modal sparse representation-based classification for lung needle biopsy images. IEEE Trans Biomed Eng 60(10):2675–2685CrossRef
28.
go back to reference Jing X, Hu R, Wu F, Chen X, Liu Q, Yao Y (2014) Uncorrelated multi-view discrimination dictionary learning for recognition. In: AAAI conference on artificial intelligence (AAAI), pp 2787–2795 Jing X, Hu R, Wu F, Chen X, Liu Q, Yao Y (2014) Uncorrelated multi-view discrimination dictionary learning for recognition. In: AAAI conference on artificial intelligence (AAAI), pp 2787–2795
29.
go back to reference Gao S, Tsang IW, Ma Y (2014) Learning category-specific dictionary and shared dictionary for fine-grained image categorization. IEEE Trans Image Process 23(2):623–634MathSciNetCrossRef Gao S, Tsang IW, Ma Y (2014) Learning category-specific dictionary and shared dictionary for fine-grained image categorization. IEEE Trans Image Process 23(2):623–634MathSciNetCrossRef
30.
go back to reference Wang D, Kong S (2014) A classification-oriented dictionary learning model: explicitly learning the particularity and commonality across categories. Pattern Recogn 47(2):885–898CrossRefMATH Wang D, Kong S (2014) A classification-oriented dictionary learning model: explicitly learning the particularity and commonality across categories. Pattern Recogn 47(2):885–898CrossRefMATH
31.
go back to reference Cai D, He X, Han J, Zhang HJ (2006) Orthogonal laplacianfaces for face recognition. IEEE Trans Image Process 15(11):3608–3614CrossRef Cai D, He X, Han J, Zhang HJ (2006) Orthogonal laplacianfaces for face recognition. IEEE Trans Image Process 15(11):3608–3614CrossRef
32.
go back to reference Murase H, Nayar SK (1995) Visual learning and recognition of 3-D objects from appearance. Int J Comput Vis 14(1):5–24CrossRef Murase H, Nayar SK (1995) Visual learning and recognition of 3-D objects from appearance. Int J Comput Vis 14(1):5–24CrossRef
33.
go back to reference LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef
34.
go back to reference Chen CF, Wei CP, Wang YC (2012) Low-rank matrix recovery with structural incoherence for robust face recognition. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 2618–2625 Chen CF, Wei CP, Wang YC (2012) Low-rank matrix recovery with structural incoherence for robust face recognition. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 2618–2625
35.
go back to reference Rosasco L, Verri A, Santoro M, Mosci S, Villa S (2009) Iterative projection methods for structured sparsity regularization. MIT Technical Reports, MIT-CSAIL-TR-2009-050, CBCL-282, Massachusetts Institute of Technology Rosasco L, Verri A, Santoro M, Mosci S, Villa S (2009) Iterative projection methods for structured sparsity regularization. MIT Technical Reports, MIT-CSAIL-TR-2009-050, CBCL-282, Massachusetts Institute of Technology
36.
go back to reference Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227CrossRef Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227CrossRef
37.
go back to reference Turk M, Pentland A (1991) Eigenfaces for recognition. J Cognit Neurosci 3(1):71–86CrossRef Turk M, Pentland A (1991) Eigenfaces for recognition. J Cognit Neurosci 3(1):71–86CrossRef
38.
go back to reference Grigorescu SE, Petkov N, Kruizinga P (2002) Comparison of texture features based on Gabor filters. IEEE Trans Image Process 11(10):1160–1167MathSciNetCrossRef Grigorescu SE, Petkov N, Kruizinga P (2002) Comparison of texture features based on Gabor filters. IEEE Trans Image Process 11(10):1160–1167MathSciNetCrossRef
39.
go back to reference Fukunaga K, Koontz WL (1970) Application of the Karhunen–Loeve expansion to feature selection and ordering. IEEE Trans Comput 19(4):311–318CrossRefMATH Fukunaga K, Koontz WL (1970) Application of the Karhunen–Loeve expansion to feature selection and ordering. IEEE Trans Comput 19(4):311–318CrossRefMATH
40.
go back to reference Ahonen T, Hadid A, Pietikainen M (2006) Face description with local binary patterns: application to face recognition. IEEE Trans Pattern Anal Mach Intell 28(12):2037–2041CrossRefMATH Ahonen T, Hadid A, Pietikainen M (2006) Face description with local binary patterns: application to face recognition. IEEE Trans Pattern Anal Mach Intell 28(12):2037–2041CrossRefMATH
41.
go back to reference Draper BA, Yambor WS, Beveridge JR (2002) Analyzing PCA-based face recognition algorithms: eigenvector selection and distance measures. In: IEEE workshop empirical evaluation methods in computer vision, pp 1–15 Draper BA, Yambor WS, Beveridge JR (2002) Analyzing PCA-based face recognition algorithms: eigenvector selection and distance measures. In: IEEE workshop empirical evaluation methods in computer vision, pp 1–15
Metadata
Title
Multi-view Discriminant Dictionary Learning via Learning View-specific and Shared Structured Dictionaries for Image Classification
Authors
Fei Wu
Xiao-Yuan Jing
Dong Yue
Publication date
23-08-2016
Publisher
Springer US
Published in
Neural Processing Letters / Issue 2/2017
Print ISSN: 1370-4621
Electronic ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-016-9545-7

Other articles of this Issue 2/2017

Neural Processing Letters 2/2017 Go to the issue