Skip to main content
Top
Published in: Neural Processing Letters 2/2019

20-08-2018

Discriminative Feature Learning via Sparse Autoencoders with Label Consistency Constraints

Authors: Cong Hu, Xiao-Jun Wu, Zhen-Qiu Shu

Published in: Neural Processing Letters | Issue 2/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Autoencoders have been successfully used to build deep hierarchical models of data. However, a deep architecture usually needs further supervised fine-tuning to obtain better discriminative capacity. To improve the discriminative capacity of deep hierarchical features, this paper proposes a new deterministic autoencoder, trained by a label consistency constraints algorithm that injects discriminative information to the network. We introduce the center loss as label consistency constraints to learn the hidden features of data and add it to the Sparse AutoEncoder to form a new autoencoder, namely Label Consistency Constrained Sparse AutoEncoders (LCCSAE). Specifically, the center loss learns the center of each class, and simultaneously penalizes the distances between the features and their corresponding class centers. In the end, autoencoders are stacked to form a deep architecture of LCCSAE for image classification tasks. To validate the effectiveness of LCCSAE, we compare it with other autoencoders in terms of the deeply learned features and the subsequent classification tasks on MNIST and CIFAR-bw datasets. Experimental results demonstrate the superiority of LCCSAE over other methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Xu J, Xiang L, Liu Q, Gilmore H, Wu J, Tang J, Madabhushi A (2016) Stacked sparse autoencoder (SSAE) for nuclei detection on breast cancer histopathology images. IEEE Trans Med Imaging 35(1):119–130CrossRef Xu J, Xiang L, Liu Q, Gilmore H, Wu J, Tang J, Madabhushi A (2016) Stacked sparse autoencoder (SSAE) for nuclei detection on breast cancer histopathology images. IEEE Trans Med Imaging 35(1):119–130CrossRef
3.
go back to reference Yang J, Yu K, Gong Y, Huang T (2009) Linear spatial pyramid matching using sparse coding for image classification. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 1794–1801 Yang J, Yu K, Gong Y, Huang T (2009) Linear spatial pyramid matching using sparse coding for image classification. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 1794–1801
4.
go back to reference Chen Y, Luo W, Yang J (2015) Facial landmark detection via pose-induced auto-encoder networks. In: IEEE international conference on image processing (ICIP), pp 2115–2119 Chen Y, Luo W, Yang J (2015) Facial landmark detection via pose-induced auto-encoder networks. In: IEEE international conference on image processing (ICIP), pp 2115–2119
5.
go back to reference Luo W, Yang J, Xu W, Fu T (2015) Locality-constrained sparse auto-encoder for image classification. IEEE Signal Process Lett 22(8):1070–1073CrossRef Luo W, Yang J, Xu W, Fu T (2015) Locality-constrained sparse auto-encoder for image classification. IEEE Signal Process Lett 22(8):1070–1073CrossRef
6.
go back to reference Yu J, Rui Y, Tang YY et al (2014) High-order distance-based multiview stochastic learning in image classification. IEEE Trans Cybern 44(12):2431–2442CrossRef Yu J, Rui Y, Tang YY et al (2014) High-order distance-based multiview stochastic learning in image classification. IEEE Trans Cybern 44(12):2431–2442CrossRef
7.
go back to reference Erhan D, Szegedy C, Toshev A, Anguelov D (2014) Scalable object detection using deep neural networks. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 2147–2154 Erhan D, Szegedy C, Toshev A, Anguelov D (2014) Scalable object detection using deep neural networks. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 2147–2154
8.
go back to reference Li LJ, Su H, Fei-Fei L, Xing EP (2010) Object bank: a high-level image representation for scene classification and semantic feature sparsification. In: Advances in neural information processing systems, pp 1378–1386 Li LJ, Su H, Fei-Fei L, Xing EP (2010) Object bank: a high-level image representation for scene classification and semantic feature sparsification. In: Advances in neural information processing systems, pp 1378–1386
9.
go back to reference Zuo Z, Wang G, Shuai B, Zhao L, Yang Q (2015) Exemplar based deep discriminative and shareable feature learning for scene image classification. Pattern Recognit 48(10):3004–3015CrossRef Zuo Z, Wang G, Shuai B, Zhao L, Yang Q (2015) Exemplar based deep discriminative and shareable feature learning for scene image classification. Pattern Recognit 48(10):3004–3015CrossRef
10.
go back to reference Feng Z, Huber P, Kittler J, Christmas W, Wu X (2015) Random cascaded-regression copse for robust facial landmark detection. IEEE Signal Process Lett 1(22):76–80CrossRef Feng Z, Huber P, Kittler J, Christmas W, Wu X (2015) Random cascaded-regression copse for robust facial landmark detection. IEEE Signal Process Lett 1(22):76–80CrossRef
11.
go back to reference Shu Z, Zhao C, Huang P (2015) Local regularization concept factorization and its semi-supervised extension for image representation. Neurocomputing 158:1–12CrossRef Shu Z, Zhao C, Huang P (2015) Local regularization concept factorization and its semi-supervised extension for image representation. Neurocomputing 158:1–12CrossRef
12.
go back to reference Turk M, Pentland A (1991) Eigenfaces for recognition. J Cognit Neurosci 3:71–86CrossRef Turk M, Pentland A (1991) Eigenfaces for recognition. J Cognit Neurosci 3:71–86CrossRef
13.
go back to reference Belhumeur PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces vs. fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720CrossRef Belhumeur PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces vs. fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720CrossRef
14.
go back to reference He X, Ji M, Bao H (2009) Graph embedding with constraints. Int Joint Conf Artif Intell (IJCAI) 9:1065–1070 He X, Ji M, Bao H (2009) Graph embedding with constraints. Int Joint Conf Artif Intell (IJCAI) 9:1065–1070
15.
go back to reference Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227CrossRef Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227CrossRef
16.
go back to reference Song X, Feng Z, Hu G, Kittler J, Christmas W, Wu X (2016) Dictionary integration using 3D morphable face models for pose-invariant collaborative-representation-based classification. ArXiv preprint arXiv:1611.00284 Song X, Feng Z, Hu G, Kittler J, Christmas W, Wu X (2016) Dictionary integration using 3D morphable face models for pose-invariant collaborative-representation-based classification. ArXiv preprint arXiv:​1611.​00284
17.
go back to reference Song X, Feng Z, Hu G, Wu X (2017) Half-face dictionary integration for representation-based classification. IEEE Trans Cybern 47(1):142–152CrossRef Song X, Feng Z, Hu G, Wu X (2017) Half-face dictionary integration for representation-based classification. IEEE Trans Cybern 47(1):142–152CrossRef
18.
go back to reference Lu X, Wang Y, Yuan Y (2013) Graph-regularized low-rank representation for destriping of hyperspectral images. IEEE Trans Geosci Remote Sens 51(7):4009–4018CrossRef Lu X, Wang Y, Yuan Y (2013) Graph-regularized low-rank representation for destriping of hyperspectral images. IEEE Trans Geosci Remote Sens 51(7):4009–4018CrossRef
19.
go back to reference Feng Z, Kittler J, Christmas W, Wu X, Pfeiffer S (2012) Automatic face annotation by multilinear AAM with missing values. In: 2012 21st international conference on pattern recognition (ICPR), pp 2586–2589 Feng Z, Kittler J, Christmas W, Wu X, Pfeiffer S (2012) Automatic face annotation by multilinear AAM with missing values. In: 2012 21st international conference on pattern recognition (ICPR), pp 2586–2589
20.
21.
go back to reference Lee DD, Seung HS (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401(6755):788CrossRef Lee DD, Seung HS (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401(6755):788CrossRef
22.
go back to reference Shu Z, Wu X, Fan H, Huang P, Wu D, Hu C, Ye F (2017) Parameter-less auto-weighted multiple graph regularized nonnegative matrix factorization for data representation. Knowl Based Syst 131:1–194CrossRef Shu Z, Wu X, Fan H, Huang P, Wu D, Hu C, Ye F (2017) Parameter-less auto-weighted multiple graph regularized nonnegative matrix factorization for data representation. Knowl Based Syst 131:1–194CrossRef
23.
go back to reference Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507MathSciNetCrossRef Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507MathSciNetCrossRef
24.
go back to reference Erhan D, Bengio Y, Courville A, Manzagol PA, Vincent P, Bengio S (2010) Why does unsupervised pre-training help deep learning? J Mach Learn Res 11:625–660MathSciNetMATH Erhan D, Bengio Y, Courville A, Manzagol PA, Vincent P, Bengio S (2010) Why does unsupervised pre-training help deep learning? J Mach Learn Res 11:625–660MathSciNetMATH
25.
go back to reference Poultney C, Chopra S, Cun YL (2006) Efficient learning of sparse representations with an energy-based model. In: Advances in neural information processing systems, pp 1137–1144 Poultney C, Chopra S, Cun YL (2006) Efficient learning of sparse representations with an energy-based model. In: Advances in neural information processing systems, pp 1137–1144
26.
go back to reference Glorot X, Bordes A, Bengio Y (2011) Domain adaptation for large-scale sentiment classification: a deep learning approach. In: Proceedings of the 28th international conference on machine learning (ICML-11), pp 513–520 Glorot X, Bordes A, Bengio Y (2011) Domain adaptation for large-scale sentiment classification: a deep learning approach. In: Proceedings of the 28th international conference on machine learning (ICML-11), pp 513–520
27.
go back to reference Vincent P, Larochelle H, Bengio Y, Manzagol PA (2008) Extracting and composing robust features with denoising autoencoders. In: Proceedings of the 25th international conference on machine learning, pp 1096–1103 Vincent P, Larochelle H, Bengio Y, Manzagol PA (2008) Extracting and composing robust features with denoising autoencoders. In: Proceedings of the 25th international conference on machine learning, pp 1096–1103
28.
go back to reference Hosseini-Asl E, Zurada JM, Nasraoui O (2016) Deep learning of part-based representation of data using sparse autoencoders with nonnegativity constraints. IEEE Trans Neural Netw Learn Syst 27:2486–2498CrossRef Hosseini-Asl E, Zurada JM, Nasraoui O (2016) Deep learning of part-based representation of data using sparse autoencoders with nonnegativity constraints. IEEE Trans Neural Netw Learn Syst 27:2486–2498CrossRef
29.
go back to reference Hu C, Wu XJ (2016) Autoencoders with drop strategy. In: The 8th international conference on advances in brain inspired cognitive systems (BICS), pp 80–89 Hu C, Wu XJ (2016) Autoencoders with drop strategy. In: The 8th international conference on advances in brain inspired cognitive systems (BICS), pp 80–89
30.
go back to reference Bengio Y, Lamblin P, Popovici D, Larochelle H (2007) Greedy layer-wise training of deep networks. In: Advances in neural information processing systems, 19(153) Bengio Y, Lamblin P, Popovici D, Larochelle H (2007) Greedy layer-wise training of deep networks. In: Advances in neural information processing systems, 19(153)
31.
go back to reference Hinton GE, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov RR (2012) Improving neural networks by preventing co-adaptation of feature detectors. ArXiv preprint arXiv:1207.0580 Hinton GE, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov RR (2012) Improving neural networks by preventing co-adaptation of feature detectors. ArXiv preprint arXiv:​1207.​0580
32.
go back to reference Bourlard H, Kamp Y (1988) Auto-association by multilayer perceptrons and singular value decomposition. Biol Cybern 59(4–5):291–294MathSciNetCrossRef Bourlard H, Kamp Y (1988) Auto-association by multilayer perceptrons and singular value decomposition. Biol Cybern 59(4–5):291–294MathSciNetCrossRef
33.
go back to reference Riedmiller M (1994) Advanced supervised learning in multi-layer perceptrons-from backpropagation to adaptive learning algorithms. Comput Stand Interfaces 16(3):265–278CrossRef Riedmiller M (1994) Advanced supervised learning in multi-layer perceptrons-from backpropagation to adaptive learning algorithms. Comput Stand Interfaces 16(3):265–278CrossRef
34.
go back to reference Wen Y, Zhang K, Li Z, Qiao Y (2016) A discriminative feature learning approach for deep face recognition. In: European conference on computer vision, pp 499–515 Wen Y, Zhang K, Li Z, Qiao Y (2016) A discriminative feature learning approach for deep face recognition. In: European conference on computer vision, pp 499–515
35.
go back to reference LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324CrossRef
36.
go back to reference Krizhevsky A, Hinton G (2009) Learning multiple layers of features from tiny images Vol 1, No 4. Technical report, University of Toronto, p 7 Krizhevsky A, Hinton G (2009) Learning multiple layers of features from tiny images Vol 1, No 4. Technical report, University of Toronto, p 7
Metadata
Title
Discriminative Feature Learning via Sparse Autoencoders with Label Consistency Constraints
Authors
Cong Hu
Xiao-Jun Wu
Zhen-Qiu Shu
Publication date
20-08-2018
Publisher
Springer US
Published in
Neural Processing Letters / Issue 2/2019
Print ISSN: 1370-4621
Electronic ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-018-9898-1

Other articles of this Issue 2/2019

Neural Processing Letters 2/2019 Go to the issue