nach oben

Neural Processing Letters

Erschienen in:

27.07.2018

Discriminative Autoencoder for Feature Extraction: Application to Character Recognition

verfasst von: Anupriya Gogna, Angshul Majumdar

Erschienen in: Neural Processing Letters | Ausgabe 3/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Conventionally, autoencoders are unsupervised representation learning tools. In this work, we propose a novel discriminative autoencoder. Use of supervised discriminative learning ensures that the learned representation is robust to variations commonly encountered in image datasets. Using the basic discriminating autoencoder as a unit, we build a stacked architecture aimed at extracting relevant representation from the training data. The efficiency of our feature extraction algorithm ensures a high classification accuracy with even simple classification schemes like KNN (K-nearest neighbor). We demonstrate the superiority of our model for representation learning by conducting experiments on standard datasets for character/image recognition and subsequent comparison with existing supervised deep architectures like class sparse stacked autoencoder and discriminative deep belief network.

Vorheriger Artikel Contractive Slab and Spike Convolutional Deep Belief Network

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Qian Y, Ye M, Zhou J (2013) Hyperspectral image classification based on structured sparse logistic regression and three-dimensional wavelet texture features. IEEE Trans Geosci Remote Sens 51(4):2276–2291CrossRef

Vigdor B, Lerner B (2006) Accurate and fast off and online fuzzy ARTMAP-based image classification with application to genetic abnormality diagnosis. IEEE Trans Neural Netw 17(5):1288–1300CrossRef

Tao H, Hou C, Nie F, Jiao Y, Yi D (2016) Effective discriminative feature selection with nontrivial solution. IEEE Trans Neural Netw Learn Syst 27(4):796–808MathSciNetCrossRef

Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), vol 1. IEEE, pp 886–893

Cheung W, Hamarneh G (2007) N-sift: N-dimensional scale invariant feature transform for matching medical images. In: 2007 4th IEEE international symposium on biomedical imaging: from nano to macro. IEEE, pp 720–723

Ahonen T, Matas J, He C, Pietikäinen M (2009) Rotation invariant image description with local binary pattern histogram fourier features. In: Scandinavian conference on image analysis. Springer, Berlin, pp 61–70

Gunturk BK, Batur AU, Altunbasak Y, Hayes MH, Mersereau RM (2003) Eigenface-domain super-resolution for face recognition. IEEE Trans Image Process 12(5):597–606CrossRef

Jing X-Y, Wong H-S, Zhang D (2006) Face recognition based on 2D Fisherface approach. Pattern Recogn 39(4):707–710CrossRefMATH

Zhang B, Fu M, Yan H (1998) Handwritten digit recognition by a mixture of local principal component analysis. Proc Neural Process Lett 8(3):241–252CrossRef

10.

Maria Joao, Amaro Joao, Falcao Gabriel, Alexandre Luís A (2016) Stacked autoencoders using low-power accelerated architectures for object recognition in autonomous systems. Neural Process Lett 43(2):445–458CrossRef

11.

Mohamed A-R, Dahl GE, Hinton G (2012) Acoustic modeling using deep belief networks. IEEE Trans Audio Speech Lang Process 20(1):14–22CrossRef

12.

Bengio Y (2009) Learning deep architectures for AI. Found Trends Mach Learn 2(1):1–127CrossRefMATH

13.

Hinton GE, Osindero S, Teh Y-W (2006) A fast learning algorithm for deep belief nets. Neural Comput 18(7):1527–1554MathSciNetCrossRefMATH

14.

Yu D, Deng L (2011) Deep learning and its applications to signal and information processing [exploratory dsp]. IEEE Signal Process Mag 28(1):145–154CrossRef

15.

Zhou S, Chen Q, Wang X (2013) Convolutional deep networks for visual data classification. Neural Process Lett 38(1):17–27CrossRef

16.

Hinton GE (2002) Training products of experts by minimizing contrastive divergence. Neural Comput 14(8):1771–1800CrossRefMATH

17.

Bengio Y, Lamblin P, Popovici D, Larochelle H (2007) Greedy layer-wise training of deep networks. Adv Neural Inf Process Syst 19:153

18.

Abbas HM (2004) Analysis and pruning of nonlinear auto-association networks. IEEE Proc Vis Image Signal Process 151(1):44–50CrossRef

19.

Bourlard H, Kamp Y (1988) Auto-association by multilayer perceptrons and singular value decomposition. Biol Cybern 59(4–5):291–294MathSciNetCrossRefMATH

20.

Vincent P, Larochelle H, Lajoie I, Bengio Y, Manzagol P-A (2010) Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J Mach Learn Res 11:3371–3408MathSciNetMATH

21.

Olshausen BA (1996) Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381(6583):607–609CrossRef

22.

Längkvist M, Loutfi A (2012) Learning representations with a dynamic objective sparse autoencoder. In: Neural information processing systems

23.

Lemme A, Reinhart RF, Steil JJ (2012) Online learning and generalization of parts-based image representations by non-negative sparse autoencoders. Neural Netw 33:194–203CrossRef

24.

Chen M, Weinberger KQ, Sha F, Bengio Y (2014) Marginalized denoising auto-encoders for nonlinear representations. In: ICML, pp 1476–1484

25.

Razakarivony S, Jurie F (2014) Discriminative autoencoders for small targets detection. In: IAPR international conference on pattern recognition, pp 3528–3533

26.

Wang J, Gao X (2015) Max–min distance nonnegative matrix factorization. Neural Netw 61:75–84CrossRefMATH

27.

Zhang Q, Li B (2010) Discriminative K-SVD for dictionary learning in face recognition. In: 2010 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 2691–2698

28.

Jiang Z, Lin Z, Davis LS (2013) Label consistent K-SVD: learning a discriminative dictionary for recognition. IEEE Trans Pattern Anal Mach Intell 35(11):2651–2664CrossRef

29.

Larochelle H, Bengio Y (2008) Classification using discriminative restricted Boltzmann machines. In: Proceedings of the 25th international conference on machine learning. ACM, pp 536–543

30.

Goldstein T, Osher S (2009) The split Bregman method for L1-regularized problems. SIAM J Imaging Sci 2(2):323–343MathSciNetCrossRefMATH

31.

http://www.iro.umontreal.ca/~lisa/twiki/bin/view.cgi/Public/DeepVsShallowComparisonICML2007

32.

http://www.cad.zju.edu.cn/home/dengcai/Data/MLData.html

33.

http://www.isical.ac.in/~ujjwal/download/database.html

34.

Lawson CL, Hanson RJ (1995) Solving least squares problems, vol 15. SIAM, PhiladelphiaCrossRefMATH

35.

Ng A (2011) Sparse autoencoder. CS294A lecture notes 72:1–19

36.

Majumdar A, Vatsa M, Singh R (2017) Face recognition via class sparsity based supervised encoding. IEEE Trans Pattern Anal Mach Intell 39(6):1273–1280CrossRef

37.

Liu Y, Zhoub S, Chen Q (2011) Discriminative deep belief networks for visual data classification. Pattern Recogn 44(10–11):2287–2296CrossRefMATH

Titel: Discriminative Autoencoder for Feature Extraction: Application to Character Recognition
verfasst von: Anupriya Gogna
Angshul Majumdar
Publikationsdatum: 27.07.2018
Verlag: Springer US
Erschienen in: Neural Processing Letters / Ausgabe 3/2019
Print ISSN: 1370-4621
Elektronische ISSN: 1573-773X
DOI: https://doi.org/10.1007/s11063-018-9894-5

Neuer Inhalt

Bildnachweise

Smart-Manufacturing Dashboard Banner/© AdobeStock_583269095, VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Nachhaltigkeitsaward Key Visual/© Cometis AG/Global ESG Monitor | Daniel Rupp | Generiert mit KI, Search Icon, Banner Hanser, Leads Kundenakquise/© Andrey Popov / stock.adobe.com, Schiffschraube/© Angelika Bentin | stock.adobe.com, Rudergelenkwelle/© Weicon GmbH & Co. KG, Digitalisierung im Marketing/© Fotolia/alphaspirit, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Sustainibility Finance/© Robert Kneschke / stock.adobe.com / Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 3/2019

Binary Filter for Fast Vessel Pattern Extraction

Alignment Based Kernel Selection for Multi-Label Learning

Biomedical Data Analysis Based on Multi-view Intact Space Learning with Geodesic Similarity Preserving

Weighted Pseudo Almost Periodic Solutions for Cellular Neural Networks with Multi-proportional Delays

An Improved Method for Semantic Image Inpainting with GANs: Progressive Inpainting

Structural Reweight Sparse Subspace Clustering

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.