Skip to main content
Erschienen in: Neural Processing Letters 2/2019

09.05.2018

Feature Extraction Based on Support Vector Data Description

verfasst von: Li Zhang, Xingning Lu

Erschienen in: Neural Processing Letters | Ausgabe 2/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Motivated by the improvement of performance and reduction of complexity, feature extraction is referred to one manner of dimensionality reduction. This paper presents a new feature extraction method based on support vector data description (FE-SVDD). First, the proposed method establishes hyper-sphere models for each category of the given data using support vector data description. Second, FE-SVDD calculates the distances between data points and the centers of the hyper-spheres. Finally, the ratios of the distances to the radii of the hyper-spheres are treated as new extracted features. Experimental results on different data sets indicate that FE-SVDD can speed up the procedure of feature extraction and extract the distinctive information of original data.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Armstrong SA, Staunton JE, Silverman LB, Pieters R, den Boer ML, Minden MD, Sallan SE, Lander ES, Golub TR, Korsmeyer SJ (2002) MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia. Nat Genet 30(1):41–47CrossRef Armstrong SA, Staunton JE, Silverman LB, Pieters R, den Boer ML, Minden MD, Sallan SE, Lander ES, Golub TR, Korsmeyer SJ (2002) MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia. Nat Genet 30(1):41–47CrossRef
2.
Zurück zum Zitat Bhattacharjee A, Richards WG, Staunton J, Li C, Monti S, Vasa P, Ladd C, Beheshti J, Bueno R, Gillette M, Loda M, Weber G, Mark EJ, Lander ES, Wong W, Johnson BE, Golub TR, Sugarbaker DJ, Meyerson M (2001) Classification of human lung carcinomas by mrna expression profiling reveals distinct adenocarcinoma subclasses. Proc Nat Acad Sci 98(24):13,790–13,795CrossRef Bhattacharjee A, Richards WG, Staunton J, Li C, Monti S, Vasa P, Ladd C, Beheshti J, Bueno R, Gillette M, Loda M, Weber G, Mark EJ, Lander ES, Wong W, Johnson BE, Golub TR, Sugarbaker DJ, Meyerson M (2001) Classification of human lung carcinomas by mrna expression profiling reveals distinct adenocarcinoma subclasses. Proc Nat Acad Sci 98(24):13,790–13,795CrossRef
3.
Zurück zum Zitat Burges CJ (1998) A tutorial on support vector machines for pattern recognition. Data Min Knowl Disc 2:121–167CrossRef Burges CJ (1998) A tutorial on support vector machines for pattern recognition. Data Min Knowl Disc 2:121–167CrossRef
4.
Zurück zum Zitat Cao LJ, Chua KS, Chong WK, Lee HP, Gu QM (2003) A comparison of PCA, KPCA and ICA for dimensionality reduction in support vector machine. Neurocomputing 55(1–2):321–336 Cao LJ, Chua KS, Chong WK, Lee HP, Gu QM (2003) A comparison of PCA, KPCA and ICA for dimensionality reduction in support vector machine. Neurocomputing 55(1–2):321–336
5.
Zurück zum Zitat Chen HH, Tiho P, Yao X (2009) Predictive ensemble pruning by expectation propagation. IEEE Trans Knowl Data Eng 21(7):999–1013CrossRef Chen HH, Tiho P, Yao X (2009) Predictive ensemble pruning by expectation propagation. IEEE Trans Knowl Data Eng 21(7):999–1013CrossRef
6.
Zurück zum Zitat Daelemans W, Goethals B, Morik K (eds) (2008) Machine learning and knowledge discovery in databases, European conference, ECML/PKDD 2008, Antwerp, Belgium, Sept 15–19, 2008, Proceedings, Part II, Lecture Notes in Computer Science, vol 5212, Springer Daelemans W, Goethals B, Morik K (eds) (2008) Machine learning and knowledge discovery in databases, European conference, ECML/PKDD 2008, Antwerp, Belgium, Sept 15–19, 2008, Proceedings, Part II, Lecture Notes in Computer Science, vol 5212, Springer
7.
Zurück zum Zitat Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30MathSciNetMATH Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30MathSciNetMATH
8.
Zurück zum Zitat Diaf A, Boufama B, Benlamri R (2013) Non-parametric fisher’s discriminant analysis with kernels for data classification. Pattern Recogn Lett 34(5):552–558CrossRef Diaf A, Boufama B, Benlamri R (2013) Non-parametric fisher’s discriminant analysis with kernels for data classification. Pattern Recogn Lett 34(5):552–558CrossRef
10.
Zurück zum Zitat Eklund PW, Hoang A (2006) A comparative study of public domain supervised classifier performance on the UCI database. Aust J Intell Inf Process Syst 9(1):1–39 Eklund PW, Hoang A (2006) A comparative study of public domain supervised classifier performance on the UCI database. Aust J Intell Inf Process Syst 9(1):1–39
11.
Zurück zum Zitat Elisseeff IGA (2006) Feature extraction. Springer, Berlin Elisseeff IGA (2006) Feature extraction. Springer, Berlin
12.
Zurück zum Zitat Friedman M (1937) The use of ranks to avoid the assumption of normality implicit in the analysis of variance. J Am Stat Assoc 32(200):675–701CrossRefMATH Friedman M (1937) The use of ranks to avoid the assumption of normality implicit in the analysis of variance. J Am Stat Assoc 32(200):675–701CrossRefMATH
13.
Zurück zum Zitat Geller SC, Gregg JP, Hagerman P, Rocke DM (2003) Transformation and normalization of oligonucleotide microarray data. Bioinformatics 19(14):1817–1823CrossRef Geller SC, Gregg JP, Hagerman P, Rocke DM (2003) Transformation and normalization of oligonucleotide microarray data. Bioinformatics 19(14):1817–1823CrossRef
14.
Zurück zum Zitat Gheyas IA, Smith LS (2010) Feature subset selection in large dimensionality domains. Pattern Recogn 43(1):5–12CrossRefMATH Gheyas IA, Smith LS (2010) Feature subset selection in large dimensionality domains. Pattern Recogn 43(1):5–12CrossRefMATH
15.
Zurück zum Zitat Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182MATH Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182MATH
16.
Zurück zum Zitat Hagan MT, Demuth HB, Beale MH, De J (1996) Neural network design. PWS Publishing Company, Boston Hagan MT, Demuth HB, Beale MH, De J (1996) Neural network design. PWS Publishing Company, Boston
17.
Zurück zum Zitat Huesken D, Lange J, Mickanin C, Weiler J, Asselbergs F, Warner J, Meloon B, Engel S, Rosenberg A, Cohen D, Labow M, Reinhardt M, Natt F, Hall J (2005) Design of a genome-wide siRNA library using an artificial neural network. Nat Biotechnol 23(8):995–1001CrossRef Huesken D, Lange J, Mickanin C, Weiler J, Asselbergs F, Warner J, Meloon B, Engel S, Rosenberg A, Cohen D, Labow M, Reinhardt M, Natt F, Hall J (2005) Design of a genome-wide siRNA library using an artificial neural network. Nat Biotechnol 23(8):995–1001CrossRef
18.
Zurück zum Zitat Kohavi R, John GH (1997) Wrappers for feature subset selection. Artif Intell 97(1–2):273–324CrossRefMATH Kohavi R, John GH (1997) Wrappers for feature subset selection. Artif Intell 97(1–2):273–324CrossRefMATH
19.
Zurück zum Zitat Lashkia GV, Anthony L (2004) Relevant, irredundant feature selection and noisy example elimination. IEEE Trans Syst Man Cybern Part B 34(2):888–897CrossRef Lashkia GV, Anthony L (2004) Relevant, irredundant feature selection and noisy example elimination. IEEE Trans Syst Man Cybern Part B 34(2):888–897CrossRef
20.
Zurück zum Zitat Lee D, Lee J (2007) Domain described support vector classifier for multi-classification problems. Pattern Recogn 40(1):41–51CrossRefMATH Lee D, Lee J (2007) Domain described support vector classifier for multi-classification problems. Pattern Recogn 40(1):41–51CrossRefMATH
21.
Zurück zum Zitat Liu B, Xiao YS, Yu PS, Hao ZF, Cao LB (2014) An efficient orientation distance-based discriminative feature extraction method for multi-classification. Knowl Inf Syst 39(2):409–433CrossRef Liu B, Xiao YS, Yu PS, Hao ZF, Cao LB (2014) An efficient orientation distance-based discriminative feature extraction method for multi-classification. Knowl Inf Syst 39(2):409–433CrossRef
22.
Zurück zum Zitat Liu WF, Zhang HM, Tao DP, Wang YJ, Lu K (2016) Large-scale paralleled sparse principal component analysis. Multimed Tools Appl 75(3):1481–1493CrossRef Liu WF, Zhang HM, Tao DP, Wang YJ, Lu K (2016) Large-scale paralleled sparse principal component analysis. Multimed Tools Appl 75(3):1481–1493CrossRef
23.
Zurück zum Zitat Liu Y, Lita LV, Niculescu RS, Bai K, Mitra P, Giles CL (2008) Real-time data pre-processing technique for efficient feature extraction in large scale datasets, pp 981–990 Liu Y, Lita LV, Niculescu RS, Bai K, Mitra P, Giles CL (2008) Real-time data pre-processing technique for efficient feature extraction in large scale datasets, pp 981–990
24.
Zurück zum Zitat Liu Z, Hsiao W, Cantarel BL, Drábek EF, Fraser-Liggett C (2011) Sparse distance-based learning for simultaneous multiclass classification and feature selection of metagenomic data. Bioinformatics 27(23):3242–3249CrossRef Liu Z, Hsiao W, Cantarel BL, Drábek EF, Fraser-Liggett C (2011) Sparse distance-based learning for simultaneous multiclass classification and feature selection of metagenomic data. Bioinformatics 27(23):3242–3249CrossRef
25.
Zurück zum Zitat López-Rubio E, Muñoz-Pérez J, Gómez-Ruiz JA (2003) Principal components analysis competitive learning. In: Artificial neural nets problem solving methods, 7th international work-conference on artificial and natural neural networks, IWANN2003, Maó, Menorca, Spain, June 3–6, 2003 Proceedings, Part I, pp 318–325 López-Rubio E, Muñoz-Pérez J, Gómez-Ruiz JA (2003) Principal components analysis competitive learning. In: Artificial neural nets problem solving methods, 7th international work-conference on artificial and natural neural networks, IWANN2003, Maó, Menorca, Spain, June 3–6, 2003 Proceedings, Part I, pp 318–325
26.
Zurück zum Zitat Maldonado S, Weber R, Basak J (2011) Simultaneous feature selection and classification using kernel-penalized support vector machines. Inf Sci 181(1):115–128CrossRef Maldonado S, Weber R, Basak J (2011) Simultaneous feature selection and classification using kernel-penalized support vector machines. Inf Sci 181(1):115–128CrossRef
27.
Zurück zum Zitat Pauwels EJ, Ambekar O (2011) One class classification for anomaly detection: support vector data description revisited. In: Advances in data mining. Applications and theoretical aspects—11th industrial conference, ICDM 2011, New York, NY, USA, Aug 30–Sept 3, 2011. Proceedings, pp 25–39 Pauwels EJ, Ambekar O (2011) One class classification for anomaly detection: support vector data description revisited. In: Advances in data mining. Applications and theoretical aspects—11th industrial conference, ICDM 2011, New York, NY, USA, Aug 30–Sept 3, 2011. Proceedings, pp 25–39
28.
Zurück zum Zitat Shao L, Liu L, Li X (2014) Feature learning for image classification via multiobjective genetic programming. IEEE Trans Neural Netw Learn Syst 25(7):1359–1371CrossRef Shao L, Liu L, Li X (2014) Feature learning for image classification via multiobjective genetic programming. IEEE Trans Neural Netw Learn Syst 25(7):1359–1371CrossRef
29.
Zurück zum Zitat Tao DC, Tang XO, Li XL, Wu XD (2006) Asymmetric bagging and random subspace for support vector machines-based relevance feedback in image retrieval. IEEE Trans Pattern Anal Mach Intell 28(7):1088–1099CrossRef Tao DC, Tang XO, Li XL, Wu XD (2006) Asymmetric bagging and random subspace for support vector machines-based relevance feedback in image retrieval. IEEE Trans Pattern Anal Mach Intell 28(7):1088–1099CrossRef
30.
Zurück zum Zitat Tao DC, Li XL, Wu XD, Maybank SJ (2009) Geometric mean for subspace selection. IEEE Trans Pattern Anal Mach Intell 31(2):260–274CrossRef Tao DC, Li XL, Wu XD, Maybank SJ (2009) Geometric mean for subspace selection. IEEE Trans Pattern Anal Mach Intell 31(2):260–274CrossRef
31.
32.
Zurück zum Zitat Xue B, Zhang M, Browne WN (2013) Particle swarm optimization for feature selection in classification: a multi-objective approach. IEEE Trans Cybern 43(6):1656–1671CrossRef Xue B, Zhang M, Browne WN (2013) Particle swarm optimization for feature selection in classification: a multi-objective approach. IEEE Trans Cybern 43(6):1656–1671CrossRef
33.
Zurück zum Zitat Yang JB, Ong CJ (2012) An effective feature selection method via mutual information estimation. IEEE Trans Syst Man Cybern Part B 42(6):1550–1559CrossRef Yang JB, Ong CJ (2012) An effective feature selection method via mutual information estimation. IEEE Trans Syst Man Cybern Part B 42(6):1550–1559CrossRef
34.
Zurück zum Zitat Zhang L, Lu XN, Wang BJ, He SP (2015) Similarity learning based on multiple support vector data description. In: 2015 international joint conference on neural networks, IJCNN 2015, Killarney, Ireland, July 12–17, 2015, pp 1–7 Zhang L, Lu XN, Wang BJ, He SP (2015) Similarity learning based on multiple support vector data description. In: 2015 international joint conference on neural networks, IJCNN 2015, Killarney, Ireland, July 12–17, 2015, pp 1–7
35.
Zurück zum Zitat Zhu Z, Ong YS, Dash M (2007) Markov blanket-embedded genetic algorithm for gene selection. Pattern Recogn 40(11):3236–3248CrossRefMATH Zhu Z, Ong YS, Dash M (2007) Markov blanket-embedded genetic algorithm for gene selection. Pattern Recogn 40(11):3236–3248CrossRefMATH
Metadaten
Titel
Feature Extraction Based on Support Vector Data Description
verfasst von
Li Zhang
Xingning Lu
Publikationsdatum
09.05.2018
Verlag
Springer US
Erschienen in
Neural Processing Letters / Ausgabe 2/2019
Print ISSN: 1370-4621
Elektronische ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-018-9838-0

Weitere Artikel der Ausgabe 2/2019

Neural Processing Letters 2/2019 Zur Ausgabe

Neuer Inhalt