Skip to main content
Top
Published in: Pattern Analysis and Applications 1/2011

01-02-2011 | Theoretical Advances

A new hybrid method for gene selection

Authors: Ruichu Cai, Zhifeng Hao, Xiaowei Yang, Han Huang

Published in: Pattern Analysis and Applications | Issue 1/2011

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Gene selection is a significant preprocessing of the discriminant analysis of microarray data. The classical gene selection methods can be classified into three categories: the filters, the wrappers and the embedded methods. In this paper, a novel hybrid gene selection method (HGSM) is proposed by exploring both the mutual information criterion (filters) and leave-one-out-error criterion (wrappers) under the framework of an improved ant algorithm. Extensive experiments are conducted on three benchmark datasets and the results confirm the effectiveness and efficiency of HGSM.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Golub TR et al (1999) Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286(5439):531–537CrossRef Golub TR et al (1999) Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286(5439):531–537CrossRef
2.
go back to reference Nutt CL et al (2003) Gene expression-based classification of malignant gliomas correlates better with survival than histological classification. Cancer Res 63(7):1602–1607 Nutt CL et al (2003) Gene expression-based classification of malignant gliomas correlates better with survival than histological classification. Cancer Res 63(7):1602–1607
3.
go back to reference Kohavi R, John GH (1997) Wrappers for feature subset selection. Artif Intell 97(1–2):273–324MATHCrossRef Kohavi R, John GH (1997) Wrappers for feature subset selection. Artif Intell 97(1–2):273–324MATHCrossRef
4.
go back to reference Cherkassky VS, Mulier F (1998) Learning from data: concepts, theory, and methods. Wiley, New York Cherkassky VS, Mulier F (1998) Learning from data: concepts, theory, and methods. Wiley, New York
5.
go back to reference Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182MATHCrossRef Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182MATHCrossRef
6.
go back to reference Chang CF, Wai KM, Patterton HG (2004) Calculating the statistical significance of physical clusters of co-regulated genes in the genome: the role of chromatin in domain-wide gene regulation. Nucleic Acids Res 32(5):1798–1807CrossRef Chang CF, Wai KM, Patterton HG (2004) Calculating the statistical significance of physical clusters of co-regulated genes in the genome: the role of chromatin in domain-wide gene regulation. Nucleic Acids Res 32(5):1798–1807CrossRef
7.
go back to reference Reiner A, Yekutieli D, Benjamini Y (2003) Identifying differentially expressed genes using false discovery rate controlling procedures. Bioinformatics 19(3):368–375CrossRef Reiner A, Yekutieli D, Benjamini Y (2003) Identifying differentially expressed genes using false discovery rate controlling procedures. Bioinformatics 19(3):368–375CrossRef
8.
go back to reference Tang EK, Suganthan PN, Yao X (2006) Gene selection algorithms for microarray data-based on least squares support vector machine. BMC Bioinforma 7:85CrossRef Tang EK, Suganthan PN, Yao X (2006) Gene selection algorithms for microarray data-based on least squares support vector machine. BMC Bioinforma 7:85CrossRef
9.
go back to reference Zhou X, Mao KZ (2005) LS Bound based gene selection for DNA microarray data. Bioinformatics 21(8):1559–1564CrossRef Zhou X, Mao KZ (2005) LS Bound based gene selection for DNA microarray data. Bioinformatics 21(8):1559–1564CrossRef
10.
go back to reference Chuang LY et al (2008) Improved binary PSO for feature selection using gene expression data. Elsevier, pp 29–37 Chuang LY et al (2008) Improved binary PSO for feature selection using gene expression data. Elsevier, pp 29–37
11.
go back to reference Ooi CH, Tan P (2003) Genetic algorithms applied to multi-class prediction for the analysis of gene expression data. Bioinformatics 19(1):37–44CrossRef Ooi CH, Tan P (2003) Genetic algorithms applied to multi-class prediction for the analysis of gene expression data. Bioinformatics 19(1):37–44CrossRef
12.
go back to reference Guyon I et al (2002) Gene selection for cancer classification using support vector machines. Mach Learn 46(1–3):389–422MATHCrossRef Guyon I et al (2002) Gene selection for cancer classification using support vector machines. Mach Learn 46(1–3):389–422MATHCrossRef
13.
go back to reference Furlanello C et al (2003) Entropy-based gene ranking without selection bias for the predictive classification of microarray data. BMC Bioinforma 4:54CrossRef Furlanello C et al (2003) Entropy-based gene ranking without selection bias for the predictive classification of microarray data. BMC Bioinforma 4:54CrossRef
14.
go back to reference Ding YY, Wilkins D (2006) Improving the performance of SVM-RFE to select genes in microarray data. BMC Bioinforma 7(Suppl 2):S12CrossRef Ding YY, Wilkins D (2006) Improving the performance of SVM-RFE to select genes in microarray data. BMC Bioinforma 7(Suppl 2):S12CrossRef
15.
go back to reference Robbins KR et al (2007) The ant colony algorithm for feature selection in high-dimension gene expression data for disease classification. Math Med Biol 24(4):413–426MATHCrossRef Robbins KR et al (2007) The ant colony algorithm for feature selection in high-dimension gene expression data for disease classification. Math Med Biol 24(4):413–426MATHCrossRef
16.
go back to reference Duda RO, Hart PE, Stork DG (2001) Pattern classification, 2nd edn. Wiley Duda RO, Hart PE, Stork DG (2001) Pattern classification, 2nd edn. Wiley
17.
go back to reference Dorigo M, Gambardella LM (1997) Ant colony system: a cooperative learning approach to the traveling salesman problem. IEEE Trans Evol Comput 1(1):53–66CrossRef Dorigo M, Gambardella LM (1997) Ant colony system: a cooperative learning approach to the traveling salesman problem. IEEE Trans Evol Comput 1(1):53–66CrossRef
18.
go back to reference Allinen M et al (2004) Molecular characterization of the tumor microenvironment in breast cancer. Cancer Cell 6(1):17–32CrossRef Allinen M et al (2004) Molecular characterization of the tumor microenvironment in breast cancer. Cancer Cell 6(1):17–32CrossRef
19.
go back to reference Alon U et al (1999) Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc Natl Acad Sci USA 96(12):6745–6750CrossRef Alon U et al (1999) Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc Natl Acad Sci USA 96(12):6745–6750CrossRef
20.
go back to reference Dudoit S, Fridlyand J, Speed TP (2002) Comparison of discrimination methods for the classification of tumors using gene expression data. J Am Stat Assoc 97(457):77–87MATHCrossRefMathSciNet Dudoit S, Fridlyand J, Speed TP (2002) Comparison of discrimination methods for the classification of tumors using gene expression data. J Am Stat Assoc 97(457):77–87MATHCrossRefMathSciNet
21.
go back to reference Ambroise C, McLachlan GJ (2002) Selection bias in gene extraction on the basis of microarray gene-expression data. Proc Natl Acad Sci USA 99(10):6562–6566MATHCrossRef Ambroise C, McLachlan GJ (2002) Selection bias in gene extraction on the basis of microarray gene-expression data. Proc Natl Acad Sci USA 99(10):6562–6566MATHCrossRef
23.
go back to reference Kelly L, Clark J, Gilliland DG (2002) Comprehensive genotypic analysis of leukemia: clinical and therapeutic implications. Curr Opin Oncol 14(1):10–18CrossRef Kelly L, Clark J, Gilliland DG (2002) Comprehensive genotypic analysis of leukemia: clinical and therapeutic implications. Curr Opin Oncol 14(1):10–18CrossRef
24.
go back to reference LeBien TW, McCormack RT (1989) The common acute lymphoblastic leukemia antigen (CD10)—emancipation from a functional enigma. Blood 73(3):625–635 LeBien TW, McCormack RT (1989) The common acute lymphoblastic leukemia antigen (CD10)—emancipation from a functional enigma. Blood 73(3):625–635
25.
go back to reference Raaijmakers M (2007) ATP-binding-cassette transporters in hematopoietic stem cells and their utility as therapeutical targets in acute and chronic myeloid leukemia. Leukemia 21(10):2094–2102CrossRef Raaijmakers M (2007) ATP-binding-cassette transporters in hematopoietic stem cells and their utility as therapeutical targets in acute and chronic myeloid leukemia. Leukemia 21(10):2094–2102CrossRef
26.
go back to reference Wong ETL et al (1999) Changes in chromatin organization at the neutrophil elastase locus associated with myeloid cell differentiation. Blood 94(11):3730 Wong ETL et al (1999) Changes in chromatin organization at the neutrophil elastase locus associated with myeloid cell differentiation. Blood 94(11):3730
27.
go back to reference Secchiero P et al (2005) Potential pathogenetic implications of cyclooxygenase-2 overexpression in B chronic lymphoid leukemia cells. Am J Pathol 167(6):1599–1607 Secchiero P et al (2005) Potential pathogenetic implications of cyclooxygenase-2 overexpression in B chronic lymphoid leukemia cells. Am J Pathol 167(6):1599–1607
28.
go back to reference Debernardi S et al (2003) Genome-wide analysis of acute myeloid leukemia with normal karyotype reveals a unique pattern of homeobox gene expression distinct from those with translocation-mediated fusion events. Genes Chromosom Cancer 37(2):149–158CrossRef Debernardi S et al (2003) Genome-wide analysis of acute myeloid leukemia with normal karyotype reveals a unique pattern of homeobox gene expression distinct from those with translocation-mediated fusion events. Genes Chromosom Cancer 37(2):149–158CrossRef
Metadata
Title
A new hybrid method for gene selection
Authors
Ruichu Cai
Zhifeng Hao
Xiaowei Yang
Han Huang
Publication date
01-02-2011
Publisher
Springer-Verlag
Published in
Pattern Analysis and Applications / Issue 1/2011
Print ISSN: 1433-7541
Electronic ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-010-0180-z

Other articles of this Issue 1/2011

Pattern Analysis and Applications 1/2011 Go to the issue

Premium Partner