Skip to main content
Erschienen in: Neural Computing and Applications 3-4/2014

01.09.2014 | Review

ELM-based gene expression classification with misclassification cost

verfasst von: Hui-juan Lu, En-hui Zheng, Yi Lu, Xiao-ping Ma, Jin-yong Liu

Erschienen in: Neural Computing and Applications | Ausgabe 3-4/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Cost-sensitive learning is a crucial problem in machine learning research. Traditional classification problem assumes that the misclassification for each category has the same cost, and the target of learning algorithm is to minimize the expected error rate. In cost-sensitive learning, costs of misclassification for samples of different categories are not the same; the target of algorithm is to minimize the sum of misclassification cost. Cost-sensitive learning can meet the actual demand of real-life classification problems, such as medical diagnosis, financial projections, and so on. Due to fast learning speed and perfect performance, extreme learning machine (ELM) has become one of the best classification algorithms, while voting based on extreme learning machine (V-ELM) makes classification results more accurate and stable. However, V-ELM and some other versions of ELM are all based on the assumption that all misclassifications have same cost. Therefore, they cannot solve cost-sensitive problems well. To overcome the drawback of ELMs mentioned above, an algorithm called cost-sensitive ELM (CS-ELM) is proposed by introducing misclassification cost of each sample into V-ELM. Experimental results on gene expression data show that CS-ELM is effective in reducing misclassification cost.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Han J, Kamber M (2011) Data mining concepts and techniques. Morgan Kanfnann, San Francisco CA Han J, Kamber M (2011) Data mining concepts and techniques. Morgan Kanfnann, San Francisco CA
2.
Zurück zum Zitat Zheng EH, Li P, Song ZH (2006) SVM-based Cost Sensitive Mining. Information and Control 3:294299 Zheng EH, Li P, Song ZH (2006) SVM-based Cost Sensitive Mining. Information and Control 3:294299
3.
Zurück zum Zitat Chow CK (1970) On optimum recognition error and reject tradeoff. IEEE Trans Inf Theory 1:41–46 Chow CK (1970) On optimum recognition error and reject tradeoff. IEEE Trans Inf Theory 1:41–46
4.
Zurück zum Zitat Foggia P, Sansone C, Tortorella F et al (1999) Mulit-classification: reject criteria for the bayesian combiner. Pattern Recognit 32:1435–1447 Foggia P, Sansone C, Tortorella F et al (1999) Mulit-classification: reject criteria for the bayesian combiner. Pattern Recognit 32:1435–1447
5.
Zurück zum Zitat Stefano CD, Sansone C, Vento M (2008) To reject or not to reject: that is the question-answer in case of neural classifiers. IEEE Trans SMC 30(1):84–94 Stefano CD, Sansone C, Vento M (2008) To reject or not to reject: that is the question-answer in case of neural classifiers. IEEE Trans SMC 30(1):84–94
6.
Zurück zum Zitat Landgrebe T, Taxdmj CW, Paclik P et al (2006) The interaction between classification and reject performance for distance-based reject-option classifiers. Pattern Recognit Lett 8:908–917 Landgrebe T, Taxdmj CW, Paclik P et al (2006) The interaction between classification and reject performance for distance-based reject-option classifiers. Pattern Recognit Lett 8:908–917
7.
Zurück zum Zitat Zheng EH et al. (2009) SVM-BASED credit card fraud detection with reject cost and class-dependent error cost. Proceedings of the PAKDD’ 09 Workshop: data mining when classes are imbalanced and errors have cost. Rangsit Campus: Thammasat Printing House, 2009: 50–58 Zheng EH et al. (2009) SVM-BASED credit card fraud detection with reject cost and class-dependent error cost. Proceedings of the PAKDD’ 09 Workshop: data mining when classes are imbalanced and errors have cost. Rangsit Campus: Thammasat Printing House, 2009: 50–58
8.
Zurück zum Zitat Zheng EH (2010) Cost sensitive data mining based on support vector machines: theories and applications. Control and Decision 25(2):191–195 Zheng EH (2010) Cost sensitive data mining based on support vector machines: theories and applications. Control and Decision 25(2):191–195
9.
Zurück zum Zitat Kubat M, Holte R, Matwin S (1997) Learning when negative examples abound. Machine learning: ECML-97, Lecture Notes in Artificial Intelligence. Springer. 1224:146–153 Kubat M, Holte R, Matwin S (1997) Learning when negative examples abound. Machine learning: ECML-97, Lecture Notes in Artificial Intelligence. Springer. 1224:146–153
10.
Zurück zum Zitat Chan PK, Stolfo SJ (2001) Toward scalable learning with non-uniform class and cost distributions: a case study in credit card fraud detection. Proceedings of the 4th international conference on knowledge discovery and data mining. 164–168 Chan PK, Stolfo SJ (2001) Toward scalable learning with non-uniform class and cost distributions: a case study in credit card fraud detection. Proceedings of the 4th international conference on knowledge discovery and data mining. 164–168
11.
Zurück zum Zitat Weiss GM, Provost F (2003) Learning when training data are costly: the effect of class distribution on tree induction. J Artif Intell Res 19:315–354MATH Weiss GM, Provost F (2003) Learning when training data are costly: the effect of class distribution on tree induction. J Artif Intell Res 19:315–354MATH
12.
Zurück zum Zitat Japkowicz N (2000) The class imbalance problem: significance and strategies. Proceedings of the 2000 international conference on artificial intelligence: special track on inductive learning, Las Vegas Japkowicz N (2000) The class imbalance problem: significance and strategies. Proceedings of the 2000 international conference on artificial intelligence: special track on inductive learning, Las Vegas
13.
Zurück zum Zitat Elkan C (2001) The foundation of cost-sensitive learning. Proceedings of the 17th international joint conference on artificial intelligence, Washington. 239–246 Elkan C (2001) The foundation of cost-sensitive learning. Proceedings of the 17th international joint conference on artificial intelligence, Washington. 239–246
14.
Zurück zum Zitat Zhou ZH, Liu XY (2006) Training cost-sensitive neural networks with methods addressing the class imbalance problem. IEEE Trans Knowl Data Eng 18(1):63–77 Zhou ZH, Liu XY (2006) Training cost-sensitive neural networks with methods addressing the class imbalance problem. IEEE Trans Knowl Data Eng 18(1):63–77
15.
Zurück zum Zitat Liu XY, Wu J, Zhou ZH (2009) Exploratory under-sampling for class-imbalance learning. IEEE Trans Syst Man Cybern-Part B: Cyber 39(2):539–550 Liu XY, Wu J, Zhou ZH (2009) Exploratory under-sampling for class-imbalance learning. IEEE Trans Syst Man Cybern-Part B: Cyber 39(2):539–550
16.
Zurück zum Zitat Ling C, Li C (1998) Data mining for direct marketing problems and solutions. Proceedings of the 4th international conference on knowledge discovery and data mining, New York. 73–79 Ling C, Li C (1998) Data mining for direct marketing problems and solutions. Proceedings of the 4th international conference on knowledge discovery and data mining, New York. 73–79
17.
Zurück zum Zitat Drummond C, Holte R (1998) Exploiting the cost in sensitivity of decision tree splitting criteria. Proceedings of the 17th international conference on machine learning. Stanford University. 73–79 Drummond C, Holte R (1998) Exploiting the cost in sensitivity of decision tree splitting criteria. Proceedings of the 17th international conference on machine learning. Stanford University. 73–79
18.
Zurück zum Zitat Fan W, Stolfo S, Zhang J, Chan P (1999) Adacost: misclassification cost-sensitive boosting. Proceedings of the 16th international conference on machine learning. Bled, Slovenia. 97–105 Fan W, Stolfo S, Zhang J, Chan P (1999) Adacost: misclassification cost-sensitive boosting. Proceedings of the 16th international conference on machine learning. Bled, Slovenia. 97–105
19.
Zurück zum Zitat Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55(1):119–139MATHMathSciNet Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55(1):119–139MATHMathSciNet
20.
Zurück zum Zitat Xiao JH, Pan KQ, Wu JP, Yang SZ (2001) A study on SVM for fault diagnosis. J Vib Measurement Diagn 21(4):258–262 Xiao JH, Pan KQ, Wu JP, Yang SZ (2001) A study on SVM for fault diagnosis. J Vib Measurement Diagn 21(4):258–262
21.
Zurück zum Zitat Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70:489–501 Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70:489–501
22.
Zurück zum Zitat Huang GB, Zhu QY, Siew CK (2004) Extreme learning machine: a new learning scheme of feedforward neural networks. Proceedings of international joint conference on neural networks. 25–29 Huang GB, Zhu QY, Siew CK (2004) Extreme learning machine: a new learning scheme of feedforward neural networks. Proceedings of international joint conference on neural networks. 25–29
23.
Zurück zum Zitat Cao JW, Lin ZP, Huang GB, Liu N (2011) Voting based extreme learning machine. Inf Sci 185(1):66–77MathSciNet Cao JW, Lin ZP, Huang GB, Liu N (2011) Voting based extreme learning machine. Inf Sci 185(1):66–77MathSciNet
Metadaten
Titel
ELM-based gene expression classification with misclassification cost
verfasst von
Hui-juan Lu
En-hui Zheng
Yi Lu
Xiao-ping Ma
Jin-yong Liu
Publikationsdatum
01.09.2014
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 3-4/2014
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-013-1512-x

Weitere Artikel der Ausgabe 3-4/2014

Neural Computing and Applications 3-4/2014 Zur Ausgabe