Skip to main content
Erschienen in: Soft Computing 23/2019

18.02.2019 | Methodologies and Application

A hybrid approach using rough set theory and hypergraph for feature selection on high-dimensional medical datasets

verfasst von: M. R. Gauthama Raman, Somu Nivethitha, Krithivasan Kannan, V. S. Shankar Sriram

Erschienen in: Soft Computing | Ausgabe 23/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

‘Curse of Dimensionality’—massive generation of high-dimensional medical datasets from various biomedical applications hardens the data analytic process for precise medical diagnosis. The design of an efficient feature selection technique for finding the optimal feature subset can be devised as a prominent solution to the above-said challenge. Further, it also improves the accuracy and minimizes the computational complexity of the learning model. The state-of-the-art feature selection techniques based on heuristic and statistical functions suffer from significant challenges in terms of classification accuracy, time complexity, etc. Hence, this paper presents Rough Set Theory and Hypergraph (RSHGT)-based feature selection technique to identify the optimal feature subset for accurate medical diagnosis. Experimental validations using six medical datasets from the Kent Ridge Biomedical dataset repository prove the efficiency of RSHGT in terms of reduct size, accuracy, precision, recall, and time complexity.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abraham A, Falc R, Bello R (2009) Rough set theory: a true landmark in data analysis. Springer, BerlinCrossRef Abraham A, Falc R, Bello R (2009) Rough set theory: a true landmark in data analysis. Springer, BerlinCrossRef
Zurück zum Zitat Alba E, Garcia-Nieto J, Jourdan L, Talbi E-G (2007) Gene selection in cancer classification using PSO/SVM and GA/SVM hybrid algorithms. In: IEEE congress on evolutionary computation. IEEE, pp 284–290 Alba E, Garcia-Nieto J, Jourdan L, Talbi E-G (2007) Gene selection in cancer classification using PSO/SVM and GA/SVM hybrid algorithms. In: IEEE congress on evolutionary computation. IEEE, pp 284–290
Zurück zum Zitat Berge C (1973) Graphs and hypergraphs. North-Holland Publishing Co., AmsterdamMATH Berge C (1973) Graphs and hypergraphs. North-Holland Publishing Co., AmsterdamMATH
Zurück zum Zitat Dharmarajan R, Kannan K (2013) On minimal transversals in simple hypergraphs. Int J Comput Appl Math 7:117–123 Dharmarajan R, Kannan K (2013) On minimal transversals in simple hypergraphs. Int J Comput Appl Math 7:117–123
Zurück zum Zitat Eiter T, Gottlob G (1995) Identifying the minimal transversals of a hypergraph and related problems. SIAM J Comput 24:1278–1304MathSciNetCrossRef Eiter T, Gottlob G (1995) Identifying the minimal transversals of a hypergraph and related problems. SIAM J Comput 24:1278–1304MathSciNetCrossRef
Zurück zum Zitat Hu, Xiaohua, Nick Cercone JH, Hu X, Cercone N, Han J (1994) An attribute-oriented rough set approach for knowledge discovery in databases. In: Ziarko WP (ed) Rough sets, fuzzy sets and knowledge discovery. Springer, London, pp 90–99CrossRef Hu, Xiaohua, Nick Cercone JH, Hu X, Cercone N, Han J (1994) An attribute-oriented rough set approach for knowledge discovery in databases. In: Ziarko WP (ed) Rough sets, fuzzy sets and knowledge discovery. Springer, London, pp 90–99CrossRef
Zurück zum Zitat Hu K, Diao L, Lu Y, Shi C (2000) A heuristic optimal reduct algorithm. In: International conference on intelligent data engineering and automated learning: data mining, financial engineering, and intelligent agents, pp 89–99 Hu K, Diao L, Lu Y, Shi C (2000) A heuristic optimal reduct algorithm. In: International conference on intelligent data engineering and automated learning: data mining, financial engineering, and intelligent agents, pp 89–99
Zurück zum Zitat Hu K, Lu Y, Shi C (2003) Feature ranking in rough sets. AI Commun 16:41–50MATH Hu K, Lu Y, Shi C (2003) Feature ranking in rough sets. AI Commun 16:41–50MATH
Zurück zum Zitat Huerta E, Duval B, Hao J (2008) Gene selection for microarray data by a LDA-based genetic algorithm. In: IAPR international conference on pattern recognition in bioinformatics. Springer, Berlin, Heidelberg, pp 250–261 Huerta E, Duval B, Hao J (2008) Gene selection for microarray data by a LDA-based genetic algorithm. In: IAPR international conference on pattern recognition in bioinformatics. Springer, Berlin, Heidelberg, pp 250–261
Zurück zum Zitat Inbarani H, Azar A, Jothi G (2014) Supervised hybrid feature selection based on PSO and rough sets for medical diagnosis. Comput methods programs 113:175–185CrossRef Inbarani H, Azar A, Jothi G (2014) Supervised hybrid feature selection based on PSO and rough sets for medical diagnosis. Comput methods programs 113:175–185CrossRef
Zurück zum Zitat Inbarani H, Bagyamathi M, Azar A (2015a) A novel hybrid feature selection method based on rough set and improved harmony search. Neural Comput Appl 26(8):1859–1880CrossRef Inbarani H, Bagyamathi M, Azar A (2015a) A novel hybrid feature selection method based on rough set and improved harmony search. Neural Comput Appl 26(8):1859–1880CrossRef
Zurück zum Zitat Kavvadias D, Stavropoulos E (2005) An efficient algorithm for the transversal hypergraph generation. J Graph Algorithms Appl 9:239–264MathSciNetCrossRef Kavvadias D, Stavropoulos E (2005) An efficient algorithm for the transversal hypergraph generation. J Graph Algorithms Appl 9:239–264MathSciNetCrossRef
Zurück zum Zitat Moteghaed NY, Maghooli K, Pirhadi S, Garshasbi M (2015) Biomarker discovery based on hybrid optimization algorithm and artificial neural networks on microarray data for cancer classification. J Med Signals Sens 5:88–96CrossRef Moteghaed NY, Maghooli K, Pirhadi S, Garshasbi M (2015) Biomarker discovery based on hybrid optimization algorithm and artificial neural networks on microarray data for cancer classification. J Med Signals Sens 5:88–96CrossRef
Zurück zum Zitat Øhrn A, Komorowski J (1997) Rosetta–a rough set toolkit for analysis of data. In: Third international joint conference on information sciences, pp 403–407 Øhrn A, Komorowski J (1997) Rosetta–a rough set toolkit for analysis of data. In: Third international joint conference on information sciences, pp 403–407
Zurück zum Zitat Pawlak Z (1998) Rough set theory and its applications to data analysis. Cybern Syst 29:661–688CrossRef Pawlak Z (1998) Rough set theory and its applications to data analysis. Cybern Syst 29:661–688CrossRef
Zurück zum Zitat Sánchez-Maroño N, Alonso-Betanzos A (2007) Filter methods for feature selection–a comparative study. In: International conference on intelligent data engineering and automated learning. Springer, Berlin, Heidelberg, pp 178–187 Sánchez-Maroño N, Alonso-Betanzos A (2007) Filter methods for feature selection–a comparative study. In: International conference on intelligent data engineering and automated learning. Springer, Berlin, Heidelberg, pp 178–187
Zurück zum Zitat Wang X, Gotoh O (2009) Microarray-based cancer prediction using soft computing approach. 7:123–139 Wang X, Gotoh O (2009) Microarray-based cancer prediction using soft computing approach. 7:123–139
Zurück zum Zitat Wang G, Yu H, Yang D (2002) Decision table reduction based on conditional information entropy. Chinese J Comput Ed 25:759–766MathSciNet Wang G, Yu H, Yang D (2002) Decision table reduction based on conditional information entropy. Chinese J Comput Ed 25:759–766MathSciNet
Zurück zum Zitat Witten I, Frank E, Hall M, Pal C (2016) Data mining: practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann Witten I, Frank E, Hall M, Pal C (2016) Data mining: practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann
Zurück zum Zitat Wroblewski J (1995) Finding minimal reducts using genetic algorithms. In: Proccedings of the second annual join conference on infromation science, pp 186–189 Wroblewski J (1995) Finding minimal reducts using genetic algorithms. In: Proccedings of the second annual join conference on infromation science, pp 186–189
Metadaten
Titel
A hybrid approach using rough set theory and hypergraph for feature selection on high-dimensional medical datasets
verfasst von
M. R. Gauthama Raman
Somu Nivethitha
Krithivasan Kannan
V. S. Shankar Sriram
Publikationsdatum
18.02.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Soft Computing / Ausgabe 23/2019
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-019-03818-6

Weitere Artikel der Ausgabe 23/2019

Soft Computing 23/2019 Zur Ausgabe

Premium Partner