Skip to main content

2019 | OriginalPaper | Buchkapitel

Novel Semantic Discretization Technique for Type-2 Diabetes Classification Model

verfasst von : Omprakash Chandrakar, Jatinderkumar R. Saini, Dharmendra G. Bhatti

Erschienen in: Innovations in Computer Science and Engineering

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Semantic discretization, which is relatively a new concept, can be viewed as the discretization technique that uses the semantics of the data along with its value. The semantics of the data refer to the domain knowledge inherent in the data. The semantics of data is derived from the data value itself. Objective and context of the study also contribute significantly to identifying semantic of the data. Since no explicit ontology is associated with the data in semantic discretization, identifying, interpreting, and exploiting, the semantics of the data is a challenging task. This paper presents a novel algorithm for semantic discretization, in which machine learning techniques such as classification and association rule mining is used to derive semantic knowledge, which is further used for discretization. To show the effectiveness of the proposed semantic discretization algorithm, we applied it on diabetes dataset. Experimental results show 2–15% improvement in classification accuracy on semantically discretized dataset in comparison to the original and statistically discretized dataset.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Yang YW, Wu GI, Maimon X, Oded Rokach L Book section, discretization methods, data mining and knowledge discovery handbook, 2005, Springer US, Boston, MA @ 978-0-387-25465-4 Yang YW, Wu GI, Maimon X, Oded Rokach L Book section, discretization methods, data mining and knowledge discovery handbook, 2005, Springer US, Boston, MA @ 978-0-387-25465-4
2.
Zurück zum Zitat Chandrakar O, Saini JR (2017) Knowledge based semantic discretization using data mining techniques. Int J Adv Intell Parad Chandrakar O, Saini JR (2017) Knowledge based semantic discretization using data mining techniques. Int J Adv Intell Parad
3.
Zurück zum Zitat Liu H, Hussain F, Tan CL, Dash M (2002) Discretization: an enabling technique. Data Min Knowl Disc 6(4):393–423MathSciNetCrossRef Liu H, Hussain F, Tan CL, Dash M (2002) Discretization: an enabling technique. Data Min Knowl Disc 6(4):393–423MathSciNetCrossRef
4.
Zurück zum Zitat Dougherty J, Kohavi R, Sahami M (1995) Supervised and unsupervised discretization of continuous features. In: Proceedings of the twelfth international conference on machine learning (ICML), 1995, pp 194–202 Dougherty J, Kohavi R, Sahami M (1995) Supervised and unsupervised discretization of continuous features. In: Proceedings of the twelfth international conference on machine learning (ICML), 1995, pp 194–202
5.
Zurück zum Zitat Yang Y, Webb GI, Wu X (2010) Discretization methods. In: Data mining and knowledge discovery handbook, pp 101–116 Yang Y, Webb GI, Wu X (2010) Discretization methods. In: Data mining and knowledge discovery handbook, pp 101–116
6.
Zurück zum Zitat Li R-P, Wang Z-O (2002) An entropy-based discretization method for classification rules with inconsistency checking. In: Proceedings of the first international conference on machine learning and cybernetics (ICMLC), pp 243–246 Li R-P, Wang Z-O (2002) An entropy-based discretization method for classification rules with inconsistency checking. In: Proceedings of the first international conference on machine learning and cybernetics (ICMLC), pp 243–246
7.
Zurück zum Zitat Yang Y, Webb GI (2009) Discretization for naive-bayes learning: managing discretization bias and variance. Mach Learn 74(1):39–74CrossRef Yang Y, Webb GI (2009) Discretization for naive-bayes learning: managing discretization bias and variance. Mach Learn 74(1):39–74CrossRef
8.
Zurück zum Zitat Bay SD (2001) Multivariate discretization for set mining. Knowl Inf Syst 3:491–512CrossRef Bay SD (2001) Multivariate discretization for set mining. Knowl Inf Syst 3:491–512CrossRef
9.
Zurück zum Zitat Cerquides J, Lopez R (1997) Proposal and empirical comparison of a parallelizable distance-based discretization method. In: III international conference on knowledge discovery and data mining (KDDM97). Newport Beach, California, USA, pp 139–142 Cerquides J, Lopez R (1997) Proposal and empirical comparison of a parallelizable distance-based discretization method. In: III international conference on knowledge discovery and data mining (KDDM97). Newport Beach, California, USA, pp 139–142
10.
Zurück zum Zitat Steck H, Jaakkola T (2004) Predictive discretization during model selection. In: XXVI symposium in pattern recognition (DAGM04). Lecture notes in computer science 3175, Springer, Tbingen, Germany, pp 1–8 Steck H, Jaakkola T (2004) Predictive discretization during model selection. In: XXVI symposium in pattern recognition (DAGM04). Lecture notes in computer science 3175, Springer, Tbingen, Germany, pp 1–8
11.
Zurück zum Zitat Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc. Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc.
12.
Zurück zum Zitat Au W-H, Chan KCC, Wong AKC (2006) A fuzzy approach to partitioning continuous attributes for classification. IEEE Trans Knowl Data Eng 18(5):715–719CrossRef Au W-H, Chan KCC, Wong AKC (2006) A fuzzy approach to partitioning continuous attributes for classification. IEEE Trans Knowl Data Eng 18(5):715–719CrossRef
13.
Zurück zum Zitat Kerber R (1992) ChiMerge: discretization of numeric attributes. X national conference on artificial intelligence American association (AAAI92). USA, pp 123–128 Kerber R (1992) ChiMerge: discretization of numeric attributes. X national conference on artificial intelligence American association (AAAI92). USA, pp 123–128
14.
Zurück zum Zitat Chandrakar O, Saini JR Development of Indian weighted diabetic risk score (IWDRS) using machine learning techniques for type-2 diabetes. In: COMPUTE ‘16 proceedings of the 9th annual ACM India conference. ACM New York, NY, USA, pp 125–128. ©2016, ISBN: 978-1-4503-4808-9. https://doi.org/10.1145/2998476.2998497 Chandrakar O, Saini JR Development of Indian weighted diabetic risk score (IWDRS) using machine learning techniques for type-2 diabetes. In: COMPUTE ‘16 proceedings of the 9th annual ACM India conference. ACM New York, NY, USA, pp 125–128. ©2016, ISBN: 978-1-4503-4808-9. https://​doi.​org/​10.​1145/​2998476.​2998497
15.
Zurück zum Zitat Bouckaert RR, Frank E, Hall M, Kirkby R, Reutemann P, Seewald A, Scuse D (2016) WEKA manual for version 3-8-1. University of Waikato, Hamilton, New Zealand Bouckaert RR, Frank E, Hall M, Kirkby R, Reutemann P, Seewald A, Scuse D (2016) WEKA manual for version 3-8-1. University of Waikato, Hamilton, New Zealand
16.
Zurück zum Zitat Chandrakar O, Saini JR Questionnaire for deriving diabetic risk score for Indian population. Accepted for presentation and publication at international conference on artificial intelligence in health care, ICAIHC-2016 Chandrakar O, Saini JR Questionnaire for deriving diabetic risk score for Indian population. Accepted for presentation and publication at international conference on artificial intelligence in health care, ICAIHC-2016
Metadaten
Titel
Novel Semantic Discretization Technique for Type-2 Diabetes Classification Model
verfasst von
Omprakash Chandrakar
Jatinderkumar R. Saini
Dharmendra G. Bhatti
Copyright-Jahr
2019
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-7082-3_17

Neuer Inhalt