Skip to main content
Erschienen in: Soft Computing 15/2021

28.10.2020 | Foundations

A nonparametric copula-based decision tree for two random variables using MIC as a classification index

verfasst von: Y. A. Khan, Q. S. Shan, Q. Liu, S. Z. Abbas

Erschienen in: Soft Computing | Ausgabe 15/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The copula is well-known for learning scale-free measures of dependence among variables and has invited much interest in recent years. At the very coronary heart of the copula, the concept is the well-known theorem of Sklar. It states that any multivariate distribution function can be disintegrated into the marginal distributions and a copula, which comprises the reliance between variables. On the other hand, the decision tree is a renowned nonparametric dominant modeling approach used for both regression and labeling problems. A decision tree represents a tree-structured classification of the data into surprising instructions for simplicity and prediction reason. In this paper, we are going to appraise with novel nonparametric copula-based decision tree organization using a measure of dependence: maximal information coefficient as classification index for two related variables which best classify the data concerning looking at the factors, but additionally ranked the factors in line with their inferences. Additionally, we pre-test the splitting criteria value to anticipate growing branches of the decision tree at each infant node. For example, we followed our proposed method to credit card records for Taiwan and coronary heart disease records of Pakistan and acquired the desirable outcomes. As a result, the anticipated method of initiating two-variable decision trees is tested using constructive tools for classification, prediction and reconnecting critical factors in statistics, finance, fitness sciences, machine learning, and many other associated fields.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Aitkenhead MJ (2008) A co-evolving decision tree classification method. Expert SystAppl 34(1):18–25CrossRef Aitkenhead MJ (2008) A co-evolving decision tree classification method. Expert SystAppl 34(1):18–25CrossRef
Zurück zum Zitat Alsagheer RHA, Alharan AFH, Al-Haboobi ASA (2017) Popular decision tree algorithms of data mining techniques: a review. Int J ComputSci Mobile Comput IJCSMC 6(6):133–142 Alsagheer RHA, Alharan AFH, Al-Haboobi ASA (2017) Popular decision tree algorithms of data mining techniques: a review. Int J ComputSci Mobile Comput IJCSMC 6(6):133–142
Zurück zum Zitat Balakrishnan S, Madigan D (2006) Decision trees for functional variables. In: 6th international conference on data mining (ICDM'06), pp 798–802 Balakrishnan S, Madigan D (2006) Decision trees for functional variables. In: 6th international conference on data mining (ICDM'06), pp 798–802
Zurück zum Zitat Chen SX, Huang TM (2007) Nonparametric estimation of copula functions for dependence modelling. Can J Stat 35(2):145–159MathSciNetCrossRef Chen SX, Huang TM (2007) Nonparametric estimation of copula functions for dependence modelling. Can J Stat 35(2):145–159MathSciNetCrossRef
Zurück zum Zitat Cherubini U, Luciano E, Vecchiato W (2004) Copula methods in finance. Wiley finance series. Wiley, LondonCrossRef Cherubini U, Luciano E, Vecchiato W (2004) Copula methods in finance. Wiley finance series. Wiley, LondonCrossRef
Zurück zum Zitat Elidan G (2012) Copula network classifiers. In: Proceedings of the 15th international conference on artificial intelligence and statistics, PMLR, vol 22, pp 346–354 Elidan G (2012) Copula network classifiers. In: Proceedings of the 15th international conference on artificial intelligence and statistics, PMLR, vol 22, pp 346–354
Zurück zum Zitat Elidan G (2013) Copulas in machine learning. In: Jaworski P, Durante F, Hardle WK (eds) Copulae in mathematical and quantitative finance, volume 213 of lecture notes in statistics. Springer, Berlin, pp 39–60 Elidan G (2013) Copulas in machine learning. In: Jaworski P, Durante F, Hardle WK (eds) Copulae in mathematical and quantitative finance, volume 213 of lecture notes in statistics. Springer, Berlin, pp 39–60
Zurück zum Zitat Geenens G, Charpentier A, Paindaveine D (2017) Probit transformation for nonparametric kernel estimation of the copula density. Bernoulli 23(3):1848–1873MathSciNetCrossRef Geenens G, Charpentier A, Paindaveine D (2017) Probit transformation for nonparametric kernel estimation of the copula density. Bernoulli 23(3):1848–1873MathSciNetCrossRef
Zurück zum Zitat Gijbels I, Mielniczuk J (1990) Estimating the density of a copula function. Commun Stat Theory Methods 19(2):445–464MathSciNetCrossRef Gijbels I, Mielniczuk J (1990) Estimating the density of a copula function. Commun Stat Theory Methods 19(2):445–464MathSciNetCrossRef
Zurück zum Zitat Hastie T, Tibshirani R, Friedman JH (2009) The elements of statistical learning: data mining, Inference and Prediction. Springer, New YorkCrossRef Hastie T, Tibshirani R, Friedman JH (2009) The elements of statistical learning: data mining, Inference and Prediction. Springer, New YorkCrossRef
Zurück zum Zitat Kinney JB, Gurinder SA (2014) Equitability, mutual information, and the maximal information coefficient. PNAS 111(9):3354–3359MathSciNetCrossRef Kinney JB, Gurinder SA (2014) Equitability, mutual information, and the maximal information coefficient. PNAS 111(9):3354–3359MathSciNetCrossRef
Zurück zum Zitat Kraskov A, Stogbauer H, Grassberger P (2004) Estimating mutual information. Phys Rev E Stat Nonlin Soft Matter Phys 69(6 Pt 2):066138MathSciNetCrossRef Kraskov A, Stogbauer H, Grassberger P (2004) Estimating mutual information. Phys Rev E Stat Nonlin Soft Matter Phys 69(6 Pt 2):066138MathSciNetCrossRef
Zurück zum Zitat Nelsen RB (1997) An introduction to copulas. Springer, New YorkMATH Nelsen RB (1997) An introduction to copulas. Springer, New YorkMATH
Zurück zum Zitat Ozdemir O, Allen TG, Choi S, Wimalajeewa T, Varshney PK (2018) Copula based classifier fusion under statistical dependence. IEEE Trans Pattern Anal Mach Intell 40(11):2740–2748CrossRef Ozdemir O, Allen TG, Choi S, Wimalajeewa T, Varshney PK (2018) Copula based classifier fusion under statistical dependence. IEEE Trans Pattern Anal Mach Intell 40(11):2740–2748CrossRef
Zurück zum Zitat Patel BN, Prajapati SG, Lakharia KI (2012) Efficient classification of data using decision tree. BunfInt J Data Min 2(1):6–12 Patel BN, Prajapati SG, Lakharia KI (2012) Efficient classification of data using decision tree. BunfInt J Data Min 2(1):6–12
Zurück zum Zitat Reshef DN et al (2011) Detecting novel associations in large data sets. Science 334(6062):1518–1524CrossRef Reshef DN et al (2011) Detecting novel associations in large data sets. Science 334(6062):1518–1524CrossRef
Zurück zum Zitat Reshef DN, Reshef Y, Mitzenmacher M, Sabeti P (2013) Equitability analysis of the maximal information coefficients with comparisons. arXiv:1301.6314v1 [cs. L.G.] Reshef DN, Reshef Y, Mitzenmacher M, Sabeti P (2013) Equitability analysis of the maximal information coefficients with comparisons. arXiv:​1301.​6314v1 [cs. L.G.]
Zurück zum Zitat Simon N, Tibshirani R (2011) Comment on "Detecting novel associations in large data sets" by Reshef et al. Science. arXiv:1401.7645 Simon N, Tibshirani R (2011) Comment on "Detecting novel associations in large data sets" by Reshef et al. Science. arXiv:​1401.​7645
Zurück zum Zitat Sklar A (1959) Fonctions de Répartition à n Dimensions et Leurs Marges. Université Paris 8 Sklar A (1959) Fonctions de Répartition à n Dimensions et Leurs Marges. Université Paris 8
Zurück zum Zitat Wang LM, Li XL, Cao CH, Yuan SM (2006) Combining decision tree and naïve Bayes for classification. Knowl Based Syst 19(7):511–515CrossRef Wang LM, Li XL, Cao CH, Yuan SM (2006) Combining decision tree and naïve Bayes for classification. Knowl Based Syst 19(7):511–515CrossRef
Zurück zum Zitat Yeh IC, Lien CH (2009) The comparisons of data mining techniques for the predictive accuracy of the probability of default of credit card clients. Expert SystAppl 36(2):2473–2480CrossRef Yeh IC, Lien CH (2009) The comparisons of data mining techniques for the predictive accuracy of the probability of default of credit card clients. Expert SystAppl 36(2):2473–2480CrossRef
Metadaten
Titel
A nonparametric copula-based decision tree for two random variables using MIC as a classification index
verfasst von
Y. A. Khan
Q. S. Shan
Q. Liu
S. Z. Abbas
Publikationsdatum
28.10.2020
Verlag
Springer Berlin Heidelberg
Erschienen in
Soft Computing / Ausgabe 15/2021
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-020-05399-1

Weitere Artikel der Ausgabe 15/2021

Soft Computing 15/2021 Zur Ausgabe

Soft computing in decision making and in modeling in economics

An improved multi-criteria emergency decision-making method in environmental disasters

Foundation, algebraic, and analytical methods in soft computing

Quantum-like Gaussian mixture model

Premium Partner