Skip to main content
Erschienen in: Data Mining and Knowledge Discovery 3/2017

05.12.2016

Visualizing the behavior and some symmetry properties of Bayesian confirmation measures

verfasst von: Emilio Celotto

Erschienen in: Data Mining and Knowledge Discovery | Ausgabe 3/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Bayesian confirmation measures, a special class of interestingness measures, are functions usually adopted in ranking inductive rules generated by data mining methods such as association rule mining, decision trees, rough sets. Till now a plethora of measures have been defined in many different ways. Identifying and effectively distinguishing among them is a difficult task. In this paper we propose a unified visual approach aimed at comparing and classifying a large subset of Bayesian confirmation measures (those satisfying the initial and final probability dependence condition). We first reduce the set of variables in their analytical expression to only two, thus allowing to draw their contour lines on the plane. We observe that two dimensional contour lines plots represent a sort of fingerprints of the confirmation measures and, therefore, this geometric visualization can be used as an effective tool in order to investigate properties and behavior of the measures. We highlight the potential of this approach not only to study known measures but also in order to invent new measures satisfying given required characteristics. We finally define, following the geometry of the plots, a new set of symmetry properties of confirmation measures and describe geometrically four classical symmetries.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
IFPD stands for initial and final probability dependence (Crupi et al. 2010)
 
2
ROC stands for receiver operating characteristics (see e.g. Fawcett 2006)
 
3
Conjunction effects are “situations in which two hypotheses H1 and H2 are both confirmed by some evidence E, and in which the conjunction \(H1\wedge H2\) is even more highly confirmed” (Atkinson 2012).
 
4
Even if in the general definition of confirmation space, i.e. the domain of a Bayesian confirmation measure c(xy), we stated that it corresponds to the open square (0,1)x(0,1) in many cases it can be extended to the square (0,1]x(0,1) by including also the points of the line x=1, i.e. \(P(H{\vert }E)=1\).
 
5
Flach (2003) and Fürnkranz and Flach (2003a) discovered rather similar geometric symmetries of evaluation metrics on the ROC space but mentioned them without an explicit definition.
 
6
The logarithm has been used in order to meet Bayesian confirmation measures sign condition.
 
Literatur
Zurück zum Zitat Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of the 1993 ACM SIGMOD international conference on management of data (SIGMOD ’93). ACM, New York, pp 207–216 Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of the 1993 ACM SIGMOD international conference on management of data (SIGMOD ’93). ACM, New York, pp 207–216
Zurück zum Zitat Atkinson D (2012) Confirmation and justification. A commentary on Shogenji’s measure. Synthese 184(1):49–61MathSciNetCrossRef Atkinson D (2012) Confirmation and justification. A commentary on Shogenji’s measure. Synthese 184(1):49–61MathSciNetCrossRef
Zurück zum Zitat Błaszczyński J, Greco S, Matarazzo B, Słowiński R, Szeląg M (2012) jMAF-dominance-based rough set data analysis framework, rough sets and intelligent systems—Professor Zdzisław Pawlak in memoriam., Intelligent systems reference librarySpringer, Berlin Błaszczyński J, Greco S, Matarazzo B, Słowiński R, Szeląg M (2012) jMAF-dominance-based rough set data analysis framework, rough sets and intelligent systems—Professor Zdzisław Pawlak in memoriam., Intelligent systems reference librarySpringer, Berlin
Zurück zum Zitat Błaszczyński J, Słowiński R, Szeląg M (2011) Sequential covering rule induction algorithm for variable consistency rough set approaches. Inform Sci 181(5):987–1002MathSciNetCrossRef Błaszczyński J, Słowiński R, Szeląg M (2011) Sequential covering rule induction algorithm for variable consistency rough set approaches. Inform Sci 181(5):987–1002MathSciNetCrossRef
Zurück zum Zitat Carnap R (1950) Logical foundations of probability. University of Chicago Press, ChicagoMATH Carnap R (1950) Logical foundations of probability. University of Chicago Press, ChicagoMATH
Zurück zum Zitat Celotto E, Ellero A, Ferretti P (2015) Conveying tourist ratings into an overall destination evaluation. Proc Soc Behav Sci 188:35–41CrossRef Celotto E, Ellero A, Ferretti P (2015) Conveying tourist ratings into an overall destination evaluation. Proc Soc Behav Sci 188:35–41CrossRef
Zurück zum Zitat Crupi V, Festa R, Buttasi C (2010) Towards a grammar of Bayesian confirmation. In: Suárez M, Dorato M, Rédei M (eds) Epistemology and methodology of science. Springer, Dordrecht, pp 73–93 Crupi V, Festa R, Buttasi C (2010) Towards a grammar of Bayesian confirmation. In: Suárez M, Dorato M, Rédei M (eds) Epistemology and methodology of science. Springer, Dordrecht, pp 73–93
Zurück zum Zitat Crupi V, Tentori K (2014) State of the field: measuring information and confirmation. Stud Hist Philos Sci 47(2014):81–90CrossRef Crupi V, Tentori K (2014) State of the field: measuring information and confirmation. Stud Hist Philos Sci 47(2014):81–90CrossRef
Zurück zum Zitat Crupi V, Tentori K, Gonzalez M (2007) On Bayesian measures of evidential support: theoretical and empirical issues. Philos Sci 74(2):229–252MathSciNetCrossRef Crupi V, Tentori K, Gonzalez M (2007) On Bayesian measures of evidential support: theoretical and empirical issues. Philos Sci 74(2):229–252MathSciNetCrossRef
Zurück zum Zitat Eells E, Fitelson B (2002) Symmetries and asymmetries in evidential support. Philos Stud 107(2):129–142CrossRef Eells E, Fitelson B (2002) Symmetries and asymmetries in evidential support. Philos Stud 107(2):129–142CrossRef
Zurück zum Zitat Finch H-A (1999) Confirming power of observations metricized for decisions among hypotheses. Philos Sci 27, pp 293–207 (part I), pp 391–404 (part II) Finch H-A (1999) Confirming power of observations metricized for decisions among hypotheses. Philos Sci 27, pp 293–207 (part I), pp 391–404 (part II)
Zurück zum Zitat Fitelson B (2001) Studies in Bayesian confirmation theory. Ph.D. Thesis, University of Wisconsin, Madison Fitelson B (2001) Studies in Bayesian confirmation theory. Ph.D. Thesis, University of Wisconsin, Madison
Zurück zum Zitat Flach P-A (2003) The geometry of ROC space: understanding machine learning metrics through ROC isometrics. In: Proceedings of the 20th International conference on machine learning (ICML’03), AAAI Press, pp 194–201 Flach P-A (2003) The geometry of ROC space: understanding machine learning metrics through ROC isometrics. In: Proceedings of the 20th International conference on machine learning (ICML’03), AAAI Press, pp 194–201
Zurück zum Zitat Fürnkranz J (1999) Separate-and-conquer rule learning. Artif Intell Rev 13(1):3–54CrossRef Fürnkranz J (1999) Separate-and-conquer rule learning. Artif Intell Rev 13(1):3–54CrossRef
Zurück zum Zitat Fürnkranz J (2005) From local to global patterns: evaluation issues in rule learning algorithms. In: Proceedings of the 2004 international conference on Local Pattern Detection (LPD’04), Springer, Berlin, pp 20–38 Fürnkranz J (2005) From local to global patterns: evaluation issues in rule learning algorithms. In: Proceedings of the 2004 international conference on Local Pattern Detection (LPD’04), Springer, Berlin, pp 20–38
Zurück zum Zitat Fürnkranz J, Flach P-A (2003a) An analysis of rule evaluation metrics. In: Proceedings of the 20th international conference on machine learning (ICML’03), AAAI Press, Washington, DC, pp 202–209 Fürnkranz J, Flach P-A (2003a) An analysis of rule evaluation metrics. In: Proceedings of the 20th international conference on machine learning (ICML’03), AAAI Press, Washington, DC, pp 202–209
Zurück zum Zitat Fürnkranz J, Flach P-A (2003b) An analysis of rule learning heuristics. Department of Computer Science, University of Bristol, CSTR-03-002, February 2003 Fürnkranz J, Flach P-A (2003b) An analysis of rule learning heuristics. Department of Computer Science, University of Bristol, CSTR-03-002, February 2003
Zurück zum Zitat Fürnkranz J, Flach P-A (2005) ROC ‘n’ rule learning—towards a better understanding of covering algorithms. Mach Learn 58(1):39–77CrossRef Fürnkranz J, Flach P-A (2005) ROC ‘n’ rule learning—towards a better understanding of covering algorithms. Mach Learn 58(1):39–77CrossRef
Zurück zum Zitat Geng L, Hamilton H-J (2006) Interestingness measures for data mining: a survey. ACM Comput Surv 38(3):1–32CrossRef Geng L, Hamilton H-J (2006) Interestingness measures for data mining: a survey. ACM Comput Surv 38(3):1–32CrossRef
Zurück zum Zitat Glass D-H (2013) Confirmation measures of association rule interestingness. Knowl Based Syst 44:65–77CrossRef Glass D-H (2013) Confirmation measures of association rule interestingness. Knowl Based Syst 44:65–77CrossRef
Zurück zum Zitat Good I-J (1950) Probability and the weighing of evidence. Hafners, New YorkMATH Good I-J (1950) Probability and the weighing of evidence. Hafners, New YorkMATH
Zurück zum Zitat Good I-J (1985) Weight of Evidence: A Brief Survey, Bayesian Statistics 2, In: J. M. Bernardo, M. H. DeGroot, D. V. Lindley and A. F. M. Smith (eds). Proceedings of the valencia international meetings on Bayesian statistics, , Elsevier Science Publishers B.V, Auckland, pp 249–270 Good I-J (1985) Weight of Evidence: A Brief Survey, Bayesian Statistics 2, In: J. M. Bernardo, M. H. DeGroot, D. V. Lindley and A. F. M. Smith (eds). Proceedings of the valencia international meetings on Bayesian statistics, , Elsevier Science Publishers B.V, Auckland, pp 249–270
Zurück zum Zitat Greco S, Matarazzo B, Słowiński R (2008) Parameterized rough set model using rough membership and Bayesian confirmation measures. Int J Approx Reason 49(2):285–300MathSciNetCrossRef Greco S, Matarazzo B, Słowiński R (2008) Parameterized rough set model using rough membership and Bayesian confirmation measures. Int J Approx Reason 49(2):285–300MathSciNetCrossRef
Zurück zum Zitat Greco S, Matarazzo B, Slowinski R, Stefanowski J (2001) An algorithm for induction of decision rules consistent with dominance principle, In: Revised papers from the second international conference on rough sets and current trends in computing (RSCTC ’00), Springer, pp 304–313 Greco S, Matarazzo B, Slowinski R, Stefanowski J (2001) An algorithm for induction of decision rules consistent with dominance principle, In: Revised papers from the second international conference on rough sets and current trends in computing (RSCTC ’00), Springer, pp 304–313
Zurück zum Zitat Greco S, Pawlak Z, Słowiński R (2004) Can Bayesian confirmation measures be useful for rough set decision rules? Eng Appl Artif Intell 17(4):345–361CrossRef Greco S, Pawlak Z, Słowiński R (2004) Can Bayesian confirmation measures be useful for rough set decision rules? Eng Appl Artif Intell 17(4):345–361CrossRef
Zurück zum Zitat Greco S, Słowiński R, Szczęch I (2012) Properties of rule interestingness measures and alternative approaches to normalization of measures. Inform Sci 216:1–16MathSciNetCrossRef Greco S, Słowiński R, Szczęch I (2012) Properties of rule interestingness measures and alternative approaches to normalization of measures. Inform Sci 216:1–16MathSciNetCrossRef
Zurück zum Zitat Kemeny J, Oppenheim P (1952) Degrees of factual support. Philos Sci 19:307–324CrossRef Kemeny J, Oppenheim P (1952) Degrees of factual support. Philos Sci 19:307–324CrossRef
Zurück zum Zitat Keynes J-M (1921) A treatise on probability. Macmillan, LondonMATH Keynes J-M (1921) A treatise on probability. Macmillan, LondonMATH
Zurück zum Zitat Lavrač N, Flach P-A, Zupan B (1999) Rule evaluation measures: a unifying view. In: Proceedings of the 9th international workshop on inductive logic programming (ILP ’99), Springer, pp 174–185 Lavrač N, Flach P-A, Zupan B (1999) Rule evaluation measures: a unifying view. In: Proceedings of the 9th international workshop on inductive logic programming (ILP ’99), Springer, pp 174–185
Zurück zum Zitat Lenca P, Meyer P, Vaillant B, Lallich S (2008) On selecting interestingness measures for association rules: user oriented description and multiple criteria decision aid. Eur J Oper Res 184(2):610–626CrossRef Lenca P, Meyer P, Vaillant B, Lallich S (2008) On selecting interestingness measures for association rules: user oriented description and multiple criteria decision aid. Eur J Oper Res 184(2):610–626CrossRef
Zurück zum Zitat Mortimer H (1988) The logic of induction. Prentice Hall, paramusMATH Mortimer H (1988) The logic of induction. Prentice Hall, paramusMATH
Zurück zum Zitat Nozick R (1981) Philosophical explanations. Clarendon Press, Oxford Nozick R (1981) Philosophical explanations. Clarendon Press, Oxford
Zurück zum Zitat Pawlak Z (1991) Rough sets: theoretical aspects of reasoning about data. Kluwer Academic Publishers, BostonCrossRef Pawlak Z (1991) Rough sets: theoretical aspects of reasoning about data. Kluwer Academic Publishers, BostonCrossRef
Zurück zum Zitat Popper K-R (1959) The logic of scientific discovery. Hutchinson, LondonMATH Popper K-R (1959) The logic of scientific discovery. Hutchinson, LondonMATH
Zurück zum Zitat Quinlan J-R (1986) Induction of decision trees. Mach Learn 1(1):81–106 Quinlan J-R (1986) Induction of decision trees. Mach Learn 1(1):81–106
Zurück zum Zitat Susmaga R, Szczęch I (2013a) The Property of \(\chi ^{2}_{\{01\}}\)-Concordance for Bayesian confirmation measures. In: Proceedings of the 10th international conference modelling decisions for artificial intelligence (MDAI 2013), LNCS, Springer, vol 8234, pp 226–236 Susmaga R, Szczęch I (2013a) The Property of \(\chi ^{2}_{\{01\}}\)-Concordance for Bayesian confirmation measures. In: Proceedings of the 10th international conference modelling decisions for artificial intelligence (MDAI 2013), LNCS, Springer, vol 8234, pp 226–236
Zurück zum Zitat Susmaga R, Szczęch I (2013) Visualization of interestingness measures. In: Proceedings of the 6th language & technology conference: human language technologies as a challenge for computer science and linguistics. Fundacja UAM, Poznań, pp 95–99 Susmaga R, Szczęch I (2013) Visualization of interestingness measures. In: Proceedings of the 6th language & technology conference: human language technologies as a challenge for computer science and linguistics. Fundacja UAM, Poznań, pp 95–99
Zurück zum Zitat Tan P-N, Kumar V, Srivastava J (2004) Selecting the right objective measure for association analysis. Inform Syst 29(4):293–313CrossRef Tan P-N, Kumar V, Srivastava J (2004) Selecting the right objective measure for association analysis. Inform Syst 29(4):293–313CrossRef
Zurück zum Zitat Tentori K, Crupi V, Bonini N, Osherson D (2007) Comparison of confirmation measures. Cognition 103:107–119CrossRef Tentori K, Crupi V, Bonini N, Osherson D (2007) Comparison of confirmation measures. Cognition 103:107–119CrossRef
Zurück zum Zitat Tew C, Giraud-Carrier C, Tanner K, Burton S (2014) Behavior-based clustering and analysis of interestingness measures for association rule mining. Data Min Knowl Discov 28(4):1004–1045MathSciNetCrossRef Tew C, Giraud-Carrier C, Tanner K, Burton S (2014) Behavior-based clustering and analysis of interestingness measures for association rule mining. Data Min Knowl Discov 28(4):1004–1045MathSciNetCrossRef
Zurück zum Zitat Todhunter I (1865) A history of mathematical theory of probability from the time of Pascal to that of Laplace. Macmillan, London reprinted, (1949) 1965. Chelsea Publishing Company, New York Todhunter I (1865) A history of mathematical theory of probability from the time of Pascal to that of Laplace. Macmillan, London reprinted, (1949) 1965. Chelsea Publishing Company, New York
Zurück zum Zitat Vilalta R, Oblinger D (2000) A quantification of distance-bias between evaluation metrics in classification. In: Proceedings of the 17th international conference on machine learning (ICML-00) Stanford. Morgan Kaufmann, pp 1087–1094 Vilalta R, Oblinger D (2000) A quantification of distance-bias between evaluation metrics in classification. In: Proceedings of the 17th international conference on machine learning (ICML-00) Stanford. Morgan Kaufmann, pp 1087–1094
Zurück zum Zitat Yao Y-Y, Zhong N (1999) An analysis of quantitative measures associated with rules, In: Proceedings of the third Pacific-Asia conference on methodologies for knowledge discovery and data mining (PAKDD ’99), Springer, pp 479–488 Yao Y-Y, Zhong N (1999) An analysis of quantitative measures associated with rules, In: Proceedings of the third Pacific-Asia conference on methodologies for knowledge discovery and data mining (PAKDD ’99), Springer, pp 479–488
Zurück zum Zitat Yule G-U (1900) On the association of attributes in statistics: with illustrations from the material of the Childhood Society, & c. Philos Trans R Soc Lond A 194:257–319CrossRef Yule G-U (1900) On the association of attributes in statistics: with illustrations from the material of the Childhood Society, & c. Philos Trans R Soc Lond A 194:257–319CrossRef
Zurück zum Zitat Yule G-U (1912) On the methods of measuring the association between two attributes. J R Stat Soc 75:579–652CrossRef Yule G-U (1912) On the methods of measuring the association between two attributes. J R Stat Soc 75:579–652CrossRef
Zurück zum Zitat Zhou B, Yao Y-Y (2014) Feature selection based on confirmation-theoretic rough sets, In: Proceedings of the 9th international conference on rough sets and current trends in computing (RSCTC 2014), Lecture Notes in Computer Science, vol 8536, pp 181–188 Zhou B, Yao Y-Y (2014) Feature selection based on confirmation-theoretic rough sets, In: Proceedings of the 9th international conference on rough sets and current trends in computing (RSCTC 2014), Lecture Notes in Computer Science, vol 8536, pp 181–188
Metadaten
Titel
Visualizing the behavior and some symmetry properties of Bayesian confirmation measures
verfasst von
Emilio Celotto
Publikationsdatum
05.12.2016
Verlag
Springer US
Erschienen in
Data Mining and Knowledge Discovery / Ausgabe 3/2017
Print ISSN: 1384-5810
Elektronische ISSN: 1573-756X
DOI
https://doi.org/10.1007/s10618-016-0487-5

Weitere Artikel der Ausgabe 3/2017

Data Mining and Knowledge Discovery 3/2017 Zur Ausgabe

Premium Partner