Skip to main content
Top
Published in: Soft Computing 5/2013

01-05-2013 | Methodologies and Application

Possibilistic classifiers for numerical data

Authors: Myriam Bounhas, Khaled Mellouli, Henri Prade, Mathieu Serrurier

Published in: Soft Computing | Issue 5/2013

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Naive Bayesian Classifiers, which rely on independence hypotheses, together with a normality assumption to estimate densities for numerical data, are known for their simplicity and their effectiveness. However, estimating densities, even under the normality assumption, may be problematic in case of poor data. In such a situation, possibility distributions may provide a more faithful representation of these data. Naive Possibilistic Classifiers (NPC), based on possibility theory, have been recently proposed as a counterpart of Bayesian classifiers to deal with classification tasks. There are only few works that treat possibilistic classification and most of existing NPC deal only with categorical attributes. This work focuses on the estimation of possibility distributions for continuous data. In this paper we investigate two kinds of possibilistic classifiers. The first one is derived from classical or flexible Bayesian classifiers by applying a probability–possibility transformation to Gaussian distributions, which introduces some further tolerance in the description of classes. The second one is based on a direct interpretation of data in possibilistic formats that exploit an idea of proximity between data values in different ways, which provides a less constrained representation of them. We show that possibilistic classifiers have a better capability to detect new instances for which the classification is ambiguous than Bayesian classifiers, where probabilities may be poorly estimated and illusorily precise. Moreover, we propose, in this case, an hybrid possibilistic classification approach based on a nearest-neighbour heuristics to improve the accuracy of the proposed possibilistic classifiers when the available information is insufficient to choose between classes. Possibilistic classifiers are compared with classical or flexible Bayesian classifiers on a collection of benchmarks databases. The experiments reported show the interest of possibilistic classifiers. In particular, flexible possibilistic classifiers perform well for data agreeing with the normality assumption, while proximity-based possibilistic classifiers outperform others in the other cases. The hybrid possibilistic classification exhibits a good ability for improving accuracy.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
go back to reference Ben Amor N, Mellouli K, Benferhat S, Dubois D, Prade H (2002) A theoretical framework for possibilistic independence in a weakly ordered setting. Int J Uncertain Fuzziness Knowledge-Based Syst 10:117–155MathSciNetMATH Ben Amor N, Mellouli K, Benferhat S, Dubois D, Prade H (2002) A theoretical framework for possibilistic independence in a weakly ordered setting. Int J Uncertain Fuzziness Knowledge-Based Syst 10:117–155MathSciNetMATH
go back to reference Ben Amor N, Benferhat S, Elouedi Z (2004) Qualitative classification and evaluation in possibilistic decision trees. In: FUZZ-IEEE’04, vol 1, pp 653–657 Ben Amor N, Benferhat S, Elouedi Z (2004) Qualitative classification and evaluation in possibilistic decision trees. In: FUZZ-IEEE’04, vol 1, pp 653–657
go back to reference Benferhat S, Tabia K (2008) An efficient algorithm for naive possibilistic classifiers with uncertain inputs. In: Proceedings of 2nd international conference on scalable uncertainty management (SUM’08). LNAI, vol 5291. Springer, Berlin, pp 63–77 Benferhat S, Tabia K (2008) An efficient algorithm for naive possibilistic classifiers with uncertain inputs. In: Proceedings of 2nd international conference on scalable uncertainty management (SUM’08). LNAI, vol 5291. Springer, Berlin, pp 63–77
go back to reference Beringer J, Hüllermeier E (2008) Case-based learning in a bipolar possibilistic framework. Int J Intell Syst 23:1119–1134MATHCrossRef Beringer J, Hüllermeier E (2008) Case-based learning in a bipolar possibilistic framework. Int J Intell Syst 23:1119–1134MATHCrossRef
go back to reference Bishop CM (1996) Neural networks for pattern recognition. Oxford University Press, New York Bishop CM (1996) Neural networks for pattern recognition. Oxford University Press, New York
go back to reference Bishop CM (1999) Latent variable models. In: Learning in graphical models, pp 371–403 Bishop CM (1999) Latent variable models. In: Learning in graphical models, pp 371–403
go back to reference Borgelt C, Gebhardt J (1999) A naïve bayes style possibilistic classifier. In: Proceedings of 7th European congress on intelligent techniques and soft computing, pp 556–565 Borgelt C, Gebhardt J (1999) A naïve bayes style possibilistic classifier. In: Proceedings of 7th European congress on intelligent techniques and soft computing, pp 556–565
go back to reference Borgelt C, Kruse R (1988) Efficient maximum projection of database-induced multivariate possibility distributions. In: Proceedings of 7th IEEE international conference on fuzzy systems, pp 663–668 Borgelt C, Kruse R (1988) Efficient maximum projection of database-induced multivariate possibility distributions. In: Proceedings of 7th IEEE international conference on fuzzy systems, pp 663–668
go back to reference Bounhas M, Mellouli K (2010) A possibilistic classification approach to handle continuous data. In: Proceedings of the eighth ACS/IEEE international conference on computer systems and applications (AICCSA-10), pp 1–8 Bounhas M, Mellouli K (2010) A possibilistic classification approach to handle continuous data. In: Proceedings of the eighth ACS/IEEE international conference on computer systems and applications (AICCSA-10), pp 1–8
go back to reference Bounhas M, Mellouli K, Prade H, Serrurier M (2010) From bayesian classifiers to possibilistic classifiers for numerical data. In: Proceedings of the fourth international conference on scalable uncertainty management, pp 112–125 Bounhas M, Mellouli K, Prade H, Serrurier M (2010) From bayesian classifiers to possibilistic classifiers for numerical data. In: Proceedings of the fourth international conference on scalable uncertainty management, pp 112–125
go back to reference Bounhas M, Prade H, Serrurier M, Mellouli K (2011) Possibilistic classifiers for uncertain numerical data. In: Proceedings of 11th European conference on symbolic and quantitative approaches to reasoning with uncertainty (ECSQARU’11), Belfast, UK, June 29–July 1. LNCS, vol 6717. Springer, Berlin, pp 434–446 Bounhas M, Prade H, Serrurier M, Mellouli K (2011) Possibilistic classifiers for uncertain numerical data. In: Proceedings of 11th European conference on symbolic and quantitative approaches to reasoning with uncertainty (ECSQARU’11), Belfast, UK, June 29–July 1. LNCS, vol 6717. Springer, Berlin, pp 434–446
go back to reference Cheng J, Greiner R (1999) Comparing bayesian network classifiers. In: Proceedings of the 15th conference on uncertainty in artificial intelligence, pp 101–107 Cheng J, Greiner R (1999) Comparing bayesian network classifiers. In: Proceedings of the 15th conference on uncertainty in artificial intelligence, pp 101–107
go back to reference Cover TM, Hart PE (1967) Nearest neighbour pattern classification. IEEE Trans Inf Theory 13:21–27MATHCrossRef Cover TM, Hart PE (1967) Nearest neighbour pattern classification. IEEE Trans Inf Theory 13:21–27MATHCrossRef
go back to reference De Cooman G (1997) Possibility theory. Part I: measure- and integral-theoretic ground- work. Part II: conditional possibility; Part III: possibilistic independence. Int J Gen Syst 25:291–371 De Cooman G (1997) Possibility theory. Part I: measure- and integral-theoretic ground- work. Part II: conditional possibility; Part III: possibilistic independence. Int J Gen Syst 25:291–371
go back to reference Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30MathSciNetMATH Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30MathSciNetMATH
go back to reference Denton A, Perrizo W (2004) A kernel-based semi-naive Bayesian classifier using p-trees. In: Proceedings of the 4th SIAM international conference on data mining Denton A, Perrizo W (2004) A kernel-based semi-naive Bayesian classifier using p-trees. In: Proceedings of the 4th SIAM international conference on data mining
go back to reference Devroye L (1983) The equivalence of weak, strong, and complete convergence in l1 for kernel density estimates. Ann Stat 11:896–904 Devroye L (1983) The equivalence of weak, strong, and complete convergence in l1 for kernel density estimates. Ann Stat 11:896–904
go back to reference Domingos P, Pazzani M (2002) Beyond independence: conditions for the optimality of the simple bayesian classifier. Mach Learn 29:102–130 Domingos P, Pazzani M (2002) Beyond independence: conditions for the optimality of the simple bayesian classifier. Mach Learn 29:102–130
go back to reference Dubois D (2006) Possibility theory and statistical reasoning. Comput Stat Data Anal 51:47–69MATHCrossRef Dubois D (2006) Possibility theory and statistical reasoning. Comput Stat Data Anal 51:47–69MATHCrossRef
go back to reference Dubois D, Prade H (1988) Possibility theory: an approach to computerized processing of uncertainty Dubois D, Prade H (1988) Possibility theory: an approach to computerized processing of uncertainty
go back to reference Dubois D, Prade H (1990) Aggregation of possibility measures. In: Multiperson decision making using fuzzy sets and possibility theory, pp 55–63 Dubois D, Prade H (1990) Aggregation of possibility measures. In: Multiperson decision making using fuzzy sets and possibility theory, pp 55–63
go back to reference Dubois D, Prade H (1990) The logical view of conditioning and its application to possibility and evidence theories. Int J Approx Reason 4:23–46MathSciNetMATHCrossRef Dubois D, Prade H (1990) The logical view of conditioning and its application to possibility and evidence theories. Int J Approx Reason 4:23–46MathSciNetMATHCrossRef
go back to reference Dubois D, Prade H (1993) On data summarization with fuzzy sets. In: Proceedings of the 5th international fuzzy systems association. World Congress (IFSA’93) Dubois D, Prade H (1993) On data summarization with fuzzy sets. In: Proceedings of the 5th international fuzzy systems association. World Congress (IFSA’93)
go back to reference Dubois D, Prade H (1998) Possibility theory: qualitative and quantitative aspects. In: Gabbay D, Smets P (eds) Handbook on defeasible reasoning and uncertainty management systems, vol 1, pp 169–226 Dubois D, Prade H (1998) Possibility theory: qualitative and quantitative aspects. In: Gabbay D, Smets P (eds) Handbook on defeasible reasoning and uncertainty management systems, vol 1, pp 169–226
go back to reference Dubois D, Prade H (2000) An overview of ordinal and numerical approaches to causal diagnostic problem solving. In: Gabbay DM, Kruse R (eds) Abductive reasoning and learning, handbooks of defeasible reasoning and uncertainty management systems, drums handbooks, vol 4, pp 231–280 Dubois D, Prade H (2000) An overview of ordinal and numerical approaches to causal diagnostic problem solving. In: Gabbay DM, Kruse R (eds) Abductive reasoning and learning, handbooks of defeasible reasoning and uncertainty management systems, drums handbooks, vol 4, pp 231–280
go back to reference Dubois D, Prade H (2009) Formal representations of uncertainty. In: Bouyssou D, Dubois D, Pirlot M, Prade H (eds) Decision-making—concepts and methods, pp 85–156 Dubois D, Prade H (2009) Formal representations of uncertainty. In: Bouyssou D, Dubois D, Pirlot M, Prade H (eds) Decision-making—concepts and methods, pp 85–156
go back to reference Dubois D, Prade H, Sandri S (1993) On possibility/probability transformations. Fuzzy Logic, pp 103–112 Dubois D, Prade H, Sandri S (1993) On possibility/probability transformations. Fuzzy Logic, pp 103–112
go back to reference Dubois D, Laurent F, Gilles M, Prade H (2004) Probability-possibility transformations, triangular fuzzy sets, and probabilistic inequalities. Reliable Comput 10:273–297MATHCrossRef Dubois D, Laurent F, Gilles M, Prade H (2004) Probability-possibility transformations, triangular fuzzy sets, and probabilistic inequalities. Reliable Comput 10:273–297MATHCrossRef
go back to reference Figueiredo M, Leitao JMN (1999) On fitting mixture models. In: Energy minimization methods in computer vision and pattern recognition, vol 1654, pp 732–749 Figueiredo M, Leitao JMN (1999) On fitting mixture models. In: Energy minimization methods in computer vision and pattern recognition, vol 1654, pp 732–749
go back to reference Friedman N, Geiger D, Goldszmidt M (1997) Bayesian network classifiers. Mach Learn 29:131–161MATHCrossRef Friedman N, Geiger D, Goldszmidt M (1997) Bayesian network classifiers. Mach Learn 29:131–161MATHCrossRef
go back to reference Geiger D, Heckerman D. (1994) Learning gaussian networks. Technical report, Microsoft Research, Advanced Technology Division Geiger D, Heckerman D. (1994) Learning gaussian networks. Technical report, Microsoft Research, Advanced Technology Division
go back to reference Grossman D, Dominigos P (2004) Learning Bayesian maximizing conditional likelihood. In: Proceedings on machine learning, pp 46–57 Grossman D, Dominigos P (2004) Learning Bayesian maximizing conditional likelihood. In: Proceedings on machine learning, pp 46–57
go back to reference Haouari B, Ben Amor N, Elouadi Z, Mellouli K (2009) Naive possibilistic network classifiers. Fuzzy Sets Syst 160(22):3224–3238MATHCrossRef Haouari B, Ben Amor N, Elouadi Z, Mellouli K (2009) Naive possibilistic network classifiers. Fuzzy Sets Syst 160(22):3224–3238MATHCrossRef
go back to reference Hüllermeier E (2003) Possibilistic instance-based learning. Artif Intell 148(1–2):335–383MATHCrossRef Hüllermeier E (2003) Possibilistic instance-based learning. Artif Intell 148(1–2):335–383MATHCrossRef
go back to reference Hüllermeier E (2005) Fuzzy methods in machine learning and data mining: status and prospects. Fuzzy Sets Syst 156(3):387–406CrossRef Hüllermeier E (2005) Fuzzy methods in machine learning and data mining: status and prospects. Fuzzy Sets Syst 156(3):387–406CrossRef
go back to reference Jenhani I, Ben Amor N, Elouedi Z (2008) Decision trees as possibilistic classifiers. Int J Approx Reason 48(3):784–807CrossRef Jenhani I, Ben Amor N, Elouedi Z (2008) Decision trees as possibilistic classifiers. Int J Approx Reason 48(3):784–807CrossRef
go back to reference John GH, Langley P (1995) Estimating continuous distributions in Bayesian classifiers. In: Proceedings of the 11th conference on uncertainty in artificial intelligence John GH, Langley P (1995) Estimating continuous distributions in Bayesian classifiers. In: Proceedings of the 11th conference on uncertainty in artificial intelligence
go back to reference Kononenko I (1991) Semi-naive bayesian classifier. In: Proceedings of the European working session on machine learning, pp 206–219 Kononenko I (1991) Semi-naive bayesian classifier. In: Proceedings of the European working session on machine learning, pp 206–219
go back to reference Kotsiantis SB (2007) Supervised machine learning: a review of classification techniques. Informatica 31:249–268MathSciNetMATH Kotsiantis SB (2007) Supervised machine learning: a review of classification techniques. Informatica 31:249–268MathSciNetMATH
go back to reference Langley P, Sage S (1994) Induction of selective bayesian classifiers. In: Proceedings of 10th conference on uncertainty in artificial intelligence (UAI-94), pp 399–406 Langley P, Sage S (1994) Induction of selective bayesian classifiers. In: Proceedings of 10th conference on uncertainty in artificial intelligence (UAI-94), pp 399–406
go back to reference Langley P, Iba W, Thompson K (1992) An analysis of bayesian classifiers. In: Proceedings of AAAI-92, vol 7, pp 223–228 Langley P, Iba W, Thompson K (1992) An analysis of bayesian classifiers. In: Proceedings of AAAI-92, vol 7, pp 223–228
go back to reference McLachlan GJ, Peel D (2000) Finite mixture models. Probability and mathematical statistics. Wiley, New York McLachlan GJ, Peel D (2000) Finite mixture models. Probability and mathematical statistics. Wiley, New York
go back to reference Pearl J (1988) Probabilistic reasoning in intelligent systems: networks of plausible inference. Morgan Kaufmman, San Francisco Pearl J (1988) Probabilistic reasoning in intelligent systems: networks of plausible inference. Morgan Kaufmman, San Francisco
go back to reference Pérez A, Larraoaga P, Inza I (2009) Bayesian classifiers based on kernel density estimation: flexible classifiers. Int J Approx Reason 50:341–362MATHCrossRef Pérez A, Larraoaga P, Inza I (2009) Bayesian classifiers based on kernel density estimation: flexible classifiers. Int J Approx Reason 50:341–362MATHCrossRef
go back to reference Quinlan JR (1986) Induction of decision trees. Mach Learn 1:81–106 Quinlan JR (1986) Induction of decision trees. Mach Learn 1:81–106
go back to reference Sahami M (1996) Learning limited dependence bayesian classifiers. In: Proceedings of the 2nd international conference on knowledge discovery and data mining, pp 335–338 Sahami M (1996) Learning limited dependence bayesian classifiers. In: Proceedings of the 2nd international conference on knowledge discovery and data mining, pp 335–338
go back to reference Shafer G (1976) A mathematical theory of evidence. Princeton University Press, Princeton Shafer G (1976) A mathematical theory of evidence. Princeton University Press, Princeton
go back to reference Strauss O, Comby F, Aldon MJ (2000) Rough histograms for robust statistics. In: Proceedings of international conference on pattern recognition (ICPR’00), vol II, Barcelona. IEEE Computer Society, pp 2684–2687 Strauss O, Comby F, Aldon MJ (2000) Rough histograms for robust statistics. In: Proceedings of international conference on pattern recognition (ICPR’00), vol II, Barcelona. IEEE Computer Society, pp 2684–2687
go back to reference Sudkamp T (2000) Similarity as a foundation for possibility. In: Proceedings of 9th IEEE international conference on fuzzy systems, San Antonio, pp 735–740 Sudkamp T (2000) Similarity as a foundation for possibility. In: Proceedings of 9th IEEE international conference on fuzzy systems, San Antonio, pp 735–740
go back to reference Yamada K (2001) Probability-possibility transformation based on evidence theory. In: Joint 9th IFSA World Congress and 20th NAFIPS international conference 2001, pp 70–75 Yamada K (2001) Probability-possibility transformation based on evidence theory. In: Joint 9th IFSA World Congress and 20th NAFIPS international conference 2001, pp 70–75
go back to reference Yang Y, Webb GI (2003) Discretization for naive-bayes learning: managing discretization bias and variance. Technical Report 2003-131 Yang Y, Webb GI (2003) Discretization for naive-bayes learning: managing discretization bias and variance. Technical Report 2003-131
go back to reference Zhang H (2004) The optimality of naive bayes. In: Proceedings of 17th international FLAIRS conference (FLAIRS2004) Zhang H (2004) The optimality of naive bayes. In: Proceedings of 17th international FLAIRS conference (FLAIRS2004)
Metadata
Title
Possibilistic classifiers for numerical data
Authors
Myriam Bounhas
Khaled Mellouli
Henri Prade
Mathieu Serrurier
Publication date
01-05-2013
Publisher
Springer-Verlag
Published in
Soft Computing / Issue 5/2013
Print ISSN: 1432-7643
Electronic ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-012-0947-9

Other articles of this Issue 5/2013

Soft Computing 5/2013 Go to the issue

Premium Partner