Pruning decision trees with misclassification costs

Bradford, Jeffrey P.; Kunz, Clayton; Kohavi, Ron; Brunk, Cliff; Brodley, Carla E.

doi:10.1007/BFb0026682

Jeffrey P. Bradford¹,
Clayton Kunz²,
Ron Kohavi²,
Cliff Brunk² &
…
Carla E. Brodley¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1398))

Included in the following conference series:

European Conference on Machine Learning

2324 Accesses
72 Citations

Abstract

We describe an experimental study of pruning methods for decision tree classifiers when the goal is minimizing loss rather than error. In addition to two common methods for error minimization, CART's cost-complexity pruning and C4.5's error-based pruning, we study the extension of cost-complexity pruning to loss and one pruning variant based on the Laplace correction. We perform an empirical comparison of these methods and evaluate them with respect to loss. We found that applying the Laplace correction to estimate the probability distributions at the leaves was beneficial to all pruning methods. Unlike in error minimization, and somewhat surprisingly, performing no pruning led to results that were on par with other methods in terms of the evaluation criteria. The main advantage of pruning was in the reduction of the decision tree size, sometimes by a factor of ten. While no method dominated others on all datasets, even for the same domain different pruning mechanisms are better for different loss matrices.

Download to read the full chapter text

Chapter PDF

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Bradford, J. P., Kunz, C., Kohavi, R., Brunk, C. & Brodley, C. E. (1998), Pruning decision trees with misclassification costs (long). http://robotics.stanford.edu/≈ronnyk/prune-long.ps.gz.
Google Scholar
Breiman, L., Friedman, J. H., Olshen, R. A. & Stone, C. J. (1984), Classification and Regression Trees, Wadsworth International Group.
Google Scholar
Cestnik, B. (1990), Estimating probabilities: A crucial task in machine learning, in L. C. Aiello, ed., ‘Proceedings of the ninth European Conference on Artificial Intelligence', pp. 147–149.
Google Scholar
Draper, B. A., Brodley, C. E. & Utgoff, P. E. (1994), ‘Goal-directed classification using linear machine decision trees', IEEE Transactions on Pattern Analysis and Machine Intelligence 16(9), 888–893.
Article Google Scholar
Good, I. J. (1965), The Estimation of Probabilities: An Essay on Modern Bayesian Methods, M.I.T. Press.
Google Scholar
Kohavi, R., Sommerfield, D. & Dougherty, J. (1996), Data mining using MCC++: A machine learning library in C++, in ‘Tools with Artificial Intelligence', IEEE Computer Society Press, pp. 234–245. http://www.sgi.com/Technology/mlc.
Google Scholar
Merz, C. J. & Murphy, P. M. (1997), UCI repository of machine learning databases. http://www.ics.uci.edu/≈mlearn/MLRepository.html.
Google Scholar
Oates, T. & Jansen, D. (1997), The effects of training set size on decision tree complexity, in D. Fisher, ed., ‘Machine Learning: Proceedings of the Fourteenth International Conference', Morgan Kaufmann, pp. 254–262.
Google Scholar
Pazzani, M., Merz, C., Murphy, P., Ali, K., Hume, T. & Brunk, C. (1994), Reducing misclassification costs, in ‘Machine Learning: Proceedings of the Eleventh International Conference', Morgan Kaufmann.
Google Scholar
Quinlan, J. R. (1993), C4.5: Programs for Machine Learning, Morgan Kaufmann, San Mateo, California.
Google Scholar
Turney, P. (1997), Cost-sensitive learning. http://ai.iit.nrc.ca/bibliographies/cost-sensitive.html.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electrical Engineering, Purdue University, 47907, West Lafayette, IN
Jeffrey P. Bradford & Carla E. Brodley
Data Mining and Visualization Silicon Graphics, Inc., 2011 N. Shoreline Blvd., 94043, Mountain View, CA
Clayton Kunz, Ron Kohavi & Cliff Brunk

Authors

Jeffrey P. Bradford
View author publications
You can also search for this author in PubMed Google Scholar
Clayton Kunz
View author publications
You can also search for this author in PubMed Google Scholar
Ron Kohavi
View author publications
You can also search for this author in PubMed Google Scholar
Cliff Brunk
View author publications
You can also search for this author in PubMed Google Scholar
Carla E. Brodley
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Claire Nédellec Céline Rouveirol

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bradford, J.P., Kunz, C., Kohavi, R., Brunk, C., Brodley, C.E. (1998). Pruning decision trees with misclassification costs. In: Nédellec, C., Rouveirol, C. (eds) Machine Learning: ECML-98. ECML 1998. Lecture Notes in Computer Science, vol 1398. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0026682

Download citation

DOI: https://doi.org/10.1007/BFb0026682
Published: 16 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64417-0
Online ISBN: 978-3-540-69781-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics