Skip to main content
Top
Published in:
Cover of the book

2000 | OriginalPaper | Chapter

A New Sampling Strategy for Building Decision Trees from Large Databases

Authors : J. H. Chauchat, R. Rakotomalala

Published in: Data Analysis, Classification, and Related Methods

Publisher: Springer Berlin Heidelberg

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

We propose a fast and efficient sampling strategy to build decision trees from a very large database, even when there are many numerical attributes which must be discretized at each step. Successive samples are used, one on each tree node. Applying the method to a simulated database (virtually infinite size) confirms that when the database is large and contains many numerical attributes, our strategy of fast sampling on each node (with sample size about n = 300 or 500) speeds up the mining process while maintaining the accuracy of the classifier.

Metadata
Title
A New Sampling Strategy for Building Decision Trees from Large Databases
Authors
J. H. Chauchat
R. Rakotomalala
Copyright Year
2000
Publisher
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-642-59789-3_32