Skip to main content
Erschienen in:
Buchtitelbild

2000 | OriginalPaper | Buchkapitel

A New Sampling Strategy for Building Decision Trees from Large Databases

verfasst von : J. H. Chauchat, R. Rakotomalala

Erschienen in: Data Analysis, Classification, and Related Methods

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

We propose a fast and efficient sampling strategy to build decision trees from a very large database, even when there are many numerical attributes which must be discretized at each step. Successive samples are used, one on each tree node. Applying the method to a simulated database (virtually infinite size) confirms that when the database is large and contains many numerical attributes, our strategy of fast sampling on each node (with sample size about n = 300 or 500) speeds up the mining process while maintaining the accuracy of the classifier.

Metadaten
Titel
A New Sampling Strategy for Building Decision Trees from Large Databases
verfasst von
J. H. Chauchat
R. Rakotomalala
Copyright-Jahr
2000
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-642-59789-3_32