Skip to main content

2015 | OriginalPaper | Buchkapitel

Comparative Analysis of Decision Tree Algorithms: ID3, C4.5 and Random Forest

verfasst von : Shiju Sathyadevan, Remya R. Nair

Erschienen in: Computational Intelligence in Data Mining - Volume 1

Verlag: Springer India

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

To analyze the raw data manually and find the correct information from it is a tough process. But Data mining technique automatically detect the relevant patterns or information from the raw data, using the data mining algorithms. In Data mining algorithms, Decision trees are the best and commonly used approach for representing the data. Using these Decision trees, data can be represented as a most visualizing form. Many different decision tree algorithms are used for the data mining technique. Each algorithm gives a unique decision tree from the input data. This paper focus on the comparison of different decision tree algorithms for data analysis.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Rokach, L., Maimon, O.: Decision trees. In Maimon, O.Z., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook vol. 6, pp. 165–192. Springer, Heidelberg (2005) Rokach, L., Maimon, O.: Decision trees. In Maimon, O.Z., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook vol. 6, pp. 165–192. Springer, Heidelberg (2005)
2.
Zurück zum Zitat Peng, X., Guo, H., Pang, J.: Performance analysis between different decision trees for uncertain data. In: Proceedings of the 2012 International Conference on Computer Science and Service System, CSSS ’12, pp. 574–577. Washington, DC (2012) Peng, X., Guo, H., Pang, J.: Performance analysis between different decision trees for uncertain data. In: Proceedings of the 2012 International Conference on Computer Science and Service System, CSSS ’12, pp. 574–577. Washington, DC (2012)
3.
Zurück zum Zitat Jin, C., De-lin, L., Fen-xiang, M.: An improved ID3 decision tree algorithm. In: Proceedings of 2009 4th International Conference on Computer Science and Education, vol. 1 (2009) Jin, C., De-lin, L., Fen-xiang, M.: An improved ID3 decision tree algorithm. In: Proceedings of 2009 4th International Conference on Computer Science and Education, vol. 1 (2009)
4.
Zurück zum Zitat Wang, M., Chai, R.: Improved classification attribute selection scheme for decision for decision tree. Comp. Eng. Appl. 3, 127–129 (2010) Wang, M., Chai, R.: Improved classification attribute selection scheme for decision for decision tree. Comp. Eng. Appl. 3, 127–129 (2010)
5.
Zurück zum Zitat Guan, C., Zeng, X.: An improved ID3 based on weighted modified information gain. In: Seventh International Conference on Computational Intelligence and Security, vol. 1, pp. 1283–1285. IEEE Computer Society, Washington, DC (2011) Guan, C., Zeng, X.: An improved ID3 based on weighted modified information gain. In: Seventh International Conference on Computational Intelligence and Security, vol. 1, pp. 1283–1285. IEEE Computer Society, Washington, DC (2011)
6.
Zurück zum Zitat Quinlan, J.R.: C4.5 program for machine learning, vol. 16, pp. 21–30. Morgan Kaufmann Publishers Inc., San Francisco (1993) Quinlan, J.R.: C4.5 program for machine learning, vol. 16, pp. 21–30. Morgan Kaufmann Publishers Inc., San Francisco (1993)
7.
Zurück zum Zitat Quinlan, J.R.: Improved use of continuous attributes inc 4.5. J. Artif. Intell. Res. 4, 77–90 (1996)MATH Quinlan, J.R.: Improved use of continuous attributes inc 4.5. J. Artif. Intell. Res. 4, 77–90 (1996)MATH
8.
Zurück zum Zitat de Vries, A.P., Roelleke, T.: Relevance information: a loss of entropy but a gain for idf? In: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 282–289 (2005) de Vries, A.P., Roelleke, T.: Relevance information: a loss of entropy but a gain for idf? In: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 282–289 (2005)
9.
Zurück zum Zitat Ruggieri, S.: Efficient C4.5. IEEE Trans. Knowl. Data Eng. 2, 438–444 (2002) Ruggieri, S.: Efficient C4.5. IEEE Trans. Knowl. Data Eng. 2, 438–444 (2002)
10.
Zurück zum Zitat Kotsiantis, S.: A hybrid decision tree classifier. J. Intell. Fuzzy Syst. 1, 327–336 (2014) Kotsiantis, S.: A hybrid decision tree classifier. J. Intell. Fuzzy Syst. 1, 327–336 (2014)
11.
Zurück zum Zitat Fern, A., Givan, R.: Online ensemble learning: an empirical study. Mach. Learn 53, 279–286 (2000) (Morgan Kaufmann Publishers Inc., San Francisco) Fern, A., Givan, R.: Online ensemble learning: an empirical study. Mach. Learn 53, 279–286 (2000) (Morgan Kaufmann Publishers Inc., San Francisco)
12.
Zurück zum Zitat Quinlan, J.R.: C4.5: Programs for machine learning, vol. 16, pp. 235–240. Morgan Kaufmann Publishers Inc., San Francisco, CA (1993) Quinlan, J.R.: C4.5: Programs for machine learning, vol. 16, pp. 235–240. Morgan Kaufmann Publishers Inc., San Francisco, CA (1993)
13.
Zurück zum Zitat Bache, K., Lichman, M.: UCI machine learning repository. Digit. Libr. 1 (2013) Bache, K., Lichman, M.: UCI machine learning repository. Digit. Libr. 1 (2013)
14.
Zurück zum Zitat Abele, J., Mantas, C.J.: Improving experimental studies about ensembles of classifiers for bankruptcy prediction and credit scoring. Expert Syst. Appl. Int. J. 41, 3825–3830. Pergamon Press Inc., USA (2014) Abele, J., Mantas, C.J.: Improving experimental studies about ensembles of classifiers for bankruptcy prediction and credit scoring. Expert Syst. Appl. Int. J. 41, 3825–3830. Pergamon Press Inc., USA (2014)
15.
Zurück zum Zitat Olson, D.L., Delen, D., Meng, Y.: Comparative analysis of data mining methods for bankruptcy prediction. Decis. Support Syst. 2, 464–473 (2012) Olson, D.L., Delen, D., Meng, Y.: Comparative analysis of data mining methods for bankruptcy prediction. Decis. Support Syst. 2, 464–473 (2012)
16.
Zurück zum Zitat Grzymala-Busse, J.W.: Rule induction. In: Maimon, O.Z., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, vol. 6, pp. 277–294. Springer, Heidelberg (2005) Grzymala-Busse, J.W.: Rule induction. In: Maimon, O.Z., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, vol. 6, pp. 277–294. Springer, Heidelberg (2005)
17.
Zurück zum Zitat Maimon, O., Rokach, L.: The Data Mining and Knowledge Discovery Handbook, vol. 2, pp. 1–17. Springer, Heidelerg (2005) Maimon, O., Rokach, L.: The Data Mining and Knowledge Discovery Handbook, vol. 2, pp. 1–17. Springer, Heidelerg (2005)
18.
Zurück zum Zitat Podgorelec, V., Kokol, P., Stiglic, B., Rozman, I.: Decision trees: an overview and their use in medicine. J. Med. Syst. 5, 445–463 (2002) Podgorelec, V., Kokol, P., Stiglic, B., Rozman, I.: Decision trees: an overview and their use in medicine. J. Med. Syst. 5, 445–463 (2002)
19.
Zurück zum Zitat Povalej, P., Kokol, P.: End user friendly data mining with decision trees: a reality or a wish? In: Proceedings of the 2007 Annual Conference on International Conference on Computer Engineering and Applications, vol. 3, pp. 35–40. Stevens Point, Wisconsin (2007) Povalej, P., Kokol, P.: End user friendly data mining with decision trees: a reality or a wish? In: Proceedings of the 2007 Annual Conference on International Conference on Computer Engineering and Applications, vol. 3, pp. 35–40. Stevens Point, Wisconsin (2007)
Metadaten
Titel
Comparative Analysis of Decision Tree Algorithms: ID3, C4.5 and Random Forest
verfasst von
Shiju Sathyadevan
Remya R. Nair
Copyright-Jahr
2015
Verlag
Springer India
DOI
https://doi.org/10.1007/978-81-322-2205-7_51

Premium Partner