skip to main content
10.1145/956750.956825acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Empirical comparisons of various voting methods in bagging

Published:24 August 2003Publication History

ABSTRACT

Finding effective methods for developing an ensemble of models has been an active research area of large-scale data mining in recent years. Models learned from data are often subject to some degree of uncertainty, for a variety of resoans. In classification, ensembles of models provide a useful means of averaging out error introduced by individual classifiers, hence reducing the generalization error of prediction.The plurality voting method is often chosen for bagging, because of its simplicity of implementation. However, the plurality approach to model reconciliation is ad-hoc. There are many other voting methods to choose from, including the anti-plurality method, the plurality method with elimination, the Borda count method, and Condorcet's method of pairwise comparisons. Any of these could lead to a better method for reconciliation.In this paper, we analyze the use of these voting methods in model reconciliation. We present empirical results comparing performance of these voting methods when applied in bagging. These results include some surprises, and among other things suggest that (1) plurality is not always the best voting method; (2) the number of classes can affect the performance of voting methods; and (3) the degree of dataset noise can affect the performance of voting methods. While it is premature to make final judgments about specific voting methods, the results of this work raise interesting questions, and they open the door to the application of voting theory in classification theory.

References

  1. K. Arrow. Social Choice and individual values (2nd Edition). Wiley, 1963.Google ScholarGoogle Scholar
  2. C. Blake and C. Merz. UCI repository of machine learning databases, 1998.Google ScholarGoogle Scholar
  3. L. Breiman, J. H. Friedan, R. A. Olshen, and C. J. Stone. Classification and regression trees. Wadsworth, 1984.Google ScholarGoogle Scholar
  4. L. Brieman. Bagging predictors. Technical Report 421, Department of Statistics, University of California at Berkeley, CA 94720, Department of Statistics, University of California at Berkeley, CA 94720, 1994.Google ScholarGoogle Scholar
  5. M.-J. Condorcet. Éssai sur l'application de l'analyse à la probabilité des décisions rendus à la pluralité des voix. Paris, 1785.Google ScholarGoogle Scholar
  6. J. de Boroda. Mémoire sur les élections au scrutin, historie de l'académie royale des sciences. Paris. 1781.Google ScholarGoogle Scholar
  7. P. C. Fishburn. Discrete mathematics in voting and group choice. SIAM J. ALG. DISC. METH., 5(2):pp. 263--275, June 1984.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Y. Freund and R. E. Schapire. A short introduction to boosting. Journal of Japanese Society for Artificial Intelligence, 14(5):pp. 771--780, September 1999.Google ScholarGoogle Scholar
  9. R. Gentleman. R project for statistical computing, 1997.Google ScholarGoogle Scholar

Index Terms

  1. Empirical comparisons of various voting methods in bagging

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '03: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
      August 2003
      736 pages
      ISBN:1581137370
      DOI:10.1145/956750

      Copyright © 2003 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 24 August 2003

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      KDD '03 Paper Acceptance Rate46of298submissions,15%Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader