skip to main content
research-article
Free Access

A few useful things to know about machine learning

Published:01 October 2012Publication History
Skip Abstract Section

Abstract

Tapping into the "folk knowledge" needed to advance machine learning applications.

References

  1. Bauer, E. and Kohavi, R. An empirical comparison of voting classification algorithms: Bagging, boosting and variants. Machine Learning 36 (1999), 105--142. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Bengio, Y. Learning deep architectures for AI. Foundations and Trends in Machine Learning 2, 1 (2009), 1--127. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Benjamini, Y. and Hochberg, Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society, Series B, 57 (1995), 289--300.Google ScholarGoogle ScholarCross RefCross Ref
  4. Bernardo, J.M. and Smith, A.F.M. Bayesian Theory. Wiley, NY, 1994.Google ScholarGoogle ScholarCross RefCross Ref
  5. Blumer, A., Ehrenfeucht, A., Haussler, D. and Warmuth, M.K. Occam's razor. Information Processing Letters 24 (1987), 377--380.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Cohen, W.W. Grammatically biased learning: Learning logic programs using an explicit antecedent description language. Artificial Intelligence 68 (1994), 303--366.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Domingos, P. The role of Occam's razor in knowledge discovery. Data Mining and Knowledge Discovery 3 (1999), 409--425. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Domingos, P. Bayesian averaging of classifiers and the overfitting problem. In Proceedings of the 17 th International Conference on Machine Learning (Stanford, CA, 2000), Morgan Kaufmann, San Mateo, CA, 223--230. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Domingos, P. A unified bias-variance decomposition and its applications. In Proceedings of the 17 th International Conference on Machine Learning (Stanford, CA, 2000), Morgan Kaufmann, San Mateo, CA, 231--238. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Domingos, P. and Pazzani, M. On the optimality of the simple Bayesian classifier under zero-one loss. Machine Learning 29 (1997), 103--130. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Hulten, G. and Domingos, P. Mining complex models from arbitrarily large databases in constant time. In Proceedings of the 8 th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Edmonton, Canada, 2002). ACM Press, NY, 525--531. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Kibler, D. and Langley, P. Machine learning as an experimental science. In Proceedings of the 3 rd European Working Session on Learning (London, UK, 1988). Pitman.Google ScholarGoogle Scholar
  13. Klockars, A.J. and Sax, G. Multiple Comparisons. Sage, Beverly Hills, CA, 1986.Google ScholarGoogle Scholar
  14. Kohavi, R., Longbotham, R., Sommerfield, D. and Henne, R. Controlled experiments on the Web: Survey and practical guide. Data Mining and Knowledge Discovery 18 (2009), 140--181. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Manyika, J., Chui, M., Brown, B., Bughin, J., Dobbs, R., Roxburgh, C. and Byers, A. Big data: The next frontier for innovation, competition, and productivity. Technical report, McKinsey Global Institute, 2011.Google ScholarGoogle Scholar
  16. Mitchell, T.M. Machine Learning. McGraw-Hill, NY, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Ng, A.Y. Preventing "overfitting" of cross-validation data. In Proceedings of the 14 th International Conference on Machine Learning (Nashville, TN, 1997). Morgan Kaufmann, San Mateo, CA, 245--253. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Pearl, J. On the connection between the complexity and credibility of inferred models. International Journal of General Systems 4 (1978), 255--264.Google ScholarGoogle ScholarCross RefCross Ref
  19. Pearl, J. Causality: Models, Reasoning, and Inference. Cambridge University Press, Cambridge, UK, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Quinlan, J.R. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, CA, 1993.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Richardson, M. and P. Domingos. Markov logic networks. Machine Learning 62 (2006), 107--136. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Tenenbaum, J., Silva, V. and Langford, J. A global geometric framework for nonlinear dimensionality reduction. Science 290 (2000), 2319--2323.Google ScholarGoogle ScholarCross RefCross Ref
  23. Vapnik, V.N. The Nature of Statistical Learning Theory. Springer, NY, 1995. Google ScholarGoogle ScholarCross RefCross Ref
  24. Witten, I., Frank, E. and Hall, M. Data Mining: Practical Machine Learning Tools and Techniques, 3rd Edition. Morgan Kaufmann, San Mateo, CA, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Wolpert, D. The lack of a priori distinctions between learning algorithms. Neural Computation 8 (1996), 1341--1390. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A few useful things to know about machine learning

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image Communications of the ACM
        Communications of the ACM  Volume 55, Issue 10
        October 2012
        101 pages
        ISSN:0001-0782
        EISSN:1557-7317
        DOI:10.1145/2347736
        Issue’s Table of Contents

        Copyright © 2012 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 October 2012

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Popular
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format