Skip to main content

2015 | OriginalPaper | Buchkapitel

8. Supervised Neural Networks and Ensemble Methods

verfasst von : Francesco Camastra, Alessandro Vinciarelli

Erschienen in: Machine Learning for Audio, Image and Video Analysis

Verlag: Springer London

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

What the reader should know to understand this chapter \(\bullet \) Fundamentals of machine learning (Chap. 4). \(\bullet \) Statistics (Appendix A).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
The result can be obtained using the Stone-Weierstrass theorem [21] or the Hahn-Banach theorem [9].
 
2
The generalized linear discriminant functions can actually lead to nonlinear surfaces, but still they cannot approximate any possible decision boundary. See Sect. 8.5 for more details.
 
3
At the moment of writing this book, software and documentation can be downloaded at the following URL: www.torch.ch.
 
4
At the time this book is being written the package is available at the following URL: http://www.cis.hut.fi/research/som-research/nnrc-programs.shtml.
 
5
The expression majority vote means that the output of \(F_{\varSigma }(\mathbf {x})\) is the most frequent output of the single ensemble classifiers \(f_i(\mathbf {x})\).
 
6
Examples of unstable classifiers are the decision trees classifiers, which are not discussed in the book.
 
Literatur
1.
Zurück zum Zitat C. Bishop. Neural Networks for Pattern Recognition. Oxford University Press, 1996. C. Bishop. Neural Networks for Pattern Recognition. Oxford University Press, 1996.
2.
Zurück zum Zitat H. Bourlard and Y. Kamp. Auto-association by Multi-Layer Perceptron and Singular Value Decomposition. Biological Cybernetics, 59:291–294, 1988. H. Bourlard and Y. Kamp. Auto-association by Multi-Layer Perceptron and Singular Value Decomposition. Biological Cybernetics, 59:291–294, 1988.
3.
Zurück zum Zitat L. Breiman. Bagging predictors. Machine Learning, 24(2):123–140, 1996. L. Breiman. Bagging predictors. Machine Learning, 24(2):123–140, 1996.
4.
Zurück zum Zitat L. Breiman. Arcing classifiers. The Annals of Statistics, 26(3):801–824, 1998. L. Breiman. Arcing classifiers. The Annals of Statistics, 26(3):801–824, 1998.
5.
Zurück zum Zitat F. Camastra and A. Vinciarelli. Cursive character recognition by learning vector quantization. Pattern Recognition Letters, 22(6–7):625–629, 2001. F. Camastra and A. Vinciarelli. Cursive character recognition by learning vector quantization. Pattern Recognition Letters, 22(6–7):625–629, 2001.
6.
Zurück zum Zitat K.J. Cherkauer. Human expert-level performance on a scientific image analysis task by a system using combined Artificial Neural Networks. In Working Notes of the AAAI Workshop on Integrating Multiple Learned Models, pages 15–21, 1996. K.J. Cherkauer. Human expert-level performance on a scientific image analysis task by a system using combined Artificial Neural Networks. In Working Notes of the AAAI Workshop on Integrating Multiple Learned Models, pages 15–21, 1996.
7.
Zurück zum Zitat R. Collobert, S. Bengio, and J. Mariethoz. Torch: a modular machine learning software library. Technical Report IDIAP-RR-02-46, IDIAP Research Institute, 2002. R. Collobert, S. Bengio, and J. Mariethoz. Torch: a modular machine learning software library. Technical Report IDIAP-RR-02-46, IDIAP Research Institute, 2002.
8.
Zurück zum Zitat K. Crammer, R. Gilad-Bachrach, A. Navot, and N. Tishby. Margin analysis of the LVQ algorithm. In Advances in Neural Information Processing Systems, volume 14, pages 109–114, 2002. K. Crammer, R. Gilad-Bachrach, A. Navot, and N. Tishby. Margin analysis of the LVQ algorithm. In Advances in Neural Information Processing Systems, volume 14, pages 109–114, 2002.
9.
Zurück zum Zitat G. Cybenko. Approximation by superpositions of a sigmoidal function. Mathematics of Control, Signals and Systems, 2:303–314, 1989. G. Cybenko. Approximation by superpositions of a sigmoidal function. Mathematics of Control, Signals and Systems, 2:303–314, 1989.
10.
Zurück zum Zitat T. Dietterich. Ensemble methods in machine learning. In Proceedings of \(1^{st}\) International Workshop on Multiple Classifier Systems, pages 1–15, 2000. T. Dietterich. Ensemble methods in machine learning. In Proceedings of \(1^{st}\) International Workshop on Multiple Classifier Systems, pages 1–15, 2000.
11.
Zurück zum Zitat T.G. Dietterich. Ensemble learning. In M. Arbib, editor, The handbook of brain theory and neural networks. MIT Press, 2002. T.G. Dietterich. Ensemble learning. In M. Arbib, editor, The handbook of brain theory and neural networks. MIT Press, 2002.
12.
Zurück zum Zitat T.G. Dietterich and G. Bakiri. Solving multiclass learning problems via error-correcting output codes. Journal of Artificial Intelligence Research, 2:263–286, 1995. T.G. Dietterich and G. Bakiri. Solving multiclass learning problems via error-correcting output codes. Journal of Artificial Intelligence Research, 2:263–286, 1995.
13.
Zurück zum Zitat R. O. Duda, P. E. Hart, and D. G. Stork. Pattern Classification. John Wiley, 2001. R. O. Duda, P. E. Hart, and D. G. Stork. Pattern Classification. John Wiley, 2001.
14.
Zurück zum Zitat B. Efron and R. J. Tibshirani. An Introduction to the Bootstrap. Chapman and Hall, 1993. B. Efron and R. J. Tibshirani. An Introduction to the Bootstrap. Chapman and Hall, 1993.
15.
Zurück zum Zitat Y. Freund and R.E. Schapire. Experiments with a new boosting algorithm. In International Conference in Machine Learning, pages 138–146, 1996. Y. Freund and R.E. Schapire. Experiments with a new boosting algorithm. In International Conference in Machine Learning, pages 138–146, 1996.
16.
Zurück zum Zitat Y. Freund and R.E. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1):119–139, 1997. Y. Freund and R.E. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1):119–139, 1997.
17.
Zurück zum Zitat B. Hammer and T. Villmann. Generalized relevance learning vector quantization. Neural Networks, 15(8–9):1059–1068, 2002. B. Hammer and T. Villmann. Generalized relevance learning vector quantization. Neural Networks, 15(8–9):1059–1068, 2002.
18.
Zurück zum Zitat S. Haykin. Neural Networks: a comprehensive foundation. Prentice-Hall, 1998. S. Haykin. Neural Networks: a comprehensive foundation. Prentice-Hall, 1998.
19.
Zurück zum Zitat R. Hecht-Nielsen, editor. Neurocomputing. Addison-Wesley, 1990. R. Hecht-Nielsen, editor. Neurocomputing. Addison-Wesley, 1990.
20.
Zurück zum Zitat J. Hertz, A. Krogh, and R.G. Palmer, editors. Introduction to the Theory of Neural Computation. Addison-Wesley, 1991. J. Hertz, A. Krogh, and R.G. Palmer, editors. Introduction to the Theory of Neural Computation. Addison-Wesley, 1991.
21.
Zurück zum Zitat K. Hornik, M. Stinchcombe, and H. White. Multi-Layer feedforward networks are universal approximators. Neural Networks, 2(5):359–366, 1989. K. Hornik, M. Stinchcombe, and H. White. Multi-Layer feedforward networks are universal approximators. Neural Networks, 2(5):359–366, 1989.
22.
Zurück zum Zitat A.K. Jain, J. Mao, and K.M. Mohiuddin. Artificial neural networks: a tutorial. IEEE Computer, pages 31–44, 1996. A.K. Jain, J. Mao, and K.M. Mohiuddin. Artificial neural networks: a tutorial. IEEE Computer, pages 31–44, 1996.
23.
Zurück zum Zitat J. Kittler, M. Hatef, R.P.W. Duin, and J. Matas. On combining classifiers. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(3):226–239, 1998. J. Kittler, M. Hatef, R.P.W. Duin, and J. Matas. On combining classifiers. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(3):226–239, 1998.
24.
Zurück zum Zitat T. Kohonen. Self-Organizing Maps. Springer-Verlag, 1997. T. Kohonen. Self-Organizing Maps. Springer-Verlag, 1997.
25.
Zurück zum Zitat T. Kohonen, J. Hynninen, J. Kangas, J. Laaksonen, and K. Torkkola. Lvq\_pak: the Learning Vector Quantization program package. Technical Report A30, Helsinki University of Technology - Laboratory of Computer and Information Science, 1996. T. Kohonen, J. Hynninen, J. Kangas, J. Laaksonen, and K. Torkkola. Lvq\_pak: the Learning Vector Quantization program package. Technical Report A30, Helsinki University of Technology - Laboratory of Computer and Information Science, 1996.
26.
Zurück zum Zitat L. Kuncheva. Combining Pattern Classifiers. Wiley-Interscience, 2004. L. Kuncheva. Combining Pattern Classifiers. Wiley-Interscience, 2004.
27.
Zurück zum Zitat J.L. McClelland, G.E. Hinton, and D.E. Rumelhart. A general framework for parallel distributed processing. In J.L. McClelland and Rumelhart, editors, Parallel Distributed Processing, volume Vol. 1: Foundations, pages 45–76. MIT Press, 1986. J.L. McClelland, G.E. Hinton, and D.E. Rumelhart. A general framework for parallel distributed processing. In J.L. McClelland and Rumelhart, editors, Parallel Distributed Processing, volume Vol. 1: Foundations, pages 45–76. MIT Press, 1986.
28.
Zurück zum Zitat J.L. McClelland, D.E. Rumelhart, and G.E. Hinton. The appeal of a parallel distributed processing. In J.L. McClelland and Rumelhart, editors, Parallel Distributed Processing, volume Vol. 1: Foundations, pages 3–44. MIT Press, 1986. J.L. McClelland, D.E. Rumelhart, and G.E. Hinton. The appeal of a parallel distributed processing. In J.L. McClelland and Rumelhart, editors, Parallel Distributed Processing, volume Vol. 1: Foundations, pages 3–44. MIT Press, 1986.
29.
Zurück zum Zitat W.S. McCulloch and W. Pitts. A logical calculus of the ideas immanent in nervous activity. Bulletin of Mathematical Biophysics, 9:127–147, 1943. W.S. McCulloch and W. Pitts. A logical calculus of the ideas immanent in nervous activity. Bulletin of Mathematical Biophysics, 9:127–147, 1943.
30.
Zurück zum Zitat E. McDermott and S. Katagiri. Prototype-based minimum classification error/generalized probabilistic descent training for various speech units. Computer Speech and Languages, 8(4):351–368, 1994. E. McDermott and S. Katagiri. Prototype-based minimum classification error/generalized probabilistic descent training for various speech units. Computer Speech and Languages, 8(4):351–368, 1994.
31.
Zurück zum Zitat D.A. Medler. A brief history of connectionism. Neural Computing Surveys, 1:61–101, 1998. D.A. Medler. A brief history of connectionism. Neural Computing Surveys, 1:61–101, 1998.
32.
Zurück zum Zitat M.L. Minsky and S.A. Papert. Perceptrons. MIT Press, 1969. M.L. Minsky and S.A. Papert. Perceptrons. MIT Press, 1969.
33.
Zurück zum Zitat T. Mitchell. Machine Learning. McGraw-Hill, 1997. T. Mitchell. Machine Learning. McGraw-Hill, 1997.
34.
Zurück zum Zitat J. Nolte. The human brain: an introduction to its functional anatomy. Mosby, 2002. J. Nolte. The human brain: an introduction to its functional anatomy. Mosby, 2002.
35.
Zurück zum Zitat M. Oja, S. Karski, and T. Kohonen. Bibliography of self-organizing map papers: 1998–2001 addendum. Neural Computing Surveys, 3:1–156, 2002. M. Oja, S. Karski, and T. Kohonen. Bibliography of self-organizing map papers: 1998–2001 addendum. Neural Computing Surveys, 3:1–156, 2002.
36.
Zurück zum Zitat D. Opitz and R. Maclin. Popular ensemble methods: an empirical study. Journal of Artificial Intelligence Research, 11:169–198, 1999. D. Opitz and R. Maclin. Popular ensemble methods: an empirical study. Journal of Artificial Intelligence Research, 11:169–198, 1999.
37.
Zurück zum Zitat B. Parmanto, P.W. Munro, and H.R. Doyle. Improving committee diagnosis with resampling techniques. In Advances in Neural Information Processing Systems, volume 8, pages 882–888, 1996. B. Parmanto, P.W. Munro, and H.R. Doyle. Improving committee diagnosis with resampling techniques. In Advances in Neural Information Processing Systems, volume 8, pages 882–888, 1996.
38.
Zurück zum Zitat M.P. Perrone and L.N. Cooper. When networks disagree: ensemble methods for hybrid neural networks. In R.J. Mammone, editor, Artificial Neural Networks for speech and vision, pages 126–142. Chapman & Hall, 1993. M.P. Perrone and L.N. Cooper. When networks disagree: ensemble methods for hybrid neural networks. In R.J. Mammone, editor, Artificial Neural Networks for speech and vision, pages 126–142. Chapman & Hall, 1993.
39.
Zurück zum Zitat F. Rosenblatt. Principles of neurodynamics: perceptrons and the theory of brain mechanisms. Spartan, 1961. F. Rosenblatt. Principles of neurodynamics: perceptrons and the theory of brain mechanisms. Spartan, 1961.
40.
Zurück zum Zitat D.E. Rumelhart, G.E. Hinton, and R.J. Williams. Learning internal representations by error propagation. In J.L. McClelland and Rumelhart, editors, Parallel Distributed Processing, volume Vol. 1: Foundations, pages 318–362. MIT Press, 1986. D.E. Rumelhart, G.E. Hinton, and R.J. Williams. Learning internal representations by error propagation. In J.L. McClelland and Rumelhart, editors, Parallel Distributed Processing, volume Vol. 1: Foundations, pages 318–362. MIT Press, 1986.
41.
Zurück zum Zitat D.E. Rumelhart, G.E. Hinton, and R.J. Williams. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1):119–139, 1997. D.E. Rumelhart, G.E. Hinton, and R.J. Williams. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1):119–139, 1997.
42.
Zurück zum Zitat D.E. Rumelhart and J.L. McClelland, editors. Parallel Distributed Processing. MIT Press, 1986. D.E. Rumelhart and J.L. McClelland, editors. Parallel Distributed Processing. MIT Press, 1986.
43.
Zurück zum Zitat A. S. Sato and K. Yamada. Generalized learning vector quantization. In Advances in Neural Information Processing Systems, volume 7, pages 423–429, 1995. A. S. Sato and K. Yamada. Generalized learning vector quantization. In Advances in Neural Information Processing Systems, volume 7, pages 423–429, 1995.
44.
Zurück zum Zitat R. E. Schapire. The strength of weak learnability. Machine Learning, 5(2):197–227, 1990. R. E. Schapire. The strength of weak learnability. Machine Learning, 5(2):197–227, 1990.
45.
Zurück zum Zitat K. Tumer and J. Ghosh. Error correlation and error reduction in ensemble classifiers. Connection Science, 8(3–4):385–404, 1996. K. Tumer and J. Ghosh. Error correlation and error reduction in ensemble classifiers. Connection Science, 8(3–4):385–404, 1996.
46.
Zurück zum Zitat P. J. Werbos. Beyond regression: New tools for prediction and analysis in the behavioral sciences. Technical report, Harvard University, Ph.D. Dissertation, 1974. P. J. Werbos. Beyond regression: New tools for prediction and analysis in the behavioral sciences. Technical report, Harvard University, Ph.D. Dissertation, 1974.
47.
Zurück zum Zitat B. Widrow and M.E. Hoff. Adaptive switching circuits. In Convention Record of the Institute of Radio Engineers, Western Electronic Show and Convention, pages 96–104. Institute for Radio Engineers, 1960. B. Widrow and M.E. Hoff. Adaptive switching circuits. In Convention Record of the Institute of Radio Engineers, Western Electronic Show and Convention, pages 96–104. Institute for Radio Engineers, 1960.
Metadaten
Titel
Supervised Neural Networks and Ensemble Methods
verfasst von
Francesco Camastra
Alessandro Vinciarelli
Copyright-Jahr
2015
Verlag
Springer London
DOI
https://doi.org/10.1007/978-1-4471-6735-8_8

Premium Partner