Skip to main content

2017 | OriginalPaper | Buchkapitel

4. Model Complexity and Selection

verfasst von : Daniel Durstewitz

Erschienen in: Advanced Data Analysis in Neuroscience

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In Chap. 2 the bias-variance tradeoff was introduced and approaches to regulate model complexity by some parameter λ—but how to choose it? Here is a fundamental issue in statistical model fitting or parameter estimation: We usually only have available a comparatively small sample from a much larger population, but we really want to make statements about the population as a whole. Now, if we choose a sufficiently flexible model, e.g., a local or spline regression model with many parameters, we may always achieve a perfect fit to the training data, as we already saw in Chap. 2 (see Fig. 2.​5). The problem with this is that it might not say much about the true underlying population anymore as we may have mainly fitted noise—we have overfit the data, and consequently our model would generalize poorly to sets of new observations not used for fitting. As a note on the side, it is not only the nominal number of parameters relevant for this but also the functional form or flexibility of our model and constraints put on the parameters. For instance, of course we cannot accurately capture a nonlinear functional relationship with a (globally) linear model, regardless of how many parameters. Or, as noted before, in basis expansions and kernel approaches, the effective number of parameters may be much smaller as the variables are constrained by their functional relationships. This chapter, especially the following discussion and Sects. 4.1–4.4, largely develops along the exposition in Hastie et al. (2009; but see also the brief discussion in Bishop, 2006, from a slightly different angle).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Akaike, H.: Information theory and an extension of the maximum likelihood principle. In: Proceedings of the Second International Symposium on Information Theory, Budapest, pp. 267–281 (1973) Akaike, H.: Information theory and an extension of the maximum likelihood principle. In: Proceedings of the Second International Symposium on Information Theory, Budapest, pp. 267–281 (1973)
Zurück zum Zitat Allefeld, C., Haynes, J.D.: Searchlight-based multi-voxel pattern analysis of fMRI by cross-validated MANOVA. Neuroimage. 89, 345–357 (2014)CrossRef Allefeld, C., Haynes, J.D.: Searchlight-based multi-voxel pattern analysis of fMRI by cross-validated MANOVA. Neuroimage. 89, 345–357 (2014)CrossRef
Zurück zum Zitat Balaguer-Ballester, E., Lapish, C.C., Seamans, J.K., Daniel Durstewitz, D.: Attractor dynamics of cortical populations during memory-guided decision-making. PLoS Comput. Biol. 7, e1002057 (2011)CrossRef Balaguer-Ballester, E., Lapish, C.C., Seamans, J.K., Daniel Durstewitz, D.: Attractor dynamics of cortical populations during memory-guided decision-making. PLoS Comput. Biol. 7, e1002057 (2011)CrossRef
Zurück zum Zitat Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)MATH Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)MATH
Zurück zum Zitat Brusco, M.J., Stanley, D.: Exact and approximate algorithms for variable selection in linear discriminant analysis. Comput. Stat. Data Anal. 55, 123–131 (2011)MathSciNetCrossRefMATH Brusco, M.J., Stanley, D.: Exact and approximate algorithms for variable selection in linear discriminant analysis. Comput. Stat. Data Anal. 55, 123–131 (2011)MathSciNetCrossRefMATH
Zurück zum Zitat Demanuele, C., Bähner, F., Plichta, M.M., Kirsch, P., Tost, H., Meyer-Lindenberg, A., Durstewitz, D.: A statistical approach for segregating cognitive task stages from multivariate fMRI BOLD time series. Front. Human Neurosci. 9, 537 (2015a)CrossRef Demanuele, C., Bähner, F., Plichta, M.M., Kirsch, P., Tost, H., Meyer-Lindenberg, A., Durstewitz, D.: A statistical approach for segregating cognitive task stages from multivariate fMRI BOLD time series. Front. Human Neurosci. 9, 537 (2015a)CrossRef
Zurück zum Zitat Demanuele, C., Kirsch, P., Esslinger, C., Zink, M., Meyer-Lindenberg, A., Durstewitz, D.: Area-specific information processing in prefrontal cortex during a probabilistic inference task: a multivariate fMRI BOLD time series analysis. PLoS One. 10, e0135424 (2015b)CrossRef Demanuele, C., Kirsch, P., Esslinger, C., Zink, M., Meyer-Lindenberg, A., Durstewitz, D.: Area-specific information processing in prefrontal cortex during a probabilistic inference task: a multivariate fMRI BOLD time series analysis. PLoS One. 10, e0135424 (2015b)CrossRef
Zurück zum Zitat Duda, R.O., Hart, P.E.: Pattern Classification and Scene Analysis. Wiley, New York (1973)MATH Duda, R.O., Hart, P.E.: Pattern Classification and Scene Analysis. Wiley, New York (1973)MATH
Zurück zum Zitat Durstewitz, D., Vittoz, N.M., Floresco, S.B., Seamans, J.K.: Abrupt transitions between prefrontal neural ensemble states accompany behavioral transitions during rule learning. Neuron. 66, 438–448 (2010)CrossRef Durstewitz, D., Vittoz, N.M., Floresco, S.B., Seamans, J.K.: Abrupt transitions between prefrontal neural ensemble states accompany behavioral transitions during rule learning. Neuron. 66, 438–448 (2010)CrossRef
Zurück zum Zitat Efron, B.: Estimating the error rate of a prediction rule: some improvements on cross-validation. J. Am. Stat. Assoc. 78, 316–331 (1983)CrossRefMATH Efron, B.: Estimating the error rate of a prediction rule: some improvements on cross-validation. J. Am. Stat. Assoc. 78, 316–331 (1983)CrossRefMATH
Zurück zum Zitat Efron, B., Tibshirani, R.: Improvements on cross-validation: the 632+ bootstrap: method. J. Am. Stat. Assoc. 92, 548–560 (1997)MathSciNetMATH Efron, B., Tibshirani, R.: Improvements on cross-validation: the 632+ bootstrap: method. J. Am. Stat. Assoc. 92, 548–560 (1997)MathSciNetMATH
Zurück zum Zitat Fahrmeir, L., Tutz, G.: Multivariate Statistical Modelling Based on Generalized Linear Models. Springer, New York (2010)MATH Fahrmeir, L., Tutz, G.: Multivariate Statistical Modelling Based on Generalized Linear Models. Springer, New York (2010)MATH
Zurück zum Zitat Ferraty, F., van Keilegom, I., Vieu, P.: On the validity of the bootstrap in non-parametric functional regression. Scand. J. Stat. 37, 286–306 (2010a)MathSciNetCrossRefMATH Ferraty, F., van Keilegom, I., Vieu, P.: On the validity of the bootstrap in non-parametric functional regression. Scand. J. Stat. 37, 286–306 (2010a)MathSciNetCrossRefMATH
Zurück zum Zitat Ferraty, F., Hall, P., Vieu, P.: Most-predictive design points for functional data predictors. Biometrika. 97(4), 807–824 (2010b)MathSciNetCrossRefMATH Ferraty, F., Hall, P., Vieu, P.: Most-predictive design points for functional data predictors. Biometrika. 97(4), 807–824 (2010b)MathSciNetCrossRefMATH
Zurück zum Zitat Friedman, J.H.: On bias, variance, 0/1—loss, and the curse-of-dimensionality. Data Mining Knowl. Discov. 1, 55–77 (1997)CrossRef Friedman, J.H.: On bias, variance, 0/1—loss, and the curse-of-dimensionality. Data Mining Knowl. Discov. 1, 55–77 (1997)CrossRef
Zurück zum Zitat Friston, K.J., Harrison, L., Penny, W.: Dynamic causal modelling. Neuroimage. 19, 1273–1302 (2003)CrossRef Friston, K.J., Harrison, L., Penny, W.: Dynamic causal modelling. Neuroimage. 19, 1273–1302 (2003)CrossRef
Zurück zum Zitat Garg, G., Prasad, G., Coyle, D.: Gaussian Mixture Model-based noise reduction in resting state fMRI data. J. Neurosci. Methods. 215(1), 71–77 (2013)CrossRef Garg, G., Prasad, G., Coyle, D.: Gaussian Mixture Model-based noise reduction in resting state fMRI data. J. Neurosci. Methods. 215(1), 71–77 (2013)CrossRef
Zurück zum Zitat Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning (Vol. 2, No. 1) Springer, New York (2009) Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning (Vol. 2, No. 1) Springer, New York (2009)
Zurück zum Zitat Khamassi, M., Quilodran, R., Enel, P., Dominey, P.F., Procyk, E.: Behavioral regulation and the modulation of information coding in the lateral prefrontal and cingulate cortex. Cereb. Cortex. 25(9), 3197–3218 (2014)CrossRef Khamassi, M., Quilodran, R., Enel, P., Dominey, P.F., Procyk, E.: Behavioral regulation and the modulation of information coding in the lateral prefrontal and cingulate cortex. Cereb. Cortex. 25(9), 3197–3218 (2014)CrossRef
Zurück zum Zitat Knuth, K.H., Habeck, M., Malakar, N.K., Mubeen, A.M., Placek, B.: Bayesian evidence and model selection. Dig. Signal Process. 47, 50–67 (2015)MathSciNetCrossRef Knuth, K.H., Habeck, M., Malakar, N.K., Mubeen, A.M., Placek, B.: Bayesian evidence and model selection. Dig. Signal Process. 47, 50–67 (2015)MathSciNetCrossRef
Zurück zum Zitat Lapish, C.C., Durstewitz, D., Chandler, L.J., Seamans, J.K.: Successful choice behavior is associated with distinct and coherent network states in anterior cingulate cortex. Proc. Natl. Acad. Sci. U S A. 105, 11963–11968 (2008)CrossRef Lapish, C.C., Durstewitz, D., Chandler, L.J., Seamans, J.K.: Successful choice behavior is associated with distinct and coherent network states in anterior cingulate cortex. Proc. Natl. Acad. Sci. U S A. 105, 11963–11968 (2008)CrossRef
Zurück zum Zitat Penny, W.D.: Comparing dynamic causal models using AIC, BIC and free energy. Neuroimage. 59, 319–330 (2012)CrossRef Penny, W.D.: Comparing dynamic causal models using AIC, BIC and free energy. Neuroimage. 59, 319–330 (2012)CrossRef
Zurück zum Zitat Penny, W.D., Mattout, J., Trujillo-Barreto, N.: Chapter 35: Bayesian model selection and averaging. In: Friston, K., Ashburner, J., Kiebel, S., Nichols, T., Penny, W. (eds.) Statistical Parametric Mapping: The Analysis of Functional Brain Images. Elsevier, London (2006) Penny, W.D., Mattout, J., Trujillo-Barreto, N.: Chapter 35: Bayesian model selection and averaging. In: Friston, K., Ashburner, J., Kiebel, S., Nichols, T., Penny, W. (eds.) Statistical Parametric Mapping: The Analysis of Functional Brain Images. Elsevier, London (2006)
Zurück zum Zitat Stephan, K.E., Penny, W.D., Daunizeau, J., Moran, R.J., Friston, K.J.: Bayesian model selection for group studies. Neuroimage. 46, 1004–1017 (2009)CrossRef Stephan, K.E., Penny, W.D., Daunizeau, J., Moran, R.J., Friston, K.J.: Bayesian model selection for group studies. Neuroimage. 46, 1004–1017 (2009)CrossRef
Zurück zum Zitat Stone, M.: Cross-Validatory Choice and Assessment of Statistical Predictions. J. R. Stat. Soc. Ser. B. 36, 111–147 (1974)MathSciNetMATH Stone, M.: Cross-Validatory Choice and Assessment of Statistical Predictions. J. R. Stat. Soc. Ser. B. 36, 111–147 (1974)MathSciNetMATH
Zurück zum Zitat Vincent, T., Badillo, S., Risser, L., Chaari, L., Bakhous, C., Forbes, F., Ciuciu, P.: Flexible multivariate hemodynamics fMRI data analyses and simulations with PyHRF. Front. Neurosci. 8, 67 (2014)CrossRef Vincent, T., Badillo, S., Risser, L., Chaari, L., Bakhous, C., Forbes, F., Ciuciu, P.: Flexible multivariate hemodynamics fMRI data analyses and simulations with PyHRF. Front. Neurosci. 8, 67 (2014)CrossRef
Zurück zum Zitat Watanabe, T.: Disease prediction based on functional connectomes using a scalable and spatially-informed support vector machine. Neuroimage. 96, 183–202 (2014)CrossRef Watanabe, T.: Disease prediction based on functional connectomes using a scalable and spatially-informed support vector machine. Neuroimage. 96, 183–202 (2014)CrossRef
Zurück zum Zitat Witten, D.M., Tibshirani, R.: Covariance-regularized regression and classification for high dimensional problems. J. R. Stat. Soc. Ser. B (Statistical Methodology). 71, 615–636 (2009)MathSciNetCrossRefMATH Witten, D.M., Tibshirani, R.: Covariance-regularized regression and classification for high dimensional problems. J. R. Stat. Soc. Ser. B (Statistical Methodology). 71, 615–636 (2009)MathSciNetCrossRefMATH
Zurück zum Zitat Witten, D.M., Tibshirani, R.: Penalized classification using Fisher’s linear discriminant. J. R. Stat. Soc. Ser. B. 73, 753–772 (2011a)MathSciNetCrossRefMATH Witten, D.M., Tibshirani, R.: Penalized classification using Fisher’s linear discriminant. J. R. Stat. Soc. Ser. B. 73, 753–772 (2011a)MathSciNetCrossRefMATH
Zurück zum Zitat Young, G., Householder, A.S.: Discussion of a set of points in terms of their mutual distances. Psychometrika. 3, 19–22 (1938)CrossRefMATH Young, G., Householder, A.S.: Discussion of a set of points in terms of their mutual distances. Psychometrika. 3, 19–22 (1938)CrossRefMATH
Metadaten
Titel
Model Complexity and Selection
verfasst von
Daniel Durstewitz
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-59976-2_4