Abstract
Many applied researchers of limited dependent variable models found it disadvantageous that a widely accepted Pseudo-R2 does not exist for this type of estimation. The paper provides guidance for researchers in choosing a Pseudo-R2 in the binary probit case. The starting point is that R2 is best understood in the ordinary least squares (OLS) case with continuous data, which is chosen as the reference situation. It is considered which Pseudo-R2 is best able to mimic the OLS-R2. The results are surprisingly clear: a measure suggested by McKelvey-Zavoina performs the best under our criterion. However, in the more likely case of low Pseudo-R2's, a normalization of a measure proposed by Aldrich-Nelson which we suggest is almost as good as the McKelvey-Zavoina, and is in general easier to calculate. We also show that if the underlying R2 is predicted using cubic regressions given the Pseudo-R2, all measures perform much better.
Similar content being viewed by others
References
Aldrich, J.H. and Nelson, F.D. (1984).Linear Probability, Logit, and Probit Models. Beverly Hills: Sage University Press.
Amemiya, T. (1981). Qualitative Response Models: A Survey.Journal of Economic Literature 19: 1483–1536.
Cragg, J.G. and Uhler, R. (1970). The Demand for Automobiles.Canadian Journal of Economics 3: 386–406.
Dhrymes, P.J. (1986). Limited Dependent Variables. In Z. Griliches and M.D. Intriligator (eds.),Handbook of Econometrics, III, Amsterdam: North-Holland.
Efron, B. (1978). Regression and ANOVA with Zero-One Data: Measures of Residual Variation.Journal of American Statistical Association 73: 113–121.
Fomby, T.B., Hill, R.C. and Johnson, S.R. (1984).Advanced Econometric Methods. New York: Springer-Verlag.
Goldberger, A.S. (1973). Correlations Between Binary Choices and Probabilistic Predictions.Journal of the American Statistical Association 68: 84.
Greene, W.H. (1990).Econometric Analysis New York: Macmillan Publishing Company.
Hauser, J.R. (1977). Testing the Accuracy, Usefulness and Significance of Probabilistic Choice Models: An Information Theoretic Approach.Operations Research 26: 406–421.
Judge, G.G., Griffiths, W.E., Hill, R.C., Lütkepohl, H. and Lee, T.-C. (1985).The Theory and Practice of Econometrics New York: John Wiley and Sons.
Lave, C.A. (1970). The Demand for Urban Mass Transportation.Review of Economics and Statistics 37: 320–323.
Maddala, G.S. (1983).Limited-Dependent and Qualitative Variables in Econometrics New York: Cambridge University Press.
McFadden, D. (1973). Conditional Logit Analysis of Qualitative Choice Behavior. In P. Zarembka (ed.),Frontiers in Econometrics. New York: Academic Press: 105–142.
McKelvey, R. and Zavoina, W. (1975). A Statistical Model for the Analysis of Ordinal Level Dependent Variables.Journal of Mathematical Sociology 4: 103–120.
Morrison, D.G. (1972). Upper Bounds for Correlation Between Binary Outcomes and Probabilistic Predictions.Journal of the American Statistical Association 67: 68–70.
Neter, J. and Maynes, E.S. (1970). Correlation Coefficient with a 0,1 Dependent Variable.Journal of the American Statistical Association 65: 501–509.
Veall, M.R. and Zimmermann, K.F. (1990). Evaluating Pseudo-R2's for Binary Probit Models.Discussion Paper No. 90-14, University of Munich.
Zimmermann, K.F. (1986). Multivariate Micro Prediction Analysis of Qualitative Models. In M.J. Beckmann, K.-W. Gaede, K. Ritter and H. Schneeweiss (eds.),Methods of Operations Research 53. Königstein-Taunus: Verlag Anton Hain: 689–696.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Veall, M.R., Zimmermann, K.F. Evaluating Pseudo-R2's for binary probit models. Qual Quant 28, 151–164 (1994). https://doi.org/10.1007/BF01102759
Issue Date:
DOI: https://doi.org/10.1007/BF01102759