Elsevier

Journal of Econometrics

Volume 157, Issue 2, August 2010, Pages 191-204
Journal of Econometrics

On the asymptotic optimality of the LIML estimator with possibly many instruments

https://doi.org/10.1016/j.jeconom.2009.12.001Get rights and content

Abstract

We consider the estimation of the coefficients of a linear structural equation in a simultaneous equation system when there are many instrumental variables. We derive some asymptotic properties of the limited information maximum likelihood (LIML) estimator when the number of instruments is large; some of these results are new as well as old, and we relate them to results in some recent studies. We have found that the variance of the limiting distribution of the LIML estimator and its modifications often attain the asymptotic lower bound when the number of instruments is large and the disturbance terms are not necessarily normally distributed, that is, for the micro-econometric models of some cases recently called many instruments and many weak instruments.

Introduction

Over the past three decades there has been increasing interest and research on the estimation of a single structural equation in a system of simultaneous equations when the number of instruments (the number of exogenous variables excluded from the structural equation), say K2, is large relative to the sample size, say n. The relevance of such models is due to the collection of large data sets and the development of computational equipment capable of analysis of such data sets. One empirical example of this kind often cited in econometric literature is Angrist and Krueger (1991); there has been some discussion by Bound et al. (1995) since then. Asymptotic distributions of estimators and test criteria are developed on the basis that both K2 and n. These asymptotic distributions are used as approximations to the distributions of the estimators and criteria when K2 and n are large.

Bekker (1994) has written “To my knowledge a first mention of such a parameter sequence was made, with respect to the linear functional relationship model, in Anderson (1976, p. 34). This work was extended to simultaneous equations by Kunitomo, 1980, Kunitomo, 1982 and Morimune (1983), who gave asymptotic expansions for the case of a single explanatory endogenous variable”. Following Bekker there have been many studies of the behavior of estimators of the coefficients of a single equation when K2 and n are large.

The main purpose of the present paper is to show that one estimator, the Limited Information Maximum Likelihood (LIML) estimator, has some optimum properties when K2 and n are large. As background we state and derive some asymptotic distributions of the LIML and Two-Stage Least Squares (TSLS) estimators as K2 and n. Some of these results are improvements on Kunitomo, 1981, Kunitomo, 1982, Morimune (1983) and Bekker (1994), Chao and Swanson (2005), van Hasselt (2006), Hansen et al. (2008), and they are presented in a unified notation.

In addition to the LIML and TSLS estimators there are other instrumental variables (IV) methods. See Anderson et al. (1982) on the earlier studies of the finite sample properties, for instance. Several semiparametric estimation methods have been developed including the generalized method of moments (GMM) estimation and the empirical likelihood (EL) method. (See Hayashi, 2000 for instance.) However, it has been recently recognized that the classical methods have some advantages in microeconometric situations with many instruments.

In this paper we shall give the results on the asymptotic properties of the LIML estimator when the number of instruments is large and we develop the large-K2 asymptotic theory or the many instruments asymptotics including the so-called case of many weak instruments. The TSLS and the GMM estimators are badly biased and they lose even consistency in some of these situations. Our results on the asymptotic properties and optimality of the LIML estimator and its variants give new interpretations of the numerical information of the finite sample properties and some guidance on the use of alternative estimation methods in simultaneous equations and micro-econometric models with many weak instruments. There is a growing amount of literature on the problem of many instruments in econometric models. We shall try to relate our results to some recent studies, including Donald and Newey (2001), Hahn (2002), Stock and Yogo (2005), Chao and Swanson, 2005, Chao and Swanson, 2006, van Hasselt (2006), van der Ploeg and Bekker (unpublished), Bekker and van der Ploeg (2005), Chioda and Jansson (unpublished), Hansen et al. (2008), and Anderson et al. (forthcoming).

In Section 2 we state the formulation of a linear structural model and the alternative estimation methods of unknown parameters with possibly many instruments. In Section 3 we develop the large-K2 asymptotics (or many instruments asymptotics) and give some results on the asymptotic normality of the LIML estimator when n and K2 are large. Then we shall present some results on the asymptotic optimality of the LIML estimator in the sense that it attains the lower bound of the asymptotic variance in a class of consistent estimators with many instruments under reasonable assumptions. Also we discuss a more general formulation of the models and the related problems. In Section 4 we show that the asymptotic results in Section 3 agree with the finite sample properties of estimators. Then brief concluding remarks will be given in Section 5. The proof of our theorems will be given in Section 6.

Section snippets

Alternative estimation methods in structural equation models with possibly many instruments

We first consider the estimation problem of a structural equation in the classical linear simultaneous equations framework.1 Let a single linear structural equation in an econometric model be y1i=β2y2i+γ1z1i+ui(i=1,,n), where y1i and y2i are a scalar and a vector of G2

Asymptotic normality of the LIML estimator

We state the limiting distribution of the LIML estimator under a set of alternative assumptions when K2n and Π2n can depend on n and n. We first consider the case when (I)K2nnc(0c<1),(II)1nΠ22(n)A22.1Π22(n)pΦ22.1, where Φ22.1 is a nonsingular constant matrix.

Condition (I) implies that the number of coefficient parameters is proportional to the number of observations. Because we want to estimate the covariance matrix of vi(n)(i=1,,n), we want c<1. Then (I) implies qn as n. Condition

Discussion of asymptotic properties and finite sample properties

It is important to investigate the finite sample properties of estimators partly because they are not necessarily similar to their asymptotic properties. One simple example would be the fact that the exact moments of some estimators do not necessarily exist. (In that case it is meaningless to compare the exact MSEs of alternative estimators and their Monte Carlo analogues.) Although we discuss the asymptotic properties of the LIML estimator, we need to investigate their relevance for practical

Concluding remarks

In this paper, we have discussed the asymptotic optimality when the number of instruments is large in a structural equation of the simultaneous equations system. Although the limited information maximum likelihood (LIML) estimator and the two stage-least squares (TSLS) estimator are asymptotically equivalent in the standard large sample theory, they are asymptotically quite different in the large-K2 asymptotics with many instruments or many weak instruments. In some recent microeconometric

Proof of theorems

In this section we give the proofs of Theorems and the mathematical derivation in Section 3.

Proof of Theorem 1

Substitution of (2.2) into (2.4) yields G=(ΠnZ+V)Z2.1A22.11Z2.1(ZΠn+V)=Π2nA22.1Π2n+VZ2.1A22.11Z2.1V+Π2nZ2.1V+VZ2.1Π2n. Then G[Π2nA22.1Π2n+K2nΩ]=Π2nZ2.1V+VZ2.1Π2n+[VZ2.1A22.11Z2.1VK2nΩ]. Condition (II) implies that as n1nΠ2nZ2.1VpO, and 1n[VZ2.1A22.11Z2.1VK2nΩ]pO. Then as n,1nGpG0=[β2IG2]Φ22.1(β2,IG2)+cΩ and 1qnHpΩ. Then βˆLIpβ and λnpc as n.

Define G1,H1,λ1n, and b1

References (37)

  • T.W. Anderson et al.

    The asymptotic properties of estimates of the parameters of a single equation in a complete system of stochastic equation

    Annals of Mathematical Statistics

    (1950)
  • J.D. Angrist et al.

    Does compulsory school attendance affect schooling and earnings

    Quarterly Journal of Economics

    (1991)
  • P.A. Bekker

    Alternative approximations to the distributions of instrumental variables estimators

    Econometrica

    (1994)
  • P.A. Bekker et al.

    Instrumental variables estimation based on group data

    Statistica Neerlandica

    (2005)
  • J. Bound et al.

    Problems with instrumental variables estimation when the correlation between the instruments and the endogenous explanatory variables is weak

    Journal of the American Statistical Association

    (1995)
  • Chao, J., Swanson, N., 2004. asymptotic distributions of JIVE in a heteroscedastic IV regression with many instruments,...
  • J. Chao et al.

    Consistent estimation with a large number of weak instruments

    Econometrica

    (2005)
  • J. Chao et al.

    Asymptotic normality of single-equation estimators for the case with a large number of instruments

  • Cited by (44)

    • Gender equality and central bank independence

      2023, Economic Analysis and Policy
    • Second-order refinements for t-ratios with many instruments

      2023, Journal of Econometrics
      Citation Excerpt :

      For example, the two stage least squares (TSLS) estimator tends to have large bias. Although the limited information maximum likelihood (LIML) estimator is less biased, its distribution is often more dispersed than the limiting distribution based on the conventional asymptotics (see, e.g., Anderson et al., 1982, 2010, 2011). We impose the following assumptions on the reduced form.

    • Robust estimation with many instruments

      2020, Journal of Econometrics
    • Minimum distance approach to inference with many instruments

      2018, Journal of Econometrics
      Citation Excerpt :

      Hahn (2002), Chamberlain (2007), Chioda and Jansson (2009), and Moreira (2009) focus on optimal inference with many instruments when the errors are normal and homoscedastic, and my optimality results build on theirs. Papers by Hansen et al. (2008), Anderson et al. (2010) and van Hasselt (2010) relax the normality assumption. Hausman et al. (2012), Chao et al. (2012), Chao et al. (2014) and Bekker and Crudu (2015) also allow for heteroscedasticity.

    View all citing articles on Scopus

    AKM09-11-23-2. This is a revised first part of Discussion Paper CIRJE-F-321 under the title “A New Light from Old Wisdoms: Alternative Estimation Methods of Simultaneous Equations and Microeconometric Models” (Graduate School of Economics, University of Tokyo, February 2005) which was presented at the Econometric Society World Congress 2005 at London (August 2005). We thank Yoichi Arai, the co-editor and the referees of this journal for some comments to the earlier versions.

    View full text