Skip to main content
Log in

Attrition bias in the Capitalia panel

  • Original papers
  • Published:
International Review of Economics Aims and scope Submit manuscript

Abstract

The Capitalia survey of manufacturing firms in Italy represents an important source of panel data on Italian firms. Panel attrition, however, represents a potential obstacle to such use of the sample. In this paper, sample entry and exit behaviour are studied, and a test for attrition bias is carried out in order to evaluate the potential for using panels constructed from the Capitalia survey. The analysis reveals the presence of distorting panel attrition effects in simple models of firm performance. In addition, the paper discusses both the implications of attrition bias for estimates and also briefly considers possible solutions. Finally, some suggestions are made as to how to reduce the impact of attrition bias through the provision of additional information on the nature of the attrition process available to the surveying institution at the moment of data collection.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Notes

  1. Originally, the survey was undertaken under the aegis of Mediocredito Centrale. Mediocredito Centrale is a bank specialising in international finance and industrial finance. In 2002, Mediocredito Centrale was taken over by the Capitalia group—now the third largest banking group in Italy.

  2. Progetto di Ricerca di Interesse Nazionale.

  3. Details of the sample surveys can be found in Mediocredito Centrale (1994) and Capitalia (2002) as well as on: http://www.capitalia.it/pages/studi02b.htm.

  4. Although if one considers the size of the sample in relation to either employment or output, the figure is closer to 10% of the manufacturing total as is pointed out in the report on the eighth wave (Capitalia 2002).

  5. In practice, the first Capitalia sample considered here—the fifth wave covering the years 1989–1991—was based on the Chambers of Commerce database of small and medium sized enterprises constructed by CERVED, a company specialising in the management of business databases.

  6. Possibly the figure of over 100% for the 6th wave is due to the growth of firms between the date of the census (1991) and the date of the survey (1994). Another possibility is that in this wave, firms are included in both questionnaire and balance sheet datasets but are not identified as the same firm in each case.

  7. This essentially follows the treatment of sample selection problems in most standard texts dealing with micro-econometrics. See, for example, Cameron and Trivedi (2005), Greene (2003), or indeed the seminal treatment in Maddala (1983).

  8. For simplicity, we consider the case in which the outcome variable, Y, and the survival indicator, S, are determined by variables observed in the previous wave. In practice, it may be desirable in both cases to include variables from the current wave. In particular, in the case of the survival equation, (2), if S is dependent on choices made by the surveying institution at the time of the current wave, these should be included in the specification of Z. For further discussion of this point, see, for example, Rendtel (2002).

  9. Specifically, given the distributional assumptions above, E t |S t * > 0)=ρευσελ where ρευ is the correlation coefficient between ε and υ and λ is the inverse mills ratio. The estimation procedure then simply involves the inclusion of estimated values for E t |S t * > 0) as an additional explanatory variable in Eq. (1).

  10. This is one of the approaches considered by Heckman and Holtz (1989) in their comparison of methods for the evaluation of labour market programmes and is the one used by Bagella et al. (2004) in their analysis of the impact of investment and export subsidies on firm performance using the Capitalia sample. Another possible approach in these circumstances would be through the use of matching models (Rosenbaum and Rubin 1983). See Heckman et al. (1999) for an extensive review of approaches to the evaluation problem as it concerns labour market programmes.

  11. See, for example, the excellent analysis of Battistin et al. (2001). These authors go rather further inasmuch as they recognise that financial support for firms will improve the survival probability of inefficient as well as efficient firms. In most cases the stated aim of such policies is, or certainly should be, to help efficient firms overcome problems with liquidity constraints rather than prolong the survival artificially of inefficient ones. Their analysis is essentially concerned with how one can evaluate such policies in these terms.

  12. The precise methodology for the matching procedure varied between waves according to the available indicators. Specifically, firms were matched on the basis of a unique identifier included in the data (fifth and sixth waves), four indicators of labour costs and value added over time (sixth and seventh waves) and by social security number (seventh and eighth waves).

  13. Given the very high rate of panel attrition, it did not appear useful to construct panels with more than two waves. Appendix includes full descriptive statistics, however, it may be noted here that full information on all four waves is only available for 132 firms.

  14. However the share of firms located in the North decreases in the eight wave.

  15. LIQUIDITY is defined as net working capital /total current liabilities. In the fifth wave, since a variable indicating the net working capital was not available, we tried to construct this variable from the available information using other variables (e.g. short credits, stores in hand, etc.); however, the large number of missing values for the resulting variable led us not to use the liquidity index in the context of the fifth wave.

  16. In previous estimates the set of explanatory variables included also the investments made by the firms in the last 3 years; however, we did not find any significant evidence.

  17. See notes in Table 4 for a description of the performance indicators.

  18. In the fourth column, the results concern the probability that respondents included in the sixth wave were already present in the fifth wave; in column 5, the results concern the probability that respondents included in the seventh wave were already in the panel in the sixth wave; column 6 concerns waves seventh and eighth.

  19. Consequently Eqs. (5) and (6) are probit models.

  20. That is, the variable STATE INCENTIVES, as previously defined.

  21. That is, EXPORTING FIRM as previously defined.

  22. The interested reader is referred to Verbeek and Nijman (1992) for a more formal treatment of the problem. Note, however, that the test is based on cross-sections not panels so that the form of the equation estimated is slightly different from (1).

  23. More if one accepts the rather broad definition of statistical significance proposed here. Using 20% as a cut-off point for statistical significance is indeed a broader criterion than is conventional; however, we feel that the important concern here is with type II errors. Obviously, lowering the critical level of statistical significance increases the chances of rejecting the null hypothesis of no panel attrition bias when in fact it is true (type I error). By the same token, however, a lower cut-off point reduces the chances of committing a type II error, i.e. of rejecting the hypothesis of panel attrition bias when it is in fact present.

  24. For example, assuming selection on unobservables, this might be achieved by estimating a bivariate probit model of sample selection and panel attrition such as that employed by O’Higgins (1994).

References

  • Bagella M, Becchetti L, Londono Bedoya DA (2004) Investment and export subsidies in Italy: who gets them and what is their impact on investment and efficiency. Riv Politica Econ 3–4:61–92

    Google Scholar 

  • Battistin E, Gavosto A, Rettore E (2001) Why do subsidised firms survive longer? An evaluation of a program promoting youth entrepreneurship in Italy. In: Lechner M, Pfeiffer F (eds) Econometric evaluation of labour market policies, ZEW economic studies, vol. 13, Springer, Heidelberg, pp 153–181

    Google Scholar 

  • Cameron AC, Trivedi PK (2005) Microeconometrics: methods and applications. Cambridge University Press, Cambridge

    Google Scholar 

  • Capitalia (2002) Indagine sulle imprese manifatturiere: ottavo rapporto sull’industria e sulla politica industriale. Capitalia, Roma

    Google Scholar 

  • Dennis M J, Rick L (2003) Effects of panel attrition on survey estimates. Paper presented at the 58th annual American association for public opinion research conference, Nashville, TN, 15–18 May 2003

  • Dionne G, Gagné R, Vanasse C (1998) Inferring technological parameters from incomplete panel data. J Econ 87:303–327

    Google Scholar 

  • Fitzgerald J, Gottschalk P, Moffitt R (1998) An analysis of sample attrition in panel data: the Michigan panel study of income dynamics. J Hum Resour 33:251–299

    Article  Google Scholar 

  • Greene W (2003) Econometric analysis, 5th edn. Prentice Hall, Englewood Cliffs

    Google Scholar 

  • Hausman JA, Wise DA (1979) Attrition bias in experimental and panel data: the Gary income maintenance experiment. Econometrica 47:455–473

    Article  Google Scholar 

  • Heckman JJ (1979) Sample selection bias as specification error. Econometrica 47:153–161

    Article  Google Scholar 

  • Heckman JJ, Holtz J (1989) Choosing among alternative methods of estimating the impact of social programs: the case of manpower training. J Am Stat Assoc 84:862–874

    Article  Google Scholar 

  • Heckman JJ, Lalonde RJ, Smith JA (1999) The economics and econometrics of active labor market programs. In: Aschenfelter O, Card D (eds) Handbook of labor economics. vol. IIIA. Elsevier, Amsterdam, pp 1865–2097

    Chapter  Google Scholar 

  • Liao TF (2006) A method for analyzing categorical data with panel attrition. Methodology of Longitudinal Surveys Conference, University of Essex, 12–14 July 2006

  • Little RJA, Rubin DB (1987) Statistical analysis and missing data. Wiley, New York

    Google Scholar 

  • Kyriazidou E (1997) Estimation of a panel data sample selection model. Econometrica 65:1335–1364

    Article  Google Scholar 

  • Maddala GS (1983) Limited dependent and qualitative variables in econometrics. Cambridge University Press, Cambridge

    Google Scholar 

  • Mediocredito Centrale (1994) Indagine sulle imprese manifatturiere: quinto rapporto sull’industria italiana e sulla politica industriale. Il Sole 24 Ore, Milano

    Google Scholar 

  • O’Higgins N (1994) Employment and sample selection bias. Oxf Econ Pap 46:605–628

    Google Scholar 

  • Rendtel U (2002) Attrition in household panels: a survey. CHINTEX, Working paper no. 4, University of Frankfurt

  • Rosenbaum P, Rubin D (1983) The central role of the propensity score in observational studies for causal effects. Biometrika 70:41–55

    Article  Google Scholar 

  • Vella F, Verbeek M (1999) Two-step estimation of panel data models with censored endogenous variables and selection bias. J Econ 90:239–263

    Google Scholar 

  • Verbeek M, Nijman T (1992) Testing for selectivity bias in panel data models. Int Econ Rev 33:681–703

    Article  Google Scholar 

Download references

Acknowledgments

Research for this paper was carried out as part of the PRIN 2003 research project “Metodi e applicazioni per la valutazione delle politiche del lavoro e di aiuto alle imprese” for which financial support is gratefully acknowledged. Thanks are due to Maria Guerra who undertook the matching procedure on the data and to Sergio Lugaresi, Attilio Pasetto and Tony Riti from Capitalia, Eric Battistin, Bruno Contini, Sergio Destefanis, Steve Pudney, Ugo Trivellato and an anonymous referee for their helpful comments on a previous version of the paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Annamaria Nese.

Appendix : Descriptive statistics

Appendix : Descriptive statistics

Table 8, 9, 10

Table 8 Summary statistics fifth wave-cross section and panel data
Table 9 Summary statistics sixth wave-cross section and panel data
Table 10 Summary statistics seventh wave-cross section and panel data

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nese, A., O’Higgins, N. Attrition bias in the Capitalia panel. Int. Rev. Econ. 54, 383–403 (2007). https://doi.org/10.1007/s12232-007-0021-6

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12232-007-0021-6

Keywords

JEL Classifications

Navigation