Skip to main content
Log in

Computing location depth and regression depth in higher dimensions

  • Published:
Statistics and Computing Aims and scope Submit manuscript

Abstract

The location depth (Tukey 1975) of a point θ relative to a p-dimensional data set Z of size n is defined as the smallest number of data points in a closed halfspace with boundary through θ. For bivariate data, it can be computed in O(nlogn) time (Rousseeuw and Ruts 1996). In this paper we construct an exact algorithm to compute the location depth in three dimensions in O(n2logn) time. We also give an approximate algorithm to compute the location depth in p dimensions in O(mp3+mpn) time, where m is the number of p-subsets used.

Recently, Rousseeuw and Hubert (1996) defined the depth of a regression fit. The depth of a hyperplane with coefficients (θ1,...,θp) is the smallest number of residuals that need to change sign to make (θ1,...,θp) a nonfit. For bivariate data (p=2) this depth can be computed in O(nlogn) time as well. We construct an algorithm to compute the regression depth of a plane relative to a three-dimensional data set in O(n2logn) time, and another that deals with p=4 in O(n3logn) time. For data sets with large n and/or p we propose an approximate algorithm that computes the depth of a regression fit in O(mp3+mpn+mnlogn) time. For all of these algorithms, actual implementations are made available.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Chambers, J. M. and Hastie, T. J. (1991) Statistical Models in S. Wadsworth, Belmont CA.

    Google Scholar 

  • Daudin, J. J., Duby, C. and Trecourt, P. (1988) Stability of principal components analysis studied by the bootstrap method. Statistics, 19, 241–258.

    Google Scholar 

  • Donoho, D. L. and Gasko, M. (1992) Breakdown properties of location estimates based on halfspace depth and projected outlyingness. The Annals of Statistics, 20, 1803–1827.

    Google Scholar 

  • Eddy, W. F. (1985) Ordering of multivariate data. In Computer Science and Statistics: Proceedings of the 16th Symposium on the Interface (L. Billard, ed.), North-Holland, Amsterdam, pp. 25–30.

    Google Scholar 

  • Liu, R. Y. (1995) Control charts for multivariate processes. Journal of the American Statistical Association, 90, 1380–1387.

    Google Scholar 

  • Liu, R. Y. and Singh, K. (1993) A quality index based on data depth and multivariate rank tests. Journal of the American Statistical Association, 88, 252–260.

    Google Scholar 

  • Liu, R. Y. and Singh, K. (1997) Notions of limiting P-values based on data depth and bootstrap. Journal of the American Statistical Association, 92, 266–277.

    Google Scholar 

  • Massé, J. and Theodorescu, R. (1994) Halfplane trimming for bivariate distributions. Journal of Multivariate Analysis, 48, 188–202.

    Google Scholar 

  • Rousseeuw, P. J. and Hubert, M. (1996) Regression depth. Submitted.

  • Rousseeuw, P. J. and Ruts, I. (1996) Algorithm AS 307: bivariate location depth. Applied Statistics (JRSS-C), 45, 516–526.

    Google Scholar 

  • Rousseeuw, P. J. and Ruts, I. (1998) Constructing the bivariate Tukey median. Statistica Sinica, to appear.

  • Ruiz, A. (1992) Estimation robuste d'une matrice de dispersion et projections révélatrices, Ph.D. Thesis, University of Toulouse, France.

    Google Scholar 

  • Ruts, I. and Rousseeuw, P. J. (1996) Computing depth contours of bivariate point clouds. Computational Statistics and Data Analysis, 23, 153–168.

    Google Scholar 

  • Tukey, J. W. (1975) Mathematics and the picturing of data. Proceedings of the International Congress of Mathematicians, Vancouver, 2, 523–531.

    Google Scholar 

  • Yeh, A. B. and Singh, K. (1997) Balanced confidence regions based on Tukey's depth and the bootstrap. Journal of the Royal Statistical Society Series B, 59, 639–652.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rousseeuw, P.J., Struyf, A. Computing location depth and regression depth in higher dimensions. Statistics and Computing 8, 193–203 (1998). https://doi.org/10.1023/A:1008945009397

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1008945009397

Navigation