Skip to main content
Erschienen in:
Buchtitelbild

2013 | OriginalPaper | Buchkapitel

Size and Power of Multivariate Outlier Detection Rules

verfasst von : Andrea Cerioli, Marco Riani, Francesca Torti

Erschienen in: Algorithms from and for Nature and Life

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Multivariate outliers are usually identified by means of robust distances. A statistically principled method for accurate outlier detection requires both availability of a good approximation to the finite-sample distribution of the robust distances and correction for the multiplicity implied by repeated testing of all the observations for outlyingness. These principles are not always met by the currently available methods. The goal of this paper is thus to provide data analysts with useful information about the practical behaviour of some popular competing techniques. Our conclusion is that the additional information provided by a data-driven level of trimming is an important bonus which ensures an often considerable gain in power.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
The Authors are grateful to Dr. Spyros Arsenis and Dr. Domenico Perrotta for pointing out this historical reference.
 
2
In the RRCOV packege of the R software this option is called eff.shape
 
Literatur
Zurück zum Zitat Atkinson, A. C., & Riani, M. (2000). Robust diagnostic regression analysis. New-York: Springer.MATHCrossRef Atkinson, A. C., & Riani, M. (2000). Robust diagnostic regression analysis. New-York: Springer.MATHCrossRef
Zurück zum Zitat Atkinson, A. C., Riani, M., & Cerioli, A. (2004). Exploring multivariate data with the forward search. New York: Springer.MATHCrossRef Atkinson, A. C., Riani, M., & Cerioli, A. (2004). Exploring multivariate data with the forward search. New York: Springer.MATHCrossRef
Zurück zum Zitat Cerioli, A. (2010). Multivariate outlier detection with high-breakdown estimators. Journal of the American Statistical Association, 105, 147–156.MathSciNetCrossRef Cerioli, A. (2010). Multivariate outlier detection with high-breakdown estimators. Journal of the American Statistical Association, 105, 147–156.MathSciNetCrossRef
Zurück zum Zitat Cerioli, A., & Farcomeni, A. (2011). Error rates for multivariate outlier detection. Computational Statistics and Data Analysis, 55, 544–553.MathSciNetMATHCrossRef Cerioli, A., & Farcomeni, A. (2011). Error rates for multivariate outlier detection. Computational Statistics and Data Analysis, 55, 544–553.MathSciNetMATHCrossRef
Zurück zum Zitat Cerioli, A., Riani, M., & Atkinson, A. C. (2009). Controlling the size of multivariate outlier tests with the MCD estimator of scatter. Statistics and Computing, 19, 341–353MathSciNetCrossRef Cerioli, A., Riani, M., & Atkinson, A. C. (2009). Controlling the size of multivariate outlier tests with the MCD estimator of scatter. Statistics and Computing, 19, 341–353MathSciNetCrossRef
Zurück zum Zitat Cerioli, A., Atkinson, A. C., & Riani, M. (2011a). Some perspectives on multivariate outlier detection. In S. Ingrassia, R. Rocci, & M. Vichi (Eds.), New perspectives in statistical modeling and data analysis (pp. 231–238). Berlin/Heidelberg: Springer.CrossRef Cerioli, A., Atkinson, A. C., & Riani, M. (2011a). Some perspectives on multivariate outlier detection. In S. Ingrassia, R. Rocci, & M. Vichi (Eds.), New perspectives in statistical modeling and data analysis (pp. 231–238). Berlin/Heidelberg: Springer.CrossRef
Zurück zum Zitat Cerioli, A., Riani, M., & Torti, F. (2011b). Accurate and powerful multivariate outlier detection. 58th congress of ISI, Dublin. Cerioli, A., Riani, M., & Torti, F. (2011b). Accurate and powerful multivariate outlier detection. 58th congress of ISI, Dublin.
Zurück zum Zitat Hadi, A. S., Rahmatullah Imon, A. H. M., & Werner, M. (2009). Detection of outliers. WIREs Computational Statistics, 1, 57–70.CrossRef Hadi, A. S., Rahmatullah Imon, A. H. M., & Werner, M. (2009). Detection of outliers. WIREs Computational Statistics, 1, 57–70.CrossRef
Zurück zum Zitat Hubert, M., Rousseeuw, P. J., & Van aelst, S. (2008). High-breakdown robust multivariate methods. Statistical Science, 23, 92–119. Hubert, M., Rousseeuw, P. J., & Van aelst, S. (2008). High-breakdown robust multivariate methods. Statistical Science, 23, 92–119.
Zurück zum Zitat Maronna, R. A., Martin, D. G., & Yohai, V. J. (2006). Robust statistics. New York: Wiley.MATHCrossRef Maronna, R. A., Martin, D. G., & Yohai, V. J. (2006). Robust statistics. New York: Wiley.MATHCrossRef
Zurück zum Zitat Morgenthaler, S. (2006). A survey of robust statistics. Statistical Methods and Applications, 15, 271–293 (Erratum 16, 171–172). Morgenthaler, S. (2006). A survey of robust statistics. Statistical Methods and Applications, 15, 271–293 (Erratum 16, 171–172).
Zurück zum Zitat Perrotta, D., Riani, M., & Torti, F. (2009). New robust dynamic plots for regression mixture detection. Advances in Data Analysis and Classification, 3, 263–279.MathSciNetCrossRef Perrotta, D., Riani, M., & Torti, F. (2009). New robust dynamic plots for regression mixture detection. Advances in Data Analysis and Classification, 3, 263–279.MathSciNetCrossRef
Zurück zum Zitat Pison, G., Van aelst, S., & Willems, G. (2002). Small sample corrections for LTS and MCD. Metrika, 55, 111–123. Pison, G., Van aelst, S., & Willems, G. (2002). Small sample corrections for LTS and MCD. Metrika, 55, 111–123.
Zurück zum Zitat Riani, M., Atkinson, A. C., & Cerioli, A. (2009). Finding an unknown number of multivariate outliers. Journal of the Royal Statistical Society B, 71, 447–466.MathSciNetMATHCrossRef Riani, M., Atkinson, A. C., & Cerioli, A. (2009). Finding an unknown number of multivariate outliers. Journal of the Royal Statistical Society B, 71, 447–466.MathSciNetMATHCrossRef
Zurück zum Zitat Riani, M., Torti, F., & Zani, S. (2011). Outliers and robustness for ordinal data. In R. S. Kennet & S. Salini (Eds.), Modern analysis of customer satisfaction surveys: with applications using R. Chichester: Wiley. Riani, M., Torti, F., & Zani, S. (2011). Outliers and robustness for ordinal data. In R. S. Kennet & S. Salini (Eds.), Modern analysis of customer satisfaction surveys: with applications using R. Chichester: Wiley.
Zurück zum Zitat Riani, M., Cerioli, A., & Torti, F. (2012). A new look at consistency factors and efficiency of robust scale estimators. Submitted. Riani, M., Cerioli, A., & Torti, F. (2012). A new look at consistency factors and efficiency of robust scale estimators. Submitted.
Zurück zum Zitat Rousseeuw, P. J., & Leroy, A. M. (1987). Robust regression and outlier detection. New York: Wiley.MATHCrossRef Rousseeuw, P. J., & Leroy, A. M. (1987). Robust regression and outlier detection. New York: Wiley.MATHCrossRef
Zurück zum Zitat Rousseeuw, P. J. & Van Driessen, K. (1999). A fast algorithm for the minimum covariance determinant estimator. Technometrics, 41, 212–223.CrossRef Rousseeuw, P. J. & Van Driessen, K. (1999). A fast algorithm for the minimum covariance determinant estimator. Technometrics, 41, 212–223.CrossRef
Zurück zum Zitat Salibian-barrera, M., Van Aelst, S., & Willems, G. (2006). Principal components analysis based on multivariate mm estimators with fast and robust bootstrap. Journal of the American Statistical Association, 101, 1198–1211.MathSciNetMATHCrossRef Salibian-barrera, M., Van Aelst, S., & Willems, G. (2006). Principal components analysis based on multivariate mm estimators with fast and robust bootstrap. Journal of the American Statistical Association, 101, 1198–1211.MathSciNetMATHCrossRef
Zurück zum Zitat Todorov, V., & Filzmoser, P. (2009). An object-oriented framework for robust multivariate analysis. Journal of Statistical Software, 32, 1–47. Todorov, V., & Filzmoser, P. (2009). An object-oriented framework for robust multivariate analysis. Journal of Statistical Software, 32, 1–47.
Metadaten
Titel
Size and Power of Multivariate Outlier Detection Rules
verfasst von
Andrea Cerioli
Marco Riani
Francesca Torti
Copyright-Jahr
2013
DOI
https://doi.org/10.1007/978-3-319-00035-0_1