Skip to main content
Top

2017 | OriginalPaper | Chapter

6. Distance and Density Based Approaches

Authors : Kishan G. Mehrotra, Chilukuri K. Mohan, HuaMing Huang

Published in: Anomaly Detection Principles and Algorithms

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In Chap. 3, we discussed distance based approaches for anomaly detection; however there the focus was to illustrate how distances can be measured and minor perturbation in proposed distance can change the outcome; illustrated by simple examples. In this chapter we consider anomaly detection techniques that depend on the distances and densities. The densities can be global or local to the point of concern.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
The change in variance is also closely related to the change in the information content, e.g., the number of bits needed to describe \({\mathcal {D}} \setminus \{p\}\) vs. \(\mathcal {D}\).
 
2
Fan et al. [41] consider the closeness of a point based on each dimension separately. Thus, in their implementation, an observation p is close to another observation q if the difference between p and q is less than r in any dimension. The algorithm is easier to implement using this definition of closeness since it avoids inter-observation distance computations.
 
3
In the Syracuse region, a daily commuting distance characterized by a driving time of 25 min would be considered excessive; that same travel time would be considered to be low in the Los Angeles region. The definition of what constitutes “excessive driving time” must hence be a function of the distribution of driving times within the region of interest, rather than a constant over all regions.
 
Literature
7.
go back to reference F. Angiulli, C. Pizzuti, “Fast outlier detection in high dimensional spaces,” in Principles of Data Mining and Knowledge Discovery (Springer, New York, 2002), pp. 15–27CrossRefMATH F. Angiulli, C. Pizzuti, “Fast outlier detection in high dimensional spaces,” in Principles of Data Mining and Knowledge Discovery (Springer, New York, 2002), pp. 15–27CrossRefMATH
16.
go back to reference M.M. Breunig, H.-P. Kriegel, R.T. Ng, J. Sander, “LOF: Identifying density-based local outliers,” in Proceedings of the ACM SIGMOD International Conference on Management of Data (ACM, New York, 2000), pp. 93–104 M.M. Breunig, H.-P. Kriegel, R.T. Ng, J. Sander, “LOF: Identifying density-based local outliers,” in Proceedings of the ACM SIGMOD International Conference on Management of Data (ACM, New York, 2000), pp. 93–104
22.
go back to reference W. Chauvenet, A Manual of Spherical and Practical Astronomy, V. II. 1863. Reprint of 1891. 5th edn (Dover, New York, NY, 1960) W. Chauvenet, A Manual of Spherical and Practical Astronomy, V. II. 1863. Reprint of 1891. 5th edn (Dover, New York, NY, 1960)
31.
go back to reference F. Dellaert, “The expectation maximization algorithm” (2002) F. Dellaert, “The expectation maximization algorithm” (2002)
32.
go back to reference A.P. Dempster, N.M. Laird, D.B. Rubin, “Maximum likelihood from incomplete data via the em algorithm.” J. R. Stat. Soc. Ser. B (Methodological), 1–38 (1977) A.P. Dempster, N.M. Laird, D.B. Rubin, “Maximum likelihood from incomplete data via the em algorithm.” J. R. Stat. Soc. Ser. B (Methodological), 1–38 (1977)
38.
go back to reference A. Fabrizio, P. Clara, “Outlier mining in large high-dimensional data sets.” IEEE Trans. Knowledge Data Eng. 17(2), 203–215 (2005)CrossRef A. Fabrizio, P. Clara, “Outlier mining in large high-dimensional data sets.” IEEE Trans. Knowledge Data Eng. 17(2), 203–215 (2005)CrossRef
39.
go back to reference A. Fabrizio, B. Stefano, P. Clara, “Distance-based detection and prediction of outliers.” IEEE Trans. Knowledge Data Eng. 18(2), 145–160 (2006)CrossRef A. Fabrizio, B. Stefano, P. Clara, “Distance-based detection and prediction of outliers.” IEEE Trans. Knowledge Data Eng. 18(2), 145–160 (2006)CrossRef
41.
go back to reference H. Fan, O.R. Zaïane, A. Foss, J. Wu, “Resolution-based outlier factor: detecting the top-n most outlying data points in engineering data.” Knowledge Inform. Syst. 19(1), 31–51 (2009)CrossRef H. Fan, O.R. Zaïane, A. Foss, J. Wu, “Resolution-based outlier factor: detecting the top-n most outlying data points in engineering data.” Knowledge Inform. Syst. 19(1), 31–51 (2009)CrossRef
47.
go back to reference Z. Gao, “Application of cluster-based local outlier factor algorithm in anti-money laundering,” in International Conference on Management and Service Science, 2009. MASS’09 (IEEE, Washington, DC, 2009), pp. 1–4 Z. Gao, “Application of cluster-based local outlier factor algorithm in anti-money laundering,” in International Conference on Management and Service Science, 2009. MASS’09 (IEEE, Washington, DC, 2009), pp. 1–4
48.
go back to reference B. Gould, “On peirce’s criterion for the rejection of doubtful observations, with tables for facilitating its application.” Astronomical J. IV, 83 (1855) B. Gould, “On peirce’s criterion for the rejection of doubtful observations, with tables for facilitating its application.” Astronomical J. IV, 83 (1855)
49.
go back to reference F.E. Grubbs, “Procedures for detecting outlying observations in samples.” Technometrics 11(1), 1–21 (1969)CrossRef F.E. Grubbs, “Procedures for detecting outlying observations in samples.” Technometrics 11(1), 1–21 (1969)CrossRef
65.
go back to reference W. Jin, A.K.H. Tung, J. Han, W. Wang, “Ranking outliers using symmetric neighborhood relationship,” in Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 577–593, 2006 W. Jin, A.K.H. Tung, J. Han, W. Wang, “Ranking outliers using symmetric neighborhood relationship,” in Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 577–593, 2006
75.
go back to reference E.M. Knorr, R.T. Ng, “A unified notion of outliers: Properties and computation,” in KDD, pp. 219–222, 1997 E.M. Knorr, R.T. Ng, “A unified notion of outliers: Properties and computation,” in KDD, pp. 219–222, 1997
76.
go back to reference E.M. Knorr, R.T. Ng, “Algorithms for mining distance-based outliers in large datasets,” in Proceeding VLDB ’98 Proceedings of the 24rd International Conference on Very Large Data Bases, pp. 392–403, 1998 E.M. Knorr, R.T. Ng, “Algorithms for mining distance-based outliers in large datasets,” in Proceeding VLDB ’98 Proceedings of the 24rd International Conference on Very Large Data Bases, pp. 392–403, 1998
92.
go back to reference S. Papadimitriou, H. Kitagawa, P.B. Gibbons, C. Faloutsos, “Loci: Fast outlier detection using the local correlation integral,” in Proceedings. 19th International Conference on Data Engineering, 2003 (IEEE, Washington, DC, 2003), pp. 315–326 S. Papadimitriou, H. Kitagawa, P.B. Gibbons, C. Faloutsos, “Loci: Fast outlier detection using the local correlation integral,” in Proceedings. 19th International Conference on Data Engineering, 2003 (IEEE, Washington, DC, 2003), pp. 315–326
93.
go back to reference B. Peirce, “Criterion for the rejection of doubtful observations.” Astron. J. II 45 (1852) B. Peirce, “Criterion for the rejection of doubtful observations.” Astron. J. II 45 (1852)
98.
go back to reference S. Ramaswamy, R. Rastogi, K. Shim, “Efficient algorithms for mining outliers from large data sets,” in ACM SIGMOD Record, vol. 29(2) (ACM, New York, 2000), pp. 427–438 S. Ramaswamy, R. Rastogi, K. Shim, “Efficient algorithms for mining outliers from large data sets,” in ACM SIGMOD Record, vol. 29(2) (ACM, New York, 2000), pp. 427–438
102.
go back to reference M.H. Safar, C. Shahabi, “Multidimensional index structures.” Shape Analysis and Retrieval of Multimedia Objects (Springer, New York, 2003), pp. 63–77 M.H. Safar, C. Shahabi, “Multidimensional index structures.” Shape Analysis and Retrieval of Multimedia Objects (Springer, New York, 2003), pp. 63–77
107.
go back to reference J. Tang, Z. Chen, A.W. Fu, D.W. Cheung, “Capabilities of outlier detection schemes in large datasets, framework and methodologies.” Knowl. Inform. Syst. 11(1), 45–84 (2006)CrossRef J. Tang, Z. Chen, A.W. Fu, D.W. Cheung, “Capabilities of outlier detection schemes in large datasets, framework and methodologies.” Knowl. Inform. Syst. 11(1), 45–84 (2006)CrossRef
108.
go back to reference J. Tang, Z. Chen, A.W. chee Fu, D.W. Cheung, “Enhancing effectiveness of outlier detections for low density patterns,” in Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 535–548, 2002 J. Tang, Z. Chen, A.W. chee Fu, D.W. Cheung, “Enhancing effectiveness of outlier detections for low density patterns,” in Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 535–548, 2002
121.
go back to reference Y. Zhang, S. Yang, Y. Wang, “LDBOD: A novel local distribution based outlier detector.” Pattern Recognit. Lett. 29(7), 967–976 (2008)CrossRef Y. Zhang, S. Yang, Y. Wang, “LDBOD: A novel local distribution based outlier detector.” Pattern Recognit. Lett. 29(7), 967–976 (2008)CrossRef
Metadata
Title
Distance and Density Based Approaches
Authors
Kishan G. Mehrotra
Chilukuri K. Mohan
HuaMing Huang
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-67526-8_6

Premium Partner