Top

Published in:

2020 | OriginalPaper | Chapter

Improving Deep Unsupervised Anomaly Detection by Exploiting VAE Latent Space Distribution

Authors : Fabrizio Angiulli, Fabio Fassetti, Luca Ferragina

Published in: Discovery Science

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Anomaly detection methods exploiting autoencoders (AE) have shown good performances. Unfortunately, deep non-linear architectures are able to perform high dimensionality reduction while keeping reconstruction error low, thus worsening outlier detecting performances of AEs. To alleviate the above problem, recently some authors have proposed to exploit Variational autoencoders (VAE), which arise as a variant of standard AEs designed for generative purposes. The key idea of VAEs is take into account a regularization term constraining the organization of the latent space. However, VAEs share with standard AEs the problem that they generalize so well that they can also well reconstruct anomalies. In this work we argue that the approach of selecting the worst reconstructed examples as anomalies is too simplistic if a VAE architecture is employed. We show that outliers tend to lie in the sparsest regions of the combined latent/error space and propose a novel unsupervised anomaly detection algorithm, called VAEOut, that identifies outliers by performing density estimation in this augmented feature space. The proposed approach shows sensible improvements in terms of detection performances over the standard approach based on the reconstruction error.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

previous chapter FairNN - Conjoint Learning of Fair Representations for Fair Decisions

next chapter Detecting Temporal Anomalies in Business Processes Using Distance-Based Methods

http://yann.lecun.com/exdb/mnist/.

https://github.com/zalandoresearch/fashion-mnist.

Aggarwal, C.C.: Outlier Analysis. Springer, New York (2017). https://doi.org/10.1007/978-1-4614-6396-2CrossRefMATH

An, J., Cho, S.: Variational autoencoder based anomaly detection using reconstruction probability. Technical report, 3, SNU Data Mining Center (2015)

Angiulli, F.: Concentration free outlier detection. In: Ceci, M., Hollmén, J., Todorovski, L., Vens, C., Džeroski, S. (eds.) ECML PKDD 2017. LNCS (LNAI), vol. 10534, pp. 3–19. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-71249-9_1CrossRef

Angiulli, F.: On the behavior of intrinsically high-dimensional spaces: distances, direct and reverse nearest neighbors, and hubness. J. Mach. Learn. Res. 18, 170:1–170:60 (2018)

Angiulli, F.: CFOF: a concentration free measure for anomaly detection. ACM Trans. Knowl. Disc. Data (TKDD) 14(1), 4:1–4:53 (2020)

Angiulli, F., Basta, S., Pizzuti, C.: Distance-based detection and prediction of outliers. IEEE Trans. Knowl. Data Eng. 2(18), 145–160 (2006)CrossRef

Angiulli, F., Fassetti, F.: DOLPHIN: an efficient algorithm for mining distance-based outliers in very large datasets. ACM Trans. Knowl. Disc. Data (TKDD) 3(1), Article 4 (2009)

Angiulli, F., Pizzuti, C.: Fast outlier detection in large high-dimensional data sets. In: Proceedings of International Conference on Principles of Data Mining and Knowledge Discovery (PKDD), pp. 15–26 (2002)

Angiulli, F., Pizzuti, C.: Outlier mining in large high-dimensional data sets. IEEE Trans. Knowl. Data Eng. 2(17), 203–215 (2005)CrossRef

10.

Barnett, V., Lewis, T.: Outliers in Statistical Data. Wiley, Hoboken (1994)MATH

11.

Breunig, M.M., Kriegel, H., Ng, R., Sander, J.: LOF: identifying density-based local outliers. In: Proceedings of International Conference on Management of Data (SIGMOD) (2000)

12.

Chalapathy, R., Chawla, S.: Deep learning for anomaly detection: a survey (2019)

13.

Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: a survey. ACM Comput. Surv. 41(3), 1–58 (2009)CrossRef

14.

Davies, L., Gather, U.: The identification of multiple outliers. J. Am. Stat. Assoc. 88, 782–792 (1993)MathSciNetCrossRef

15.

Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016)MATH

16.

Hautamäki, V., Kärkkäinen, I., Fränti, P.: Outlier detection using k-nearest neighbour graph. In: International Conference on Pattern Recognition (ICPR), Cambridge, UK, 23–26 August 2004, pp. 430–433 (2004)

17.

Hawkins, S., He, H., Williams, G., Baxter, R.: Outlier detection using replicator neural networks. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds.) DaWaK 2002. LNCS, vol. 2454, pp. 170–180. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-46145-0_17CrossRef

18.

Hecht-Nielsen, R.: Replicator neural networks for universal optimal source coding. Science 269(5232), 1860–1863 (1995)CrossRef

19.

Higgins, I., et al.: \(\beta \)-vae: learning basic visual concepts with a constrained variational framework. In: International Conference on Learning Representations (ICLR) (2017)

20.

Jin, W., Tung, A., Han, J.: Mining top-n local outliers in large databases. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) (2001)

21.

Kawachi, Y., Koizumi, Y., Harada, N.: Complementary set variational autoencoder for supervised anomaly detection. In: IEEE ICASSP, pp. 2366–2370 (2018)

22.

Kingma, D.P., Welling, M.: Auto-encoding variational bayes (2013)

23.

Knorr, E., Ng, R., Tucakov, V.: Distance-based outlier: algorithms and applications. VLDB J. 8(3–4), 237–253 (2000)CrossRef

24.

Kramer, M.A.: Nonlinear principal component analysis using autoassociative neural networks. AIChE J. 37(2), 233–243 (1991)CrossRef

25.

Kriegel, H.P., Schubert, M., Zimek, A.: Angle-based outlier detection in high-dimensional data. In: Proceedings of International Conference on Knowledge Discovery and Data Mining (KDD), pp. 444–452 (2008)

26.

Liu, F., Ting, K., Zhou, Z.H.: Isolation-based anomaly detection. ACM Trans. Knowl. Disc. Data (TKDD) 6(1), 1–39 (2012)CrossRef

27.

Radovanović, M., Nanopoulos, A., Ivanović, M.: Reverse nearest neighbors in unsupervised distance-based outlier detection. IEEE Trans. Knowl. Data Eng. 27(5), 1369–1382 (2015)CrossRef

28.

Schölkopf, B., Platt, J.C., Shawe-Taylor, J., Smola, A.J., Williamson, R.C.: Estimating the support of a high-dimensional distribution. Neural Comput. 13(7), 1443–1471 (2001)CrossRef

29.

Sun, J., Wang, X., Xiong, N., Shao, J.: Learning sparse representation with variational auto-encoder for anomaly detection. IEEE Access 6, 33353–33361 (2018)CrossRef

30.

Tax, D.M.J., Duin, R.P.W.: Support vector data description. Mach. Learn. 54(1), 45–66 (2004)CrossRef

31.

Wiewel, F., Yang, B.: Continual learning for anomaly detection with variational autoencoder. In: IEEE ICASSP, pp. 3837–3841 (2019)

Title: Improving Deep Unsupervised Anomaly Detection by Exploiting VAE Latent Space Distribution
Authors: Fabrizio Angiulli
Fabio Fassetti
Luca Ferragina
Publisher: Springer International Publishing
Book: Discovery Science
Print ISBN: 978-3-030-61526-0

Electronic ISBN: 978-3-030-61527-7

Copyright Year: 2020
DOI: https://doi.org/10.1007/978-3-030-61527-7_39

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner