Skip to main content
Top

2020 | OriginalPaper | Chapter

Improving Deep Unsupervised Anomaly Detection by Exploiting VAE Latent Space Distribution

Authors : Fabrizio Angiulli, Fabio Fassetti, Luca Ferragina

Published in: Discovery Science

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Anomaly detection methods exploiting autoencoders (AE) have shown good performances. Unfortunately, deep non-linear architectures are able to perform high dimensionality reduction while keeping reconstruction error low, thus worsening outlier detecting performances of AEs. To alleviate the above problem, recently some authors have proposed to exploit Variational autoencoders (VAE), which arise as a variant of standard AEs designed for generative purposes. The key idea of VAEs is take into account a regularization term constraining the organization of the latent space. However, VAEs share with standard AEs the problem that they generalize so well that they can also well reconstruct anomalies. In this work we argue that the approach of selecting the worst reconstructed examples as anomalies is too simplistic if a VAE architecture is employed. We show that outliers tend to lie in the sparsest regions of the combined latent/error space and propose a novel unsupervised anomaly detection algorithm, called VAEOut, that identifies outliers by performing density estimation in this augmented feature space. The proposed approach shows sensible improvements in terms of detection performances over the standard approach based on the reconstruction error.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference An, J., Cho, S.: Variational autoencoder based anomaly detection using reconstruction probability. Technical report, 3, SNU Data Mining Center (2015) An, J., Cho, S.: Variational autoencoder based anomaly detection using reconstruction probability. Technical report, 3, SNU Data Mining Center (2015)
4.
go back to reference Angiulli, F.: On the behavior of intrinsically high-dimensional spaces: distances, direct and reverse nearest neighbors, and hubness. J. Mach. Learn. Res. 18, 170:1–170:60 (2018) Angiulli, F.: On the behavior of intrinsically high-dimensional spaces: distances, direct and reverse nearest neighbors, and hubness. J. Mach. Learn. Res. 18, 170:1–170:60 (2018)
5.
go back to reference Angiulli, F.: CFOF: a concentration free measure for anomaly detection. ACM Trans. Knowl. Disc. Data (TKDD) 14(1), 4:1–4:53 (2020) Angiulli, F.: CFOF: a concentration free measure for anomaly detection. ACM Trans. Knowl. Disc. Data (TKDD) 14(1), 4:1–4:53 (2020)
6.
go back to reference Angiulli, F., Basta, S., Pizzuti, C.: Distance-based detection and prediction of outliers. IEEE Trans. Knowl. Data Eng. 2(18), 145–160 (2006)CrossRef Angiulli, F., Basta, S., Pizzuti, C.: Distance-based detection and prediction of outliers. IEEE Trans. Knowl. Data Eng. 2(18), 145–160 (2006)CrossRef
7.
go back to reference Angiulli, F., Fassetti, F.: DOLPHIN: an efficient algorithm for mining distance-based outliers in very large datasets. ACM Trans. Knowl. Disc. Data (TKDD) 3(1), Article 4 (2009) Angiulli, F., Fassetti, F.: DOLPHIN: an efficient algorithm for mining distance-based outliers in very large datasets. ACM Trans. Knowl. Disc. Data (TKDD) 3(1), Article 4 (2009)
8.
go back to reference Angiulli, F., Pizzuti, C.: Fast outlier detection in large high-dimensional data sets. In: Proceedings of International Conference on Principles of Data Mining and Knowledge Discovery (PKDD), pp. 15–26 (2002) Angiulli, F., Pizzuti, C.: Fast outlier detection in large high-dimensional data sets. In: Proceedings of International Conference on Principles of Data Mining and Knowledge Discovery (PKDD), pp. 15–26 (2002)
9.
go back to reference Angiulli, F., Pizzuti, C.: Outlier mining in large high-dimensional data sets. IEEE Trans. Knowl. Data Eng. 2(17), 203–215 (2005)CrossRef Angiulli, F., Pizzuti, C.: Outlier mining in large high-dimensional data sets. IEEE Trans. Knowl. Data Eng. 2(17), 203–215 (2005)CrossRef
10.
go back to reference Barnett, V., Lewis, T.: Outliers in Statistical Data. Wiley, Hoboken (1994)MATH Barnett, V., Lewis, T.: Outliers in Statistical Data. Wiley, Hoboken (1994)MATH
11.
go back to reference Breunig, M.M., Kriegel, H., Ng, R., Sander, J.: LOF: identifying density-based local outliers. In: Proceedings of International Conference on Management of Data (SIGMOD) (2000) Breunig, M.M., Kriegel, H., Ng, R., Sander, J.: LOF: identifying density-based local outliers. In: Proceedings of International Conference on Management of Data (SIGMOD) (2000)
12.
go back to reference Chalapathy, R., Chawla, S.: Deep learning for anomaly detection: a survey (2019) Chalapathy, R., Chawla, S.: Deep learning for anomaly detection: a survey (2019)
13.
go back to reference Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: a survey. ACM Comput. Surv. 41(3), 1–58 (2009)CrossRef Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: a survey. ACM Comput. Surv. 41(3), 1–58 (2009)CrossRef
14.
15.
go back to reference Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016)MATH Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016)MATH
16.
go back to reference Hautamäki, V., Kärkkäinen, I., Fränti, P.: Outlier detection using k-nearest neighbour graph. In: International Conference on Pattern Recognition (ICPR), Cambridge, UK, 23–26 August 2004, pp. 430–433 (2004) Hautamäki, V., Kärkkäinen, I., Fränti, P.: Outlier detection using k-nearest neighbour graph. In: International Conference on Pattern Recognition (ICPR), Cambridge, UK, 23–26 August 2004, pp. 430–433 (2004)
18.
go back to reference Hecht-Nielsen, R.: Replicator neural networks for universal optimal source coding. Science 269(5232), 1860–1863 (1995)CrossRef Hecht-Nielsen, R.: Replicator neural networks for universal optimal source coding. Science 269(5232), 1860–1863 (1995)CrossRef
19.
go back to reference Higgins, I., et al.: \(\beta \)-vae: learning basic visual concepts with a constrained variational framework. In: International Conference on Learning Representations (ICLR) (2017) Higgins, I., et al.: \(\beta \)-vae: learning basic visual concepts with a constrained variational framework. In: International Conference on Learning Representations (ICLR) (2017)
20.
go back to reference Jin, W., Tung, A., Han, J.: Mining top-n local outliers in large databases. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) (2001) Jin, W., Tung, A., Han, J.: Mining top-n local outliers in large databases. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) (2001)
21.
go back to reference Kawachi, Y., Koizumi, Y., Harada, N.: Complementary set variational autoencoder for supervised anomaly detection. In: IEEE ICASSP, pp. 2366–2370 (2018) Kawachi, Y., Koizumi, Y., Harada, N.: Complementary set variational autoencoder for supervised anomaly detection. In: IEEE ICASSP, pp. 2366–2370 (2018)
22.
go back to reference Kingma, D.P., Welling, M.: Auto-encoding variational bayes (2013) Kingma, D.P., Welling, M.: Auto-encoding variational bayes (2013)
23.
go back to reference Knorr, E., Ng, R., Tucakov, V.: Distance-based outlier: algorithms and applications. VLDB J. 8(3–4), 237–253 (2000)CrossRef Knorr, E., Ng, R., Tucakov, V.: Distance-based outlier: algorithms and applications. VLDB J. 8(3–4), 237–253 (2000)CrossRef
24.
go back to reference Kramer, M.A.: Nonlinear principal component analysis using autoassociative neural networks. AIChE J. 37(2), 233–243 (1991)CrossRef Kramer, M.A.: Nonlinear principal component analysis using autoassociative neural networks. AIChE J. 37(2), 233–243 (1991)CrossRef
25.
go back to reference Kriegel, H.P., Schubert, M., Zimek, A.: Angle-based outlier detection in high-dimensional data. In: Proceedings of International Conference on Knowledge Discovery and Data Mining (KDD), pp. 444–452 (2008) Kriegel, H.P., Schubert, M., Zimek, A.: Angle-based outlier detection in high-dimensional data. In: Proceedings of International Conference on Knowledge Discovery and Data Mining (KDD), pp. 444–452 (2008)
26.
go back to reference Liu, F., Ting, K., Zhou, Z.H.: Isolation-based anomaly detection. ACM Trans. Knowl. Disc. Data (TKDD) 6(1), 1–39 (2012)CrossRef Liu, F., Ting, K., Zhou, Z.H.: Isolation-based anomaly detection. ACM Trans. Knowl. Disc. Data (TKDD) 6(1), 1–39 (2012)CrossRef
27.
go back to reference Radovanović, M., Nanopoulos, A., Ivanović, M.: Reverse nearest neighbors in unsupervised distance-based outlier detection. IEEE Trans. Knowl. Data Eng. 27(5), 1369–1382 (2015)CrossRef Radovanović, M., Nanopoulos, A., Ivanović, M.: Reverse nearest neighbors in unsupervised distance-based outlier detection. IEEE Trans. Knowl. Data Eng. 27(5), 1369–1382 (2015)CrossRef
28.
go back to reference Schölkopf, B., Platt, J.C., Shawe-Taylor, J., Smola, A.J., Williamson, R.C.: Estimating the support of a high-dimensional distribution. Neural Comput. 13(7), 1443–1471 (2001)CrossRef Schölkopf, B., Platt, J.C., Shawe-Taylor, J., Smola, A.J., Williamson, R.C.: Estimating the support of a high-dimensional distribution. Neural Comput. 13(7), 1443–1471 (2001)CrossRef
29.
go back to reference Sun, J., Wang, X., Xiong, N., Shao, J.: Learning sparse representation with variational auto-encoder for anomaly detection. IEEE Access 6, 33353–33361 (2018)CrossRef Sun, J., Wang, X., Xiong, N., Shao, J.: Learning sparse representation with variational auto-encoder for anomaly detection. IEEE Access 6, 33353–33361 (2018)CrossRef
30.
go back to reference Tax, D.M.J., Duin, R.P.W.: Support vector data description. Mach. Learn. 54(1), 45–66 (2004)CrossRef Tax, D.M.J., Duin, R.P.W.: Support vector data description. Mach. Learn. 54(1), 45–66 (2004)CrossRef
31.
go back to reference Wiewel, F., Yang, B.: Continual learning for anomaly detection with variational autoencoder. In: IEEE ICASSP, pp. 3837–3841 (2019) Wiewel, F., Yang, B.: Continual learning for anomaly detection with variational autoencoder. In: IEEE ICASSP, pp. 3837–3841 (2019)
Metadata
Title
Improving Deep Unsupervised Anomaly Detection by Exploiting VAE Latent Space Distribution
Authors
Fabrizio Angiulli
Fabio Fassetti
Luca Ferragina
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-61527-7_39

Premium Partner