Skip to main content
Erschienen in: Neural Processing Letters 3/2023

01.09.2022

Two Outlier-Sensitive Measures for Semi-supervised Dynamic Ensemble Anomaly Detection Models

verfasst von: Shiyuan Fu, Xin Gao, Baofeng Li, Bing Xue, Xin Jia, Zijian Huang, Guangyao Zhang, Xu Huang

Erschienen in: Neural Processing Letters | Ausgabe 3/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Semi-supervised anomaly detection has received wide interest because of not requiring counterexamples during training. Existing competence measures for semi-supervised dynamic ensemble anomaly detection models do not consider the imbalance characteristic of training samples, which will result in serious overfitting on normal samples. This paper proposes two outlier-sensitive measures to estimate the competence of base classifiers for dynamic ensemble models. When a normal sample is correctly classified, both measures give a higher positive score to base classifiers with confidence closer to 0.5, which is different from the conventional idea that base classifiers with higher confidence should obtain higher scores. When a sample is misclassified, the Output-based Outlier-Sensitive measure calculates a negative score based on the confidence outputted by the base classifier, while the Cost-Sensitive-based Outlier-Sensitive measure gives a negative score based on the category of this sample. Multiple experiments are carried out on 30 datasets from public repositories under the unified framework proposed in this paper, and results show that dynamic ensemble models with our competence measures can outperform a number of typical ensemble models in terms of G-mean and F1, regardless of the pseudo outlier labeling methods and base classifier selection methods used in the model.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Fußnoten
1
For other researchers can better reproduce our experimental result: During implementation, we find that one of the base one-class classifiers KNN_DD (see Sect. 4.2) from ’dd_tools’ cannot return a reasonable value on training samples. In such case, we use a conversion method offered by ’dd_tools’ itself, which can directly normalize outputs for validation samples and test instances, (only) on KNN_DD.
 
2
It should be noted that for experiments conducted in Sects. 4 and 5, the negative sign in Eq. 2 is discarded so that a higher EM score indicates that this base classifier is more competent. This small modification does not affect its performance but can reduce confusion.
 
Literatur
2.
Zurück zum Zitat Anbarasi MS, Ghaayathri S, Kamaleswari R, Abirami I (2011) Outlier detection for multidimensional medical data. Int J Comput Sci Inf Technol 2(1):512–516 Anbarasi MS, Ghaayathri S, Kamaleswari R, Abirami I (2011) Outlier detection for multidimensional medical data. Int J Comput Sci Inf Technol 2(1):512–516
6.
Zurück zum Zitat Cohen G, Sax H, Geissbuhler A (2008) Novelty detection using one-class Parzen density estimator. An application to surveillance of nosocomial infections. Stud Health Technol Inf 136:21–26 Cohen G, Sax H, Geissbuhler A (2008) Novelty detection using one-class Parzen density estimator. An application to surveillance of nosocomial infections. Stud Health Technol Inf 136:21–26
10.
Zurück zum Zitat Elkan C (2001) The foundations of cost-sensitive learning. In: Proceedings of the 17th international joint conference on Artificial intelligence, Lawrence Erlbaum Associates Ltd, pp 973–978 Elkan C (2001) The foundations of cost-sensitive learning. In: Proceedings of the 17th international joint conference on Artificial intelligence, Lawrence Erlbaum Associates Ltd, pp 973–978
Metadaten
Titel
Two Outlier-Sensitive Measures for Semi-supervised Dynamic Ensemble Anomaly Detection Models
verfasst von
Shiyuan Fu
Xin Gao
Baofeng Li
Bing Xue
Xin Jia
Zijian Huang
Guangyao Zhang
Xu Huang
Publikationsdatum
01.09.2022
Verlag
Springer US
Erschienen in
Neural Processing Letters / Ausgabe 3/2023
Print ISSN: 1370-4621
Elektronische ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-022-11017-y

Weitere Artikel der Ausgabe 3/2023

Neural Processing Letters 3/2023 Zur Ausgabe

Neuer Inhalt