Skip to main content
Top
Published in: International Journal of Machine Learning and Cybernetics 2/2021

09-08-2020 | Original Article

Using multiple classifier behavior to develop a dynamic outlier ensemble

Authors: Ping Yuan, Biao Wang, Zhizhong Mao

Published in: International Journal of Machine Learning and Cybernetics | Issue 2/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Outlier ensembles that use more base detectors recently become an attractive approach to solving problems of single detectors. However, existing outlier ensembles often assume that base detectors make independent errors, which is difficult to satisfy in practical applications. To this end, this paper proposes a dynamic outlier ensemble to loose this error independence assumption. In our method, it is desired that the most competent base detector(s) can be singled out by the dynamic selection mechanism for each test pattern. The usage of the concept of multiple classifier behavior (MCB) has two purposes. One is to generate artificial outlier examples used for competence estimates. This strategy is different from other methods since we do not make any assumption regarding the data distribution. On the other hand, MCB is used to refine validation sets initialized by the K-nearest neighbors (KNN) rule. It is desired that objects in the refined validation sets are more representative than those found by KNN. With the refined validation sets, competences of all base detectors will be estimated by a probabilistic method, before which we have transformed outputs of base detectors into a probabilistic form. Finally, a switching mechanism that determines whether one detector should be nominated to make the decision or a fusion method should be applied instead is proposed in order to achieve a robust detection result. We carry out experiments on 20 benchmark data sets to verify the effectiveness of our detection method.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Show more products
Literature
1.
go back to reference Aggarwal CC, Sathe S (2015) Theoretical foundations and algorithms for outlier ensembles. ACM SIGKDD Explor Newsl 17(1):24–47CrossRef Aggarwal CC, Sathe S (2015) Theoretical foundations and algorithms for outlier ensembles. ACM SIGKDD Explor Newsl 17(1):24–47CrossRef
3.
go back to reference Ando S, Thanomphongphan T, Seki Y, Suzuki E (2015) Ensemble anomaly detection from multi-resolution trajectory features. Data Min Knowl Disc 29(1):39–83MathSciNetCrossRef Ando S, Thanomphongphan T, Seki Y, Suzuki E (2015) Ensemble anomaly detection from multi-resolution trajectory features. Data Min Knowl Disc 29(1):39–83MathSciNetCrossRef
4.
go back to reference Campos GO, Zimek A, Sander J, Campello RJGB, Micenková B, Schubert E, Houle ME (2016) On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study. Data Min Knowl Disc 30(4):891–927MathSciNetCrossRef Campos GO, Zimek A, Sander J, Campello RJGB, Micenková B, Schubert E, Houle ME (2016) On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study. Data Min Knowl Disc 30(4):891–927MathSciNetCrossRef
5.
go back to reference Chandola V, Banerjee A, Kumar V (2009) Anomaly detection: a survey. ACM Comput Surv 41(3):1–58CrossRef Chandola V, Banerjee A, Kumar V (2009) Anomaly detection: a survey. ACM Comput Surv 41(3):1–58CrossRef
6.
go back to reference Christou IT, Gekas G, Kyrikou A (2012) A classifier ensemble approach to the TV-viewer profile adaptation problem. Int J Mach Learn Cybern 3(4):313–326CrossRef Christou IT, Gekas G, Kyrikou A (2012) A classifier ensemble approach to the TV-viewer profile adaptation problem. Int J Mach Learn Cybern 3(4):313–326CrossRef
7.
go back to reference Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7(1):1–30MathSciNetMATH Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7(1):1–30MathSciNetMATH
8.
go back to reference Fan W, Miller M, Miller M (2001) Using artificial anomalies to detect unknown and known network intrusions. In: Paper presented at the International Conference on Data Mining Fan W, Miller M, Miller M (2001) Using artificial anomalies to detect unknown and known network intrusions. In: Paper presented at the International Conference on Data Mining
9.
go back to reference Fernandez A, Garcia S, Jesus MJD, Herrera F (2008) A study of the behaviour of linguistic fuzzy rule based classification systems in the framework of imbalanced data-sets. Fuzzy Sets Syst 159(18):2378–2398MathSciNetCrossRef Fernandez A, Garcia S, Jesus MJD, Herrera F (2008) A study of the behaviour of linguistic fuzzy rule based classification systems in the framework of imbalanced data-sets. Fuzzy Sets Syst 159(18):2378–2398MathSciNetCrossRef
10.
go back to reference Gao J, Tan PN (2006) Converting output scores from outlier detection algorithms into probability estimates. In: Paper presented at the Sixth International Conference on Data Mining Gao J, Tan PN (2006) Converting output scores from outlier detection algorithms into probability estimates. In: Paper presented at the Sixth International Conference on Data Mining
11.
go back to reference Giacinto G, Roli F (1999) Methods for dynamic classifier selection. In: Paper presented at the Proceedings of 10th International Conference on Image Analysis and Processing Giacinto G, Roli F (1999) Methods for dynamic classifier selection. In: Paper presented at the Proceedings of 10th International Conference on Image Analysis and Processing
12.
go back to reference Giacinto G, Roli F (2001) Dynamic classifier selection based on multiple classifier behaviour. Pattern Recogn 34(9):1879–1881CrossRef Giacinto G, Roli F (2001) Dynamic classifier selection based on multiple classifier behaviour. Pattern Recogn 34(9):1879–1881CrossRef
13.
go back to reference Hempstalk K, Frank E, Witten IH (2008) One-class classification by combining density and class probability estimation. In: Paper presented at the Joint European Conference on Machine Learning and Knowledge Discovery in Databases Hempstalk K, Frank E, Witten IH (2008) One-class classification by combining density and class probability estimation. In: Paper presented at the Joint European Conference on Machine Learning and Knowledge Discovery in Databases
14.
go back to reference Huang J, Ling CX (2005) Using AUC and accuracy in evaluating learning algorithms. IEEE Trans Knowl Data Eng 17(3):299–310CrossRef Huang J, Ling CX (2005) Using AUC and accuracy in evaluating learning algorithms. IEEE Trans Knowl Data Eng 17(3):299–310CrossRef
15.
go back to reference Huang YS, Suen CY (1995) A method of combining multiple experts for the recognition of unconstrained handwritten numerals. Pattern Anal Mach Intell IEEE Trans 17(1):90–94CrossRef Huang YS, Suen CY (1995) A method of combining multiple experts for the recognition of unconstrained handwritten numerals. Pattern Anal Mach Intell IEEE Trans 17(1):90–94CrossRef
16.
go back to reference Iman RL, Davenport JM (1979) Approximations of the critical region of the Friedman statistic. Commun Stat 9(6):571–595CrossRef Iman RL, Davenport JM (1979) Approximations of the critical region of the Friedman statistic. Commun Stat 9(6):571–595CrossRef
17.
go back to reference Krawczyk B (2015) One-class classifier ensemble pruning and weighting with firefly algorithm. Neurocomputing 150(150):490–500CrossRef Krawczyk B (2015) One-class classifier ensemble pruning and weighting with firefly algorithm. Neurocomputing 150(150):490–500CrossRef
18.
go back to reference Krawczyk B (2016) Dynamic classifier selection for one-class classification. Knowl-Based Syst 107:43–53CrossRef Krawczyk B (2016) Dynamic classifier selection for one-class classification. Knowl-Based Syst 107:43–53CrossRef
19.
go back to reference Krawczyk B, Woźniak M, Cyganek B (2014) Clustering-based ensembles for one-class classification. Inf Sci 264(6):182–195MathSciNetCrossRef Krawczyk B, Woźniak M, Cyganek B (2014) Clustering-based ensembles for one-class classification. Inf Sci 264(6):182–195MathSciNetCrossRef
20.
go back to reference Kuncheva LI (2002) Switching between selection and fusion in combining classifiers: an experiment. IEEE Trans Syst Man Cybern Part B: Cybern 32(2):146–156CrossRef Kuncheva LI (2002) Switching between selection and fusion in combining classifiers: an experiment. IEEE Trans Syst Man Cybern Part B: Cybern 32(2):146–156CrossRef
21.
go back to reference Kuncheva LI, Whitaker CJ (2003) Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Mach Learn 51(2):181–207CrossRef Kuncheva LI, Whitaker CJ (2003) Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Mach Learn 51(2):181–207CrossRef
22.
go back to reference Oliveira DVR, Cavalcanti GDC, Sabourin R (2017) Online pruning of base classifiers for dynamic ensemble selection. Pattern Recogn 72:44–58CrossRef Oliveira DVR, Cavalcanti GDC, Sabourin R (2017) Online pruning of base classifiers for dynamic ensemble selection. Pattern Recogn 72:44–58CrossRef
23.
go back to reference Parhizkar E, Abadi M (2015) BeeOWA: a novel approach based on ABC algorithm and induced OWA operators for constructing one-class classifier ensembles. Neurocomputing 166:367–381CrossRef Parhizkar E, Abadi M (2015) BeeOWA: a novel approach based on ABC algorithm and induced OWA operators for constructing one-class classifier ensembles. Neurocomputing 166:367–381CrossRef
24.
go back to reference Rätsch G, Mika S, Schölkopf B, Müller K-R (2002) Constructing boosting algorithms from SVMs: An application to one-class classification. IEEE Trans Pattern Anal Mach Intell 9:1184–1199CrossRef Rätsch G, Mika S, Schölkopf B, Müller K-R (2002) Constructing boosting algorithms from SVMs: An application to one-class classification. IEEE Trans Pattern Anal Mach Intell 9:1184–1199CrossRef
25.
go back to reference Rayana S, Zhong W, Akoglu L (2016) Sequential ensemble learning for outlier detection: A bias-variance perspective. In: Paper presented at the 2016 IEEE 16th International Conference on Data Mining (ICDM) Rayana S, Zhong W, Akoglu L (2016) Sequential ensemble learning for outlier detection: A bias-variance perspective. In: Paper presented at the 2016 IEEE 16th International Conference on Data Mining (ICDM)
26.
go back to reference Salehi M, Zhang X, Bezdek JC, Leckie C (2016). Smart sampling: a novel unsupervised boosting approach for outlier detection. In: Paper presented at the Australasian Joint Conference on Artificial Intelligence Salehi M, Zhang X, Bezdek JC, Leckie C (2016). Smart sampling: a novel unsupervised boosting approach for outlier detection. In: Paper presented at the Australasian Joint Conference on Artificial Intelligence
27.
go back to reference Sun Y, Wong AKC, Kamel MS (2009) Classification of imbalanced data: a review. Int J Pattern Recognit Artif Intell 23(04):687–719CrossRef Sun Y, Wong AKC, Kamel MS (2009) Classification of imbalanced data: a review. Int J Pattern Recognit Artif Intell 23(04):687–719CrossRef
28.
go back to reference Tax DM, Duin RP (2001) Uniform object generation for optimizing one-class classifiers. J Mach Learn Res 2:155–173MATH Tax DM, Duin RP (2001) Uniform object generation for optimizing one-class classifiers. J Mach Learn Res 2:155–173MATH
29.
go back to reference Tax DMJ, Breukelen MV, Duin RPW, Kittler J (2000) Combining multiple classifiers by averaging or by multiplying? Pattern Recogn 33(9):1475–1485CrossRef Tax DMJ, Breukelen MV, Duin RPW, Kittler J (2000) Combining multiple classifiers by averaging or by multiplying? Pattern Recogn 33(9):1475–1485CrossRef
30.
go back to reference Tax DMJ, Duin RPW (2001) Combining One-Class Classifiers. In: Paper presented at the International Workshop on Multiple Classifier Systems Tax DMJ, Duin RPW (2001) Combining One-Class Classifiers. In: Paper presented at the International Workshop on Multiple Classifier Systems
31.
go back to reference Wang B, Mao Z (2017) One-class classifiers ensemble based anomaly detection scheme for process control systems. Trans Inst Measur Control 40(12):3466–3476CrossRef Wang B, Mao Z (2017) One-class classifiers ensemble based anomaly detection scheme for process control systems. Trans Inst Measur Control 40(12):3466–3476CrossRef
32.
go back to reference Wang B, Mao Z (2019) Outlier detection based on a dynamic ensemble model: applied to process monitoring. Inf Fusion 51:244–258CrossRef Wang B, Mao Z (2019) Outlier detection based on a dynamic ensemble model: applied to process monitoring. Inf Fusion 51:244–258CrossRef
33.
go back to reference Wang B, Mao Z, Huang K (2017) Detecting outliers in complex nonlinear systems controlled by predictive control strategy. Chaos Solitons Fract 103:588–595CrossRef Wang B, Mao Z, Huang K (2017) Detecting outliers in complex nonlinear systems controlled by predictive control strategy. Chaos Solitons Fract 103:588–595CrossRef
34.
go back to reference Zhang H, Gang L, Chow TWS, Wenyin L (2011) Textual and visual content-based anti-phishing: a Bayesian approach. IEEE Trans Neural Netw 22(10):1532–1546CrossRef Zhang H, Gang L, Chow TWS, Wenyin L (2011) Textual and visual content-based anti-phishing: a Bayesian approach. IEEE Trans Neural Netw 22(10):1532–1546CrossRef
35.
go back to reference Zhao Q-L, Jiang Y-H, Xu M (2009) A fast ensemble pruning algorithm based on pattern mining process. Data Min Knowl Disc 19(2):277–292MathSciNetCrossRef Zhao Q-L, Jiang Y-H, Xu M (2009) A fast ensemble pruning algorithm based on pattern mining process. Data Min Knowl Disc 19(2):277–292MathSciNetCrossRef
36.
go back to reference Zimek A, Campello RJ, Sander J (2014) Ensembles for unsupervised outlier detection: challenges and research questions a position paper. ACM SIGKDD Explor Newsl 15(1):11–22CrossRef Zimek A, Campello RJ, Sander J (2014) Ensembles for unsupervised outlier detection: challenges and research questions a position paper. ACM SIGKDD Explor Newsl 15(1):11–22CrossRef
Metadata
Title
Using multiple classifier behavior to develop a dynamic outlier ensemble
Authors
Ping Yuan
Biao Wang
Zhizhong Mao
Publication date
09-08-2020
Publisher
Springer Berlin Heidelberg
Published in
International Journal of Machine Learning and Cybernetics / Issue 2/2021
Print ISSN: 1868-8071
Electronic ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-020-01183-7

Other articles of this Issue 2/2021

International Journal of Machine Learning and Cybernetics 2/2021 Go to the issue