Skip to main content
Erschienen in: Neural Processing Letters 3/2021

17.03.2021

Enhanced Non-parametric Sequence-based Learning Algorithm for Outlier Detection in the Internet of Things

verfasst von: Abel Efetobor Edje, Shaffie Muhammad Abd Latiff, Howe Weng Chan

Erschienen in: Neural Processing Letters | Ausgabe 3/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Although research on outlier detection methods has been an investigation area for long, few of those studies relate to an Internet of Things (IoT) domain. Several critical decisions taken on daily business operations depend on various data collected over time. Therefore, it is mandatory to guarantee its correctness, integrity, and accuracy before any further processing can commence. Outliers are often assumed to be Error by most algorithms in the past, which is always attributed to faulty sensors. Hence, this assumption has been investigated and results show that outliers can be classified into Error and Event types with the support of a Non-parametric sequence-based learning algorithm. The event type outlier is majorly caused by abnormality from sensor readings, which are very important and should not be ignored. However, the non-parametric sequence approach and other existing techniques still find it elusive to detect outliers in the global search space of a large dataset. Therefore, this paper proposes an Enhanced Non-parametric sequence learning algorithm based on Ensemble Clustering Techniques to detect Event and Error outliers in large datasets. Experiments are conducted on six different datasets from the UCL repository, except one collected from a laboratory testbed, to demonstrate the robustness and effectiveness of the proposed approach over the existing techniques. The results show a remarkable performance rate of 96.653% accuracy, 94.284% precision, and 98.112% for Error outlier detection. It also performs better in Event outlier detection with 87.611% accuracy, 71.141% precision and 85.755% specificity with 1291 s execution time.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Kamal S, Ramadan RA, El-Refai F (2016) Smart outlier detection of wireless sensor network. Electron Energies 29:383–393 Kamal S, Ramadan RA, El-Refai F (2016) Smart outlier detection of wireless sensor network. Electron Energies 29:383–393
3.
Zurück zum Zitat Deng X, Jiang P, Peng X, Mi C (2019) An intelligent outlier detection method with one class support tucker machine and genetic algorithm toward big sensor data in Internet of Things. IEEE Trans Ind Electron 66:4672–4683CrossRef Deng X, Jiang P, Peng X, Mi C (2019) An intelligent outlier detection method with one class support tucker machine and genetic algorithm toward big sensor data in Internet of Things. IEEE Trans Ind Electron 66:4672–4683CrossRef
4.
Zurück zum Zitat Nesa N, Ghosh T (2018) IndrajitBanejee: non-parametric sequence-based learning approach for outlier detection in IoT. Future Gener Comput Syst 82:412–421CrossRef Nesa N, Ghosh T (2018) IndrajitBanejee: non-parametric sequence-based learning approach for outlier detection in IoT. Future Gener Comput Syst 82:412–421CrossRef
5.
Zurück zum Zitat Erfani SM, Rajasegarar S, Karunasekera S, Leckie C (2016) High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning. Pattern Recogn 58:121–134CrossRef Erfani SM, Rajasegarar S, Karunasekera S, Leckie C (2016) High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning. Pattern Recogn 58:121–134CrossRef
6.
Zurück zum Zitat Lyu L, Jin J, Rajasegarar S, He X, Palaniswami M (2017) Fog-empoered anomaly detection in IoT using hyperellipsoidal clustering. IEEE Internet Things J 4:1174–1184CrossRef Lyu L, Jin J, Rajasegarar S, He X, Palaniswami M (2017) Fog-empoered anomaly detection in IoT using hyperellipsoidal clustering. IEEE Internet Things J 4:1174–1184CrossRef
7.
Zurück zum Zitat Zhang R, Ji P, Mylaraswamy D, Srivastava M, Zahedi S (2013) Cooperative sensor anomaly detection using global information. Tsinghua Sci Technol 18:209–219CrossRef Zhang R, Ji P, Mylaraswamy D, Srivastava M, Zahedi S (2013) Cooperative sensor anomaly detection using global information. Tsinghua Sci Technol 18:209–219CrossRef
8.
Zurück zum Zitat Yu T, Wang X, Shami A (2017) Recursive principal component analysis-based data outlier detection and sensor data aggregation in IoT systems. IEEE Internet Things J 4:2207–2216CrossRef Yu T, Wang X, Shami A (2017) Recursive principal component analysis-based data outlier detection and sensor data aggregation in IoT systems. IEEE Internet Things J 4:2207–2216CrossRef
9.
Zurück zum Zitat Kong X, Chang J, Niu M, Huang X, Wang J, Chang SI (2018) Research on real time feature extraction method for complex manufacturing big data. Int J Adv Manuf Technol 99:1101–1108CrossRef Kong X, Chang J, Niu M, Huang X, Wang J, Chang SI (2018) Research on real time feature extraction method for complex manufacturing big data. Int J Adv Manuf Technol 99:1101–1108CrossRef
10.
Zurück zum Zitat Yu J, Rui Y, Tao D (2014) Click prediction for web image reranking using multimodal sparse coding. IEEE Trans Image Process 23(5):2019–2032MathSciNetCrossRef Yu J, Rui Y, Tao D (2014) Click prediction for web image reranking using multimodal sparse coding. IEEE Trans Image Process 23(5):2019–2032MathSciNetCrossRef
11.
Zurück zum Zitat Yu J, Rui Y, Chen B (2014) Exploiting click constraints and multi-view features for image re-ranking. IEEE Trans Multimed 16(1):159–168CrossRef Yu J, Rui Y, Chen B (2014) Exploiting click constraints and multi-view features for image re-ranking. IEEE Trans Multimed 16(1):159–168CrossRef
12.
Zurück zum Zitat Yu J, Tan M, Zhang H, Tao D, Rui Y (2019) Hierarchical deep click feature prediction for fine-grained image recognition. IEEE Trans Pattern Anal Mach Intell 1–14 (In Press) Yu J, Tan M, Zhang H, Tao D, Rui Y (2019) Hierarchical deep click feature prediction for fine-grained image recognition. IEEE Trans Pattern Anal Mach Intell 1–14 (In Press)
13.
Zurück zum Zitat Razzak I, Zafar K, Imran M, Xu G (2020) Randomized nonlinear one-class support vector machines with bounded loss function to detect outliers for large scale IoT data. Future Gener Comput Syst 112:715–723CrossRef Razzak I, Zafar K, Imran M, Xu G (2020) Randomized nonlinear one-class support vector machines with bounded loss function to detect outliers for large scale IoT data. Future Gener Comput Syst 112:715–723CrossRef
14.
Zurück zum Zitat Simar L, Wilson PW (2015) Statistical approaches for non-parametric frontier models: a guided tour. Int Stat Rev 83:77–110MathSciNetCrossRef Simar L, Wilson PW (2015) Statistical approaches for non-parametric frontier models: a guided tour. Int Stat Rev 83:77–110MathSciNetCrossRef
15.
Zurück zum Zitat Shahid N, Naqvi IH, Qaisar SB (2015) Characteristics and classification of outlier detection techniques for wireless sensor networks in harsh environments: a survey. Artif Intell Rev 43:193–228CrossRef Shahid N, Naqvi IH, Qaisar SB (2015) Characteristics and classification of outlier detection techniques for wireless sensor networks in harsh environments: a survey. Artif Intell Rev 43:193–228CrossRef
16.
Zurück zum Zitat Zhang Y, Ren J, Liu J, Xu C, Guo H, Liu Y (2017) A survey on emerging computing paradigms for big data. Chin J Electron 26:1–12CrossRef Zhang Y, Ren J, Liu J, Xu C, Guo H, Liu Y (2017) A survey on emerging computing paradigms for big data. Chin J Electron 26:1–12CrossRef
17.
Zurück zum Zitat Thuc KX, Insoo K (2011) A collaborative event detection scheme using fuzzy logic in clustered wireless sensor networks. AEU Int J Electron Commun 65:485–488CrossRef Thuc KX, Insoo K (2011) A collaborative event detection scheme using fuzzy logic in clustered wireless sensor networks. AEU Int J Electron Commun 65:485–488CrossRef
18.
Zurück zum Zitat Ayadi A, Ghorbel O, Obeid AM, Abid M (2017) Outlier detection approaches for wireless sensor networks: a survey. Comput Netw 129:319–333CrossRef Ayadi A, Ghorbel O, Obeid AM, Abid M (2017) Outlier detection approaches for wireless sensor networks: a survey. Comput Netw 129:319–333CrossRef
19.
Zurück zum Zitat Fan H, Zaïane OR, Foss A, Wu J (2009) Resolution-based outlier factor: detecting the top-n most outlying data points in engineering data. Knowl Inf Syst 19:31–51CrossRef Fan H, Zaïane OR, Foss A, Wu J (2009) Resolution-based outlier factor: detecting the top-n most outlying data points in engineering data. Knowl Inf Syst 19:31–51CrossRef
20.
Zurück zum Zitat Tsai C-H, Chang C-L, Chen L (2003) Applying grey relational analysis to the vendor evaluation model. Int J Comput Internet Manag 11:45–53 Tsai C-H, Chang C-L, Chen L (2003) Applying grey relational analysis to the vendor evaluation model. Int J Comput Internet Manag 11:45–53
21.
Zurück zum Zitat Modi K, Oza B (2016) Outlier analysis approaches in data mining. Int J Innov Res Technol 3:6–12 Modi K, Oza B (2016) Outlier analysis approaches in data mining. Int J Innov Res Technol 3:6–12
22.
Zurück zum Zitat Zhao Z, Chen W, Wu X, Chen PC, Liu J (2017) LSTM network: a deep learning approach for short-term traffic forecast. IET Intell Transp Syst J 11:68–75CrossRef Zhao Z, Chen W, Wu X, Chen PC, Liu J (2017) LSTM network: a deep learning approach for short-term traffic forecast. IET Intell Transp Syst J 11:68–75CrossRef
23.
Zurück zum Zitat Ajitha P, Chandra E (2015) A survey on outliers detection in distributed data mining for big data. J Basic Appl Sci Res 5:31–38 Ajitha P, Chandra E (2015) A survey on outliers detection in distributed data mining for big data. J Basic Appl Sci Res 5:31–38
24.
Zurück zum Zitat Fan H, Zaïane OR, Foss A, Wu J (2006) A nonparametric outlier detection for effectively discovering Top-N outliers from engineering data. In: Pacific-Asia conference on knowledge discovery and data mining, pp 557–566 Fan H, Zaïane OR, Foss A, Wu J (2006) A nonparametric outlier detection for effectively discovering Top-N outliers from engineering data. In: Pacific-Asia conference on knowledge discovery and data mining, pp 557–566
25.
Zurück zum Zitat Shekhar S, Lu CT, Zhang P (2003) A unified approach to detecting spatial outliers. GeoInform J 7:139–166CrossRef Shekhar S, Lu CT, Zhang P (2003) A unified approach to detecting spatial outliers. GeoInform J 7:139–166CrossRef
26.
Zurück zum Zitat Bouguettaya A, Yu Q, Liu X, Zhou X, Song A (2015) Efficient agglomerative hierarchical clustering. Expert Syst Appl 42:2785–2797CrossRef Bouguettaya A, Yu Q, Liu X, Zhou X, Song A (2015) Efficient agglomerative hierarchical clustering. Expert Syst Appl 42:2785–2797CrossRef
27.
Zurück zum Zitat Manogaran G, Vijayakumar V, Varatharajan R, Kumar PM, Sundarasekar R, Hsu CH (2018) Machine learning based big data processing framework for cancer diagnosis using hidden Markov model and GM clustering. Wirel Personal Commun 102:2099–2116CrossRef Manogaran G, Vijayakumar V, Varatharajan R, Kumar PM, Sundarasekar R, Hsu CH (2018) Machine learning based big data processing framework for cancer diagnosis using hidden Markov model and GM clustering. Wirel Personal Commun 102:2099–2116CrossRef
28.
Zurück zum Zitat Yang MS, Lai CY, Lin CY (2012) A robust EM clustering algorithm for Gaussian mixture models. Pattern Recogn 45:3950–3961CrossRef Yang MS, Lai CY, Lin CY (2012) A robust EM clustering algorithm for Gaussian mixture models. Pattern Recogn 45:3950–3961CrossRef
29.
Zurück zum Zitat Wang Y, Chen W, Zhang J, Dong T, Shan G, Chi X (2011) Efficient volume exploration using the Gaussian mixture model. IEEE Trans Vis Comput Gr 17:1560–1573CrossRef Wang Y, Chen W, Zhang J, Dong T, Shan G, Chi X (2011) Efficient volume exploration using the Gaussian mixture model. IEEE Trans Vis Comput Gr 17:1560–1573CrossRef
32.
Zurück zum Zitat Tripathy A, Agrawal A, Rath SK (2016) Classification of sentiment reviews using n-gram machine learning approach. Expert Syst Appl 57:117–126CrossRef Tripathy A, Agrawal A, Rath SK (2016) Classification of sentiment reviews using n-gram machine learning approach. Expert Syst Appl 57:117–126CrossRef
33.
Zurück zum Zitat de Souza PS, dos Santos Marques W, Rossi FD, da Cunha Rodrigues G, Calheiros RN (2017) Calheiros performance and accuracy trade-off analysis of techniques for anomaly detection in IoT sensors. In: International conference on information networking, Da Nang, Vietnam, pp 486–491 de Souza PS, dos Santos Marques W, Rossi FD, da Cunha Rodrigues G, Calheiros RN (2017) Calheiros performance and accuracy trade-off analysis of techniques for anomaly detection in IoT sensors. In: International conference on information networking, Da Nang, Vietnam, pp 486–491
34.
Zurück zum Zitat Fonollosa J, Sheik S, Huerta R, Marco S (2015) Reservoir computing compensates slow response of chemosensor arrays exposed to fast varying gas concentrations in continuous monitoring. Sensors Actuators 215:618–629CrossRef Fonollosa J, Sheik S, Huerta R, Marco S (2015) Reservoir computing compensates slow response of chemosensor arrays exposed to fast varying gas concentrations in continuous monitoring. Sensors Actuators 215:618–629CrossRef
35.
Zurück zum Zitat Burgués J, Jiménez-Soto JM, Marco S (2018) Estimation of limit of detection in semiconductor gas sensors through linearized calibration models. Anal Chem Acta 1013:13–25CrossRef Burgués J, Jiménez-Soto JM, Marco S (2018) Estimation of limit of detection in semiconductor gas sensors through linearized calibration models. Anal Chem Acta 1013:13–25CrossRef
36.
Zurück zum Zitat Anguita D, Ghio A, Oneto L (2013) A public domain dataset for human activity recognition using smartphones. In: European symposium on artificial neural networks, computational intelligence and machine learning, Bruges Belgium, pp 437–442 Anguita D, Ghio A, Oneto L (2013) A public domain dataset for human activity recognition using smartphones. In: European symposium on artificial neural networks, computational intelligence and machine learning, Bruges Belgium, pp 437–442
37.
Zurück zum Zitat Raafat HM, Hossain MS, Essa E, Elmougy S, Tolba AS, Muhammad G, Ghoneim A (2017) Fog intelligence for real-time IoT sensor data abalytics. IEEE Access 5:24062–240069CrossRef Raafat HM, Hossain MS, Essa E, Elmougy S, Tolba AS, Muhammad G, Ghoneim A (2017) Fog intelligence for real-time IoT sensor data abalytics. IEEE Access 5:24062–240069CrossRef
38.
Zurück zum Zitat Aljawarneh SA, Vangipuram R (2020) GARUDA: Gaussian dissimilarity measure for feature representation and anomaly detection in Internet of Things. J Supercomput 76:4376–4413CrossRef Aljawarneh SA, Vangipuram R (2020) GARUDA: Gaussian dissimilarity measure for feature representation and anomaly detection in Internet of Things. J Supercomput 76:4376–4413CrossRef
39.
Zurück zum Zitat Luo T, Nagarajan SG (2018) Distributed anomaly detection using autoencoder neural networks in WNS for IoT. In: IEEE international conference on communications, Kansas City, MO, USA, pp 1–6 Luo T, Nagarajan SG (2018) Distributed anomaly detection using autoencoder neural networks in WNS for IoT. In: IEEE international conference on communications, Kansas City, MO, USA, pp 1–6
40.
Zurück zum Zitat Bandyopadhyay S, Ukil A, Puri C, Singh R, Bose T, Pal A (2016) SensIPro: smart sensor analytics for Internet of Things. In: IEEE symposium on computers and communications (ISCC), Messina Italy, pp 1–7 Bandyopadhyay S, Ukil A, Puri C, Singh R, Bose T, Pal A (2016) SensIPro: smart sensor analytics for Internet of Things. In: IEEE symposium on computers and communications (ISCC), Messina Italy, pp 1–7
41.
Zurück zum Zitat Sharma V, You I, Kumar R (2017) Isma: intelligent sensing model for anomalies detection in cross platform OSNs with a case study on IoT. IEEE Access 5:3284–3301CrossRef Sharma V, You I, Kumar R (2017) Isma: intelligent sensing model for anomalies detection in cross platform OSNs with a case study on IoT. IEEE Access 5:3284–3301CrossRef
Metadaten
Titel
Enhanced Non-parametric Sequence-based Learning Algorithm for Outlier Detection in the Internet of Things
verfasst von
Abel Efetobor Edje
Shaffie Muhammad Abd Latiff
Howe Weng Chan
Publikationsdatum
17.03.2021
Verlag
Springer US
Erschienen in
Neural Processing Letters / Ausgabe 3/2021
Print ISSN: 1370-4621
Elektronische ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-021-10473-2

Weitere Artikel der Ausgabe 3/2021

Neural Processing Letters 3/2021 Zur Ausgabe

Neuer Inhalt