Skip to main content
Top
Published in: Cluster Computing 1/2019

16-11-2017

A parameter based growing ensemble of self-organizing maps for outlier detection in healthcare

Authors: Samir Elmougy, M. Shamim Hossain, Ahmed S. Tolba, Mohammed F. Alhamid, Ghulam Muhammad

Published in: Cluster Computing | Special Issue 1/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Outlier detection is critical for many applications such as healthcare, health insurance, medical diagnosis, predictive analytics, pattern recognition, intrusion detection, anomaly or defect detection, video surveillance, credit card fraud detection and text mining. Outlier detection techniques could be statistics, distance- or model based. Techniques, which are based on a single method for outlier detection usually have weaknesses and strengths and are mostly unstable. Outlier detection ensembles harness the strengths of individual detectors and result in stable performance. This paper presents a new parameter based growing self-organizing maps ensemble (GSOME) for outlier detection in multivariate patterns. For outlier detection, the proposed GSOME transforms non-linear relationships between high dimensional patterns into a simple 1D geometric relationship. Whatever the pattern dimensionality is, it is mapped to a single point of a line. The dispersion of mapped points will be used to locate the outliers and measure the degree of outlyingness. Several experiments on both real and synthetic data sets show the promising performance of the proposed GSOME.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Christy, A., MeeraGandhi, G., Vaithyasubramanian, S.: Cluster based outlier detection algorithm for healthcare data. Procedia Comput. Sci. 50, 209–215 (2015)CrossRef Christy, A., MeeraGandhi, G., Vaithyasubramanian, S.: Cluster based outlier detection algorithm for healthcare data. Procedia Comput. Sci. 50, 209–215 (2015)CrossRef
2.
go back to reference Muhammad, G.: Automatic speech recognition using interlaced derivative pattern for cloud based healthcare system. Clust. Comput. 18(2), 795–802 (2015)MathSciNetCrossRef Muhammad, G.: Automatic speech recognition using interlaced derivative pattern for cloud based healthcare system. Clust. Comput. 18(2), 795–802 (2015)MathSciNetCrossRef
3.
go back to reference Vembandasamy, K., Karthikeyan, T.: Novel outlier detection in diabetics classification using data mining techniques. Int. J. Appl. Eng. Res. 11(2), 1400–1403 (2016) Vembandasamy, K., Karthikeyan, T.: Novel outlier detection in diabetics classification using data mining techniques. Int. J. Appl. Eng. Res. 11(2), 1400–1403 (2016)
4.
go back to reference Hu, L., et al.: Software defined healthcare networks. IEEE Wirel. Commun. 22(6), 67–75 (2015)CrossRef Hu, L., et al.: Software defined healthcare networks. IEEE Wirel. Commun. 22(6), 67–75 (2015)CrossRef
6.
go back to reference Hossain, M.S., Muhammad, G.: Cloud-assisted industrial internet of things (IIoT)—enabled framework for health monitoring. Comput. Netw. 101(2016), 192–202 (2016)CrossRef Hossain, M.S., Muhammad, G.: Cloud-assisted industrial internet of things (IIoT)—enabled framework for health monitoring. Comput. Netw. 101(2016), 192–202 (2016)CrossRef
7.
go back to reference Hossain, M.S., Muhammad, G.: Cloud-assisted speech and face recognition framework for health monitoring. Mob. Netw. Appl. 20(3), 391–399 (2015)CrossRef Hossain, M.S., Muhammad, G.: Cloud-assisted speech and face recognition framework for health monitoring. Mob. Netw. Appl. 20(3), 391–399 (2015)CrossRef
9.
go back to reference Hauskrecht, M., Batal, I., Hong, C., Nguyen, Q., Cooper, G.E., Visweswaran, S., Clermont, G.: Outlier-based detection of unusual patient-management actions. An ICU study. J. Biomed. Inform. 64, 211–221 (2017)CrossRef Hauskrecht, M., Batal, I., Hong, C., Nguyen, Q., Cooper, G.E., Visweswaran, S., Clermont, G.: Outlier-based detection of unusual patient-management actions. An ICU study. J. Biomed. Inform. 64, 211–221 (2017)CrossRef
10.
go back to reference Laurikkala, J., Juhola, M., Kentala, E.: Informal identification of outliers in medical data. In: Intelligent Data Analysis in Medicine and Pharmacology (IDAMAP-2000), A Workshop at the 14th European Conference on Artificial Intelligence (ECAI-2000), Berlin, Germany, August 20–25 (2000) Laurikkala, J., Juhola, M., Kentala, E.: Informal identification of outliers in medical data. In: Intelligent Data Analysis in Medicine and Pharmacology (IDAMAP-2000), A Workshop at the 14th European Conference on Artificial Intelligence (ECAI-2000), Berlin, Germany, August 20–25 (2000)
12.
go back to reference Ypma, R., Duin, P.W.: Novelty detection using self-organizing maps. In: Kasabov, N., Kozma, R., Ko, K., O’Shea, R., Coghill, G., Gedeon, T. (eds.) Progress in Connectionist-Based Information Systems, vol. 2, pp. 1322–1325. Springer, London (1997) Ypma, R., Duin, P.W.: Novelty detection using self-organizing maps. In: Kasabov, N., Kozma, R., Ko, K., O’Shea, R., Coghill, G., Gedeon, T. (eds.) Progress in Connectionist-Based Information Systems, vol. 2, pp. 1322–1325. Springer, London (1997)
13.
go back to reference Banerjee, A., Chandola, V., Lazarevic, A., Kumar, V., Srivastava, J.: Anomaly Detection: A Tutorial. In: SIAM Data Mining Conference, Atlanta, GA (2008) Banerjee, A., Chandola, V., Lazarevic, A., Kumar, V., Srivastava, J.: Anomaly Detection: A Tutorial. In: SIAM Data Mining Conference, Atlanta, GA (2008)
14.
go back to reference Song, X., Wu, M., Jermaine, C., Ranka, S.: Conditional anomaly detection. IEEE Trans. Knowl. Data Eng. 19(5), 631–645 (2007)CrossRef Song, X., Wu, M., Jermaine, C., Ranka, S.: Conditional anomaly detection. IEEE Trans. Knowl. Data Eng. 19(5), 631–645 (2007)CrossRef
16.
go back to reference TILDA, Textile defect image database. University of Freiburg, Germany (1996) TILDA, Textile defect image database. University of Freiburg, Germany (1996)
17.
go back to reference Geman, S., et al.: Neural networks and the bias/variance dilemma. Neural Comput. 4, 1–58 (1992)CrossRef Geman, S., et al.: Neural networks and the bias/variance dilemma. Neural Comput. 4, 1–58 (1992)CrossRef
18.
go back to reference Zhang, Y., Meratnia, N., Havinga, P.J.M.: Outlier Detection Techniques for Wireless Sensor Network: A Survey. University of Twente, Enschede (2008) Zhang, Y., Meratnia, N., Havinga, P.J.M.: Outlier Detection Techniques for Wireless Sensor Network: A Survey. University of Twente, Enschede (2008)
19.
go back to reference Ghaemi, R., Sulaiman, M.N., Ibrahim, I., Mustapha, N.: A Survey: Clustering Ensembles Techniques. World Academy of Science, Engineering and Technology, Singapore (2009) Ghaemi, R., Sulaiman, M.N., Ibrahim, I., Mustapha, N.: A Survey: Clustering Ensembles Techniques. World Academy of Science, Engineering and Technology, Singapore (2009)
20.
go back to reference Lazarevic, A., Kumar, V.: Feature bagging for outlier detection. In: KDD, pp. 157–166 (2005) Lazarevic, A., Kumar, V.: Feature bagging for outlier detection. In: KDD, pp. 157–166 (2005)
22.
go back to reference Hodge, V.J., Austin, J.A.: Survey of outlier detection methodologies. Artif. Intell. Rev. 22(2), 85–126 (2004)CrossRefMATH Hodge, V.J., Austin, J.A.: Survey of outlier detection methodologies. Artif. Intell. Rev. 22(2), 85–126 (2004)CrossRefMATH
23.
go back to reference Fausette, V.L.: Fundamentals of Neural Networks. Prentice Hall, Upper Saddle River (1993) Fausette, V.L.: Fundamentals of Neural Networks. Prentice Hall, Upper Saddle River (1993)
24.
go back to reference Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: an efficient data clustering method for very large databases. In: Jagadish, H.V., Mumick, I.S. (Eds.). Proceedings of the ACM SIGMOD International Conference on Management of Data, Montreal, Quebec, Canada, June 4-6, pp. 103–114. ACM Press, New York (1996) Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: an efficient data clustering method for very large databases. In: Jagadish, H.V., Mumick, I.S. (Eds.). Proceedings of the ACM SIGMOD International Conference on Management of Data, Montreal, Quebec, Canada, June 4-6, pp. 103–114. ACM Press, New York (1996)
25.
go back to reference Ester, M., Kriegel, H-P., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, Portland, Oregon, pp. 226–231 (1996) Ester, M., Kriegel, H-P., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, Portland, Oregon, pp. 226–231 (1996)
26.
go back to reference Stolfo, S.J., Prodromidis, A.L., Tselepis, S., Lee, W., Fan, D.W., Chan, P.K.: JAM: Java agents for meta-learning over distributed databases. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 74–81 (1997) Stolfo, S.J., Prodromidis, A.L., Tselepis, S., Lee, W., Fan, D.W., Chan, P.K.: JAM: Java agents for meta-learning over distributed databases. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 74–81 (1997)
27.
go back to reference Breiman, L., Friedman, J., Olshen, R., Stone, C.: Classification and Regression Trees. Wadsworth International Group, Belmont, CA (1984)MATH Breiman, L., Friedman, J., Olshen, R., Stone, C.: Classification and Regression Trees. Wadsworth International Group, Belmont, CA (1984)MATH
28.
go back to reference Cohen, W.W.: Fast effective rule induction. In: International Conference on Machine Learning, pp. 115–123 (1995) Cohen, W.W.: Fast effective rule induction. In: International Conference on Machine Learning, pp. 115–123 (1995)
29.
go back to reference Knorr, E.M., Ng, R.T., Tucakov, V.: Distance-based outliers: algorithms and applications. VLDB J. 8, 237–253 (2000)CrossRef Knorr, E.M., Ng, R.T., Tucakov, V.: Distance-based outliers: algorithms and applications. VLDB J. 8, 237–253 (2000)CrossRef
30.
go back to reference Brodley, C.E., Friedl, M.A.: Identifying mislabeled training data. J. Artif. Intell. Res. 11, 131–167 (1999)CrossRefMATH Brodley, C.E., Friedl, M.A.: Identifying mislabeled training data. J. Artif. Intell. Res. 11, 131–167 (1999)CrossRefMATH
31.
go back to reference Saunders, R., Gero, J.S.: A curious design agent: a computational model of novelty-seeking behavior in design. In: Proceedings of the Sixth Conference on Computer Aided Architectural Design Research in Asia (CAADRIA2001), Sydney, pp. 725–738(2001a) Saunders, R., Gero, J.S.: A curious design agent: a computational model of novelty-seeking behavior in design. In: Proceedings of the Sixth Conference on Computer Aided Architectural Design Research in Asia (CAADRIA2001), Sydney, pp. 725–738(2001a)
32.
go back to reference Vesanto, J., Himberg, J., Siponen, M., Simula, O.: Enhancing SOM based data visualization. In: Proceedings of the 5th International Conference on Soft Computing and Information/Intelligent Systems. Methodologies for the Conception, Design and Application of Soft Computing, vol. 1, pp. 64–67. Singapore: World Scientific (1998) Vesanto, J., Himberg, J., Siponen, M., Simula, O.: Enhancing SOM based data visualization. In: Proceedings of the 5th International Conference on Soft Computing and Information/Intelligent Systems. Methodologies for the Conception, Design and Application of Soft Computing, vol. 1, pp. 64–67. Singapore: World Scientific (1998)
33.
go back to reference Graham, W., Rohan, B., Hongxing, H., Hawkins, S., Gu, L.: A comparative study of RNN for outlier detection in data mining. In: ICDM ’02 Proceedings of the 2002 IEEE International Conference on Data Mining IEEE Computer Society Washington, DC, USA (2002) Graham, W., Rohan, B., Hongxing, H., Hawkins, S., Gu, L.: A comparative study of RNN for outlier detection in data mining. In: ICDM ’02 Proceedings of the 2002 IEEE International Conference on Data Mining IEEE Computer Society Washington, DC, USA (2002)
34.
go back to reference Hawkins, S., Hongxing, H., Graham, W., Rohan, B., Baxter, A.: Outlier Detection Using Replicator Neural Networks, DaWaK, pp. 170–180. Springer, New York (2002) Hawkins, S., Hongxing, H., Graham, W., Rohan, B., Baxter, A.: Outlier Detection Using Replicator Neural Networks, DaWaK, pp. 170–180. Springer, New York (2002)
36.
go back to reference Jiawei, H., Micheline, K., Pei, P.: Data Mining: Concepts and Techniques, 3rd edn. Elsevier, New York (2010) Jiawei, H., Micheline, K., Pei, P.: Data Mining: Concepts and Techniques, 3rd edn. Elsevier, New York (2010)
37.
go back to reference Saunders, R., Gero, J.S.: Designing for interest and novelty: motivating design agents. In: Proceedings of CAAD Futures 2001, pp. 725–738. Eindhoven (2001) Saunders, R., Gero, J.S.: Designing for interest and novelty: motivating design agents. In: Proceedings of CAAD Futures 2001, pp. 725–738. Eindhoven (2001)
38.
go back to reference Marsland, S.: On-line novelty detection through self-organization, with application to inspection robotics. Ph.D. thesis, Faculty of Science and Engineering, University of Manchester, UK (2001) Marsland, S.: On-line novelty detection through self-organization, with application to inspection robotics. Ph.D. thesis, Faculty of Science and Engineering, University of Manchester, UK (2001)
39.
go back to reference Brown, G., Wyatt, J., Harris, R., Yao, X.: Diversity creation methods: a survey and categorization. J. Inf. Fusion 6(1), 5–20 (2005)CrossRef Brown, G., Wyatt, J., Harris, R., Yao, X.: Diversity creation methods: a survey and categorization. J. Inf. Fusion 6(1), 5–20 (2005)CrossRef
40.
go back to reference Kuncheva, L.I., Whitaker, C.J.: Measures of diversity in classifier ensembles. Mach. Learn. 51, 181–207 (2003)CrossRefMATH Kuncheva, L.I., Whitaker, C.J.: Measures of diversity in classifier ensembles. Mach. Learn. 51, 181–207 (2003)CrossRefMATH
41.
go back to reference Savdra, C., Salas, R., Moreno, S., Allende, H.: Fusion of self organizing maps. In: Prudhomme et al. (eds.) LNCS 4507, (2007); ISMIS, LNAI 4994 (2008) Savdra, C., Salas, R., Moreno, S., Allende, H.: Fusion of self organizing maps. In: Prudhomme et al. (eds.) LNCS 4507, (2007); ISMIS, LNAI 4994 (2008)
42.
go back to reference Vesanto, J., Himberg, J., Alhoniemi, E., Parhankangas, J.: Self-Organizing Map in Matlab: the SOM Toolbox. In: Proceedings of the Matlab DSP Conference, pp. 35–40. Espoo, Finland (1999) Vesanto, J., Himberg, J., Alhoniemi, E., Parhankangas, J.: Self-Organizing Map in Matlab: the SOM Toolbox. In: Proceedings of the Matlab DSP Conference, pp. 35–40. Espoo, Finland (1999)
43.
go back to reference Moglu, F., Alpaydin, E.: Combining multiple representations for pen-based handwritten digit recognition. Turk J. Electr. Eng. 9(1) (2001) Moglu, F., Alpaydin, E.: Combining multiple representations for pen-based handwritten digit recognition. Turk J. Electr. Eng. 9(1) (2001)
44.
go back to reference Xue, Z., Shang, Y., Feng, A.: Semi-supervised outlier detection based on fuzzy rough C-means clustering. Math Comput. Simul. 80(9) (2010) Xue, Z., Shang, Y., Feng, A.: Semi-supervised outlier detection based on fuzzy rough C-means clustering. Math Comput. Simul. 80(9) (2010)
45.
go back to reference Buizza, R., Palmer, T.N.: Impact of Ensemble Size on Ensemble Prediction, European Centre for Medium-Range Weather Forecasts, Reading, Berkshire, UK (1988) Buizza, R., Palmer, T.N.: Impact of Ensemble Size on Ensemble Prediction, European Centre for Medium-Range Weather Forecasts, Reading, Berkshire, UK (1988)
Metadata
Title
A parameter based growing ensemble of self-organizing maps for outlier detection in healthcare
Authors
Samir Elmougy
M. Shamim Hossain
Ahmed S. Tolba
Mohammed F. Alhamid
Ghulam Muhammad
Publication date
16-11-2017
Publisher
Springer US
Published in
Cluster Computing / Issue Special Issue 1/2019
Print ISSN: 1386-7857
Electronic ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-017-1327-0

Other articles of this Special Issue 1/2019

Cluster Computing 1/2019 Go to the issue

Premium Partner