Skip to main content

2018 | OriginalPaper | Buchkapitel

Supporting Breast Cancer Diagnosis with Multi-objective Genetic Algorithm for Outlier Detection

verfasst von : Agnieszka Duraj, Lukasz Chomatek

Erschienen in: Advanced Solutions in Diagnostics and Fault Tolerant Control

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Outlier detection in medical data covers a broad spectrum of medical research. In this paper, the authors propose a new approach to outlier detection based on the multi-objective genetic algorithm. In medical data, an outlier may be considered as a deviation which indicates the existence of cancerous cells in the breast. The paper presents the results of tests which were conducted on the set of medical data from the repository. The results of the study indicate that our method can be successfully applied to the medical problem in question.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Alma, Ö.G., Serdar, K., Aybars, U.: Genetic algorithm based outlier detection using bayesian information criterion in multiple regression models having multicollinearity problems. Gazi Univ. J. Sci. 22(3), 141–148 (2009) Alma, Ö.G., Serdar, K., Aybars, U.: Genetic algorithm based outlier detection using bayesian information criterion in multiple regression models having multicollinearity problems. Gazi Univ. J. Sci. 22(3), 141–148 (2009)
2.
Zurück zum Zitat Andrews, D.F., Pregibon, D.: Finding the outliers that matter. J. R. Stat. Soc. Ser. B (Methodol.) 40, 85–93 (1978)MATH Andrews, D.F., Pregibon, D.: Finding the outliers that matter. J. R. Stat. Soc. Ser. B (Methodol.) 40, 85–93 (1978)MATH
3.
Zurück zum Zitat Barnett, V., Lewis, T.: Outliers in Statistical Data (Probability and Mathematical Statistics). Wiley, Chichester (1994)MATH Barnett, V., Lewis, T.: Outliers in Statistical Data (Probability and Mathematical Statistics). Wiley, Chichester (1994)MATH
4.
Zurück zum Zitat Chauhan, R., Kaur, H., Alam, M.A.: Data clustering method for discovering clusters in spatial cancer databases. Int. J. Comput. Appl. (0975–8887) 10, 24–28 (2010) Chauhan, R., Kaur, H., Alam, M.A.: Data clustering method for discovering clusters in spatial cancer databases. Int. J. Comput. Appl. (0975–8887) 10, 24–28 (2010)
5.
Zurück zum Zitat Cheang, M.C., van de Rijn, M., Nielsen, T.O.: Gene expression profiling of breast cancer. Annu. Rev. Pathmechdis. Mech. Dis. 3, 67–97 (2008)CrossRef Cheang, M.C., van de Rijn, M., Nielsen, T.O.: Gene expression profiling of breast cancer. Annu. Rev. Pathmechdis. Mech. Dis. 3, 67–97 (2008)CrossRef
6.
Zurück zum Zitat Cook, R.D.: Detection of influential observation in linear regression. Technometrics 19(1), 15–18 (1977)MathSciNetMATH Cook, R.D.: Detection of influential observation in linear regression. Technometrics 19(1), 15–18 (1977)MathSciNetMATH
7.
Zurück zum Zitat Corne, D.W., Jerram, N.R., Knowles, J.D., Oates, M.J.: Pesa-ii: Region-based selection in evolutionary multiobjective optimization. In: Proceedings of the 3rd Annual Conference on Genetic and Evolutionary Computation, pp. 283–290. Morgan Kaufmann Publishers Inc. (2001) Corne, D.W., Jerram, N.R., Knowles, J.D., Oates, M.J.: Pesa-ii: Region-based selection in evolutionary multiobjective optimization. In: Proceedings of the 3rd Annual Conference on Genetic and Evolutionary Computation, pp. 283–290. Morgan Kaufmann Publishers Inc. (2001)
8.
Zurück zum Zitat Crawford, K.D., Wainwright, R.L.: Applying genetic algorithms to outlier detection. In: ICGA, pp. 546–550 (1995) Crawford, K.D., Wainwright, R.L.: Applying genetic algorithms to outlier detection. In: ICGA, pp. 546–550 (1995)
9.
Zurück zum Zitat Cucina, D., di Salvatore, A., Protopapas, M.K.: Outliers detection in multivariate time series using genetic algorithms. Chemometr. Intell. Lab. Syst. 132, 103–110 (2014)CrossRef Cucina, D., di Salvatore, A., Protopapas, M.K.: Outliers detection in multivariate time series using genetic algorithms. Chemometr. Intell. Lab. Syst. 132, 103–110 (2014)CrossRef
10.
Zurück zum Zitat Deb, K., Pratap, A., Agarwal, S., Meyarivan, T.: A fast and elitist multiobjective genetic algorithm: NSGA-ii. IEEE Trans. Evol. Comput. 6(2), 182–197 (2002)CrossRef Deb, K., Pratap, A., Agarwal, S., Meyarivan, T.: A fast and elitist multiobjective genetic algorithm: NSGA-ii. IEEE Trans. Evol. Comput. 6(2), 182–197 (2002)CrossRef
11.
Zurück zum Zitat Delen, D., Walker, G., Kadam, A.: Predicting breast cancer survivability: a comparison of three data mining methods. Artif. Intell. Med. 34(2), 113–127 (2005)CrossRef Delen, D., Walker, G., Kadam, A.: Predicting breast cancer survivability: a comparison of three data mining methods. Artif. Intell. Med. 34(2), 113–127 (2005)CrossRef
12.
Zurück zum Zitat Duraj, A., Krawczyk, A.: Finding outliers for large medical datasets. Przeglad Elektrotechniczny 86, 188–191 (2010) Duraj, A., Krawczyk, A.: Finding outliers for large medical datasets. Przeglad Elektrotechniczny 86, 188–191 (2010)
13.
Zurück zum Zitat Duraj, A., Szczepaniak, P.S.: Information outliers and their detection. In: Burgin, M., Hofkirchner, W. (eds.) Information Studies and the Quest for Transdisciplinarity, vol. 9, Chap. 15, pp. 413–437. World Scientific Publishing Company (2017) Duraj, A., Szczepaniak, P.S.: Information outliers and their detection. In: Burgin, M., Hofkirchner, W. (eds.) Information Studies and the Quest for Transdisciplinarity, vol. 9, Chap. 15, pp. 413–437. World Scientific Publishing Company (2017)
14.
Zurück zum Zitat Duraj, A., Szczepaniak, P.S., Ochelska-Mierzejewska, J.: Detection of outlier information using linguistic summarization. In: Andreasen, T., et al. (eds.) Flexible Query Answering Systems 2015; Advances in Intelligent Systems and Computing 400, Proceedings of the 11th International Conference FQAS 2015, Cracow, Poland, pp. 101–113. Springer (2015) Duraj, A., Szczepaniak, P.S., Ochelska-Mierzejewska, J.: Detection of outlier information using linguistic summarization. In: Andreasen, T., et al. (eds.) Flexible Query Answering Systems 2015; Advances in Intelligent Systems and Computing 400, Proceedings of the 11th International Conference FQAS 2015, Cracow, Poland, pp. 101–113. Springer (2015)
15.
Zurück zum Zitat Durillo, J.J., Nebro, A.J.: jMetal: a java framework for multi-objective optimization. Adv. Eng. Softw. 42(10), 760–771 (2011)CrossRef Durillo, J.J., Nebro, A.J.: jMetal: a java framework for multi-objective optimization. Adv. Eng. Softw. 42(10), 760–771 (2011)CrossRef
16.
Zurück zum Zitat Durillo, J.J., Nebro, A.J., Alba, E.: The jMetal framework for multi-objective optimization: design and architecture. In: 2010 IEEE Congress on Evolutionary Computation (CEC), pp. 1–8. IEEE (2010) Durillo, J.J., Nebro, A.J., Alba, E.: The jMetal framework for multi-objective optimization: design and architecture. In: 2010 IEEE Congress on Evolutionary Computation (CEC), pp. 1–8. IEEE (2010)
18.
Zurück zum Zitat Hadka, D.: MOEA framework: a free and open source java framework for multiobjective optimization (2012) Hadka, D.: MOEA framework: a free and open source java framework for multiobjective optimization (2012)
19.
Zurück zum Zitat Hawkins, D.: Identification of Outliers. Monographs on Applied Probability and Statistics. Chapman and Hall, London (1980)CrossRefMATH Hawkins, D.: Identification of Outliers. Monographs on Applied Probability and Statistics. Chapman and Hall, London (1980)CrossRefMATH
20.
Zurück zum Zitat Konak, A., Coit, D.W., Smith, A.E.: Multi-objective optimization using genetic algorithms: a tutorial. Reliab. Eng. Syst. Saf. 91(9), 992–1007 (2006)CrossRef Konak, A., Coit, D.W., Smith, A.E.: Multi-objective optimization using genetic algorithms: a tutorial. Reliab. Eng. Syst. Saf. 91(9), 992–1007 (2006)CrossRef
22.
Zurück zum Zitat Lilford, R., Mohammed, M.A., Spiegelhalter, D., Thomson, R.: Use and misuse of process and outcome data in managing performance of acute medical care: avoiding institutional stigma. Lancet 363(9415), 1147–1154 (2004)CrossRef Lilford, R., Mohammed, M.A., Spiegelhalter, D., Thomson, R.: Use and misuse of process and outcome data in managing performance of acute medical care: avoiding institutional stigma. Lancet 363(9415), 1147–1154 (2004)CrossRef
23.
Zurück zum Zitat Nicolau, M., Levine, A.J., Carlsson, G.: Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival. Proc. Nat. Acad. Sci. 108(17), 7265–7270 (2011)CrossRef Nicolau, M., Levine, A.J., Carlsson, G.: Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival. Proc. Nat. Acad. Sci. 108(17), 7265–7270 (2011)CrossRef
24.
Zurück zum Zitat Pincus, S.M., Gladstone, I.M., Ehrenkranz, R.A.: A regularity statistic for medical data analysis. J. Clin. Monit. Comput. 7(4), 335–345 (1991)CrossRef Pincus, S.M., Gladstone, I.M., Ehrenkranz, R.A.: A regularity statistic for medical data analysis. J. Clin. Monit. Comput. 7(4), 335–345 (1991)CrossRef
26.
Zurück zum Zitat Shaari, F., Bakar, A.A., Hamdan, A.R.: A predictive analysis on medical data based on outlier detection method using non-reduct computation. In: International Conference on Advanced Data Mining and Applications, pp. 603–610. Springer (2009) Shaari, F., Bakar, A.A., Hamdan, A.R.: A predictive analysis on medical data based on outlier detection method using non-reduct computation. In: International Conference on Advanced Data Mining and Applications, pp. 603–610. Springer (2009)
27.
Zurück zum Zitat Sotiriou, C., Wirapati, P., Loi, S., Harris, A., Fox, S., Smeds, J., Nordgren, H., Farmer, P., Praz, V., Haibe-Kains, B., et al.: Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis. J. Nat. Cancer Inst. 98(4), 262–272 (2006)CrossRef Sotiriou, C., Wirapati, P., Loi, S., Harris, A., Fox, S., Smeds, J., Nordgren, H., Farmer, P., Praz, V., Haibe-Kains, B., et al.: Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis. J. Nat. Cancer Inst. 98(4), 262–272 (2006)CrossRef
28.
Zurück zum Zitat Stasiak, B., Tarasiuk, P., Michalska, I., Tomczyk, A., Szczepaniak, P.: Localization of demyelinating plaques in MRI using convolutional neural networks. In: In Proceedings of the 10th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2017), vol. 2, pp. 55–64 (2017) Stasiak, B., Tarasiuk, P., Michalska, I., Tomczyk, A., Szczepaniak, P.: Localization of demyelinating plaques in MRI using convolutional neural networks. In: In Proceedings of the 10th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2017), vol. 2, pp. 55–64 (2017)
29.
Zurück zum Zitat Street, W.N., Wolberg, W.H., Mangasarian, O.L.: Nuclear feature extraction for breast tumor diagnosis. In: IS&T/SPIE’s Symposium on Electronic Imaging: Science and Technology, pp. 861–870. International Society for Optics and Photonics (1993) Street, W.N., Wolberg, W.H., Mangasarian, O.L.: Nuclear feature extraction for breast tumor diagnosis. In: IS&T/SPIE’s Symposium on Electronic Imaging: Science and Technology, pp. 861–870. International Society for Optics and Photonics (1993)
30.
Zurück zum Zitat Taloba, A.I., Marghny, M., El-Aziz, R.M.A.: Outlier detection using improved genetic k-means. Int. J. Comput. Appl. 28(11), 33–36 (2014) Taloba, A.I., Marghny, M., El-Aziz, R.M.A.: Outlier detection using improved genetic k-means. Int. J. Comput. Appl. 28(11), 33–36 (2014)
31.
Zurück zum Zitat Thongkam, J., Xu, G., Zhang, Y., Huang, F.: Support vector machine for outlier detection in breast cancer survivability prediction. In: Asia-Pacific Web Conference, pp. 99–109. Springer (2008) Thongkam, J., Xu, G., Zhang, Y., Huang, F.: Support vector machine for outlier detection in breast cancer survivability prediction. In: Asia-Pacific Web Conference, pp. 99–109. Springer (2008)
32.
Zurück zum Zitat Tolvi, J.: Genetic algorithms for outlier detection and variable selection in linear regression models. Soft Comput. 8(8), 527–533 (2004)CrossRefMATH Tolvi, J.: Genetic algorithms for outlier detection and variable selection in linear regression models. Soft Comput. 8(8), 527–533 (2004)CrossRefMATH
34.
Zurück zum Zitat Tomczyk, A., Szczepaniak, P.S.: On the relationship between active contours and contextual classification. In: Computer Recognition Systems, pp. 303–310. Springer (2005) Tomczyk, A., Szczepaniak, P.S.: On the relationship between active contours and contextual classification. In: Computer Recognition Systems, pp. 303–310. Springer (2005)
35.
Zurück zum Zitat Wolberg, W.H., Street, W.N., Mangasarian, O.: Machine learning techniques to diagnose breast cancer from image-processed nuclear features of fine needle aspirates. Cancer Lett. 77(2–3), 163–171 (1994)CrossRef Wolberg, W.H., Street, W.N., Mangasarian, O.: Machine learning techniques to diagnose breast cancer from image-processed nuclear features of fine needle aspirates. Cancer Lett. 77(2–3), 163–171 (1994)CrossRef
37.
Zurück zum Zitat Xiong, X., Kim, Y., Baek, Y., Rhee, D.W., Kim, S.H.: Analysis of breast cancer using data mining & statistical techniques. In: Sixth International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, 2005 and First ACIS International Workshop on Self-assembling Wireless Networks, SNPD/SAWN 2005, pp. 82–87. IEEE (2005) Xiong, X., Kim, Y., Baek, Y., Rhee, D.W., Kim, S.H.: Analysis of breast cancer using data mining & statistical techniques. In: Sixth International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, 2005 and First ACIS International Workshop on Self-assembling Wireless Networks, SNPD/SAWN 2005, pp. 82–87. IEEE (2005)
38.
Zurück zum Zitat Yi, W., Fuyong, W.: Breast cancer diagnosis via support vector machines. In: Control Conference, CCC 2006, Chinese, pp. 1853–1856. IEEE (2006) Yi, W., Fuyong, W.: Breast cancer diagnosis via support vector machines. In: Control Conference, CCC 2006, Chinese, pp. 1853–1856. IEEE (2006)
39.
Zurück zum Zitat Zitzler, E., Laumanns, M., Thiele, L., et al.: SPEA2: Improving the strength pareto evolutionary algorithm (2001) Zitzler, E., Laumanns, M., Thiele, L., et al.: SPEA2: Improving the strength pareto evolutionary algorithm (2001)
Metadaten
Titel
Supporting Breast Cancer Diagnosis with Multi-objective Genetic Algorithm for Outlier Detection
verfasst von
Agnieszka Duraj
Lukasz Chomatek
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-64474-5_25