Skip to main content
Top

2020 | OriginalPaper | Chapter

K-Means Clustering for Features Arrangement in Metagenomic Data Visualization

Authors : Hai Thanh Nguyen, Toan Bao Tran, Huong Hoang Luong, Trung Phuoc Le, Nghi C. Tran, Quoc-Dinh Truong

Published in: Advances in Computational Collective Intelligence

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Personalized medicine is one of the most concern of the scientists to propose successful treatments for diseases. This approach considers patients’ genetic make-up and attention to their preferences, beliefs, attitudes, knowledge and social context. Deep learning techniques hold important roles and obtain achievements in bioinformatics tasks. Metagenomic data analysis is very important to develop and evaluate methods and tools applying to Personalized medicine. Metagenomic data is usually characterized by high-dimensional spaces where humans meet difficulties to interpret data. Visualizing metagenomic data is crucial to provide insights in data which can help researchers to explore patterns in data. Moreover, these visualizations can be fetched into deep learning such as Convolutional Neural Networks to do prediction tasks. In this study, we propose a visualization method for metagenomic data where features are arranged in the visualization based on K-means clustering algorithms. We show by experiments on metagenomic datasets of three diseases (Colorectal Cancer, Obesity and Type 2 Diabetes) that the proposed approach not only provides a robust method for visualization where we can observe clusters in the images but also enables us to improve the performance in disease prediction with deep learning algorithms.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Moscow, J.A., et al.: The evidence framework for precision cancer medicine. Nat. Rev. Clin. Oncol. 15(3), 183–192 (2017)CrossRef Moscow, J.A., et al.: The evidence framework for precision cancer medicine. Nat. Rev. Clin. Oncol. 15(3), 183–192 (2017)CrossRef
2.
go back to reference Chial, H.: DNA sequencing technologies key to the Human Genome Project. Nat. Educ. 1(1), 219 (2008) Chial, H.: DNA sequencing technologies key to the Human Genome Project. Nat. Educ. 1(1), 219 (2008)
3.
go back to reference Handelsman, J.: Metagenomics: application of genomics to uncultured microorganisms. Microbiol. Mol. Biol. 69(1), 195–195 (2005)CrossRef Handelsman, J.: Metagenomics: application of genomics to uncultured microorganisms. Microbiol. Mol. Biol. 69(1), 195–195 (2005)CrossRef
6.
go back to reference Nakamura, S., et al.: Direct metagenomic detection of viral pathogens in nasal and fecal specimens using an unbiased high-throughput sequencing approach. PLoS ONE 4(1), e4219 (2009)CrossRef Nakamura, S., et al.: Direct metagenomic detection of viral pathogens in nasal and fecal specimens using an unbiased high-throughput sequencing approach. PLoS ONE 4(1), e4219 (2009)CrossRef
7.
go back to reference Li, L., Delwart, E.: From orphan virus to pathogen: the path to the clinical lab. Curr. Opin. Virol. 1(4), 282–288 (2011)CrossRef Li, L., Delwart, E.: From orphan virus to pathogen: the path to the clinical lab. Curr. Opin. Virol. 1(4), 282–288 (2011)CrossRef
8.
go back to reference Udugama, B., et al.: Diagnosing COVID-19: the disease and tools for detection. ACS Nano 14(4), 3822–3835 (2020)CrossRef Udugama, B., et al.: Diagnosing COVID-19: the disease and tools for detection. ACS Nano 14(4), 3822–3835 (2020)CrossRef
11.
go back to reference Soueidan, H., Nikolski, M.: Machine learning for metagenomics: methods and tools. Metagenomics 1(1) (2017) Soueidan, H., Nikolski, M.: Machine learning for metagenomics: methods and tools. Metagenomics 1(1) (2017)
12.
go back to reference Patwardhan, A., Ray. S., Roy, A.: Molecular markers in phylogenetic studies-a review. J. Phylogenetics Evol. Biol. 02(02) (2014) Patwardhan, A., Ray. S., Roy, A.: Molecular markers in phylogenetic studies-a review. J. Phylogenetics Evol. Biol. 02(02) (2014)
14.
go back to reference Zhou, F., et al.: Bayesian biclustering for microbial metagenomic sequencing data via multinomial matrix factorization. arXiv:2005.08361 (2020) Zhou, F., et al.: Bayesian biclustering for microbial metagenomic sequencing data via multinomial matrix factorization. arXiv:​2005.​08361 (2020)
16.
go back to reference Nguyen, T.H., et al.: Disease prediction using synthetic image representations of metagenomic data and convolutional neural networks. In: IEEE-RIVF, pp 231–236. IEEE Xplore (2019). ISBN 978-1-5386-9313-1 Nguyen, T.H., et al.: Disease prediction using synthetic image representations of metagenomic data and convolutional neural networks. In: IEEE-RIVF, pp 231–236. IEEE Xplore (2019). ISBN 978-1-5386-9313-1
17.
go back to reference Alonso, J.B.: K-means vs mini batch k-means: a comparison (2013) Alonso, J.B.: K-means vs mini batch k-means: a comparison (2013)
18.
go back to reference Soni, R., James Mathai, K.: An innovative ‘cluster-then-predict’ approach for improved sentiment prediction. In: Choudhary, R.K., Mandal, J.K., Auluck, N., Nagarajaram, H.A. (eds.) Advanced Computing and Communication Technologies. AISC, vol. 452, pp. 131–140. Springer, Singapore (2016). https://doi.org/10.1007/978-981-10-1023-1_13CrossRef Soni, R., James Mathai, K.: An innovative ‘cluster-then-predict’ approach for improved sentiment prediction. In: Choudhary, R.K., Mandal, J.K., Auluck, N., Nagarajaram, H.A. (eds.) Advanced Computing and Communication Technologies. AISC, vol. 452, pp. 131–140. Springer, Singapore (2016). https://​doi.​org/​10.​1007/​978-981-10-1023-1_​13CrossRef
19.
go back to reference Liang, Q. et al.: DeepMicrobes: taxonomic classification for metagenomics with deep learning. NAR Genomics Bioinform. 2(1) (2020) Liang, Q. et al.: DeepMicrobes: taxonomic classification for metagenomics with deep learning. NAR Genomics Bioinform. 2(1) (2020)
Metadata
Title
K-Means Clustering for Features Arrangement in Metagenomic Data Visualization
Authors
Hai Thanh Nguyen
Toan Bao Tran
Huong Hoang Luong
Trung Phuoc Le
Nghi C. Tran
Quoc-Dinh Truong
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-63119-2_7

Premium Partner