Skip to main content

2020 | OriginalPaper | Buchkapitel

Predicting Cancer Patients’ Survival Using Random Forests

verfasst von : Camila Takemoto Bertolini, Saul de Castro Leite, Fernanda Nascimento Almeida

Erschienen in: Advances in Bioinformatics and Computational Biology

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The increasing amount of data available on the web, coupled with the demand for useful information, has sparked increasing interest in gaining knowledge in large information systems, especially biomedical ones. Health institutions operate in an environment that has been generating thousands of health records about patients. Such databases can be the source of a wealth of information. For instance, these databases can be used to study factors that contribute to the incidence of a pathology and thereby determine patient profiles at the earliest stage of the disease. Such information can be extracted with the help of Machine Learning methods, which are capable of dealing with large amounts of data in order to make predictions. These methods offer an opportunity to translate new data into palpable information and, thus, allows earlier diagnosis and precise treatment options. In order to understand the potential of these methods, we use a database that contain records of cancer patients, which is made publicly available by the Oncocentro Foundation of São Paulo. This database contains historical clinical information from cancer patients of the past 20 years. In this paper we present an initial investigation towards the goal of improving prognosis and therefore increasing the chances of survival among cancer patients. The Random Forest Classification Model was employed in our analysis; this model shows to be a suitable predicting tool for ours purpose. Thus, we intend to present means that allows the design of predictive, preventive and personalized treatments, as well as assisting in the decision making process of the disease.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
4.
Zurück zum Zitat Chen, M., Hao, Y., Hwang, K., Wang, L., Wang, L.: Disease prediction by machine learning over big data from healthcare communities. IEEE Access 5, 8869–8879 (2017)CrossRef Chen, M., Hao, Y., Hwang, K., Wang, L., Wang, L.: Disease prediction by machine learning over big data from healthcare communities. IEEE Access 5, 8869–8879 (2017)CrossRef
6.
Zurück zum Zitat Chen, X., Ishwaran, H.: Random forests for genomic data analysis. Genomics 99(6), 323–329 (2012)CrossRef Chen, X., Ishwaran, H.: Random forests for genomic data analysis. Genomics 99(6), 323–329 (2012)CrossRef
10.
Zurück zum Zitat McKinney, W.: Data structures for statistical computing in python. In: van der Walt, S., Millman, J. (eds.) Proceedings of the 9th Python in Science Conference. pp. 51–56 (2010) McKinney, W.: Data structures for statistical computing in python. In: van der Walt, S., Millman, J. (eds.) Proceedings of the 9th Python in Science Conference. pp. 51–56 (2010)
12.
Zurück zum Zitat Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetMATH Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetMATH
18.
Zurück zum Zitat Wiens, J., Shenoy, E.S.: Machine learning for healthcare: on the verge of a major shift in healthcare epidemiology. Clin. Infect. Dis. 66(1), 149–153 (2017)CrossRef Wiens, J., Shenoy, E.S.: Machine learning for healthcare: on the verge of a major shift in healthcare epidemiology. Clin. Infect. Dis. 66(1), 149–153 (2017)CrossRef
Metadaten
Titel
Predicting Cancer Patients’ Survival Using Random Forests
verfasst von
Camila Takemoto Bertolini
Saul de Castro Leite
Fernanda Nascimento Almeida
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-46417-2_9