Skip to main content

2023 | OriginalPaper | Buchkapitel

Determining HPV Status in Patients with Oropharyngeal Cancer from 3D CT Images Using Radiomics: Effect of Sampling Methods

verfasst von : Kubra Sarac, Albert Guvenis

Erschienen in: Bioinformatics and Biomedical Engineering

Verlag: Springer Nature Switzerland

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Non-invasive detection of human papillomavirus (HPV) status is important for the treatment planning of patients with oropharyngeal cancer (OPC). In this work, three-dimensional (3D) head and neck computed tomography (CT) scans are utilized to identify HPV infection status in patients with OPC by applying radiomics and several resampling methods to handle highly imbalanced data. 1142 radiomic features were obtained from the segmented CT images of 238 patients. The features used were selected through correlation coefficient analysis, feature importance analysis, and backward elimination. The fifty most important features were chosen. Six different sampling methods, which are Synthetic Minority Oversampling Technique (SMOTE), Support Vector Machine Synthetic Minority Oversampling Technique (SVMSMOTE), Adaptive Synthetic Sampling Method (ADASYN), NearMiss, Condensed Nearest Neighbors (CNN), and Tomek’s Link, were performed on the training set for each of the positive and negative HPV classes. Two different machine learning (ML) algorithms, a Light Gradient Boosting Machine (LightGBM) and Extreme Gradient Boosting (XGBoost), were applied as predictive classification models. Model performances were assessed separately on 20% of the data. Oversampling methods displayed better performance than undersampling methods. The best performance was seen in the combination of SMOTE and XGBoost algorithms, which had an area under the curve (AUC) of 0.93 (95% CI: 82–99) and an accuracy of 90% (95% CI: 78–96). Our work demonstrated a reasonable accuracy in the forecast of HPV status using 3D imbalanced and small datasets. Further work is needed to test the algorithms on larger, balanced, and multi-institutional data.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
13.
Zurück zum Zitat Zhinan, L.: Prediction of HPV status in oropharyngeal squamous cell carcinoma based on radiomics and machine learning algorithms: a multi-cohort study, pp. 1–16 (2022) Zhinan, L.: Prediction of HPV status in oropharyngeal squamous cell carcinoma based on radiomics and machine learning algorithms: a multi-cohort study, pp. 1–16 (2022)
25.
Zurück zum Zitat Yen, S.J., Lee, Y.S.: Under-sampling approaches for improving prediction of the minority class in an imbalanced dataset. In: Huang, D.S., Li, K., Irwin, G.W. (eds.) Intelligent Control and Automation. Lecture Notes in Control and Information Sciences, vol. 344. Springer, Heidelberg (2006). https://doi.org/10.1007/978-3-540-37256-1_89 Yen, S.J., Lee, Y.S.: Under-sampling approaches for improving prediction of the minority class in an imbalanced dataset. In: Huang, D.S., Li, K., Irwin, G.W. (eds.) Intelligent Control and Automation. Lecture Notes in Control and Information Sciences, vol. 344. Springer, Heidelberg (2006). https://​doi.​org/​10.​1007/​978-3-540-37256-1_​89
41.
Zurück zum Zitat Stancin, I., Jovic, A.: An overview and comparison of free Python libraries for data mining and big data analysis. In: 2019 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2019 – Proceedings, pp. 977–982 (2019). https://doi.org/10.23919/MIPRO.2019.8757088 Stancin, I., Jovic, A.: An overview and comparison of free Python libraries for data mining and big data analysis. In: 2019 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2019 – Proceedings, pp. 977–982 (2019). https://​doi.​org/​10.​23919/​MIPRO.​2019.​8757088
Metadaten
Titel
Determining HPV Status in Patients with Oropharyngeal Cancer from 3D CT Images Using Radiomics: Effect of Sampling Methods
verfasst von
Kubra Sarac
Albert Guvenis
Copyright-Jahr
2023
DOI
https://doi.org/10.1007/978-3-031-34960-7_3

Premium Partner