Skip to main content
Erschienen in: Network Modeling Analysis in Health Informatics and Bioinformatics 4/2013

01.12.2013 | Original Article

Data mining models for predicting oral cancer survivability

verfasst von: Neha Sharma, Hari Om

Erschienen in: Network Modeling Analysis in Health Informatics and Bioinformatics | Ausgabe 4/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, three predictive models are proposed to identify the most effective model for predicting the survival rate of oral cancer in patients who visit the ENT OPD. This study examined 1,024 patients who visited a tertiary care center during Jan 2004 and Dec 2009. The predictive models developed in this work are Single Tree, Decision Tree Forest and TreeBoost based on classification analysis. For all these models, it is observed that there is no misclassified row in any category and all cases have correctly been classified. The sensitivity and specificity of these models is 100 %. All the models display similar results and performance; however, as the TreeBoost model considers all 18 predictors for each split, it is marginally better than the Single Tree and Decision Tree Forest. The experimental results of probability calibration, threshold analysis and lift–gain are also slightly better in case of the TreeBoost model. Thus, the TreeBoost classification model is optimal for predicting survivability of oral cancer patients.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abual-Rub MS (2012) et al A hybrid harmony search algorithm for ab initio protein tertiary structure prediction. Netw Model Anal Health Inform Bioinform 1(3):69–85CrossRef Abual-Rub MS (2012) et al A hybrid harmony search algorithm for ab initio protein tertiary structure prediction. Netw Model Anal Health Inform Bioinform 1(3):69–85CrossRef
Zurück zum Zitat Agrawal M, Pandey S, Jain S, Maitin S (2012) Oral cancer awareness of the general public in Gorakhpur City, India. Asian Pac J Cancer Prev 13:5195–5199CrossRef Agrawal M, Pandey S, Jain S, Maitin S (2012) Oral cancer awareness of the general public in Gorakhpur City, India. Asian Pac J Cancer Prev 13:5195–5199CrossRef
Zurück zum Zitat Anuradha K, Sankaranarayanan K (2012) Identification of suspicious regions to detect oral cancers at an earlier stage–a literature survey. Int J Adv Eng Technol 3(1):84–91 Anuradha K, Sankaranarayanan K (2012) Identification of suspicious regions to detect oral cancers at an earlier stage–a literature survey. Int J Adv Eng Technol 3(1):84–91
Zurück zum Zitat Christopher C (2010) Encyclopaedia Britannica: definition of data mining. Retrieved 2010-12-09 Christopher C (2010) Encyclopaedia Britannica: definition of data mining. Retrieved 2010-12-09
Zurück zum Zitat Chuang L-Y, Wu K-C, Chang H-W, Yang C-H (2011) Support vector machine-based prediction for oral cancer using four snps in DNA repair genes. In: Proceedings of the international multiconference of engineers and computer scientists, March 16–18 2011 Chuang L-Y, Wu K-C, Chang H-W, Yang C-H (2011) Support vector machine-based prediction for oral cancer using four snps in DNA repair genes. In: Proceedings of the international multiconference of engineers and computer scientists, March 16–18 2011
Zurück zum Zitat Cunningham SJ, Holmes G (1999) Developing innovative applications in agriculture using data mining. In: Proceedings of the Southeast Asia regional computer confederation conference, 1999 Cunningham SJ, Holmes G (1999) Developing innovative applications in agriculture using data mining. In: Proceedings of the Southeast Asia regional computer confederation conference, 1999
Zurück zum Zitat Data Mining Curriculum (2006) ACM SIGKDD. 2006-04-30. Retrieved 2011-10-28 Data Mining Curriculum (2006) ACM SIGKDD. 2006-04-30. Retrieved 2011-10-28
Zurück zum Zitat Elango JK, Gangadharan P, Sumithra S, Kuriakose MA (2006) Trends of head and neck cancers in urban and rural India. Asian Pac J Cancer Prev 7(1):108–112 (view at Scopus) Elango JK, Gangadharan P, Sumithra S, Kuriakose MA (2006) Trends of head and neck cancers in urban and rural India. Asian Pac J Cancer Prev 7(1):108–112 (view at Scopus)
Zurück zum Zitat Fayyad UM, Piatetsky-Shapiro G, Smyth P (1996) From data mining to knowledge discovery: an overview. In: Advances in knowledge discovery and data mining. AAAI Press, Menlo Park Fayyad UM, Piatetsky-Shapiro G, Smyth P (1996) From data mining to knowledge discovery: an overview. In: Advances in knowledge discovery and data mining. AAAI Press, Menlo Park
Zurück zum Zitat Ferlay J, Shin HR, Bray F et al (2010) Estimates of worldwide burden of cancer in 2008: GLOBOCAN 2008. Int J Cancer 12:2893–2917CrossRef Ferlay J, Shin HR, Bray F et al (2010) Estimates of worldwide burden of cancer in 2008: GLOBOCAN 2008. Int J Cancer 12:2893–2917CrossRef
Zurück zum Zitat Gadewal NS, Zingde SM (2011) Database and interaction network of genes involved in oral cancer: Version II. Bioinformation 6(4):169–170CrossRef Gadewal NS, Zingde SM (2011) Database and interaction network of genes involved in oral cancer: Version II. Bioinformation 6(4):169–170CrossRef
Zurück zum Zitat Gupta MK, Misra K (2013) Modeling and simulation analysis of propyl-thiouracil (PTU), an anti-thyroid drug on thyroid peroxidase (TPO), thyroid stimulating hormone receptor (TSHR), and sodium iodide (NIS) symporter based on systems biology approach. Netw Model Anal Health Inform Bioinform 2(1):45–57MathSciNetCrossRef Gupta MK, Misra K (2013) Modeling and simulation analysis of propyl-thiouracil (PTU), an anti-thyroid drug on thyroid peroxidase (TPO), thyroid stimulating hormone receptor (TSHR), and sodium iodide (NIS) symporter based on systems biology approach. Netw Model Anal Health Inform Bioinform 2(1):45–57MathSciNetCrossRef
Zurück zum Zitat Gupta MK et al (2012) Prediction of miRNA in HIV-1 genome and its targets through artificial neural network: a bioinformatics approach. Netw Model Anal Health Inform Bioinform 1(4):141–151CrossRef Gupta MK et al (2012) Prediction of miRNA in HIV-1 genome and its targets through artificial neural network: a bioinformatics approach. Netw Model Anal Health Inform Bioinform 1(4):141–151CrossRef
Zurück zum Zitat Han J, Kamber M (2012) Data mining: concepts and techniques, 3rd edn. Morgan Kaufmann Han J, Kamber M (2012) Data mining: concepts and techniques, 3rd edn. Morgan Kaufmann
Zurück zum Zitat Jemal A, Thimas A, Murray T, Thun M (2002) Cancer statistics, 2002. CA Cancer J Clin 52:181–182CrossRef Jemal A, Thimas A, Murray T, Thun M (2002) Cancer statistics, 2002. CA Cancer J Clin 52:181–182CrossRef
Zurück zum Zitat Jemal A, Siegel R, Xu J, Ward E (2010) Cancer statistics. CA Cancer J Clin 60:277–300CrossRef Jemal A, Siegel R, Xu J, Ward E (2010) Cancer statistics. CA Cancer J Clin 60:277–300CrossRef
Zurück zum Zitat Kaladhar DSVGK, Chandana B, BharathKumar P (2011) Predicting cancer survivability using classification algorithms. Int J Res Rev Comput Sci (IJRRCS) 2(2):340–343 Kaladhar DSVGK, Chandana B, BharathKumar P (2011) Predicting cancer survivability using classification algorithms. Int J Res Rev Comput Sci (IJRRCS) 2(2):340–343
Zurück zum Zitat Kent S (1996) Diagnosis of oral cancer using genetic programming—a technical report, CSTR -96-14 Kent S (1996) Diagnosis of oral cancer using genetic programming—a technical report, CSTR -96-14
Zurück zum Zitat Khandekar PS, Bagdey PS, Tiwari RR (2006) Oral cancer and some epidemiological factors: a hospital based study. Indian J Community Med 31(3):157–159 Khandekar PS, Bagdey PS, Tiwari RR (2006) Oral cancer and some epidemiological factors: a hospital based study. Indian J Community Med 31(3):157–159
Zurück zum Zitat Manoharan N, Tyagi BB, Raina V (2010) Cancer incidences in rural Delhi—2004–2005. Asian Pac J Cancer Prev 11(1):73–78 (view at Scopus) Manoharan N, Tyagi BB, Raina V (2010) Cancer incidences in rural Delhi—2004–2005. Asian Pac J Cancer Prev 11(1):73–78 (view at Scopus)
Zurück zum Zitat Mehmed K (2003) Data mining: concepts, models, methods, and algorithms. Wiley, Chichester (ISBN 0-471-22852-4. OCLC 50055336) Mehmed K (2003) Data mining: concepts, models, methods, and algorithms. Wiley, Chichester (ISBN 0-471-22852-4. OCLC 50055336)
Zurück zum Zitat Milovic B, Milovic M (2012) Prediction and decision making in health care using data mining. Int J Public Health Sci 1(2):69–78 Milovic B, Milovic M (2012) Prediction and decision making in health care using data mining. Int J Public Health Sci 1(2):69–78
Zurück zum Zitat Neha S, Om H (2012) Framework for early detection and prevention of oral cancer using data mining. Int J Adv Eng Technol 4(2):302–310 Neha S, Om H (2012) Framework for early detection and prevention of oral cancer using data mining. Int J Adv Eng Technol 4(2):302–310
Zurück zum Zitat Sankaranarayanan R (1990) Oralcancer in India: an epidemiologic and clinical review. Oral Surg Oral Med Oral Pathol 69(3):325–330CrossRef Sankaranarayanan R (1990) Oralcancer in India: an epidemiologic and clinical review. Oral Surg Oral Med Oral Pathol 69(3):325–330CrossRef
Zurück zum Zitat Sankaranarayanan R, Ramadas K, Thomas G et al (2005) Effect of screening on oral cancer mortality in Kerala, India: a cluster-randomised controlled trial. Lancet 365(9475):1927–1933 (view at Publisher View at Google Scholar View at Scopus)CrossRef Sankaranarayanan R, Ramadas K, Thomas G et al (2005) Effect of screening on oral cancer mortality in Kerala, India: a cluster-randomised controlled trial. Lancet 365(9475):1927–1933 (view at Publisher View at Google Scholar View at Scopus)CrossRef
Zurück zum Zitat Scully C, Bagan JV, Hopper C, Epstein JB (2008) Oral Cancer: current and future diagnostics techniques—a review article. Am J Dent 21(4):199–209 Scully C, Bagan JV, Hopper C, Epstein JB (2008) Oral Cancer: current and future diagnostics techniques—a review article. Am J Dent 21(4):199–209
Zurück zum Zitat Seymour G (1993) Predictive inference: an introduction. Chapman & Hall, New York (ISBN 0-412-03471-9)MATH Seymour G (1993) Predictive inference: an introduction. Chapman & Hall, New York (ISBN 0-412-03471-9)MATH
Zurück zum Zitat Trevor H, Robert T, Jerome F (2009) The elements of statistical learning: data mining, inference, and prediction. Retrieved 2012-08-07 Trevor H, Robert T, Jerome F (2009) The elements of statistical learning: data mining, inference, and prediction. Retrieved 2012-08-07
Zurück zum Zitat Werning JW (2007) Oral cancer: diagnosis, management, and rehabilitation. Thieme, New York Werning JW (2007) Oral cancer: diagnosis, management, and rehabilitation. Thieme, New York
Zurück zum Zitat Woolgar JA, Scott J, Vaughan ED, Brown JS, West CR, Rogers S (1995) Survival, metastasis and recurrence of oral cancer in relation to pathological features. Ann R Coll Surg Engl 77:325–331 Woolgar JA, Scott J, Vaughan ED, Brown JS, West CR, Rogers S (1995) Survival, metastasis and recurrence of oral cancer in relation to pathological features. Ann R Coll Surg Engl 77:325–331
Metadaten
Titel
Data mining models for predicting oral cancer survivability
verfasst von
Neha Sharma
Hari Om
Publikationsdatum
01.12.2013
Verlag
Springer Vienna
Erschienen in
Network Modeling Analysis in Health Informatics and Bioinformatics / Ausgabe 4/2013
Print ISSN: 2192-6662
Elektronische ISSN: 2192-6670
DOI
https://doi.org/10.1007/s13721-013-0045-7

Weitere Artikel der Ausgabe 4/2013

Network Modeling Analysis in Health Informatics and Bioinformatics 4/2013 Zur Ausgabe