Skip to main content
Erschienen in: Journal of Classification 3/2020

23.12.2019 | Software Abstract

ROC and AUC with a Binary Predictor: a Potentially Misleading Metric

verfasst von: John Muschelli III

Erschienen in: Journal of Classification | Ausgabe 3/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In analysis of binary outcomes, the receiver operator characteristic (ROC) curve is heavily used to show the performance of a model or algorithm. The ROC curve is informative about the performance over a series of thresholds and can be summarized by the area under the curve (AUC), a single number. When a predictor is categorical, the ROC curve has one less than number of categories as potential thresholds; when the predictor is binary, there is only one threshold. As the AUC may be used in decision-making processes on determining the best model, it important to discuss how it agrees with the intuition from the ROC curve. We discuss how the interpolation of the curve between thresholds with binary predictors can largely change the AUC. Overall, we show using a linear interpolation from the ROC curve with binary predictors corresponds to the estimated AUC, which is most commonly done in software, which we believe can lead to misleading results. We compare R, Python, Stata, and SAS software implementations. We recommend using reporting the interpolation used and discuss the merit of using the step function interpolator, also referred to as the “pessimistic” approach by Fawcett (2006).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
Zurück zum Zitat Bamber, D. (1975). The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. Journal of Mathematical Psychology, 12(4), 387–415.MathSciNetCrossRef Bamber, D. (1975). The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. Journal of Mathematical Psychology, 12(4), 387–415.MathSciNetCrossRef
Zurück zum Zitat Blumberg, D.M., De Moraes, C.G., Liebmann, J.M., Garg, R., Chen, C., Theventhiran, A., Hood, D.C. (2016). Technology and the glaucoma suspect. Investigative Ophthalmology & Visual Science, 57(9), OCT80–OCT85.CrossRef Blumberg, D.M., De Moraes, C.G., Liebmann, J.M., Garg, R., Chen, C., Theventhiran, A., Hood, D.C. (2016). Technology and the glaucoma suspect. Investigative Ophthalmology & Visual Science, 57(9), OCT80–OCT85.CrossRef
Zurück zum Zitat Budwega, J., Sprengerb, T., De Vere-Tyndall, A., Hagenkordd, A., Stippichd, C., Bergera, C.T. (2016). Factors associated with significant MRI findings in medical walk-in patients with acute headache. Swiss Medical Weekly, 146, w14349. Budwega, J., Sprengerb, T., De Vere-Tyndall, A., Hagenkordd, A., Stippichd, C., Bergera, C.T. (2016). Factors associated with significant MRI findings in medical walk-in patients with acute headache. Swiss Medical Weekly, 146, w14349.
Zurück zum Zitat DeLong, E.R, DeLong, D.M, Clarke-Pearson, D.L. (1988). Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics, 837–45. DeLong, E.R, DeLong, D.M, Clarke-Pearson, D.L. (1988). Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics, 837–45.
Zurück zum Zitat Glaveckaite, S., Valeviciene, N., Palionis, D., Skorniakov, V., Celutkiene, J., Tamosiunas, A., Uzdavinys, G., Laucevicius, A. (2011). Value of scar imaging and inotropic reserve combination for the prediction of segmental and global left ventricular functional recovery after revascularisation. Journal of Cardiovascular Magnetic Resonance, 13(1), 35.CrossRef Glaveckaite, S., Valeviciene, N., Palionis, D., Skorniakov, V., Celutkiene, J., Tamosiunas, A., Uzdavinys, G., Laucevicius, A. (2011). Value of scar imaging and inotropic reserve combination for the prediction of segmental and global left ventricular functional recovery after revascularisation. Journal of Cardiovascular Magnetic Resonance, 13(1), 35.CrossRef
Zurück zum Zitat Hanley, J.A, & McNeil, B.J. (1982). The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143(1), 29–36.CrossRef Hanley, J.A, & McNeil, B.J. (1982). The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143(1), 29–36.CrossRef
Zurück zum Zitat Hsu, Y.-C., & Lieli, R. (2014). Inference for ROC curves based on estimated predictive indices: a note on testing AUC = 0.5. Unpublished Manuscript. Hsu, Y.-C., & Lieli, R. (2014). Inference for ROC curves based on estimated predictive indices: a note on testing AUC = 0.5. Unpublished Manuscript.
Zurück zum Zitat Kushnir, V.A, Darmon, S.K, Barad, D.H, Gleicher, N. (2018). Degree of mosaicism in trophectoderm does not predict pregnancy potential: a corrected analysis of pregnancy outcomes following transfer of mosaic embryos. Reproductive Biology and Endocrinology, 16(1), 6.CrossRef Kushnir, V.A, Darmon, S.K, Barad, D.H, Gleicher, N. (2018). Degree of mosaicism in trophectoderm does not predict pregnancy potential: a corrected analysis of pregnancy outcomes following transfer of mosaic embryos. Reproductive Biology and Endocrinology, 16(1), 6.CrossRef
Zurück zum Zitat Mwipatayi, B.P, Sharma, S., Daneshmand, A., Thomas, S.D, Vijayan, V., Altaf, N., Garbowski, M., et al. (2016). Durability of the balloon-expandable covered versus bare-metal stents in the covered versus balloon expandable stent trial (COBEST) for the treatment of aortoiliac occlusive disease. Journal of Vascular Surgery, 64(1), 83–94.CrossRef Mwipatayi, B.P, Sharma, S., Daneshmand, A., Thomas, S.D, Vijayan, V., Altaf, N., Garbowski, M., et al. (2016). Durability of the balloon-expandable covered versus bare-metal stents in the covered versus balloon expandable stent trial (COBEST) for the treatment of aortoiliac occlusive disease. Journal of Vascular Surgery, 64(1), 83–94.CrossRef
Zurück zum Zitat Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., et al. (2011). Scikit-learn: machine learning in python. Journal of Machine Learning Research, 12, 2825–30.MathSciNetMATH Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., et al. (2011). Scikit-learn: machine learning in python. Journal of Machine Learning Research, 12, 2825–30.MathSciNetMATH
Zurück zum Zitat Pepe, M., Longton, G., Janes, H. (2009). Estimation and comparison of receiver operating characteristic curves. The Stata Journal, 9(1), 1.CrossRef Pepe, M., Longton, G., Janes, H. (2009). Estimation and comparison of receiver operating characteristic curves. The Stata Journal, 9(1), 1.CrossRef
Zurück zum Zitat Robin, X., Turck, N., Hainard, A., Tiberti, N., Lisacek, F., Sanchez, J.-C., Müller, M. (2011). pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics, 12, 77.CrossRef Robin, X., Turck, N., Hainard, A., Tiberti, N., Lisacek, F., Sanchez, J.-C., Müller, M. (2011). pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics, 12, 77.CrossRef
Zurück zum Zitat Saito, T., & Rehmsmeier, M. (2015). The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PloS One, 10(3), e0118432.CrossRef Saito, T., & Rehmsmeier, M. (2015). The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PloS One, 10(3), e0118432.CrossRef
Zurück zum Zitat SAS, S.A.S., & Version, S.T.A.T. (2017). 9.4 [Computer program]. Cary, NC:SAS Institute. SAS, S.A.S., & Version, S.T.A.T. (2017). 9.4 [Computer program]. Cary, NC:SAS Institute.
Zurück zum Zitat Shterev, I.D, Dunson, D.B, Chan, C., Sempowski, G.D. (2018). Bayesian multi-plate high-throughput screening of compounds. Scientific Reports, 8(1), 9551.CrossRef Shterev, I.D, Dunson, D.B, Chan, C., Sempowski, G.D. (2018). Bayesian multi-plate high-throughput screening of compounds. Scientific Reports, 8(1), 9551.CrossRef
Zurück zum Zitat Snarr, B.S, Liu, M.Y, Zuckerberg, J.C, Falkensammer, C.B, Nadaraj, S., Burstein, D., Ho, D., et al. (2017). The parasternal short-axis view improves diagnostic accuracy for inferior sinus venosus type of atrial septal defects by transthoracic echocardiography. Journal of the American Society of Echocardiography, 30(3), 209–15.CrossRef Snarr, B.S, Liu, M.Y, Zuckerberg, J.C, Falkensammer, C.B, Nadaraj, S., Burstein, D., Ho, D., et al. (2017). The parasternal short-axis view improves diagnostic accuracy for inferior sinus venosus type of atrial septal defects by transthoracic echocardiography. Journal of the American Society of Echocardiography, 30(3), 209–15.CrossRef
Zurück zum Zitat Stata, S. (2013). Release 13. Statistical software. StataCorp LP, College Station, TX. Stata, S. (2013). Release 13. Statistical software. StataCorp LP, College Station, TX.
Zurück zum Zitat Veltri, D., Kamath, U., Shehu, A. (2018). Deep learning improves antimicrobial peptide recognition. Bioinformatics, 1, 8. Veltri, D., Kamath, U., Shehu, A. (2018). Deep learning improves antimicrobial peptide recognition. Bioinformatics, 1, 8.
Zurück zum Zitat Xiong, X., Li, Q., Yang, W.-S., Wei, X., Hu, X., Wang, X.-C., Zhu, D., Li, R., Cao, D., Xie, P. (2018). Comparison of swirl sign and black hole sign in predicting early hematoma growth in patients with spontaneous intracerebral hemorrhage. Medical Science Monitor: International Medical Journal of Experimental and Clinical Research, 24, 567.CrossRef Xiong, X., Li, Q., Yang, W.-S., Wei, X., Hu, X., Wang, X.-C., Zhu, D., Li, R., Cao, D., Xie, P. (2018). Comparison of swirl sign and black hole sign in predicting early hematoma growth in patients with spontaneous intracerebral hemorrhage. Medical Science Monitor: International Medical Journal of Experimental and Clinical Research, 24, 567.CrossRef
Metadaten
Titel
ROC and AUC with a Binary Predictor: a Potentially Misleading Metric
verfasst von
John Muschelli III
Publikationsdatum
23.12.2019
Verlag
Springer US
Erschienen in
Journal of Classification / Ausgabe 3/2020
Print ISSN: 0176-4268
Elektronische ISSN: 1432-1343
DOI
https://doi.org/10.1007/s00357-019-09345-1

Weitere Artikel der Ausgabe 3/2020

Journal of Classification 3/2020 Zur Ausgabe

Premium Partner