Skip to main content

2018 | OriginalPaper | Buchkapitel

58. Selection of Transformations of Continuous Predictors in Logistic Regression

verfasst von : Michael Chang, Rohan J. Dalpatadu, Ashok K. Singh

Erschienen in: Information Technology - New Generations

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The binary logistic regression is a machine learning tool for classification and discrimination that is widely used in business analytics and medical research. Transforming continuous predictors to improve model performance of logistic regression is a common practice, but no systematic method for finding optimal transformations exists in the statistical or data mining literature. In this paper, the problem of selecting transformations of continuous predictors to improve the performance of logistic regression models is considered. The proposed method is based upon the point-biserial correlation coefficient between the binary response and a continuous predictor. Several examples are presented to illustrate the proposed method.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat M.H. Kutner, C.J. Nachtsheim, J. Neter, Applied Linear Regression Models, 4th edn. (McGraw-Hill Higher Education, Boston, 2004), pp. 129–141 M.H. Kutner, C.J. Nachtsheim, J. Neter, Applied Linear Regression Models, 4th edn. (McGraw-Hill Higher Education, Boston, 2004), pp. 129–141
2.
Zurück zum Zitat F.E. Harrell, Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis (Springer Science & Business Media, New York, 2001), pp. 7–10CrossRef F.E. Harrell, Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis (Springer Science & Business Media, New York, 2001), pp. 7–10CrossRef
3.
Zurück zum Zitat E.W. Steyerberg, Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating (Springer Science & Business Media, New York, 2008), pp. 57–58 E.W. Steyerberg, Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating (Springer Science & Business Media, New York, 2008), pp. 57–58
4.
Zurück zum Zitat R. Kay, S. Little, Transformations of the explanatory variables in the logistic regression model for binary data. Biomelrika 74(3), 495–501 (1987)MathSciNetCrossRef R. Kay, S. Little, Transformations of the explanatory variables in the logistic regression model for binary data. Biomelrika 74(3), 495–501 (1987)MathSciNetCrossRef
5.
Zurück zum Zitat H.C. Kraemer, Correlation coefficients in medical research: from product moment correlation to the odds ratio. Stat. Methods Med. Res. 15, 525–545 (2006)MathSciNetCrossRef H.C. Kraemer, Correlation coefficients in medical research: from product moment correlation to the odds ratio. Stat. Methods Med. Res. 15, 525–545 (2006)MathSciNetCrossRef
7.
Zurück zum Zitat F. Guillet, H. Hamilton, J. (eds.), Quality Measures in Data Mining, vol 43 (Springer, New York, 2007)MATH F. Guillet, H. Hamilton, J. (eds.), Quality Measures in Data Mining, vol 43 (Springer, New York, 2007)MATH
8.
Zurück zum Zitat G. James, D. Witten, T. Hastie, R. Tibshirani, An Introduction to Statistical Learning, vol 6 (Springer, New York, 2013)CrossRef G. James, D. Witten, T. Hastie, R. Tibshirani, An Introduction to Statistical Learning, vol 6 (Springer, New York, 2013)CrossRef
9.
Zurück zum Zitat D.W. Hosmer Jr., H. Lemeshow, Applied Logistic Regression (Wiley, New York, 2004)MATH D.W. Hosmer Jr., H. Lemeshow, Applied Logistic Regression (Wiley, New York, 2004)MATH
10.
Zurück zum Zitat F. Cady, The Data Science Handbook (Wiley, New York, 2017), pp. 118–119 F. Cady, The Data Science Handbook (Wiley, New York, 2017), pp. 118–119
11.
Zurück zum Zitat D.M.W. Powers, Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. J. Mach. Learn. Technol. 2(1), 37–63 (2011)MathSciNet D.M.W. Powers, Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. J. Mach. Learn. Technol. 2(1), 37–63 (2011)MathSciNet
12.
Zurück zum Zitat J. Fox, G. Monette, Generalized collinearity diagnostics. J. Am. Stat. Assoc. 87, 178–183 (1992)CrossRef J. Fox, G. Monette, Generalized collinearity diagnostics. J. Am. Stat. Assoc. 87, 178–183 (1992)CrossRef
13.
Zurück zum Zitat E.W. Steyerberg, A.J. Vickers, N.R. Cook, T. Gerds, M. Gonen, N. Obuchowski, M.J. Pencina, M.W. Kattan, Assessing the performance of prediction models: a framework for some traditional and novel measures. Epidemiology 21(1), 128–138 (2010)CrossRef E.W. Steyerberg, A.J. Vickers, N.R. Cook, T. Gerds, M. Gonen, N. Obuchowski, M.J. Pencina, M.W. Kattan, Assessing the performance of prediction models: a framework for some traditional and novel measures. Epidemiology 21(1), 128–138 (2010)CrossRef
14.
Zurück zum Zitat M. Bozorgi, K. Taghva, A.K. Singh, Cancer survivability with logistic regression, in Computing Conference 2017, London, July 2017, pp. 18–20 M. Bozorgi, K. Taghva, A.K. Singh, Cancer survivability with logistic regression, in Computing Conference 2017, London, July 2017, pp. 18–20
15.
Zurück zum Zitat Y. Zhao, R and Data Mining: Examples and Case Studies (Academic Press, London, 2012), pp. 90–92 Y. Zhao, R and Data Mining: Examples and Case Studies (Academic Press, London, 2012), pp. 90–92
Metadaten
Titel
Selection of Transformations of Continuous Predictors in Logistic Regression
verfasst von
Michael Chang
Rohan J. Dalpatadu
Ashok K. Singh
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-77028-4_58

Premium Partner