Skip to main content
Erschienen in: Data Mining and Knowledge Discovery 4/2021

18.01.2021

Predictive modeling of infant mortality

verfasst von: Antonia Saravanou, Clemens Noelke, Nicholas Huntington, Dolores Acevedo-Garcia, Dimitrios Gunopulos

Erschienen in: Data Mining and Knowledge Discovery | Ausgabe 4/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The Infant Mortality Rate (IMR) is defined as the number of infants for every thousand infants that do not survive until their first birthday. IMR is an important metric not only because it provides information about infant births in an area, but it also measures the general societal health status. In the United States of America, the IMR is higher than many other developed countries, despite the high level of prosperity. It is important to note here that the U.S.A. exhibits strong and persistent inequalities in the IMR across different racial and ethnic groups (Kochanek et al. in Natl Vital Stat Rep 65(4):1–122, 2006). In this paper, we study predictive models in the problem of infant mortality. We implement traditional machine learning models and state-of-the-art neural network models with various combinations of features extracted from birth certificates. Those combinations include features that can be summed as socio-economic and ethical features related to the mother and the father of the infant and medical measurements during the pregnancy and the delivery. We approach the classification problem of infant mortality, whether an infant will survive until her first birthday or not, both as binary and multi-class based on the time of death. We focus on understanding and exploring the importance of features extracted from the birth certificates. For example, we test the performance of models trained on the general population to models trained in subsets of the population, e.g., for individual races. We show in our experimental evaluation comparisons between different predictive models (including those used by epidemiology researchers), various combinations of features, different distributions in the training set and features’ importance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abrevaya J (2002) The effects of demographics and maternal behavior on the distribution of birth outcomes. In: Economic applications of quantile regression Abrevaya J (2002) The effects of demographics and maternal behavior on the distribution of birth outcomes. In: Economic applications of quantile regression
Zurück zum Zitat Acevedo-Garcia D, Soobader M, Berkman L (2007) Low birthweight among U.S. hispanic/latino subgroups: the effect of maternal foreign-born status and education. Soc Sci Med 65(12):2503–2516CrossRef Acevedo-Garcia D, Soobader M, Berkman L (2007) Low birthweight among U.S. hispanic/latino subgroups: the effect of maternal foreign-born status and education. Soc Sci Med 65(12):2503–2516CrossRef
Zurück zum Zitat Acevedo-Garcia D, Soobader MJ, Berkman LF (2005) The differential effect of foreign-born status on low birth weight by race/ethnicity and education. Pediatrics 115(1):e20–e30CrossRef Acevedo-Garcia D, Soobader MJ, Berkman LF (2005) The differential effect of foreign-born status on low birth weight by race/ethnicity and education. Pediatrics 115(1):e20–e30CrossRef
Zurück zum Zitat Acevedo-Garcia D, Soobader MJ, Berkman LF (2007) Low birthweight among us hispanic/latino subgroups: the effect of maternal foreign-born status and education. Soc Sci Med 65(12):2503–2516CrossRef Acevedo-Garcia D, Soobader MJ, Berkman LF (2007) Low birthweight among us hispanic/latino subgroups: the effect of maternal foreign-born status and education. Soc Sci Med 65(12):2503–2516CrossRef
Zurück zum Zitat Almond D, Chay KY, Lee DS (2005) The costs of low birth weight. Q J Econ 120:1031–1083 Almond D, Chay KY, Lee DS (2005) The costs of low birth weight. Q J Econ 120:1031–1083
Zurück zum Zitat Callaghan WM, MacDorman MF, Rasmussen SA, Qin C, Lackritz EM (2006) The contribution of preterm birth to infant mortality rates in the united states. Pediatrics 118(4):1566–1573CrossRef Callaghan WM, MacDorman MF, Rasmussen SA, Qin C, Lackritz EM (2006) The contribution of preterm birth to infant mortality rates in the united states. Pediatrics 118(4):1566–1573CrossRef
Zurück zum Zitat Casey BM, McIntire DD, Leveno KJ (2001) The continuing value of the Apgar score for the assessment of newborn infants. New Engl J Med 344:467–471CrossRef Casey BM, McIntire DD, Leveno KJ (2001) The continuing value of the Apgar score for the assessment of newborn infants. New Engl J Med 344:467–471CrossRef
Zurück zum Zitat Chen T, Guestrin C (2016) Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd SIGKDD 2016. ACM Chen T, Guestrin C (2016) Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd SIGKDD 2016. ACM
Zurück zum Zitat Doyle JM, Echevarria S, Frisbie WP (2003) Race/ethnicity, Apgar and infant mortality. Springer, Berlin Doyle JM, Echevarria S, Frisbie WP (2003) Race/ethnicity, Apgar and infant mortality. Springer, Berlin
Zurück zum Zitat Finch BK (2003) Early origins of the gradient: the relationship between socioeconomic status and infant mortality in the united states. Demography 40(4):675–699CrossRef Finch BK (2003) Early origins of the gradient: the relationship between socioeconomic status and infant mortality in the united states. Demography 40(4):675–699CrossRef
Zurück zum Zitat Health (2006) United States, 2005: with chartbook on trends in the health of Americans. US Department of Health and Human Services, Washington Health (2006) United States, 2005: with chartbook on trends in the health of Americans. US Department of Health and Human Services, Washington
Zurück zum Zitat Hegyi T, Carbone T, Anwar M, Ostfeld B, Hiatt M, Koons A, Pinto-Martin J, Paneth N (1998) The Apgar score and its components in the preterm infant. Pediatrics 101(1 Pt 1):77–81CrossRef Hegyi T, Carbone T, Anwar M, Ostfeld B, Hiatt M, Koons A, Pinto-Martin J, Paneth N (1998) The Apgar score and its components in the preterm infant. Pediatrics 101(1 Pt 1):77–81CrossRef
Zurück zum Zitat Hessol NA, Fuentes-Afflick E (2005) Ethnic differences in neonatal and postneonatal mortality. Pediatrics 115(1):e44–e51CrossRef Hessol NA, Fuentes-Afflick E (2005) Ethnic differences in neonatal and postneonatal mortality. Pediatrics 115(1):e44–e51CrossRef
Zurück zum Zitat Hessol NA, Fuentes-Afflick E, Bacchetti P (1998) Risk of low birth weight infants among black and white parents. Elsevier, Amsterdam Hessol NA, Fuentes-Afflick E, Bacchetti P (1998) Risk of low birth weight infants among black and white parents. Elsevier, Amsterdam
Zurück zum Zitat Hummer RA, Biegler M, De Turk PB, Forbes D, Frisbie WP, Hong Y, Pullum SG (1999) Race/ethnicity, nativity, and infant mortality in the United States. Soc Forc 77:1083–1118CrossRef Hummer RA, Biegler M, De Turk PB, Forbes D, Frisbie WP, Hong Y, Pullum SG (1999) Race/ethnicity, nativity, and infant mortality in the United States. Soc Forc 77:1083–1118CrossRef
Zurück zum Zitat John GH, Langley P (1995) Estimating continuous distributions in Bayesian classifiers. In: Proceedings of the eleventh conference on uncertainty in artificial intelligence, UAI’95 John GH, Langley P (1995) Estimating continuous distributions in Bayesian classifiers. In: Proceedings of the eleventh conference on uncertainty in artificial intelligence, UAI’95
Zurück zum Zitat Kochanek KD, Murphy SL, Xu J, Tejada-Vera B (2006) Deaths: final data for 2014. Natl Vital Stat Rep 65(4):1–122 Kochanek KD, Murphy SL, Xu J, Tejada-Vera B (2006) Deaths: final data for 2014. Natl Vital Stat Rep 65(4):1–122
Zurück zum Zitat Ma S, Finch BK (2010) Birth outcome measures and infant mortality. Popul Res Policy Rev 29:865CrossRef Ma S, Finch BK (2010) Birth outcome measures and infant mortality. Popul Res Policy Rev 29:865CrossRef
Zurück zum Zitat Macinko J, Guanais FC, de Souza M (2006) Evaluation of the impact of the family health program on infant mortality in brazil, 1990–2002. J Epidemiol Commun Health 60(1):13–19CrossRef Macinko J, Guanais FC, de Souza M (2006) Evaluation of the impact of the family health program on infant mortality in brazil, 1990–2002. J Epidemiol Commun Health 60(1):13–19CrossRef
Zurück zum Zitat Mathews T, MacDorman MF (2007) Infant mortality statistics from the 2004 period linked birth/infant death data set. Natl Vital Stat Rep 55(14):1–32 Mathews T, MacDorman MF (2007) Infant mortality statistics from the 2004 period linked birth/infant death data set. Natl Vital Stat Rep 55(14):1–32
Zurück zum Zitat McCormick MC (1985) The contribution of low birth weight to infant mortality and childhood morbidity. N Engl J Med 312:82–90CrossRef McCormick MC (1985) The contribution of low birth weight to infant mortality and childhood morbidity. N Engl J Med 312:82–90CrossRef
Zurück zum Zitat Osypuk TL, Acevedo-Garcia D (2008) Are racial disparities in preterm birth larger in hypersegregated areas? Am J Epidemiol 167(11):1295–1304CrossRef Osypuk TL, Acevedo-Garcia D (2008) Are racial disparities in preterm birth larger in hypersegregated areas? Am J Epidemiol 167(11):1295–1304CrossRef
Zurück zum Zitat Osypuk TL, Acevedo-Garcia D (2008) Are racial disparities in preterm birth larger in hypersegregated areas? Am J Epidemiol 167(11):1295–304CrossRef Osypuk TL, Acevedo-Garcia D (2008) Are racial disparities in preterm birth larger in hypersegregated areas? Am J Epidemiol 167(11):1295–304CrossRef
Zurück zum Zitat Papile LA (2001) The apgar score in the 21st century. N Engl J Med 344(7):519–520CrossRef Papile LA (2001) The apgar score in the 21st century. N Engl J Med 344(7):519–520CrossRef
Zurück zum Zitat Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825–2830MathSciNetMATH Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825–2830MathSciNetMATH
Zurück zum Zitat Potash E, Brew J, Loewi A, Majumdar S, Reece A, Walsh J, Rozier E, Jorgenson E, Mansour R, Ghani R (2015) Predictive modeling for public health: Preventing childhood lead poisoning. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, KDD’15. ACM Potash E, Brew J, Loewi A, Majumdar S, Reece A, Walsh J, Rozier E, Jorgenson E, Mansour R, Ghani R (2015) Predictive modeling for public health: Preventing childhood lead poisoning. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, KDD’15. ACM
Zurück zum Zitat Powers D, Parker F (2006) Race/ethnic differences and age-variation in the effects of birth outcomes on infant mortality in the US. Demograph Res 14(10):179–216CrossRef Powers D, Parker F (2006) Race/ethnic differences and age-variation in the effects of birth outcomes on infant mortality in the US. Demograph Res 14(10):179–216CrossRef
Zurück zum Zitat Rinta-Koski OP, Särkkä S, Hollmén J, Leskinen M, Andersson S (2018) Gaussian process classification for prediction of in-hospital mortality among preterm infants. Neurocomputing 298:134–141CrossRef Rinta-Koski OP, Särkkä S, Hollmén J, Leskinen M, Andersson S (2018) Gaussian process classification for prediction of in-hospital mortality among preterm infants. Neurocomputing 298:134–141CrossRef
Zurück zum Zitat Saravanou A, Noelke C, Huntington N, Acevedo-Garcia D, Gunopulos D (2019) Infant mortality prediction using birth certificate data. DSHealth KDD workshop. arXiv preprint arXiv:1907.08968 Saravanou A, Noelke C, Huntington N, Acevedo-Garcia D, Gunopulos D (2019) Infant mortality prediction using birth certificate data. DSHealth KDD workshop. arXiv preprint arXiv:​1907.​08968
Zurück zum Zitat Saravanou A, Noelke C, Huntington N, Acevedo-Garcia D, Gunopulos D (2019b) Predicting infant mortality at the time of birth. Population Association Annual Meeting, Austin Saravanou A, Noelke C, Huntington N, Acevedo-Garcia D, Gunopulos D (2019b) Predicting infant mortality at the time of birth. Population Association Annual Meeting, Austin
Zurück zum Zitat Schölkopf B, Williamson RC, Smola AJ, Shawe-Taylor J, Platt JC (2000) Support vector method for novelty detection. In: Advances in neural information processing systems, pp 582–588 Schölkopf B, Williamson RC, Smola AJ, Shawe-Taylor J, Platt JC (2000) Support vector method for novelty detection. In: Advances in neural information processing systems, pp 582–588
Zurück zum Zitat Somanchi S, Adhikari S, Lin A, Eneva E, Ghani R (2015) Early prediction of cardiac arrest (code blue) using electronic medical records. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, KDD’15. ACM Somanchi S, Adhikari S, Lin A, Eneva E, Ghani R (2015) Early prediction of cardiac arrest (code blue) using electronic medical records. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, KDD’15. ACM
Zurück zum Zitat Wilcox AJ (2001) On the importance-and the unimportance-of birthweight. Int J Epidemiol 30:1233–1241CrossRef Wilcox AJ (2001) On the importance-and the unimportance-of birthweight. Int J Epidemiol 30:1233–1241CrossRef
Zurück zum Zitat Wilcox AJ, Skjaerven R (1992) Birth weight and perinatal mortality: the effect of gestational age. Am J Public Health 82:378–82CrossRef Wilcox AJ, Skjaerven R (1992) Birth weight and perinatal mortality: the effect of gestational age. Am J Public Health 82:378–82CrossRef
Metadaten
Titel
Predictive modeling of infant mortality
verfasst von
Antonia Saravanou
Clemens Noelke
Nicholas Huntington
Dolores Acevedo-Garcia
Dimitrios Gunopulos
Publikationsdatum
18.01.2021
Verlag
Springer US
Erschienen in
Data Mining and Knowledge Discovery / Ausgabe 4/2021
Print ISSN: 1384-5810
Elektronische ISSN: 1573-756X
DOI
https://doi.org/10.1007/s10618-020-00728-2

Weitere Artikel der Ausgabe 4/2021

Data Mining and Knowledge Discovery 4/2021 Zur Ausgabe

Premium Partner