Skip to main content
Top
Published in: Annals of Data Science 4/2020

10-06-2020

Copula Approach for Developing a Biomarker Panel for Prediction of Dengue Hemorrhagic Fever

Authors: Jong-Min Kim, Hyunsu Ju, Yoonsung Jung

Published in: Annals of Data Science | Issue 4/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The choice of variable-selection methods to identify important variables for binary classification modeling is critical for producing stable statistical models that are interpretable, that generate accurate predictions, and have minimal bias. This work is motivated by the availability of data on clinical and laboratory features of dengue fever infections obtained from 51 individuals enrolled in a prospective observational study of acute human dengue infections. Our paper uses objective Bayesian method to identify important variables for dengue hemorrhagic fever (DHF) over the dengue data set. With the selected important variables by objective Bayesian method, we employ a Gaussian copula marginal regression model considering correlation error structure and a general method of semi-parametric Bayesian inference for Gaussian copula model to estimate, separately, the marginal distribution and dependence structure. We also carry out a receiver operating characteristic (ROC) analysis for the predictive model for DHF and compare our proposed model with the other models of Ju and Brasier (Variable selection methods for developing a biomarker panel for prediction of dengue hemorrhagic fever. BMC Res Notes 6:365, 2013) tested on the basis of the ROC analysis. Our results extend the previous models of DHF by suggesting that IL-10, Days Fever, Sex and Lymphocytes are the major features for predicting DHF on the basis of blood chemistries and cytokine measurements. In addition, the dependence structure of these Days Fever, Lymphocytes, IL-10 and Sex protein profiles associated with disease outcomes was discovered by the semi-parametric Bayesian Gaussian copula model and Gaussian partial correlation method.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Aasa K, Czadob C, Frigessic A, Bakkend H (2009) Pair-copula constructions of multiple dependence. Insur Math Econ 44(2):182–198CrossRef Aasa K, Czadob C, Frigessic A, Bakkend H (2009) Pair-copula constructions of multiple dependence. Insur Math Econ 44(2):182–198CrossRef
3.
go back to reference Bayarri MJ, Berger JO, Forte A, Garcia-Donato G (2012) Criteria for Bayesian model choice with application to variable selection. Ann Stat 40:1550–1577CrossRef Bayarri MJ, Berger JO, Forte A, Garcia-Donato G (2012) Criteria for Bayesian model choice with application to variable selection. Ann Stat 40:1550–1577CrossRef
4.
go back to reference Brasier AR, Ju H, Garcia J, Spratt HM, Victor SS, Forshey BM, Halsey ES, Comach G, Sierra G, Blair PJ, Rocha C, Morrison AC, Scott TW, Bazan I, Kochel TJ, Venezuelan Dengue Fever Working Group (2012) A Three-Component Biomarker Panel for Prediction of Dengue Hemorrhagic Fever. Am J Trop Med Hyg 86(2):341–348 Brasier AR, Ju H, Garcia J, Spratt HM, Victor SS, Forshey BM, Halsey ES, Comach G, Sierra G, Blair PJ, Rocha C, Morrison AC, Scott TW, Bazan I, Kochel TJ, Venezuelan Dengue Fever Working Group (2012) A Three-Component Biomarker Panel for Prediction of Dengue Hemorrhagic Fever. Am J Trop Med Hyg 86(2):341–348
5.
go back to reference Denuit M, Lambert P (2005) Constraints on concordance measures in bivariate discrete data. J Multivar Anal 93(1):40–57CrossRef Denuit M, Lambert P (2005) Constraints on concordance measures in bivariate discrete data. J Multivar Anal 93(1):40–57CrossRef
6.
go back to reference Garcia-Donato G, Forte A (2015) R package BayesVarSel. R Foundation for Statistical Computing, Vienna Garcia-Donato G, Forte A (2015) R package BayesVarSel. R Foundation for Statistical Computing, Vienna
8.
go back to reference Genest C, Ghoudi K, Rivest LP (1995) A semiparametric estimation procedure of dependence parameters in multivariate families of distributions. Biometrika 82(3):543–552CrossRef Genest C, Ghoudi K, Rivest LP (1995) A semiparametric estimation procedure of dependence parameters in multivariate families of distributions. Biometrika 82(3):543–552CrossRef
9.
go back to reference Hoff PD (2007) Extending the rank likelihood for semiparametric copula estimation. Ann Appl Stat 1(1):265–283CrossRef Hoff PD (2007) Extending the rank likelihood for semiparametric copula estimation. Ann Appl Stat 1(1):265–283CrossRef
10.
go back to reference Joe H (1997) Multivariate models and dependence concepts. Chapman and Hall, LondonCrossRef Joe H (1997) Multivariate models and dependence concepts. Chapman and Hall, LondonCrossRef
11.
go back to reference Ju H, Brasier AR (2013) Variable selection methods for developing a biomarker panel for prediction of dengue hemorrhagic fever. BMC Res Notes 6:365CrossRef Ju H, Brasier AR (2013) Variable selection methods for developing a biomarker panel for prediction of dengue hemorrhagic fever. BMC Res Notes 6:365CrossRef
12.
go back to reference Kim D, Kim J-M (2014) Analysis of directional dependence using asymmetric copula-based regression models. J Stat Comput Simul 84(9):1990–2010CrossRef Kim D, Kim J-M (2014) Analysis of directional dependence using asymmetric copula-based regression models. J Stat Comput Simul 84(9):1990–2010CrossRef
13.
go back to reference Kim J-M, Jung Y-S, Sungur EA, Han K, Park C, Sohn I (2008) A copula method for modeling directional dependence of genes. BMC Bioinform 9:225CrossRef Kim J-M, Jung Y-S, Sungur EA, Han K, Park C, Sohn I (2008) A copula method for modeling directional dependence of genes. BMC Bioinform 9:225CrossRef
14.
go back to reference Kim J-M, Jung Y-S, Choi T, Sungur EA (2011) Partial correlation with copula modeling. Comput Stat Data Anal 55(3):1357–1366CrossRef Kim J-M, Jung Y-S, Choi T, Sungur EA (2011) Partial correlation with copula modeling. Comput Stat Data Anal 55(3):1357–1366CrossRef
15.
go back to reference Kojadinovic I, Yan J (2010) Modeling multivariate distributions with continuous margins using the copula R Package. J Stat Softw 34(9):1–20CrossRef Kojadinovic I, Yan J (2010) Modeling multivariate distributions with continuous margins using the copula R Package. J Stat Softw 34(9):1–20CrossRef
16.
go back to reference Madsen L, Fang Y (2011) Joint regression analysis for discrete longitudinal data. Biometrics 67(3):1171–1175CrossRef Madsen L, Fang Y (2011) Joint regression analysis for discrete longitudinal data. Biometrics 67(3):1171–1175CrossRef
17.
go back to reference Masarotto G, Varin C (2012) Gaussian copula marginal regression. Electron J Stat 6:1517–1549CrossRef Masarotto G, Varin C (2012) Gaussian copula marginal regression. Electron J Stat 6:1517–1549CrossRef
18.
go back to reference Nelsen R (2006) An introduction to copulas, 2nd edn. Springer, New York Nelsen R (2006) An introduction to copulas, 2nd edn. Springer, New York
19.
go back to reference Olson D, Shi Y (2007) Introduction to business data mining. McGraw-Hill/Irwin, Boston Olson D, Shi Y (2007) Introduction to business data mining. McGraw-Hill/Irwin, Boston
20.
go back to reference Shi Y, Tian YJ, Kou G, Peng Y, Li JP (2011) Optimization based data mining: theory and applications. Springer, LondonCrossRef Shi Y, Tian YJ, Kou G, Peng Y, Li JP (2011) Optimization based data mining: theory and applications. Springer, LondonCrossRef
21.
go back to reference Sklar A (1959) Fonctions de repartition a n-dimensions et leurs marges, (French). Publ Inst Stat Univ Paris 8:229–231 Sklar A (1959) Fonctions de repartition a n-dimensions et leurs marges, (French). Publ Inst Stat Univ Paris 8:229–231
22.
go back to reference Song PX-K (2000) Multivariate dispersion models generated from Gaussian copula. Scand J Stat 27:305–320CrossRef Song PX-K (2000) Multivariate dispersion models generated from Gaussian copula. Scand J Stat 27:305–320CrossRef
23.
go back to reference Zellner A (1986) On assessing prior distributions and Bayesian regression analysis with g-prior distributions. In: Zellner A (ed) In Bayesian inference and decision techniques: essays in Honor of Bruno de Finetti. Edward Elgar Publishing Limited, Cheltenham, pp 389–399 Zellner A (1986) On assessing prior distributions and Bayesian regression analysis with g-prior distributions. In: Zellner A (ed) In Bayesian inference and decision techniques: essays in Honor of Bruno de Finetti. Edward Elgar Publishing Limited, Cheltenham, pp 389–399
Metadata
Title
Copula Approach for Developing a Biomarker Panel for Prediction of Dengue Hemorrhagic Fever
Authors
Jong-Min Kim
Hyunsu Ju
Yoonsung Jung
Publication date
10-06-2020
Publisher
Springer Berlin Heidelberg
Published in
Annals of Data Science / Issue 4/2020
Print ISSN: 2198-5804
Electronic ISSN: 2198-5812
DOI
https://doi.org/10.1007/s40745-020-00293-x

Other articles of this Issue 4/2020

Annals of Data Science 4/2020 Go to the issue