Skip to main content
Top
Published in: Annals of Data Science 2/2021

16-05-2019

Generalized Count Data Regression Models and Their Applications to Health Care Data

Authors: Carl Lee, Felix Famoye, Alfred Akinsete

Published in: Annals of Data Science | Issue 2/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

A method for developing generalized parametric regression models for count data is proposed and studied. The method is based on the framework of the T-geometric family of distributions. A T-geometric family consists of discrete distributions, which are analogues to the continuous distributions for the random variable T. The general methodology is applied to derive some generalized regression models for count data. These regression models can fit count data that are under-dispersed, equi-dispersed or over-dispersed. The extension to model truncated or inflated data is addressed. Some new generalized T-geometric regression models are applied to real world data sets to illustrate the flexibility of the models. The models were fitted to four response variables from health care data and their performance compared. No single regression model outperforms other models for all the four response variables. Thus, a researcher should evaluate different models before selecting a final regression model for a count response variable.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Akinsete A, Famoye F, Lee C (2014) The Kumaraswamy-geometric distribution. J Stat Distrib Appl 1:17CrossRef Akinsete A, Famoye F, Lee C (2014) The Kumaraswamy-geometric distribution. J Stat Distrib Appl 1:17CrossRef
2.
go back to reference Aljarrah MA, Lee C, Famoye F (2014) On generating T–X family of distributions using quantile functions. J Stat Distrib Appl 1:1CrossRef Aljarrah MA, Lee C, Famoye F (2014) On generating TX family of distributions using quantile functions. J Stat Distrib Appl 1:1CrossRef
3.
go back to reference Alzaatreh A, Lee C, Famoye F (2014) T-normal family of distributions: a new approach to generalize the normal distribution. J Stat Distrib Appl 1:16CrossRef Alzaatreh A, Lee C, Famoye F (2014) T-normal family of distributions: a new approach to generalize the normal distribution. J Stat Distrib Appl 1:16CrossRef
4.
go back to reference Alzaatreh A, Lee C, Famoye F (2012) On the discrete analogues of continuous distributions. Stat Methodol 9:589–603CrossRef Alzaatreh A, Lee C, Famoye F (2012) On the discrete analogues of continuous distributions. Stat Methodol 9:589–603CrossRef
5.
go back to reference Cameron AC, Johansson P (1997) Count data regression using series expansion: with applications. J Appl Econom 12:203–223CrossRef Cameron AC, Johansson P (1997) Count data regression using series expansion: with applications. J Appl Econom 12:203–223CrossRef
6.
go back to reference Cameron AC, Trivedi PK (2013) Regression analysis of count data, 2nd edn. Cambridge University Press, CambridgeCrossRef Cameron AC, Trivedi PK (2013) Regression analysis of count data, 2nd edn. Cambridge University Press, CambridgeCrossRef
7.
go back to reference Cameron AC, Trivedi PK, Milne F, Piggott J (1988) A microeconomic model of the demand for health care and health insurance in Australia. Rev Econom Stud, LV, pp 85–106 Cameron AC, Trivedi PK, Milne F, Piggott J (1988) A microeconomic model of the demand for health care and health insurance in Australia. Rev Econom Stud, LV, pp 85–106
8.
go back to reference Chakraborty S, Chakravarty D (2012) Discrete gamma distributions: properties and parameter estimations. Commun Stat Theory Methods 41:3301–3324CrossRef Chakraborty S, Chakravarty D (2012) Discrete gamma distributions: properties and parameter estimations. Commun Stat Theory Methods 41:3301–3324CrossRef
9.
go back to reference Chambers R, Dreassi E, Salvati N (2014) Disease mapping via negative binomial regression M-quantiles. Stat Med 33(27):4805–4824CrossRef Chambers R, Dreassi E, Salvati N (2014) Disease mapping via negative binomial regression M-quantiles. Stat Med 33(27):4805–4824CrossRef
11.
go back to reference Consul PC, Famoye F (2006) Lagrangian probability distributions. Birkhäuser, Boston Consul PC, Famoye F (2006) Lagrangian probability distributions. Birkhäuser, Boston
12.
go back to reference Consul PC, Shoukri MM (1985) The generalized Poisson distribution when the sample mean is larger than the sample variance. Commun Stat Theory Methods 14:667–681 Consul PC, Shoukri MM (1985) The generalized Poisson distribution when the sample mean is larger than the sample variance. Commun Stat Theory Methods 14:667–681
13.
go back to reference Epstein ES (1969) A scoring system for probability forecasts of ranked categories. J Appl Meteorol 8:985–987CrossRef Epstein ES (1969) A scoring system for probability forecasts of ranked categories. J Appl Meteorol 8:985–987CrossRef
14.
go back to reference Famoye F (2018) Exponentiated Weibull-geometric distribution and its regression model. J Data Sci Famoye F (2018) Exponentiated Weibull-geometric distribution and its regression model. J Data Sci
15.
go back to reference Famoye F (1993) Restricted generalized Poisson regression model. Commun Stat Theory Methods 22(5):1335–1354CrossRef Famoye F (1993) Restricted generalized Poisson regression model. Commun Stat Theory Methods 22(5):1335–1354CrossRef
16.
go back to reference Famoye F, Lee C (2017) Exponentiated exponential-geometric regression model. J Appl Stat 44(16):2963–2977CrossRef Famoye F, Lee C (2017) Exponentiated exponential-geometric regression model. J Appl Stat 44(16):2963–2977CrossRef
17.
go back to reference Famoye F, Singh KP (2006) Zero-inflated generalized Poisson regression model with applications to domestic violence data. J Data Sci 4(1):117–130 Famoye F, Singh KP (2006) Zero-inflated generalized Poisson regression model with applications to domestic violence data. J Data Sci 4(1):117–130
18.
go back to reference Frome EL, Kurtner MH, Beauchamp JJ (1973) Regression analysis of Poisson-distributed data. J Am Stat Assoc 68:288–298CrossRef Frome EL, Kurtner MH, Beauchamp JJ (1973) Regression analysis of Poisson-distributed data. J Am Stat Assoc 68:288–298CrossRef
19.
go back to reference Gupta RD, Kundu D (2001) Exponentiated-exponential family: an alternative to gamma and Weibull distributions. Biom J 43:117–130CrossRef Gupta RD, Kundu D (2001) Exponentiated-exponential family: an alternative to gamma and Weibull distributions. Biom J 43:117–130CrossRef
20.
go back to reference Hilbe JM (2011) Negative binomial regression, 2nd edn. Cambridge University Press, New YorkCrossRef Hilbe JM (2011) Negative binomial regression, 2nd edn. Cambridge University Press, New YorkCrossRef
21.
go back to reference Jorgenson DW (1961) Multiple regression analysis of a Poisson process. J Am Stat Assoc 56:235–245CrossRef Jorgenson DW (1961) Multiple regression analysis of a Poisson process. J Am Stat Assoc 56:235–245CrossRef
22.
go back to reference Lawless JF (1987) Negative binomial and mixed Poisson regression. Can J Stat 15(3):209–225CrossRef Lawless JF (1987) Negative binomial and mixed Poisson regression. Can J Stat 15(3):209–225CrossRef
23.
go back to reference McCullagh P, Nelder JA (1989) Generalized linear models, 2nd edn. Chapman and Hall, LondonCrossRef McCullagh P, Nelder JA (1989) Generalized linear models, 2nd edn. Chapman and Hall, LondonCrossRef
24.
go back to reference Mullahy J (1997) Heterogeneity, excess zeros, and the structure of count data models. J Appl Econom 12:337–350CrossRef Mullahy J (1997) Heterogeneity, excess zeros, and the structure of count data models. J Appl Econom 12:337–350CrossRef
25.
go back to reference Murphy AH (1969) On the ranked probability skill score. J Appl Meteorol 8:988–989CrossRef Murphy AH (1969) On the ranked probability skill score. J Appl Meteorol 8:988–989CrossRef
26.
go back to reference Murphy AH (1971) A note on the ranked probability skill score. J Appl Meteorol 10:155–156CrossRef Murphy AH (1971) A note on the ranked probability skill score. J Appl Meteorol 10:155–156CrossRef
27.
go back to reference Nelder JA, Wedderburn RWM (1972) Generalized linear models. J R Stat Soc A 135:370–384CrossRef Nelder JA, Wedderburn RWM (1972) Generalized linear models. J R Stat Soc A 135:370–384CrossRef
28.
go back to reference Nekoukhou V, Bidram H (2015) The exponentiated discrete Weibull distribution. Stat Oper Res Trans 39(1):127–146 Nekoukhou V, Bidram H (2015) The exponentiated discrete Weibull distribution. Stat Oper Res Trans 39(1):127–146
30.
go back to reference Sellers KF, Shmueli G (2010) A flexible regression model for count data. Ann Appl Stat 4(2):943–961CrossRef Sellers KF, Shmueli G (2010) A flexible regression model for count data. Ann Appl Stat 4(2):943–961CrossRef
31.
go back to reference Sun SZ, Ong SH (2016) A generalized inverse trinomial distribution with application. Stat Methodol 33:217–233CrossRef Sun SZ, Ong SH (2016) A generalized inverse trinomial distribution with application. Stat Methodol 33:217–233CrossRef
32.
go back to reference Vuong QH (1989) Likelihood ratio tests for model selection and non-nested hypotheses. Econometrica 57(2):307–333CrossRef Vuong QH (1989) Likelihood ratio tests for model selection and non-nested hypotheses. Econometrica 57(2):307–333CrossRef
34.
go back to reference Weigel AP, Liniger MA, Appenzeller C (2006) The discrete Brier and ranked probability skill scores. Mon Weather Rev 135:118–124CrossRef Weigel AP, Liniger MA, Appenzeller C (2006) The discrete Brier and ranked probability skill scores. Mon Weather Rev 135:118–124CrossRef
35.
go back to reference Winkelmann R (2008) Econometric analysis of count data, 5th edn. Springer, Berlin Winkelmann R (2008) Econometric analysis of count data, 5th edn. Springer, Berlin
Metadata
Title
Generalized Count Data Regression Models and Their Applications to Health Care Data
Authors
Carl Lee
Felix Famoye
Alfred Akinsete
Publication date
16-05-2019
Publisher
Springer Berlin Heidelberg
Published in
Annals of Data Science / Issue 2/2021
Print ISSN: 2198-5804
Electronic ISSN: 2198-5812
DOI
https://doi.org/10.1007/s40745-019-00221-8

Other articles of this Issue 2/2021

Annals of Data Science 2/2021 Go to the issue