Let
\(Z_x\) be the continuous random frailty at age
x, with a probability density function,
\(\mu _x(z)\) be the conditional force of mortality for an individual in a population group at age
x and with a frailty level
z$$\begin{aligned} \mu _x(z) = \lim _{t \rightarrow 0} \left[ \frac{P(T_x \le t \vert Z_x = z)}{t}\right] \end{aligned}$$
(1)
where
\(T_x\) being the remaining lifetime. Note that
\(Z_x\) is invariant concerning
t. Vaupel et al. (
1979) define frailty as a multiplicative factor operating on the force of mortality
$$\begin{aligned} \mu _x(z) = \mu _x \cdot z \end{aligned}$$
(2)
The survival function of an individual at age 0 taking into account the frailty is defined as follows
$$\begin{aligned} S(x,z) = e^{-\int _{0}^{x} \mu _t(Z)dt} = e^{-zH(x)} \end{aligned}$$
(3)
With
H(
x) being the cumulative standard force of mortality in the interval (0,
x).
2.1 The Frailty-based Lee–Carter family models
Let
\(\mu _{x,t}\) the force of mortality at age
x and time
t. A Lee–Carter mortality model (LCA) (Lee and Carter
1992) is defined as follows
$$\begin{aligned} y_{x,t} = \log (\mu _{x,t}) = a_x + b_x k_t + \epsilon _{xt} \end{aligned}$$
(4)
We define the force of mortality conditional on frailty
\(\mu _{x,t}\) and the relative model
$$\begin{aligned} y_{x,t} = \log (\mu _{x,t}) = a_x + b_x k_t + z_t + \epsilon _{xt} \end{aligned}$$
(5)
where
\(z_t\) is a time-dependent multiplicative coefficient of the force of mortality.
We can express the ordinal least squares optimization for estimating parameters as minimizing the squared sum of errors
$$\begin{aligned} \min _{a_x, b_x, k_t, z_t} \sum _{x,t}^{} \left( y_{x,t,z} - \left( a_x + b_x k_t + z_t \right) \right) ^2 \end{aligned}$$
(6)
so that the objective function is minimised by equating to 0 the first derivatives with respect to
\(a_x, b_x, k_t, z_t\).
The updating for the parameters can also be obtained recursively using a normal equation. The required parameters will be obtained numerically according to the following set of equations which can be solved recursively as proposed by Renshaw and Haberman (
2003)
$$\begin{aligned} \begin{aligned} a_x = \frac{\sum _{t}^{} D_{xt} \left[ y_{x,t,z} - \hat{b}_x\hat{k}_t - \hat{z}_t \right] }{\sum _{t}^{} D_{xt}} \\ b_x = \frac{\sum _{t}^{} D_{xt}\hat{k}_t \left[ y_{x,t,z} - \hat{a}_x - \hat{z}_t \right] }{\sum _{t}^{} D_{xt}\hat{k}_t^2} \\ k_t = \frac{\sum _{t}^{} D_{xt}\hat{b}_x \left[ y_{x,t,z} - \hat{a}_x - \hat{z}_t \right] }{\sum _{t}^{} D_{xt}\hat{b}_x^2} \\ z_t = \frac{\sum _{t}^{} D_{xt} \left[ y_{x,t,z} - \hat{a}_x - \hat{b}_x\hat{k}_t \right] }{\sum _{t}^{} D_{xt}} \end{aligned} \end{aligned}$$
(7)
As the original LC model, the estimation of
\(a_x, b_x, k_t, z_t\) is performed in two stages. The first stage determines the values of the parameters and the second stage involves a re-estimation and matching of the deaths by age
x and time
t. In this sense, the re-estimation of
\(D_{xt}\) at the second stage, could be interpreted as matching observed and modelled deaths allowing for frailty.
To estimate \(z_t\) the definition of a measurable variable of frailty is required. To do this, we perform a RF and we consider the variable importance to determine the main features that affects frailty. On the basis of the results, we build a co-morbidity index (ci) that assigns a frailty score of an individual.
The LCA family of models are based on aggregated data (not individual data) so that we arrange the vector of individual co-morbidity scores in a matrix by age, which we call the Co-morbidity (Aggregated) Matrix
CI, in which the rows are the scores for each individual at age
x and the columns represent the co-morbidity scores by age from the first observed in the sample, namely 50, to the maximum age
\(\omega \).
$$\begin{aligned} CI_t = \begin{pmatrix} ci_{50,1} &{} ci_{50,2} &{} \cdots &{} ci_{50,n} \\ ci_{51,1} &{} ci_{51,2} &{} \cdots &{} ci_{51,n} \\ \vdots &{} \vdots &{} \ddots &{} \vdots \\ ci_{\omega 1} &{} ci_{\omega 2} &{} \cdots &{} ci_{\omega n} \\ \end{pmatrix} \end{aligned}$$
(8)
where
ci is the score of co-morbidity for an individual at age
x, each row collects co-morbidity scores of
n individuals of a certain age
x from 50 to
\(\omega \). Being a matrix that collects respondents by age and time, it does not necessarily have the same number of columns per each row. This aspect is not a problem because the matrix representation is only a way to represent the data to be synthesized. Since the co-morbidity score is computed for more than one time according to the waves of a panel survey (see Sect.
3 for a discussion of the data sources), we have
\(CI_1, \ldots , CI_T\) co-morbidity matrices that correspond to the calendar years being analysed.
The model is estimated following a similar procedure to the LCA model, which is effectively the application of a singular value decomposition to the matrix of deviances from the \(a_x\) and \(z_t\) parameters.
Let
\(\log (m_{x,t})\) the matrix of the log rates by age and time,
\(a_x\) the row vector of average age-specific mortality rates and
\(z_t\) the column vector of frailty varying by time. We define the matrix
\(A_{xt}\) as follows:
$$\begin{aligned} A_{xt} = \log (m_{xt}) - a_x - z_t \end{aligned}$$
(9)
\(A_{xt}\) is the matrix to be factored in to estimate the parameters
\(b_x\) and
\(k_t\):
$$\begin{aligned} A_{xt} = b_x k_t \end{aligned}$$
(10)
To do this, it is necessary to arrange the co-morbidity matrices
\(CI_t\) in a unique matrix age by time. To obtain the input matrix for the models, each co-morbidity matrix
\(CI_t\) is aggregated by row, resulting in a series of column vectors representing the average age score for each time
t. We consider three measures to aggregate the matrix: the first measure is the average number of co- morbidities. Letting
k be a generic individual with age
x at time
t and
\(ci_j\) be a co-morbidity index, that is the number co-morbidities of an individual
j, we define the aggregated vectors of the average number of co-morbidities as follows
$$\begin{aligned} C_{ave,t} = \frac{1}{J} \sum _{j=1}^{J} ci_j, t = 1,..., T \end{aligned}$$
(11)
The second measure is the average difference of co-morbidities between age
\(x+1\) and age
x at time
t. Let
\(\overline{ci}_j\) the average co-morbidity score for individuals with age
x at time
t and
\(\overline{ci}_{j'}\) the average co-morbidity score for individuals with age
\(x+1\) at time
t, we define the aggregated vectors of increase of co-morbidity as follows
$$\begin{aligned} C_{inc,t} = \frac{1}{J} \sum _{j' \ne j} \left( \overline{ci}_{j'} - \overline{ci}_j \right) , t = 1,..., T \end{aligned}$$
(12)
And the third measure is the ratio between the deviation from the mean of the co-morbidity score at age
x and time
t and its mean. Let
\(\overline{ci}_j\) be the average co-morbidity score for individuals with age
x at time
t, we define the aggregated vectors as follows
$$\begin{aligned} C_{rel,t} = \frac{ci_j - \overline{ci}_j}{\overline{ci}_j}, t = 1,\ldots , T \end{aligned}$$
(13)
The indexes in Eqs. (
11)–(
13) are obtained by fixing time
t and aggregating the co-morbidity scores by age
x.
The input frailty matrix, namely
Z, is obtained by merging the
\(C_{t}\) vectors, choosing the best method of aggregation among the three proposed. The time- varying frailty column vector
\(z_t\) can be seen as the average values by age of
Z matrix
$$\begin{aligned} z_t = \frac{1}{\omega } \sum _{x=1}^{\omega } \zeta _x \end{aligned}$$
(14)
where
\(\zeta _x\) are the rows of the
Z matrix.
From Equation (
5), we define the force of mortality conditional on frailty
\(\mu _{x,t}\). The frailty parameter
z is included in the different functional forms that we propose for
\(y_{x,t} = \log (\mu _{x,t})\), on the basis of the different characteristics being highlighted.
In addition to the model defined by Equation (
5), called the Frailty LCA (FLCA) model, we define the Age-dependent Frailty LCA (AFLCA) model, in which a frailty-based average of log-specific mortality rates,
\(z_x\) is used as a substitute for the
\(a_x\) parameter
$$\begin{aligned} y_{x,t,z} = \log (\mu _{x,t,z}) = b_x k_t + z_x + \epsilon _{xt} \end{aligned}$$
(15)
The idea of this formulation is that frailty is an age-dependent factor that affects the age-specific mortality rate, and represents the mortality effects that the LCA model fails to capture. This aspect is highlighted by the high correlation between
\(a_x\) and
\(z_x\).
The third model, called the Age and Time Interaction Frailty LCA (IFLCA) model, is formulated to take into account an interaction effect between
\(a_x\) and
\(z_t\). This model is based on the idea of identifying an effect for both age and time for frailty. However, it has not been possible to use a single parameter
\(z_{x,t}\), that is the combined age and time dependent frailty, due to problems related to model convergence.
$$\begin{aligned} y_{x,t,z} = \log (\mu _{x,t,z}) = a_x z_t + b_x k_t + \epsilon _{xt} \end{aligned}$$
(16)
In this formulation, frailty
\(z_t\) is a time-varying factor that modifies age- specific mortality rates according to a temporal ageing trend. The fourth model is a generalization of the LCA model proposed by Niu and Melenberg (
2014), called the Age-specific and Temporal Frailty LCA (ATFLCA) model, that includes an exogenous trend that affects the mortality trend
\(k_t\). In this formulation,
\(z_t\) is estimated as a time-varying variable as in Equations (
5) and (
16), but adding an age-specific coefficient of frailty to be estimated
$$\begin{aligned} y_{x,t,z} = \log (\mu _{x,t,z}) = a_x + g_x z_t + b_x k_t + \epsilon _{xt} \end{aligned}$$
(17)
In this formulation, frailty
\(z_t\) is a time-varying factor, independent of the average age- specific mortality rates
\(a_x\), but with an age-specific frailty factor
\(g_x\) to estimate in combination with
\(z_t\).
Table
1 summarises the characteristics of the models proposed in Equations (
5) and (
15)–(
17)
Table 1
Proposed models summary
Frailty Lee–Carter Model | FLCA | Includes \(z_t\) parameter representing a time-varying multiplicative factor of frailty |
Age-dependent Frailty | AFLCA | Replaces \(a_x\) with \(z_x\) parameter representing an age-specific |
Lee–Carter Model |
multiplicative factor of frailty |
Age and Time Interaction Frailty | IFLCA | Includes the product of \(a_x\) and \(z_t\) representing the interaction multiplicativeeffect between age-specific mortality and time-varying frailty |
Lee–Carter Model |
Age-specific and Temporal Frailty | ATFLCA | Includes the \(g_x\) and \(z_t\) parameters representing the age-specific and time-varying frailty respectively |
Lee–Carter Model |
To identify what are the variables that can affect mortality using individual-based data, a Random Forest (RF) algorithm based on classification trees is used. The RF algorithm was introduced by Breiman (
2001) and consists of many independent trees grown by recursively performing binary splits on the dataset. Let
\(\left[ \left( x_1,\ y_1\right) ,\ldots ,\left( x_n,\ y_n\right) \right] \) the training set. The algorithm predicts the response Y by estimating the regression function
\(m\left( x\right) =\mathbb {E}\left[ Y \vert X=x\right] \). The mean-squared generalized error for any numerical predictor h(x) is defined as in the following
$$\begin{aligned} \mathbb {E}_{X,Y} = (Y - h(X))^2 \end{aligned}$$
(18)
where the random forest predictor is the average over
\(k = 1, \ldots , n\) trees (Breiman
2001). Breiman (
1996) defines bagging as the algorithm to synthesize many trees together generating many bootstrap samples and averaging the predictors. The estimator of the target variable
\(\hat{y}_{R_j}\) is the function of the regression tree estimator
$$\begin{aligned} \hat{f}^{tree}(X)=\sum _{j \in J}{\hat{y}_{R_j} 1_{\{X \in R_{j}\}}} \end{aligned}$$
(19)
\(1_{\{.\}}\) being the indicator function and
\((R_j)_{j\in J}\) the region of the predictor space which is divided into
J distinct and non-overlapping
\(R_1, R_2,\ldots , R_J\) and obtained by minimizing the Residual Sum of Squares.