Skip to main content

2022 | Buch

Emerging Topics in Modeling Interval-Censored Survival Data

insite
SUCHEN

Über dieses Buch

This book primarily aims to discuss emerging topics in statistical methods and to booster research, education, and training to advance statistical modeling on interval-censored survival data. Commonly collected from public health and biomedical research, among other sources, interval-censored survival data can easily be mistaken for typical right-censored survival data, which can result in erroneous statistical inference due to the complexity of this type of data. The book invites a group of internationally leading researchers to systematically discuss and explore the historical development of the associated methods and their computational implementations, as well as emerging topics related to interval-censored data. It covers a variety of topics, including univariate interval-censored data, multivariate interval-censored data, clustered interval-censored data, competing risk interval-censored data, data with interval-censored covariates, interval-censored data from electric medical records, and misclassified interval-censored data. Researchers, students, and practitioners can directly make use of the state-of-the-art methods covered in the book to tackle their problems in research, education, training and consultation.

Inhaltsverzeichnis

Frontmatter

Introduction and Review

Frontmatter
Overview of Historical Developments in Modeling Interval-Censored Survival Data
Abstract
This chapter serves as an overview of the historical development of the methods for the analysis of interval-censored survival data. It begins with a description of how the interval-censored data arise from the studies where the subjects are followed periodically, and the time to the event of interest cannot be observed exactly. From a historical perspective, such data become more common with the emergence of new clinical and epidemiologic study designs. However, the well-developed methods that are used for right-censored survival data analysis could not be applied. This chapter follows on the methodological development of this area, starting in 1970 from a broad historical perspective.
Dianne Finkelstein
Overview of Recent Advances on the Analysis of Interval-Censored Failure Time Data
Abstract
As discussed by Dr. Finkelstein in Chap. 1, interval-censored failure time data are a general type of failure time or time-to-event data that often occur in many areas, including demographical studies, epidemiological studies, medical or public health research and social science. In contrast to the historic review of Chap. 1, this chapter will provide a brief review of some recent advances on several topics concerning the analysis of interval-censored data. These include the analysis of interval-censored data with time-dependent covariates, the presence of informative censoring, or the presence of a cured subgroup, respectively. Also it will cover the analysis of interval-censored data arising from case-cohort studies and the variable selection based on interval-censored data as well as the analysis of doubly interval-censored data.
Mingyue Du
Predictive Accuracy of Prediction Model for Interval-Censored Data
Abstract
Diverse measures for predictive accuracy have been developed for survival data which require additional investigation to reflect the time dependent outcomes. In this chapter, our purpose is to review several methods to evaluate prediction models and to compare their performance in a context of interval censored data. This chapter provides conceptual and practical explanation of statistical methods for the time-dependent ROC and C-index as the discrimination measure. Also, Brier score is dealt to evaluate overall performance for the prediction model. We aim to provide clarity of each method and identify software tools to carry out analysis in practice. We illustrate the methods using a dementia dataset.
Yang-Jin Kim

Emerging Topics in Methodology

Frontmatter
A Practical Guide to Exact Confidence Intervals for a Distribution of Current Status Data Using the Binomial Approach
Abstract
We review our recently developed pointwise confidence intervals for the distribution of event times for current status data. Previous existing methods were based on asymptotic distributions, but our new approach is based on binomial properties. This binomial approach can be applied with continuous and discrete assessment distributions. We discuss confidence interval (CI) versions using the binomial approach, a valid (i.e., exact) CI and an ABA (Approximate Binomial Approach) confidence interval. The valid confidence interval guarantees the nominal rate. Although the valid confidence interval is necessarily conservative for small sample sizes, asymptotically its coverage rate approaches the nominal one. The ABA confidence interval does not guarantee the nominal rate, but its coverage rate asymptotically approaches the nominal rate under certain conditions. Extensive and systematic simulations are presented to compare these new confidence intervals with existing intervals (likelihood ratio test [LRT] CIs and smoothed maximum likelihood estimation [SMLE] CIs). The valid CI has been theoretically shown to guarantee the nominal coverage rate, while simulations show where other methods can have less than nominal coverage. The ABA CIs generally have coverage rates closer to the nominal level with shorter length than existing intervals in most cases. Based on the anti-hepatitis A antibody responses in Bulgaria (Keiding, J R Stat Soc Ser A 154:371–412, 1991, Table 2), we compare our new confidence intervals with the LRT CIs and SMLE CIs for the distribution of age at which people were first exposed to hepatitis A. We introduce the R package csci and provide R code to reproduce examples in this chapter.
Sungwook Kim, Michael P. Fay, Michael A. Proschan
Accelerated Hazards Model and Its Extensions for Interval-Censored Data
Abstract
Semiparametric regression analysis of interval-censored data are often performed under popular models, such as the Cox proportional hazards, proportional odds and accelerated failure time models. There are cases in practice that such conventional model assumptions may be inappropriate for modeling survival outcomes of interest. In this chapter, we introduce an alternative, the accelerated hazards model, for the analysis of interval-censored data and its extension to a class of generalized accelerated hazards mixture cure models in the presence of a cure fraction. Inference for these models are obtained using the sieve maximum likelihood method, and the resulting estimators are shown to be consistent and asymptotically normal under mild regularity conditions. The finite sample performance of these models is examined through simulation studies and their practical applications are illustrated by real data examples.
Liming Xiang
Maximum Likelihood Estimation of Semiparametric Regression Models with Interval-Censored Data
Abstract
Interval censoring arises frequently in clinical, epidemiological, financial, and sociological studies, where the event or failure of interest is not observed at an exact time point but is rather known to occur within a time interval induced by periodic examinations. We formulate the effects of potentially time-dependent covariates on the failure time through the semiparametric Cox proportional hazards model. We study nonparametric maximum likelihood estimation with an arbitrary number of examination times for each study subject. We present an EM algorithm that involves very simple calculations and converges stably for any dataset, even in the presence of time-dependent covariates. The resulting estimators for the regression parameters are consistent, asymptotically normal, and asymptotically efficient with an easily estimated covariance matrix. In addition, we extend the EM algorithm and asymptotic theory to competing risks and multivariate failure time data. Finally, we provide applications to real medical studies.
D. Y. Lin, Donglin Zeng
Use of the INLA Approach for the Analysis of Interval-Censored Data
Abstract
The Integrated Nested Laplace Approximation (INLA) methodology is well known in various fields for fast and accurate Bayesian inference, although it has not been proposed yet for interval-censored data. Most survival models, including those with interval censoring, can be shown to be a latent Gaussian model and as such INLA can be used for near real-time Bayesian inference. We provide a brief summary of the INLA methodology and illustrate the approach on real data examples with interval censoring, including a joint model. The analysis is done using the R package INLA and all code is available for reproducibility.
Janet van Niekerk, Håvard Rue
Copula Models and Diagnostics for Multivariate Interval-Censored Data
Abstract
In studies concerning disease progression or patient survival, multivariate time-to-event outcomes are increasingly used as endpoints. The exact times from the non-fatal events are sometimes unobservable due to “interval-censoring” since the event status can only be determined at intermittent assessment times. In this chapter, we introduce a class of copula models to analyze multivariate interval-censored outcomes. It is a joint approach that directly connects the marginal (univariate) distributions through a copula function to construct the joint distribution. We allow flexible modeling of the covariate effects through a semiparametric transformation model. A sieve maximum likelihood estimation approach is proposed for parameter estimation. Moreover, as many copula models are available, it is essential to check if the chosen copula model fits the data well for analysis. In the second part of this chapter, we introduce a general goodness-of-fit test procedure for copula-based interval-censored data using the information ratio (IR). It can be applied to any copula family with a parametric form, such as the frequently used Archimedean and Gaussian families. Finally, we present an R package CopulaCenR, which is designed for analyzing multivariate interval-censored data through different copula families.
Ying Ding, Tao Sun
Efficient Estimation of the Additive Risks Model for Interval-Censored Data
Abstract
In contrast to the popular Cox model which presents a multiplicative covariate effect specification on the time to event hazards, the semiparametric additive risks model (ARM) offers an attractive additive specification, allowing for direct assessment of the changes or the differences in the hazard function for changing value of the covariates. The ARM is a flexible model, allowing the estimation of both time-independent and time-varying covariates. It has a nonparametric component and a regression component identified by a finite-dimensional parameter. This chapter presents an efficient approach for maximum-likelihood (ML) estimation of the nonparametric and the finite-dimensional components of the model via the Minorize-Maximize (MM) algorithm for case-II interval-censored data. The operating characteristics of our proposed MM approach are assessed via simulation studies, with illustration on a breast cancer dataset via the R package MMIntAdd. It is expected that the proposed computational approach will not only provide scalability to the ML estimation scenario but may also simplify the computational burden of other complex likelihoods or models.
Tong Wang, Dipankar Bandyopadhyay, Samiran Sinha

Emerging Topics in Applications

Frontmatter
Modeling and Analysis of Chronic Disease Processes Under Intermittent Observation
Abstract
A wealth of information is available from clinic-based registries of individuals with chronic diseases but modeling and analysis of disease processes can be challenging because of the ways individuals are enrolled in the registry and because they are only seen at intermittent clinic visits post-enrolment. Such features are common in observational cohort studies in general, but even in randomized trials, intermittent observation of individuals is common, resulting in incomplete information about transitions among disease states and about time-dependent covariates. In this chapter we describe independence conditions needed for valid likelihood-based inference about multistate disease processes under intermittent observation schemes. In addition, we describe how joint models for disease and observation processes can be used to address disease-related clinic visits. We also describe how joint models can be used to deal with internal time-dependent markers when marker values are observed only at clinic visits, and investigate the limiting values of regression coefficients of marker effects when the common approach of carrying forward the most recently recorded value is used. In addition to failure time processes, we consider more general multistate disease processes, and partially specified models for state occupancy probabilities. An application to a study of cancer metastatic to bone is considered, involving a serum marker of bone activity and its association with risk of skeletal complications.
Richard J. Cook, Jerald F. Lawless
Case-Cohort Studies with Time-Dependent Covariates and Interval-Censored Outcome
Abstract
In large cohort studies on rare diseases, the case-cohort design is widely used to assess associations between covariates and survival time (e.g., time until disease onset). In many settings, the event of interest is not observed at an exact time point but only known to occur between two study visits. In this chapter, we consider fitting parametric survival models to data from case-cohort studies with interval censored outcomes and both fixed and time-dependent covariates. Simulation results demonstrate the proposed estimator is approximately unbiased and the standard errors are well estimated from the sandwich estimators. The methods are applied to an observational study which examined the association between hormonal contraceptive use and risk of HIV acquisition.
Xiaoming Gao, Michael G. Hudgens, Fei Zou
The BivarIntCensored: An R Package for Nonparametric Inference of Bivariate Interval-Censored Data
Abstract
Bivariate event time data are constantly encountered in biomedical research. In many real-life applications, both event times are possibly subject to interval censoring that gives rise to bivariate interval-censored data. Nonparametric inference of bivariate interval-censored data focuses on estimation of the joint distribution function of event times or the joint survival function. The conventional nonparametric maximum likelihood estimator suffers non-uniqueness of the estimates as well as computation inefficiency due to searching for maximal intersections and high-dimensional convex programming. A spline-based sieve nonparametric maximum likelihood estimator for the joint cumulative distribution function with bivariate interval-censored data has been developed to resolve the non-uniqueness issue in estimation. Numerically, it leads to a convex programming with complicated linear constraints but much reduced number of unknown parameters. In this project, we will illustrate the key characteristics of the sieve nonparametric maximum likelihood estimation with emphasis on numerical computation. We will develop an R-based software to facilitate the public use for computing the spline-based sieve nonparametric maximum likelihood estimator of the joint cumulative distribution function.
Junyi Zhou, Yuan Wu, Ying Zhang
Joint Modeling for Longitudinal and Interval-Censored Survival Data: Application to IMPI Multi-Center HIV/AIDS Clinical Trial
Abstract
Joint models for longitudinal and survival data are a class of models that jointly analyze an outcome of interest repeatedly observed over time along with the associated event times. These models are useful in two practical applications; firstly focusing on survival outcome whilst accounting for time-varying covariates measured with error and secondly focusing on the longitudinal outcome while controlling for informative censoring. The joint modeling framework has mainly been focused on right-censored data in the survival outcome for the last decade. This chapter is then aimed to extend the classical joint modeling framework to interval-censored data using a cardiology multi-center clinical trial. We illustrate our approach using R statistical software.
Ding-Geng Chen, Isaac Singini
Regression Analysis with Interval-Censored Covariates. Application to Liquid Chromatography
Abstract
We extend the linear regression model accounting for an interval-censored covariate of Gómez et al. (Stat Med 22:409–425, 2003) to a generalized linear model that accommodates non-normal responses belonging to an exponential family. We redefine the original likelihood function to include exactly observed values of the same covariate. We propose two goodness-of-fit measures that accommodate interval-censored covariates. The data set that has motivated this work comes from the Metabolomic Analysis area when the interest is to assess the association of any metabolite extracted from a human sample and measured by liquid chromatography with anthropometric, clinical, and biochemical parameters and potentially any other response variable of interest. The concentration of compounds, such as plasma carotenoids, cannot always be exactly measured because of their low quantity in the body and the limitations of the methods and equipment. Up to this date, there is no consensus as to how data under the limit of quantitation or even detection should be treated.
In this chapter, we treat the concentration of these compounds as interval-censored random variables with lower and upper limits given by the detection and quantitation limits. As a result, the sum of some of these compounds is also an interval-censored covariate. The method proposed is illustrated with an application to a data set on the association between glucose, a completely observed response variable, and the sum of carotenoids, an interval-censored random variable acting as an individual explanatory variable. The analysis adjusts for age and daily energy intake. All the methods have been implemented in R and can be found in the Github repository https://​github.​com/​klongear/​ICbook.
Guadalupe Gómez Melis, María Marhuenda-Muñoz, Klaus Langohr
Misclassification Simulation Extrapolation Procedure for Interval-Censored Log-Logistic Accelerated Failure Time Model
Abstract
Misclassification of binary covariates often occurs in survival data and any survival data analysis ignoring such misclassification will result in estimation bias. To handle such misclassification, the misclassification simulation extrapolation (MC-SIMEX) procedure is a flexible method proposed in survival data analysis, which has been investigated extensively for right-censored survival data. However, the performance of the MC-SIMEX method has not been explored enough for interval-censored survival data. This chapter is aimed to investigate the performance of the MC-SIMEX procedure to interval-censored survival data through Monte-Carlo simulations and real data analysis. This investigation focuses on the log-logistic accelerated failure time (AFT) model since the log-logistic distribution plays an important role in evaluating non-monotonic hazards for survival data.
Varadan Sevilimedu, Lili Yu, (Din) Ding-Geng Chen, Yuhlong Lio
Backmatter
Metadaten
Titel
Emerging Topics in Modeling Interval-Censored Survival Data
herausgegeben von
Jianguo Sun
Ding-Geng Chen
Copyright-Jahr
2022
Electronic ISBN
978-3-031-12366-5
Print ISBN
978-3-031-12365-8
DOI
https://doi.org/10.1007/978-3-031-12366-5

Premium Partner