Recurrent Events Analysis with Data Collected at Informative Clinical Visits in Electronic Medical Records
- 2018-06-28 (Thu.), 14:00 PM
- Recreation Hall, 2F, Institute of Statistical Science
- Prof. Chiung-Yu Huang
- Department of Epidemiology and Biostatistics, School of Medicine, University of California, San Francisco, USA
Abstract
Although increasingly used as a data resource for assembling cohorts, electronic medical records (EMR) pose a number of analytic challenges because they are primarily collected for clinical encounters rather than for research purpose. In particular, patient's health status influences when and what data are recorded, leading to bias in the collected data. In this paper, we consider recurrent event data analysis by leveraging the EMR data. Conventional regression methods for event risk analysis usually require the values of covariates to be observed throughout the follow-up period. In EMR databases, time-dependent risk factors are intermittently measured at clinical visits, and the timing of these visits is informative in the sense that it depends on the disease course. Simple methods, such as the last-observation-carried-forward approach, can lead to biased estimation; on the other hand, complex joint models require additional assumptions on the covariate process and can not be easily extended to handle multiple longitudinal predictors. We present a novel kernel-smoothing estimation procedure for the semiparametric proportional rate model of recurrent events with time-dependent covariates by correcting the sampling bias resulting from the informative observation times in EMR-derived clinic data. The proposed method does not require model specifications for the covariate processes and can easily handle multiple time-dependent covariates. The estimator for the regression parameters is asymptotically unbiased and normally distributed with a root-${n}$ convergence rate. We further investigate the bias in scenarios where the assumptions on the observation time process are violated. Simulation studies are conducted to evaluate the performance of the proposed estimator. Our method is applied to a kidney transplant study for illustration.