Perperoglou, Aris and le Cessie, Saskia and van Houwelingen, Hans C (2006) A fast routine for fitting Cox models with time varying effects of the covariates. Computer Methods and Programs in Biomedicine, 81 (2). pp. 154-161. DOI https://doi.org/10.1016/j.cmpb.2005.11.006
Perperoglou, Aris and le Cessie, Saskia and van Houwelingen, Hans C (2006) A fast routine for fitting Cox models with time varying effects of the covariates. Computer Methods and Programs in Biomedicine, 81 (2). pp. 154-161. DOI https://doi.org/10.1016/j.cmpb.2005.11.006
Perperoglou, Aris and le Cessie, Saskia and van Houwelingen, Hans C (2006) A fast routine for fitting Cox models with time varying effects of the covariates. Computer Methods and Programs in Biomedicine, 81 (2). pp. 154-161. DOI https://doi.org/10.1016/j.cmpb.2005.11.006
Abstract
The S-plus and R statistical packages have implemented a counting process setup to estimate Cox models with time varying effects of the covariates. The data set has to be re-arranged in a repeated measurement setting: the time is divided into small time intervals where a single event occurs and for each time interval, the covariate values and outcome in the interval for each subject still under observation are stacked to a large data set. This is the known (Tstart,Tstop] algorithm implemented in Therneau's Survival library (S-plus), which has been ported into an R package by Thomas Lumley. However, the expansion of a data set leads to a larger set, which can be hard to handle even with fast modern computers. We propose the use of a fast and efficient algorithm, written in R, which works on the original data without the use of an expansion. The computations are done on the original data set, with significant less memory resources used. This improves the computational time by orders of magnitude. The algorithm can also fit reduced rank Cox models with time varying effects. We illustrate the method on a large data set of 2433 breast cancer patients, a smaller study of 358 ovarian cancer patients, and compare the computational times on simulated data of up to 10,000 cases with SAS proc phreg and survival package in R. For larger data sets our algorithm was several times faster, and was able to handle larger data sets then SAS and R. © 2005 Elsevier Ireland Ltd. All rights reserved.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | survival analysis; time-varying effects; reduced rank models; efficient computations |
Subjects: | Q Science > QA Mathematics |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Mathematics, Statistics and Actuarial Science, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 06 Nov 2012 14:17 |
Last Modified: | 04 Dec 2024 06:25 |
URI: | http://repository.essex.ac.uk/id/eprint/3830 |