Replica analysis of overfitting in regre...

Confidence intervals for high-dimensional Cox models

March 3, 2018

84% Match

Yi Yu, Jelena Bradic, Richard J. Samworth

Methodology

Statistics Theory

The purpose of this paper is to construct confidence intervals for the regression coefficients in high-dimensional Cox proportional hazards regression models where the number of covariates may be larger than the sample size. Our debiased estimator construction is similar to those in Zhang and Zhang (2014) and van de Geer et al. (2014), but the time-dependent covariates and censored risk sets introduce considerable additional challenges. Our theoretical results, which provide ...

Find SimilarView on arXiv

Time-to-Event Prediction with Neural Networks and Cox Regression

July 1, 2019

84% Match

Håvard Kvamme, Ørnulf Borgan, Ida Scheel

Machine Learning

New methods for time-to-event prediction are proposed by extending the Cox proportional hazards model with neural networks. Building on methodology from nested case-control studies, we propose a loss function that scales well to large data sets, and enables fitting of both proportional and non-proportional extensions of the Cox model. Through simulation studies, the proposed loss function is verified to be a good approximation for the Cox partial log-likelihood. The proposed ...

Find SimilarView on arXiv

Beyond first-order asymptotics for Cox regression

April 13, 2015

84% Match

Donald A. Pierce, Ruggero Bellio

Statistics Theory

To go beyond standard first-order asymptotics for Cox regression, we develop parametric bootstrap and second-order methods. In general, computation of $P$-values beyond first order requires more model specification than is required for the likelihood function. It is problematic to specify a censoring mechanism to be taken very seriously in detail, and it appears that conditioning on censoring is not a viable alternative to that. We circumvent this matter by employing a refere...

Find SimilarView on arXiv

Joint Modeling of Multivariate Longitudinal and Survival Outcomes with the R package INLAjoint

February 13, 2024

84% Match

Denis Rustand, Niekerk Janet van, ... , Rue Håvard

Methodology

Computation

This paper introduces the R package INLAjoint, designed as a toolbox for fitting a diverse range of regression models addressing both longitudinal and survival outcomes. INLAjoint relies on the computational efficiency of the integrated nested Laplace approximations methodology, an efficient alternative to Markov chain Monte Carlo for Bayesian inference, ensuring both speed and accuracy in parameter estimation and uncertainty quantification. The package facilitates the constr...

Find SimilarView on arXiv

Approximating Partial Likelihood Estimators via Optimal Subsampling

October 10, 2022

84% Match

Haixiang Zhang, Lulu Zuo, ... , Sun Liuquan

Methodology

Computation

With the growing availability of large-scale biomedical data, it is often time-consuming or infeasible to directly perform traditional statistical analysis with relatively limited computing resources at hand. We propose a fast subsampling method to effectively approximate the full data maximum partial likelihood estimator in Cox's model, which largely reduces the computational burden when analyzing massive survival data. We establish consistency and asymptotic normality of a ...

Find SimilarView on arXiv

A General Machine Learning Framework for Survival Analysis

June 27, 2020

84% Match

Andreas Bender, David Rügamer, ... , Bischl Bernd

Machine Learning

Computation

The modeling of time-to-event data, also known as survival analysis, requires specialized methods that can deal with censoring and truncation, time-varying features and effects, and that extend to settings with multiple competing events. However, many machine learning methods for survival analysis only consider the standard setting with right-censored data and proportional hazards assumption. The methods that do provide extensions usually address at most a subset of these cha...

Find SimilarView on arXiv

On near-redundancy and identifiability of parametric hazard regression models under censoring

May 9, 2023

84% Match

F. J. Rubio, J. A. Espindola, J. A. Montoya

Methodology

We study parametric inference on a rich class of hazard regression models in the presence of right-censoring. Previous literature has reported some inferential challenges, such as multimodal or flat likelihood surfaces, in this class of models for some particular data sets. We formalize the study of these inferential problems by linking them to the concepts of near-redundancy and practical non-identifiability of parameters. We show that the maximum likelihood estimators of th...

Find SimilarView on arXiv

The Variability of Model Specification

October 6, 2021

84% Match

Joseph R. Barr, Peter Shaw, Marcus Sobel

Machine Learning

Statistics Theory

Applications

Statistics Theory

It's regarded as an axiom that a good model is one that compromises between bias and variance. The bias is measured in training cost, while the variance of a (say, regression) model is measure by the cost associated with a validation set. If reducing bias is the goal, one will strive to fetch as complex a model as necessary, but complexity is invariably coupled with variance: greater complexity implies greater variance. In practice, driving training cost to near zero does not...

Find SimilarView on arXiv

Robust joint modelling of longitudinal and survival data with a time-varying degrees-of-freedom parameter

December 11, 2019

84% Match

Lisa McFetridge, Ozgur Asar, Jonas Wallin

Applications

Repeated measures of biomarkers have the potential of explaining hazards of survival outcomes. In practice, these measurements are intermittently measured and are known to be subject to substantial measurement error. Joint modelling of longitudinal and survival data enables us to associate intermittently measured error-prone biomarkers with risks of survival outcomes. Most of the joint models available in the literature have been built on the Gaussian assumption. This makes t...

Find SimilarView on arXiv

Bayesian Cox Regression for Large-scale Inference with Applications to Electronic Health Records

June 18, 2021

84% Match

Alexander W. Jung, Moritz Gerstung

Methodology

Applications

The Cox model is an indispensable tool for time-to-event analysis, particularly in biomedical research. However, medicine is undergoing a profound transformation, generating data at an unprecedented scale, which opens new frontiers to study and understand diseases. With the wealth of data collected, new challenges for statistical inference arise, as datasets are often high dimensional, exhibit an increasing number of measurements at irregularly spaced time points, and are sim...

Find SimilarView on arXiv

Replica analysis of overfitting in regression models for time-to-event data

Confidence intervals for high-dimensional Cox models

Time-to-Event Prediction with Neural Networks and Cox Regression

Beyond first-order asymptotics for Cox regression

Joint Modeling of Multivariate Longitudinal and Survival Outcomes with the R package INLAjoint

Approximating Partial Likelihood Estimators via Optimal Subsampling

A General Machine Learning Framework for Survival Analysis

On near-redundancy and identifiability of parametric hazard regression models under censoring

The Variability of Model Specification

Robust joint modelling of longitudinal and survival data with a time-varying degrees-of-freedom parameter

Bayesian Cox Regression for Large-scale Inference with Applications to Electronic Health Records