The effect of regularization in high dim...

Replica analysis of overfitting in regression models for time to event data: the impact of censoring

December 5, 2023

87% Match

Emanuele Massa, Alexander Mozeika, Anthony Coolen

Methodology

Disordered Systems and Neura...

Statistics Theory

We use statistical mechanics techniques, viz. the replica method, to model the effect of censoring on overfitting in Cox's proportional hazards model, the dominant regression method for time-to-event data. In the overfitting regime, Maximum Likelihood parameter estimators are known to be biased already for small values of the ratio of the number of covariates over the number of samples. The inclusion of censoring was avoided in previous overfitting analyses for mathematical c...

Find SimilarView on arXiv

Marginal false discovery rate control for likelihood-based penalized regression models

October 31, 2017

87% Match

Ryan Miller, Patrick Breheny

Methodology

The popularity of penalized regression in high-dimensional data analysis has led to a demand for new inferential tools for these models. False discovery rate control is widely used in high-dimensional hypothesis testing, but has only recently been considered in the context of penalized regression. Almost all of this work, however, has focused on lasso-penalized linear regression. In this paper, we derive a general method for controlling the marginal false discovery rate that ...

Find SimilarView on arXiv

Kernel Cox partially linear regression: building predictive models for cancer patients' survival

October 11, 2023

87% Match

Yaohua Rong, Sihai Dave Zhao, ... , Li Yi

Machine Learning

Wide heterogeneity exists in cancer patients' survival, ranging from a few months to several decades. To accurately predict clinical outcomes, it is vital to build an accurate predictive model that relates patients' molecular profiles with patients' survival. With complex relationships between survival and high-dimensional molecular predictors, it is challenging to conduct non-parametric modeling and irrelevant predictors removing simultaneously. In this paper, we build a ker...

Find SimilarView on arXiv

On the adaptive elastic-net with a diverging number of parameters

August 13, 2009

87% Match

Hui Zou, Hao Helen Zhang

Statistics Theory

We consider the problem of model selection and estimation in situations where the number of parameters diverges with the sample size. When the dimension is high, an ideal method should have the oracle property [J. Amer. Statist. Assoc. 96 (2001) 1348--1360] and [Ann. Statist. 32 (2004) 928--961] which ensures the optimal large sample performance. Furthermore, the high-dimensionality often induces the collinearity problem, which should be properly handled by the ideal method. ...

Find SimilarView on arXiv

Cox's proportional hazards model with a high-dimensional and sparse regression parameter

October 28, 2017

87% Match

Kou Fujimori

Statistics Theory

This paper deals with the proportional hazards model proposed by D. R. Cox in a high-dimensional and sparse setting for a regression parameter. To estimate the regression parameter, the Dantzig selector is applied. The variable selection consistency of the Dantzig selector for the model will be proved. This property enables us to reduce the dimension of the parameter and to construct asymptotically normal estimators for the regression parameter and the cumulative baseline haz...

Find SimilarView on arXiv

Estimation and variable selection in a joint model of survival times and longitudinal outcomes with random effects

June 29, 2023

87% Match

Antoine MaIAGE Caillebotte, Estelle MaIAGE Kuhn, Sarah MICS Lemler

Statistics Theory

This paper considers a joint survival and mixed-effects model to explain the survival time from longitudinal data and high-dimensional covariates. The longitudinal data is modeled using a nonlinear effects model, where the regression function serves as a link function incorporated into a Cox model as a covariate. In that way, the longitudinal data is related to the survival time at a given time. Additionally, the Cox model takes into account the inclusion of high-dimensional ...

Find SimilarView on arXiv

Structured Estimation in Nonparameteric Cox Model

July 18, 2012

87% Match

Jelena Bradic, Rui Song

Statistics Theory

Methodology

Machine Learning

Statistics Theory

To better understand the interplay of censoring and sparsity we develop finite sample properties of nonparametric Cox proportional hazard's model. Due to high impact of sequencing data, carrying genetic information of each individual, we work with over-parametrized problem and propose general class of group penalties suitable for sparse structured variable selection and estimation. Novel non-asymptotic sandwich bounds for the partial likelihood are developed. We establish how...

Find SimilarView on arXiv

Correction of overfitting bias in regression models

April 12, 2022

87% Match

Emanuele Massa, Marianne Jonker, ... , Coolen Anthony

Methodology

Statistics Theory

Data Analysis, Statistics an...

Statistics Theory

Regression analysis based on many covariates is becoming increasingly common. However, when the number of covariates $p$ is of the same order as the number of observations $n$, maximum likelihood regression becomes unreliable due to overfitting. This typically leads to systematic estimation biases and increased estimator variances. It is crucial for inference and prediction to quantify these effects correctly. Several methods have been proposed in literature to overcome overf...

Find SimilarView on arXiv

Adaptive Elastic Net Method for Cox Model

July 23, 2015

87% Match

Chunhong Li, Xinxing Wei, Hongshuai Dai

Statistics Theory

Probability

Statistics Theory

In this paper, we study the Adaptive Elastic Net method for the Cox model. We prove the grouping effect and oracle property of its estimators. Finally, we show these two properties by an empirical analysis and a numerical simulation, respectively.

Find SimilarView on arXiv

Confidence intervals for high-dimensional Cox models

March 3, 2018

87% Match

Yi Yu, Jelena Bradic, Richard J. Samworth

Methodology

Statistics Theory

The purpose of this paper is to construct confidence intervals for the regression coefficients in high-dimensional Cox proportional hazards regression models where the number of covariates may be larger than the sample size. Our debiased estimator construction is similar to those in Zhang and Zhang (2014) and van de Geer et al. (2014), but the time-dependent covariates and censored risk sets introduce considerable additional challenges. Our theoretical results, which provide ...

Find SimilarView on arXiv

The effect of regularization in high dimensional Cox regression

Replica analysis of overfitting in regression models for time to event data: the impact of censoring

Marginal false discovery rate control for likelihood-based penalized regression models

Kernel Cox partially linear regression: building predictive models for cancer patients' survival

On the adaptive elastic-net with a diverging number of parameters

Cox's proportional hazards model with a high-dimensional and sparse regression parameter

Estimation and variable selection in a joint model of survival times and longitudinal outcomes with random effects

Structured Estimation in Nonparameteric Cox Model

Correction of overfitting bias in regression models

Adaptive Elastic Net Method for Cox Model

Confidence intervals for high-dimensional Cox models