Analysis of overfitting in the regulariz...

Approximating Partial Likelihood Estimators via Optimal Subsampling

October 10, 2022

86% Match

Haixiang Zhang, Lulu Zuo, ... , Sun Liuquan

Methodology

Computation

With the growing availability of large-scale biomedical data, it is often time-consuming or infeasible to directly perform traditional statistical analysis with relatively limited computing resources at hand. We propose a fast subsampling method to effectively approximate the full data maximum partial likelihood estimator in Cox's model, which largely reduces the computational burden when analyzing massive survival data. We establish consistency and asymptotic normality of a ...

Find SimilarView on arXiv

Optimal Estimation for the Functional Cox Model

January 27, 2016

86% Match

Simeng Qu, Jane-Ling Wang, Xiao Wang

Methodology

Statistics Theory

Functional covariates are common in many medical, biodemographic, and neuroimaging studies. The aim of this paper is to study functional Cox models with right-censored data in the presence of both functional and scalar covariates. We study the asymptotic properties of the maximum partial likelihood estimator and establish the asymptotic normality and efficiency of the estimator of the finite-dimensional estimator. Under the framework of reproducing kernel Hilbert space, the e...

Find SimilarView on arXiv

Scalable Sparse Cox's Regression for Large-Scale Survival Data via Broken Adaptive Ridge

December 2, 2017

86% Match

Eric S. Kawaguchi, Marc A. Suchard, ... , Li Gang

Methodology

This paper develops a new scalable sparse Cox regression tool for sparse high-dimensional massive sample size (sHDMSS) survival data. The method is a local $L_0$-penalized Cox regression via repeatedly performing reweighted $L_2$-penalized Cox regression. We show that the resulting estimator enjoys the best of $L_0$- and $L_2$-penalized Cox regressions while overcoming their limitations. Specifically, the estimator is selection consistent, oracle for parameter estimation, and...

Find SimilarView on arXiv

Regularized estimation for highly multivariate log Gaussian Cox processes

May 4, 2019

86% Match

Achmad Choiruddin, Francisco Cuevas-Pacheco, ... , Waagepetersen Rasmus

Methodology

Computation

Statistical inference for highly multivariate point pattern data is challenging due to complex models with large numbers of parameters. In this paper, we develop numerically stable and efficient parameter estimation and model selection algorithms for a class of multivariate log Gaussian Cox processes. The methodology is applied to a highly multivariate point pattern data set from tropical rain forest ecology.

Find SimilarView on arXiv

Optimizing Cox Models with Stochastic Gradient Descent: Theoretical Foundations and Practical Guidances

August 5, 2024

86% Match

Lang Zeng, Weijing Tang, ... , Ding Ying

Machine Learning

Optimizing Cox regression and its neural network variants poses substantial computational challenges in large-scale studies. Stochastic gradient descent (SGD), known for its scalability in model optimization, has recently been adapted to optimize Cox models. Unlike its conventional application, which typically targets a sum of independent individual loss, SGD for Cox models updates parameters based on the partial likelihood of a subset of data. Despite its empirical success, ...

Find SimilarView on arXiv

Bayesian inference on Cox regression models using catalytic prior distributions

December 3, 2023

86% Match

Weihao Li, Dongming Huang

Methodology

The Cox proportional hazards model (Cox model) is a popular model for survival data analysis. When the sample size is small relative to the dimension of the model, the standard maximum partial likelihood inference is often problematic. In this work, we propose the Cox catalytic prior distributions for Bayesian inference on Cox models, which is an extension of a general class of prior distributions originally designed for stabilizing complex parametric models. The Cox catalyti...

Find SimilarView on arXiv

Inference with penalized likelihood

January 30, 2014

86% Match

Daniel Commenges, Jérémie Bureau, Hein Putter

Statistics Theory

This work studies the statistical properties of the maximum penalized likelihood approach in a semi-parametric framework. We recall the penalized likelihood approach for estimating a function and review some asymptotic results. We investigate the properties of two estimators of the variance of maximum penalized likelihood estimators: sandwich estimator and a Bayesian estimator. The coverage rates of confidence intervals based on these estimators are studied through a simulati...

Find SimilarView on arXiv

A Fast Divide-and-Conquer Sparse Cox Regression

April 2, 2018

86% Match

Yan Wang, Nathan Palmer, Qian Di, Joel Schwartz, ... , Cai Tianxi

Computation

Applications

We propose a computationally and statistically efficient divide-and-conquer (DAC) algorithm to fit sparse Cox regression to massive datasets where the sample size $n_0$ is exceedingly large and the covariate dimension $p$ is not small but $n_0\gg p$. The proposed algorithm achieves computational efficiency through a one-step linear approximation followed by a least square approximation to the partial likelihood (PL). These sequences of linearization enable us to maximize the ...

Find SimilarView on arXiv

On the Properties of Simulation-based Estimators in High Dimensions

October 10, 2018

85% Match

Stéphane Guerrier, Mucyo Karemera, ... , Victoria-Feser Maria-Pia

Statistics Theory

Computation

Methodology

Statistics Theory

Considering the increasing size of available data, the need for statistical methods that control the finite sample bias is growing. This is mainly due to the frequent settings where the number of variables is large and allowed to increase with the sample size bringing standard inferential procedures to incur significant loss in terms of performance. Moreover, the complexity of statistical models is also increasing thereby entailing important computational challenges in constr...

Find SimilarView on arXiv

Mastering Rare Event Analysis: Optimal Subsample Size in Logistic and Cox Regressions

June 19, 2024

85% Match

Tal Agassi, Nir Keret, Malka Gorfine

Methodology

In the realm of contemporary data analysis, the use of massive datasets has taken on heightened significance, albeit often entailing considerable demands on computational time and memory. While a multitude of existing works offer optimal subsampling methods for conducting analyses on subsamples with minimized efficiency loss, they notably lack tools for judiciously selecting the optimal subsample size. To bridge this gap, our work introduces tools designed for choosing the op...

Find SimilarView on arXiv

Analysis of overfitting in the regularized Cox model

Approximating Partial Likelihood Estimators via Optimal Subsampling

Optimal Estimation for the Functional Cox Model

Scalable Sparse Cox's Regression for Large-Scale Survival Data via Broken Adaptive Ridge

Regularized estimation for highly multivariate log Gaussian Cox processes

Optimizing Cox Models with Stochastic Gradient Descent: Theoretical Foundations and Practical Guidances

Bayesian inference on Cox regression models using catalytic prior distributions

Inference with penalized likelihood

A Fast Divide-and-Conquer Sparse Cox Regression

On the Properties of Simulation-based Estimators in High Dimensions

Mastering Rare Event Analysis: Optimal Subsample Size in Logistic and Cox Regressions