The effect of regularization in high dim...

Elastic Net Regularization Paths for All Generalized Linear Models

March 5, 2021

86% Match

J. Kenneth Tay, Balasubramanian Narasimhan, Trevor Hastie

Computation

Methodology

The lasso and elastic net are popular regularized regression models for supervised learning. Friedman, Hastie, and Tibshirani (2010) introduced a computationally efficient algorithm for computing the elastic net regularization path for ordinary least squares regression, logistic regression and multinomial logistic regression, while Simon, Friedman, Hastie, and Tibshirani (2011) extended this work to Cox models for right-censored data. We further extend the reach of the elasti...

Find SimilarView on arXiv

Stepwise Choice of Covariates in High Dimensional Regression

October 17, 2016

86% Match

Laurie Davies

Methodology

Given data y(n) and p(n)covariates x(n) one problem in linear regression is to decide which if any of the covariates to include. There are many articles on this problem but all are based on a stochastic model for the data. This paper gives what seems to be a new approach which does not require any form of model. It is conceptually and algorithmically simple and consistency results can be proved under appropriate assumptions.

Find SimilarView on arXiv

On cross-validated Lasso in high dimensions

May 7, 2016

86% Match

Denis Chetverikov, Zhipeng Liao, Victor Chernozhukov

Statistics Theory

In this paper, we derive non-asymptotic error bounds for the Lasso estimator when the penalty parameter for the estimator is chosen using $K$-fold cross-validation. Our bounds imply that the cross-validated Lasso estimator has nearly optimal rates of convergence in the prediction, $L^2$, and $L^1$ norms. For example, we show that in the model with the Gaussian noise and under fairly general assumptions on the candidate set of values of the penalty parameter, the estimation er...

Find SimilarView on arXiv

Inference with penalized likelihood

January 30, 2014

86% Match

Daniel Commenges, Jérémie Bureau, Hein Putter

Statistics Theory

This work studies the statistical properties of the maximum penalized likelihood approach in a semi-parametric framework. We recall the penalized likelihood approach for estimating a function and review some asymptotic results. We investigate the properties of two estimators of the variance of maximum penalized likelihood estimators: sandwich estimator and a Bayesian estimator. The coverage rates of confidence intervals based on these estimators are studied through a simulati...

Find SimilarView on arXiv

Robust penalized least squares of depth trimmed residuals regression for high-dimensional data

September 4, 2023

86% Match

Yijun Zuo

Machine Learning

Methodology

Challenges with data in the big-data era include (i) the dimension $p$ is often larger than the sample size $n$ (ii) outliers or contaminated points are frequently hidden and more difficult to detect. Challenge (i) renders most conventional methods inapplicable. Thus, it attracts tremendous attention from statistics, computer science, and bio-medical communities. Numerous penalized regression methods have been introduced as modern methods for analyzing high-dimensional data. ...

Find SimilarView on arXiv

High-dimensional additive hazard models and the Lasso

June 23, 2011

86% Match

Séphane Gaïffas, Agathe Guilloux

Statistics Theory

We consider a general high-dimensional additive hazard model in a non-asymptotic setting, including regression for censored-data. In this context, we consider a Lasso estimator with a fully data-driven $\ell_1$ penalization, which is tuned for the estimation problem at hand. We prove sharp oracle inequalities for this estimator. Our analysis involves a new "data-driven" Bernstein's inequality, that is of independent interest, where the predictable variation is replaced by the...

Find SimilarView on arXiv

A pliable lasso for the Cox model

July 18, 2018

86% Match

Wenfei Du, Rob Tibshirani

Applications

We introduce a pliable lasso method for estimation of interaction effects in the Cox proportional hazards model framework. The pliable lasso is a linear model that includes interactions between covariates X and a set of modifying variables Z and assumes sparsity of the main effects and interaction effects. The hierarchical penalty excludes interaction effects when the corresponding main effects are zero: this avoids overfitting and an explosion of model complexity. We extend ...

Find SimilarView on arXiv

Factor-Augmented Regularized Model for Hazard Regression

October 3, 2022

86% Match

Pierre Bayle, Jianqing Fan

Methodology

Statistics Theory

Applications

Machine Learning

Statistics Theory

A prevalent feature of high-dimensional data is the dependence among covariates, and model selection is known to be challenging when covariates are highly correlated. To perform model selection for the high-dimensional Cox proportional hazards model in presence of correlated covariates with factor structure, we propose a new model, Factor-Augmented Regularized Model for Hazard Regression (FarmHazard), which builds upon latent factors that drive covariate dependence and extend...

Find SimilarView on arXiv

Regularization for Cox's proportional hazards model with NP-dimensionality

October 25, 2010

86% Match

Jelena Bradic, Jianqing Fan, Jiancheng Jiang

Statistics Theory

Methodology

Machine Learning

Statistics Theory

High throughput genetic sequencing arrays with thousands of measurements per sample and a great amount of related censored clinical data have increased demanding need for better measurement specific model selection. In this paper we establish strong oracle properties of nonconcave penalized methods for nonpolynomial (NP) dimensional data with censoring in the framework of Cox's proportional hazards model. A class of folded-concave penalties are employed and both LASSO and SCA...

Find SimilarView on arXiv

Penalization-induced shrinking without rotation in high dimensional GLM regression: a cavity analysis

September 9, 2022

86% Match

Emanuele Massa, Marianne Jonker, Anthony Coolen

Statistics Theory

Disordered Systems and Neura...

Statistics Theory

In high dimensional regression, where the number of covariates is of the order of the number of observations, ridge penalization is often used as a remedy against overfitting. Unfortunately, for correlated covariates such regularisation typically induces in generalized linear models not only shrinking of the estimated parameter vector, but also an unwanted \emph{rotation} relative to the true vector. We show analytically how this problem can be removed by using a generalizati...

Find SimilarView on arXiv

The effect of regularization in high dimensional Cox regression

Elastic Net Regularization Paths for All Generalized Linear Models

Stepwise Choice of Covariates in High Dimensional Regression

On cross-validated Lasso in high dimensions

Inference with penalized likelihood

Robust penalized least squares of depth trimmed residuals regression for high-dimensional data

High-dimensional additive hazard models and the Lasso

A pliable lasso for the Cox model

Factor-Augmented Regularized Model for Hazard Regression

Regularization for Cox's proportional hazards model with NP-dimensionality

Penalization-induced shrinking without rotation in high dimensional GLM regression: a cavity analysis