Prediction Errors for Penalized Regressi...

Tuning parameter selection in high dimensional penalized likelihood

May 11, 2016

87% Match

Yingying Fan, Cheng Yong Tang

Methodology

Machine Learning

Determining how to appropriately select the tuning parameter is essential in penalized likelihood methods for high-dimensional data analysis. We examine this problem in the setting of penalized likelihood methods for generalized linear models, where the dimensionality of covariates p is allowed to increase exponentially with the sample size n. We propose to select the tuning parameter by optimizing the generalized information criterion (GIC) with an appropriate model complexi...

Find SimilarView on arXiv

A Unified Bayesian Inference Framework for Generalized Linear Models

December 29, 2017

87% Match

Xiangming Meng, Sheng Wu, Jiang Zhu

Information Theory

In this letter, we present a unified Bayesian inference framework for generalized linear models (GLM) which iteratively reduces the GLM problem to a sequence of standard linear model (SLM) problems. This framework provides new perspectives on some established GLM algorithms derived from SLM ones and also suggests novel extensions for some other SLM algorithms. Specific instances elucidated under such framework are the GLM versions of approximate message passing (AMP), vector ...

Find SimilarView on arXiv

Replica analysis of overfitting in generalized linear models

April 14, 2020

87% Match

ACC Coolen, M Sheikh, A Mozeika, ... , Antenucci F

Disordered Systems and Neura...

Statistics Theory

Nearly all statistical inference methods were developed for the regime where the number $N$ of data samples is much larger than the data dimension $p$. Inference protocols such as maximum likelihood (ML) or maximum a posteriori probability (MAP) are unreliable if $p=O(N)$, due to overfitting. This limitation has for many disciplines with increasingly high-dimensional data become a serious bottleneck. We recently showed that in Cox regression for time-to-event data the overfit...

Find Similar View on arXiv

Approximate Message Passing with Spectral Initialization for Generalized Linear Models

October 7, 2020

87% Match

Marco Mondelli, Ramji Venkataramanan

stat.ML

cs.IT

cs.LG

math.IT

math.ST

stat.TH

We consider the problem of estimating a signal from measurements obtained via a generalized linear model. We focus on estimators based on approximate message passing (AMP), a family of iterative algorithms with many appealing features: the performance of AMP in the high-dimensional limit can be succinctly characterized under suitable model assumptions; AMP can also be tailored to the empirical distribution of the signal entries, and for a wide class of estimation problems, AM...

Find SimilarView on arXiv

Model selection and minimax estimation in generalized linear models

September 30, 2014

87% Match

Felix Abramovich, Vadim Grinshtein

Statistics Theory

Methodology

Statistics Theory

We consider model selection in generalized linear models (GLM) for high-dimensional data and propose a wide class of model selection criteria based on penalized maximum likelihood with a complexity penalty on the model size. We derive a general nonasymptotic upper bound for the expected Kullback-Leibler divergence between the true distribution of the data and that generated by a selected model, and establish the corresponding minimax lower bounds for sparse GLM. For the prope...

Find SimilarView on arXiv

A stochastic variational framework for fitting and diagnosing generalized linear mixed models

August 24, 2012

86% Match

Linda S. L. Tan, David J. Nott

Methodology

In stochastic variational inference, the variational Bayes objective function is optimized using stochastic gradient approximation, where gradients computed on small random subsets of data are used to approximate the true gradient over the whole data set. This enables complex models to be fit to large data sets as data can be processed in mini-batches. In this article, we extend stochastic variational inference for conjugate-exponential models to nonconjugate models and prese...

Find SimilarView on arXiv

Efficiency for Regularization Parameter Selection in Penalized Likelihood Estimation of Misspecified Models

February 8, 2013

86% Match

Cheryl J. Flynn, Clifford M. Hurvich, Jeffrey S. Simonoff

Machine Learning

It has been shown that AIC-type criteria are asymptotically efficient selectors of the tuning parameter in non-concave penalized regression methods under the assumption that the population variance is known or that a consistent estimator is available. We relax this assumption to prove that AIC itself is asymptotically efficient and we study its performance in finite samples. In classical regression, it is known that AIC tends to select overly complex models when the dimension...

Find SimilarView on arXiv

Semi-analytic approximate stability selection for correlated data in generalized linear models

March 19, 2020

86% Match

Takashi Takahashi, Yoshiyuki Kabashima

Machine Learning

Disordered Systems and Neura...

Statistical Mechanics

Machine Learning

Methodology

We consider the variable selection problem of generalized linear models (GLMs). Stability selection (SS) is a promising method proposed for solving this problem. Although SS provides practical variable selection criteria, it is computationally demanding because it needs to fit GLMs to many re-sampled datasets. We propose a novel approximate inference algorithm that can conduct SS without the repeated fitting. The algorithm is based on the replica method of statistical mechani...

Find SimilarView on arXiv

Theoretical Analysis of Leave-one-out Cross Validation for Non-differentiable Penalties under High-dimensional Settings

February 13, 2024

86% Match

Haolin Zou, Arnab Auddy, ... , Maleki Arian

Statistics Theory

Machine Learning

Statistics Theory

Despite a large and significant body of recent work focused on estimating the out-of-sample risk of regularized models in the high dimensional regime, a theoretical understanding of this problem for non-differentiable penalties such as generalized LASSO and nuclear norm is missing. In this paper we resolve this challenge. We study this problem in the proportional high dimensional regime where both the sample size n and number of features p are large, and n/p and the signal-to...

Find SimilarView on arXiv

Asymptotic Statistical Analysis of Sparse Group LASSO via Approximate Message Passing Algorithm

July 2, 2021

86% Match

Kan Chen, Zhiqi Bu, Shiyun Xu

Statistics Theory

Methodology

Statistics Theory

Sparse Group LASSO (SGL) is a regularized model for high-dimensional linear regression problems with grouped covariates. SGL applies $l_1$ and $l_2$ penalties on the individual predictors and group predictors, respectively, to guarantee sparse effects both on the inter-group and within-group levels. In this paper, we apply the approximate message passing (AMP) algorithm to efficiently solve the SGL problem under Gaussian random designs. We further use the recently developed s...

Find SimilarView on arXiv

Prediction Errors for Penalized Regressions based on Generalized Approximate Message Passing

Tuning parameter selection in high dimensional penalized likelihood

A Unified Bayesian Inference Framework for Generalized Linear Models

Replica analysis of overfitting in generalized linear models

Approximate Message Passing with Spectral Initialization for Generalized Linear Models

Model selection and minimax estimation in generalized linear models

A stochastic variational framework for fitting and diagnosing generalized linear mixed models

Efficiency for Regularization Parameter Selection in Penalized Likelihood Estimation of Misspecified Models

Semi-analytic approximate stability selection for correlated data in generalized linear models

Theoretical Analysis of Leave-one-out Cross Validation for Non-differentiable Penalties under High-dimensional Settings

Asymptotic Statistical Analysis of Sparse Group LASSO via Approximate Message Passing Algorithm