Semi-analytic approximate stability sele...

Estimation in Rotationally Invariant Generalized Linear Models via Approximate Message Passing

December 8, 2021

85% Match

Ramji Venkataramanan, Kevin Kögler, Marco Mondelli

stat.ML

cs.IT

cs.LG

math.IT

math.ST

stat.TH

We consider the problem of signal estimation in generalized linear models defined via rotationally invariant design matrices. Since these matrices can have an arbitrary spectral distribution, this model is well suited for capturing complex correlation structures which often arise in applications. We propose a novel family of approximate message passing (AMP) algorithms for signal estimation, and rigorously characterize their performance in the high-dimensional limit via a sta...

Find SimilarView on arXiv

Approximate message passing for nonconvex sparse regularization with stability and asymptotic analysis

November 8, 2017

85% Match

Ayaka Sakata, Yingying Xu

Machine Learning

We analyse a linear regression problem with nonconvex regularization called smoothly clipped absolute deviation (SCAD) under an overcomplete Gaussian basis for Gaussian random data. We propose an approximate message passing (AMP) algorithm considering nonconvex regularization, namely SCAD-AMP, and analytically show that the stability condition corresponds to the de Almeida--Thouless condition in spin glass literature. Through asymptotic analysis, we show the correspondence be...

Find SimilarView on arXiv

An Ensemble EM Algorithm for Bayesian Variable Selection

March 14, 2016

85% Match

Jin Wang, Feng Liang, Yuan Ji

Computation

Methodology

We study the Bayesian approach to variable selection in the context of linear regression. Motivated by a recent work by Rockova and George (2014), we propose an EM algorithm that returns the MAP estimate of the set of relevant variables. Due to its particular updating scheme, our algorithm can be implemented efficiently without inverting a large matrix in each iteration and therefore can scale up with big data. We also show that the MAP estimate returned by our EM algorithm a...

Find SimilarView on arXiv

Distributed variable screening for generalized linear models

May 7, 2024

85% Match

Tianbo Diao, Lianqiang Qu, ... , Sun Liuquan

Methodology

In this article, we develop a distributed variable screening method for generalized linear models. This method is designed to handle situations where both the sample size and the number of covariates are large. Specifically, the proposed method selects relevant covariates by using a sparsity-restricted surrogate likelihood estimator. It takes into account the joint effects of the covariates rather than just the marginal effect, and this characteristic enhances the reliability...

Find SimilarView on arXiv

Stability Approach to Regularization Selection (StARS) for High Dimensional Graphical Models

June 16, 2010

85% Match

Han Liu, Kathryn Roeder, Larry Wasserman

Machine Learning

A challenging problem in estimating high-dimensional graphical models is to choose the regularization parameter in a data-dependent way. The standard techniques include $K$-fold cross-validation ($K$-CV), Akaike information criterion (AIC), and Bayesian information criterion (BIC). Though these methods work well for low-dimensional problems, they are not suitable in high dimensional settings. In this paper, we present StARS: a new stability-based method for choosing the regul...

Find SimilarView on arXiv

Variable selection with error control: Another look at Stability Selection

May 27, 2011

85% Match

Rajen D. Shah, Richard J. Samworth

Statistics Theory

Stability Selection was recently introduced by Meinshausen and Buhlmann (2010) as a very general technique designed to improve the performance of a variable selection algorithm. It is based on aggregating the results of applying a selection procedure to subsamples of the data. We introduce a variant, called Complementary Pairs Stability Selection (CPSS), and derive bounds both on the expected number of variables included by CPSS that have low selection probability under the o...

Find SimilarView on arXiv

Scalable Bayesian model averaging through local information propagation

March 10, 2014

85% Match

Li Ma

Methodology

Computation

We show that a probabilistic version of the classical forward-stepwise variable inclusion procedure can serve as a general data-augmentation scheme for model space distributions in (generalized) linear models. This latent variable representation takes the form of a Markov process, thereby allowing information propagation algorithms to be applied for sampling from model space posteriors. In particular, we propose a sequential Monte Carlo method for achieving effective unbiased...

Find SimilarView on arXiv

A Statistical View of Column Subset Selection

July 24, 2023

85% Match

Anav Sood, Trevor Hastie

Methodology

Data Structures and Algorith...

Machine Learning

We consider the problem of selecting a small subset of representative variables from a large dataset. In the computer science literature, this dimensionality reduction problem is typically formalized as Column Subset Selection (CSS). Meanwhile, the typical statistical formalization is to find an information-maximizing set of Principal Variables. This paper shows that these two approaches are equivalent, and moreover, both can be viewed as maximum likelihood estimation within ...

Find SimilarView on arXiv

Sure independence screening in generalized linear models with NP-dimensionality

March 30, 2009

85% Match

Jianqing Fan, Rui Song

Methodology

Statistics Theory

Machine Learning

Statistics Theory

Ultrahigh-dimensional variable selection plays an increasingly important role in contemporary scientific discoveries and statistical research. Among others, Fan and Lv [J. R. Stat. Soc. Ser. B Stat. Methodol. 70 (2008) 849-911] propose an independent screening framework by ranking the marginal correlations. They showed that the correlation ranking procedure possesses a sure independence screening property within the context of the linear model with Gaussian covariates and res...

Find SimilarView on arXiv

A stochastic variational framework for fitting and diagnosing generalized linear mixed models

August 24, 2012

85% Match

Linda S. L. Tan, David J. Nott

Methodology

In stochastic variational inference, the variational Bayes objective function is optimized using stochastic gradient approximation, where gradients computed on small random subsets of data are used to approximate the true gradient over the whole data set. This enables complex models to be fit to large data sets as data can be processed in mini-batches. In this article, we extend stochastic variational inference for conjugate-exponential models to nonconjugate models and prese...

Find SimilarView on arXiv

Semi-analytic approximate stability selection for correlated data in generalized linear models

Estimation in Rotationally Invariant Generalized Linear Models via Approximate Message Passing

Approximate message passing for nonconvex sparse regularization with stability and asymptotic analysis

An Ensemble EM Algorithm for Bayesian Variable Selection

Distributed variable screening for generalized linear models

Stability Approach to Regularization Selection (StARS) for High Dimensional Graphical Models

Variable selection with error control: Another look at Stability Selection

Scalable Bayesian model averaging through local information propagation

A Statistical View of Column Subset Selection

Sure independence screening in generalized linear models with NP-dimensionality

A stochastic variational framework for fitting and diagnosing generalized linear mixed models