Sparse High-Dimensional Regression: Exac...

Slowly Varying Regression under Sparsity

February 22, 2021

90% Match

Dimitris Bertsimas, Vassilis Jr Digalakis, ... , Lami Omar Skali

Machine Learning

Optimization and Control

Computation

Machine Learning

We present the framework of slowly varying regression under sparsity, allowing sparse regression models to exhibit slow and sparse variations. The problem of parameter estimation is formulated as a mixed-integer optimization problem. We demonstrate that it can be precisely reformulated as a binary convex optimization problem through a novel relaxation technique. This relaxation involves a new equality on Moore-Penrose inverses, convexifying the non-convex objective function w...

Find SimilarView on arXiv

Square-Root Lasso: Pivotal Recovery of Sparse Signals via Conic Programming

September 28, 2010

89% Match

Alexandre Belloni, Victor Chernozhukov, Lie Wang

Methodology

Statistics Theory

We propose a pivotal method for estimating high-dimensional sparse linear regression models, where the overall number of regressors $p$ is large, possibly much larger than $n$, but only $s$ regressors are significant. The method is a modification of the lasso, called the square-root lasso. The method is pivotal in that it neither relies on the knowledge of the standard deviation $\sigma$ or nor does it need to pre-estimate $\sigma$. Moreover, the method does not rely on norma...

Find SimilarView on arXiv

Scalable Algorithms for the Sparse Ridge Regression

June 11, 2018

89% Match

Weijun Xie, Xinwei Deng

Computation

Optimization and Control

Sparse regression and variable selection for large-scale data have been rapidly developed in the past decades. This work focuses on sparse ridge regression, which enforces the sparsity by use of the L0 norm. We first prove that the continuous relaxation of the mixed integer second order conic (MISOC) reformulation using perspective formulation is equivalent to that of the convex integer formulation proposed in recent work. We also show that the convex hull of the constraint s...

Find SimilarView on arXiv

High dimensional thresholded regression and shrinkage effect

May 11, 2016

89% Match

Zemin Zheng, Yingying Fan, Jinchi Lv

Methodology

Machine Learning

High-dimensional sparse modeling via regularization provides a powerful tool for analyzing large-scale data sets and obtaining meaningful, interpretable models. The use of nonconvex penalty functions shows advantage in selecting important features in high dimensions, but the global optimality of such methods still demands more understanding. In this paper, we consider sparse regression with hard-thresholding penalty, which we show to give rise to thresholded regression. This ...

Find SimilarView on arXiv

Inference in High-dimensional Linear Regression

June 22, 2021

89% Match

Heather S. Battey, Nancy Reid

Methodology

Statistics Theory

This paper develops an approach to inference in a linear regression model when the number of potential explanatory variables is larger than the sample size. The approach treats each regression coefficient in turn as the interest parameter, the remaining coefficients being nuisance parameters, and seeks an optimal interest-respecting transformation, inducing sparsity on the relevant blocks of the notional Fisher information matrix. The induced sparsity is exploited through a m...

Find SimilarView on arXiv

Simultaneous support recovery in high dimensions: Benefits and perils of block $\ell_1/\ell_\infty$-regularization

May 5, 2009

89% Match

S. Negahban, M. J. Wainwright

Statistics Theory

Information Theory

Statistics Theory

Consider the use of $\ell_{1}/\ell_{\infty}$-regularized regression for joint estimation of a $\pdim \times \numreg$ matrix of regression coefficients. We analyze the high-dimensional scaling of $\ell_1/\ell_\infty$-regularized quadratic programming, considering both consistency in $\ell_\infty$-norm, and variable selection. We begin by establishing bounds on the $\ell_\infty$-error as well sufficient conditions for exact variable selection for fixed and random designs. Our s...

Find SimilarView on arXiv

A Survey of Tuning Parameter Selection for High-dimensional Regression

August 10, 2019

89% Match

Yunan Wu, Lan Wang

Methodology

Machine Learning

Penalized (or regularized) regression, as represented by Lasso and its variants, has become a standard technique for analyzing high-dimensional data when the number of variables substantially exceeds the sample size. The performance of penalized regression relies crucially on the choice of the tuning parameter, which determines the amount of regularization and hence the sparsity level of the fitted model. The optimal choice of tuning parameter depends on both the structure of...

Find SimilarView on arXiv

Rank-one Convexification for Sparse Regression

January 29, 2019

89% Match

Alper Atamturk, Andres Gomez

Machine Learning

Optimization and Control

Methodology

Sparse regression models are increasingly prevalent due to their ease of interpretability and superior out-of-sample performance. However, the exact model of sparse regression with an $\ell_0$ constraint restricting the support of the estimators is a challenging (\NP-hard) non-convex optimization problem. In this paper, we derive new strong convex relaxations for sparse regression. These relaxations are based on the ideal (convex-hull) formulations for rank-one quadratic term...

Find SimilarView on arXiv

On the Sensitivity of the Lasso to the Number of Predictor Variables

March 18, 2014

89% Match

Cheryl J. Flynn, Clifford M. Hurvich, Jeffrey S. Simonoff

Machine Learning

The Lasso is a computationally efficient regression regularization procedure that can produce sparse estimators when the number of predictors (p) is large. Oracle inequalities provide probability loss bounds for the Lasso estimator at a deterministic choice of the regularization parameter. These bounds tend to zero if p is appropriately controlled, and are thus commonly cited as theoretical justification for the Lasso and its ability to handle high-dimensional settings. Unfor...

Find SimilarView on arXiv

Joint variable and rank selection for parsimonious estimation of high-dimensional matrices

October 17, 2011

89% Match

Florentina Bunea, Yiyuan She, Marten H. Wegkamp

Statistics Theory

Methodology

Machine Learning

Statistics Theory

We propose dimension reduction methods for sparse, high-dimensional multivariate response regression models. Both the number of responses and that of the predictors may exceed the sample size. Sometimes viewed as complementary, predictor selection and rank reduction are the most popular strategies for obtaining lower-dimensional approximations of the parameter matrix in such models. We show in this article that important gains in prediction accuracy can be obtained by conside...

Find SimilarView on arXiv

Sparse High-Dimensional Regression: Exact Scalable Algorithms and Phase Transitions

Slowly Varying Regression under Sparsity

Square-Root Lasso: Pivotal Recovery of Sparse Signals via Conic Programming

Scalable Algorithms for the Sparse Ridge Regression

High dimensional thresholded regression and shrinkage effect

Inference in High-dimensional Linear Regression

Simultaneous support recovery in high dimensions: Benefits and perils of block $\ell_1/\ell_\infty$-regularization

A Survey of Tuning Parameter Selection for High-dimensional Regression

Rank-one Convexification for Sparse Regression

On the Sensitivity of the Lasso to the Number of Predictor Variables

Joint variable and rank selection for parsimonious estimation of high-dimensional matrices