Exact results on high-dimensional linear...

Hypothesis Testing in High-Dimensional Regression under the Gaussian Random Design Model: Asymptotic Theory

January 17, 2013

87% Match

Adel Javanmard, Andrea Montanari

stat.ME

cs.IT

math.IT

math.ST

stat.ML

stat.TH

We consider linear regression in the high-dimensional regime where the number of observations $n$ is smaller than the number of parameters $p$. A very successful approach in this setting uses $\ell_1$-penalized least squares (a.k.a. the Lasso) to search for a subset of $s_0< n$ parameters that best explain the data, while setting the other parameters to zero. Considerable amount of work has been devoted to characterizing the estimation and model selection problems within this...

Find SimilarView on arXiv

Introduction to the Special Issue on Sparsity and Regularization Methods

January 2, 2013

87% Match

Jon Wellner, Tong Zhang

Methodology

Traditional statistical inference considers relatively small data sets and the corresponding theoretical analysis focuses on the asymptotic behavior of a statistical estimator when the number of samples approaches infinity. However, many data sets encountered in modern applications have dimensionality significantly larger than the number of training data available, and for such problems the classical statistical tools become inadequate. In order to analyze high-dimensional da...

Find SimilarView on arXiv

Sparse High-Dimensional Linear Regression. Algorithmic Barriers and a Local Search Algorithm

November 14, 2017

87% Match

David Gamarnik, Ilias Zadik

Statistics Theory

Probability

Machine Learning

Statistics Theory

We consider a sparse high dimensional regression model where the goal is to recover a $k$-sparse unknown vector $\beta^*$ from $n$ noisy linear observations of the form $Y=X\beta^*+W \in \mathbb{R}^n$ where $X \in \mathbb{R}^{n \times p}$ has iid $N(0,1)$ entries and $W \in \mathbb{R}^n$ has iid $N(0,\sigma^2)$ entries. Under certain assumptions on the parameters, an intriguing assymptotic gap appears between the minimum value of $n$, call it $n^*$, for which the recovery is ...

Find SimilarView on arXiv

Random projections for Bayesian regression

April 23, 2015

87% Match

Leo N. Geppert, Katja Ickstadt, Alexander Munteanu, ... , Sohler Christian

Computation

Data Structures and Algorith...

This article deals with random projections applied as a data reduction technique for Bayesian regression analysis. We show sufficient conditions under which the entire $d$-dimensional distribution is approximately preserved under random projections by reducing the number of data points from $n$ to $k\in O(\operatorname{poly}(d/\varepsilon))$ in the case $n\gg d$. Under mild assumptions, we prove that evaluating a Gaussian likelihood function based on the projected data instea...

Find SimilarView on arXiv

Scaled minimax optimality in high-dimensional linear regression: A non-convex algorithmic regularization approach

August 27, 2020

87% Match

Mohamed Ndaoud

Statistics Theory

The question of fast convergence in the classical problem of high dimensional linear regression has been extensively studied. Arguably, one of the fastest procedures in practice is Iterative Hard Thresholding (IHT). Still, IHT relies strongly on the knowledge of the true sparsity parameter $s$. In this paper, we present a novel fast procedure for estimation in the high dimensional linear regression. Taking advantage of the interplay between estimation, support recovery and op...

Find SimilarView on arXiv

Four lectures on probabilistic methods for data science

December 20, 2016

87% Match

Roman Vershynin

math.PR

cs.DS

cs.IT

math.IT

math.ST

stat.TH

Methods of high-dimensional probability play a central role in applications for statistics, signal processing theoretical computer science and related fields. These lectures present a sample of particularly useful tools of high-dimensional probability, focusing on the classical and matrix Bernstein's inequality and the uniform matrix deviation inequality. We illustrate these tools with applications for dimension reduction, network analysis, covariance estimation, matrix compl...

Find SimilarView on arXiv

Prediction regions through Inverse Regression

July 9, 2018

87% Match

Emilie Devijver, Emeline Perthame

Statistics Theory

Methodology

Statistics Theory

Predict a new response from a covariate is a challenging task in regression, which raises new question since the era of high-dimensional data. In this paper, we are interested in the inverse regression method from a theoretical viewpoint. Theoretical results have already been derived for the well-known linear model, but recently, the curse of dimensionality has increased the interest of practitioners and theoreticians into generalization of those results for various estimator...

Find SimilarView on arXiv

High-Dimensional Econometrics and Regularized GMM

June 5, 2018

86% Match

Alexandre Belloni, Victor Chernozhukov, Denis Chetverikov, ... , Kato Kengo

Statistics Theory

Econometrics

Statistics Theory

This chapter presents key concepts and theoretical results for analyzing estimation and inference in high-dimensional models. High-dimensional models are characterized by having a number of unknown parameters that is not vanishingly small relative to the sample size. We first present results in a framework where estimators of parameters of interest may be represented directly as approximate means. Within this context, we review fundamental results including high-dimensional c...

Find SimilarView on arXiv

Accuracy Assessment for High-dimensional Linear Regression

March 10, 2016

86% Match

T. Tony Cai, Zijian Guo

Statistics Theory

This paper considers point and interval estimation of the $\ell_q$ loss of an estimator in high-dimensional linear regression with random design. We establish the minimax rate for estimating the $\ell_{q}$ loss and the minimax expected length of confidence intervals for the $\ell_{q}$ loss of rate-optimal estimators of the regression vector, including commonly used estimators such as Lasso, scaled Lasso, square-root Lasso and Dantzig Selector. Adaptivity of the confidence int...

Find SimilarView on arXiv

In Defense of the Indefensible: A Very Naive Approach to High-Dimensional Inference

May 16, 2017

86% Match

Sen Zhao, Daniela Witten, Ali Shojaie

Methodology

Statistics Theory

Machine Learning

Statistics Theory

A great deal of interest has recently focused on conducting inference on the parameters in a high-dimensional linear model. In this paper, we consider a simple and very na\"{i}ve two-step procedure for this task, in which we (i) fit a lasso model in order to obtain a subset of the variables, and (ii) fit a least squares model on the lasso-selected set. Conventional statistical wisdom tells us that we cannot make use of the standard statistical inference tools for the result...

Find SimilarView on arXiv

Exact results on high-dimensional linear regression via statistical physics

Hypothesis Testing in High-Dimensional Regression under the Gaussian Random Design Model: Asymptotic Theory

Introduction to the Special Issue on Sparsity and Regularization Methods

Sparse High-Dimensional Linear Regression. Algorithmic Barriers and a Local Search Algorithm

Random projections for Bayesian regression

Scaled minimax optimality in high-dimensional linear regression: A non-convex algorithmic regularization approach

Four lectures on probabilistic methods for data science

Prediction regions through Inverse Regression

High-Dimensional Econometrics and Regularized GMM

Accuracy Assessment for High-dimensional Linear Regression

In Defense of the Indefensible: A Very Naive Approach to High-Dimensional Inference