Lecture Notes on High Dimensional Linear...

Fisher Lecture: Dimension Reduction in Regression

August 28, 2007

88% Match

R. Dennis Cook

Methodology

Beginning with a discussion of R. A. Fisher's early written remarks that relate to dimension reduction, this article revisits principal components as a reductive method in regression, develops several model-based extensions and ends with descriptions of general approaches to model-based and model-free dimension reduction in regression. It is argued that the role for principal components and related methodology may be broader than previously seen and that the common practice o...

Find SimilarView on arXiv

Universality of regularized regression estimators in high dimensions

June 16, 2022

88% Match

Qiyang Han, Yandi Shen

Statistics Theory

Information Theory

Probability

Statistics Theory

The Convex Gaussian Min-Max Theorem (CGMT) has emerged as a prominent theoretical tool for analyzing the precise stochastic behavior of various statistical estimators in the so-called high dimensional proportional regime, where the sample size and the signal dimension are of the same order. However, a well recognized limitation of the existing CGMT machinery rests in its stringent requirement on the exact Gaussianity of the design matrix, therefore rendering the obtained prec...

Find SimilarView on arXiv

High-dimensional inference in misspecified linear models

March 22, 2015

88% Match

Peter Bühlmann, de Geer Sara van

Methodology

We consider high-dimensional inference when the assumed linear model is misspecified. We describe some correct interpretations and corresponding sufficient assumptions for valid asymptotic inference of the model parameters, which still have a useful meaning when the model is misspecified. We largely focus on the de-sparsified Lasso procedure but we also indicate some implications for (multiple) sample splitting techniques. In view of available methods and software, our result...

Find SimilarView on arXiv

Six Lectures on Linearized Neural Networks

August 25, 2023

88% Match

Theodor Misiakiewicz, Andrea Montanari

Machine Learning

Statistics Theory

In these six lectures, we examine what can be learnt about the behavior of multi-layer neural networks from the analysis of linear models. We first recall the correspondence between neural networks and linear models via the so-called lazy regime. We then review four models for linearized neural networks: linear regression with concentrated features, kernel ridge regression, random feature model and neural tangent model. Finally, we highlight the limitations of the linear theo...

Find SimilarView on arXiv

Trust, but verify: benefits and pitfalls of least-squares refitting in high dimensions

June 1, 2013

88% Match

Johannes Lederer

Methodology

Statistics Theory

Least-squares refitting is widely used in high dimensional regression to reduce the prediction bias of l1-penalized estimators (e.g., Lasso and Square-Root Lasso). We present theoretical and numerical results that provide new insights into the benefits and pitfalls of least-squares refitting. In particular, we consider both prediction and estimation, and we pay close attention to the effects of correlations in the design matrices of linear regression models, since these corre...

Find SimilarView on arXiv

A note on sparse least-squares regression

December 29, 2013

88% Match

Christos Boutsidis, Malik Magdon-Ismail

Data Structures and Algorith...

We compute a \emph{sparse} solution to the classical least-squares problem $\min_x||A x -b||,$ where $A$ is an arbitrary matrix. We describe a novel algorithm for this sparse least-squares problem. The algorithm operates as follows: first, it selects columns from $A$, and then solves a least-squares problem only with the selected columns. The column selection algorithm that we use is known to perform well for the well studied column subset selection problem. The contribution ...

Find SimilarView on arXiv

Model Selection for High-Dimensional Regression under the Generalized Irrepresentability Condition

May 2, 2013

88% Match

Adel Javanmard, Andrea Montanari

math.ST

cs.IT

cs.LG

math.IT

stat.ME

stat.ML

stat.TH

In the high-dimensional regression model a response variable is linearly related to $p$ covariates, but the sample size $n$ is smaller than $p$. We assume that only a small subset of covariates is `active' (i.e., the corresponding coefficients are non-zero), and consider the model-selection problem of identifying the active covariates. A popular approach is to estimate the regression coefficients through the Lasso ($\ell_1$-regularized least squares). This is known to correct...

Find SimilarView on arXiv

Stepwise Choice of Covariates in High Dimensional Regression

October 17, 2016

88% Match

Laurie Davies

Methodology

Given data y(n) and p(n)covariates x(n) one problem in linear regression is to decide which if any of the covariates to include. There are many articles on this problem but all are based on a stochastic model for the data. This paper gives what seems to be a new approach which does not require any form of model. It is conceptually and algorithmically simple and consistency results can be proved under appropriate assumptions.

Find SimilarView on arXiv

On the Sensitivity of the Lasso to the Number of Predictor Variables

March 18, 2014

88% Match

Cheryl J. Flynn, Clifford M. Hurvich, Jeffrey S. Simonoff

Machine Learning

The Lasso is a computationally efficient regression regularization procedure that can produce sparse estimators when the number of predictors (p) is large. Oracle inequalities provide probability loss bounds for the Lasso estimator at a deterministic choice of the regularization parameter. These bounds tend to zero if p is appropriately controlled, and are thus commonly cited as theoretical justification for the Lasso and its ability to handle high-dimensional settings. Unfor...

Find SimilarView on arXiv

Sparse High-Dimensional Regression: Exact Scalable Algorithms and Phase Transitions

September 28, 2017

88% Match

Dimitris Bertsimas, Parys Bart Van

Optimization and Control

Machine Learning

We present a novel binary convex reformulation of the sparse regression problem that constitutes a new duality perspective. We devise a new cutting plane method and provide evidence that it can solve to provable optimality the sparse regression problem for sample sizes n and number of regressors p in the 100,000s, that is two orders of magnitude better than the current state of the art, in seconds. The ability to solve the problem for very high dimensions allows us to observe...

Find SimilarView on arXiv

Lecture Notes on High Dimensional Linear Regression

Fisher Lecture: Dimension Reduction in Regression

Universality of regularized regression estimators in high dimensions

High-dimensional inference in misspecified linear models

Six Lectures on Linearized Neural Networks

Trust, but verify: benefits and pitfalls of least-squares refitting in high dimensions

A note on sparse least-squares regression

Model Selection for High-Dimensional Regression under the Generalized Irrepresentability Condition

Stepwise Choice of Covariates in High Dimensional Regression

On the Sensitivity of the Lasso to the Number of Predictor Variables

Sparse High-Dimensional Regression: Exact Scalable Algorithms and Phase Transitions