Lecture Notes on High Dimensional Linear...

Statistical Challenges with High Dimensionality: Feature Selection in Knowledge Discovery

February 7, 2006

88% Match

Jianqing Fan, Runze Li

Statistics Theory

Optimization and Control

Statistics Theory

Technological innovations have revolutionized the process of scientific research and knowledge discovery. The availability of massive data and challenges from frontiers of research and development have reshaped statistical thinking, data analysis and theoretical studies. The challenges of high-dimensionality arise in diverse fields of sciences and the humanities, ranging from computational biology and health studies to financial engineering and risk management. In all of thes...

Find SimilarView on arXiv

On cross-validated Lasso in high dimensions

May 7, 2016

88% Match

Denis Chetverikov, Zhipeng Liao, Victor Chernozhukov

Statistics Theory

In this paper, we derive non-asymptotic error bounds for the Lasso estimator when the penalty parameter for the estimator is chosen using $K$-fold cross-validation. Our bounds imply that the cross-validated Lasso estimator has nearly optimal rates of convergence in the prediction, $L^2$, and $L^1$ norms. For example, we show that in the model with the Gaussian noise and under fairly general assumptions on the candidate set of values of the penalty parameter, the estimation er...

Find SimilarView on arXiv

The geometry of least squares in the 21st century

September 30, 2013

88% Match

Jonathan Taylor

Statistics Theory

It has been over 200 years since Gauss's and Legendre's famous priority dispute on who discovered the method of least squares. Nevertheless, we argue that the normal equations are still relevant in many facets of modern statistics, particularly in the domain of high-dimensional inference. Even today, we are still learning new things about the law of large numbers, first described in Bernoulli's Ars Conjectandi 300 years ago, as it applies to high dimensional inference. The ot...

Find SimilarView on arXiv

High-dimensional regression with unknown variance

September 26, 2011

88% Match

Christophe CMAP Giraud, Sylvie Unité MIAJ Huet, Nicolas MISTEA Verzelen

Statistics Theory

We review recent results for high-dimensional sparse linear regression in the practical case of unknown variance. Different sparsity settings are covered, including coordinate-sparsity, group-sparsity and variation-sparsity. The emphasis is put on non-asymptotic analyses and feasible procedures. In addition, a small numerical study compares the practical performance of three schemes for tuning the Lasso estimator and some references are collected for some more general models,...

Find SimilarView on arXiv

Estimation in high dimensions: a geometric perspective

May 20, 2014

88% Match

Roman Vershynin

Statistics Theory

This tutorial provides an exposition of a flexible geometric framework for high dimensional estimation problems with constraints. The tutorial develops geometric intuition about high dimensional sets, justifies it with some results of asymptotic convex geometry, and demonstrates connections between geometric results and estimation problems. The theory is illustrated with applications to sparse recovery, matrix completion, quantization, linear and logistic regression and gener...

Find SimilarView on arXiv

An adaptive shortest-solution guided decimation approach to sparse high-dimensional linear regression

November 28, 2022

88% Match

Xue Yu, Yifan Sun, Haijun Zhou

Machine Learning

Computation

High-dimensional linear regression model is the most popular statistical model for high-dimensional data, but it is quite a challenging task to achieve a sparse set of regression coefficients. In this paper, we propose a simple heuristic algorithm to construct sparse high-dimensional linear regression models, which is adapted from the shortest solution-guided decimation algorithm and is referred to as ASSD. This algorithm constructs the support of regression coefficients unde...

Find SimilarView on arXiv

lassopack: Model selection and prediction with regularized regression in Stata

January 16, 2019

88% Match

Achim Ahrens, Christian B. Hansen, Mark E. Schaffer

Econometrics

This article introduces lassopack, a suite of programs for regularized regression in Stata. lassopack implements lasso, square-root lasso, elastic net, ridge regression, adaptive lasso and post-estimation OLS. The methods are suitable for the high-dimensional setting where the number of predictors $p$ may be large and possibly greater than the number of observations, $n$. We offer three different approaches for selecting the penalization (`tuning') parameters: information cri...

Find SimilarView on arXiv

On LASSO for High Dimensional Predictive Regression

December 14, 2022

88% Match

Ziwei Mei, Zhentao Shi

Econometrics

Machine Learning

This paper examines LASSO, a widely-used $L_{1}$-penalized regression method, in high dimensional linear predictive regressions, particularly when the number of potential predictors exceeds the sample size and numerous unit root regressors are present. The consistency of LASSO is contingent upon two key components: the deviation bound of the cross product of the regressors and the error term, and the restricted eigenvalue of the Gram matrix. We present new probabilistic bound...

Find SimilarView on arXiv

A Practical Scheme and Fast Algorithm to Tune the Lasso With Optimality Guarantees

October 1, 2014

88% Match

Michaël Chichignoud, Johannes Lederer, Martin Wainwright

Methodology

Statistics Theory

We introduce a novel scheme for choosing the regularization parameter in high-dimensional linear regression with Lasso. This scheme, inspired by Lepski's method for bandwidth selection in non-parametric regression, is equipped with both optimal finite-sample guarantees and a fast algorithm. In particular, for any design matrix such that the Lasso has low sup-norm error under an "oracle choice" of the regularization parameter, we show that our method matches the oracle perform...

Find SimilarView on arXiv

LLASSO: A linear unified LASSO for multicollinear situations

October 13, 2017

88% Match

M. Arashi, Y. Asar, B. Yuzbasi

Methodology

We propose a rescaled LASSO, by premultipying the LASSO with a matrix term, namely linear unified LASSO (LLASSO) for multicollinear situations. Our numerical study has shown that the LLASSO is comparable with other sparse modeling techniques and often outperforms the LASSO and elastic net. Our findings open new visions about using the LASSO still for sparse modeling and variable selection. We conclude our study by pointing that the LLASSO can be solved by the same efficient a...

Find SimilarView on arXiv

Lecture Notes on High Dimensional Linear Regression

Statistical Challenges with High Dimensionality: Feature Selection in Knowledge Discovery

On cross-validated Lasso in high dimensions

The geometry of least squares in the 21st century

High-dimensional regression with unknown variance

Estimation in high dimensions: a geometric perspective

An adaptive shortest-solution guided decimation approach to sparse high-dimensional linear regression

lassopack: Model selection and prediction with regularized regression in Stata

On LASSO for High Dimensional Predictive Regression

A Practical Scheme and Fast Algorithm to Tune the Lasso With Optimality Guarantees

LLASSO: A linear unified LASSO for multicollinear situations