Invariance-based Inference in High-Dimen...

Permutation testing in high-dimensional linear models: an empirical investigation

January 6, 2020

89% Match

Jesse Hemerik, Magne Thoresen, Livio Finos

Methodology

Permutation testing in linear models, where the number of nuisance coefficients is smaller than the sample size, is a well-studied topic. The common approach of such tests is to permute residuals after regressing on the nuisance covariates. Permutation-based tests are valuable in particular because they can be highly robust to violations of the standard linear model, such as non-normality and heteroscedasticity. Moreover, in some cases they can be combined with existing, powe...

Find SimilarView on arXiv

A Flexible Framework for Hypothesis Testing in High-dimensions

April 26, 2017

89% Match

Adel Javanmard, Jason D. Lee

math.ST

cs.LG

stat.AP

stat.ME

stat.ML

stat.TH

Hypothesis testing in the linear regression model is a fundamental statistical problem. We consider linear regression in the high-dimensional regime where the number of parameters exceeds the number of samples ($p> n$). In order to make informative inference, we assume that the model is approximately sparse, that is the effect of covariates on the response can be well approximated by conditioning on a relatively small number of covariates whose identities are unknown. We deve...

Find SimilarView on arXiv

Inference in High Dimensions with the Penalized Score Test

January 12, 2014

89% Match

Arend Voorman, Ali Shojaie, Daniela Witten

Methodology

Machine Learning

In recent years, there has been considerable theoretical development regarding variable selection consistency of penalized regression techniques, such as the lasso. However, there has been relatively little work on quantifying the uncertainty in these selection procedures. In this paper, we propose a new method for inference in high dimensions using a score test based on penalized regression. In this test, we perform penalized regression of an outcome on all but a single feat...

Find SimilarView on arXiv

Testing Many Zero Restrictions in a High Dimensional Linear Regression Setting

January 22, 2023

89% Match

Jonathan B. Hill

Statistics Theory

We propose a test of many zero parameter restrictions in a high dimensional linear iid regression model with $k$ $>>$ $n$ regressors. The test statistic is formed by estimating key parameters one at a time based on many low dimension regression models with nuisance terms. The parsimoniously parametrized models identify whether the original parameter of interest is or is not zero. Estimating fixed low dimension sub-parameters ensures greater estimator accuracy, it does not req...

Find SimilarView on arXiv

Testing Heteroskedasticity in High-Dimensional Linear Regression

October 31, 2022

89% Match

Akira Shinkyu

Statistics Theory

We propose a new testing procedure of heteroskedasticity in high-dimensional linear regression, where the number of covariates can be larger than the sample size. Our testing procedure is based on residuals of the Lasso. We demonstrate that our test statistic has asymptotic normality under the null hypothesis of homoskedasticity. Simulation results show that the proposed testing procedure obtains accurate empirical sizes and powers. We also present results of real economic da...

Find SimilarView on arXiv

Statistical Inference and Large-scale Multiple Testing for High-dimensional Regression Models

January 25, 2023

89% Match

T. Tony Cai, Zijian Guo, Yin Xia

Methodology

Statistics Theory

This paper presents a selective survey of recent developments in statistical inference and multiple testing for high-dimensional regression models, including linear and logistic regression. We examine the construction of confidence intervals and hypothesis tests for various low-dimensional objectives such as regression coefficients and linear and quadratic functionals. The key technique is to generate debiased and desparsified estimators for the targeted low-dimensional objec...

Find SimilarView on arXiv

A projection pursuit framework for testing general high-dimensional hypothesis

May 2, 2017

89% Match

Yinchu Zhu, Jelena Bradic

Methodology

Statistics Theory

Computation

Machine Learning

Statistics Theory

This article develops a framework for testing general hypothesis in high-dimensional models where the number of variables may far exceed the number of observations. Existing literature has considered less than a handful of hypotheses, such as testing individual coordinates of the model parameter. However, the problem of testing general and complex hypotheses remains widely open. We propose a new inference method developed around the hypothesis adaptive projection pursuit fram...

Find SimilarView on arXiv

Confidence Intervals for Low-Dimensional Parameters in High-Dimensional Linear Models

October 12, 2011

89% Match

Cun-Hui Zhang, Stephanie S. Zhang

Methodology

The purpose of this paper is to propose methodologies for statistical inference of low-dimensional parameters with high-dimensional data. We focus on constructing confidence intervals for individual coefficients and linear combinations of several of them in a linear regression model, although our ideas are applicable in a much broad context. The theoretical results presented here provide sufficient conditions for the asymptotic normality of the proposed estimators along with ...

Find SimilarView on arXiv

High-dimensional regression in practice: an empirical study of finite-sample prediction, variable selection and ranking

August 2, 2018

89% Match

Fan Wang, Sach Mukherjee, ... , Hill Steven M.

Methodology

Machine Learning

Penalized likelihood approaches are widely used for high-dimensional regression. Although many methods have been proposed and the associated theory is now well-developed, the relative efficacy of different approaches in finite-sample settings, as encountered in practice, remains incompletely understood. There is therefore a need for empirical investigations in this area that can offer practical insight and guidance to users. In this paper we present a large-scale comparison o...

Find SimilarView on arXiv

A study on tuning parameter selection for the high-dimensional lasso

February 4, 2016

89% Match

Darren Homrighausen, Daniel J. McDonald

Methodology

Machine Learning

High-dimensional predictive models, those with more measurements than observations, require regularization to be well defined, perform well empirically, and possess theoretical guarantees. The amount of regularization, often determined by tuning parameters, is integral to achieving good performance. One can choose the tuning parameter in a variety of ways, such as through resampling methods or generalized information criteria. However, the theory supporting many regularized p...

Find SimilarView on arXiv

Invariance-based Inference in High-Dimensional Regression with Finite-Sample Guarantees

Permutation testing in high-dimensional linear models: an empirical investigation

A Flexible Framework for Hypothesis Testing in High-dimensions

Inference in High Dimensions with the Penalized Score Test

Testing Many Zero Restrictions in a High Dimensional Linear Regression Setting

Testing Heteroskedasticity in High-Dimensional Linear Regression

Statistical Inference and Large-scale Multiple Testing for High-dimensional Regression Models

A projection pursuit framework for testing general high-dimensional hypothesis

Confidence Intervals for Low-Dimensional Parameters in High-Dimensional Linear Models

High-dimensional regression in practice: an empirical study of finite-sample prediction, variable selection and ranking

A study on tuning parameter selection for the high-dimensional lasso