Exact results on high-dimensional linear...

High-dimensional inference: a statistical mechanics perspective

October 28, 2020

91% Match

Jean Barbier

Disordered Systems and Neura...

Statistical Mechanics

Information Theory

Machine Learning

Information Theory

Statistical inference is the science of drawing conclusions about some system from data. In modern signal processing and machine learning, inference is done in very high dimension: very many unknown characteristics about the system have to be deduced from a lot of high-dimensional noisy data. This "high-dimensional regime" is reminiscent of statistical mechanics, which aims at describing the macroscopic behavior of a complex system based on the knowledge of its microscopic in...

Find Similar View on arXiv

Statistical Mechanics of High-Dimensional Inference

January 18, 2016

89% Match

Madhu Advani, Surya Ganguli

stat.ML

cond-mat.dis-nn

cond-mat.stat-mech

math.ST

q-bio.QM

stat.TH

To model modern large-scale datasets, we need efficient algorithms to infer a set of $P$ unknown model parameters from $N$ noisy measurements. What are fundamental limits on the accuracy of parameter inference, given finite signal-to-noise ratios, limited measurements, prior information, and computational tractability requirements? How can we combine prior information with measurements to achieve these limits? Classical statistics gives incisive answers to these questions as ...

Find Similar View on arXiv

Inference in High-dimensional Linear Regression

June 22, 2021

89% Match

Heather S. Battey, Nancy Reid

Methodology

Statistics Theory

This paper develops an approach to inference in a linear regression model when the number of potential explanatory variables is larger than the sample size. The approach treats each regression coefficient in turn as the interest parameter, the remaining coefficients being nuisance parameters, and seeks an optimal interest-respecting transformation, inducing sparsity on the relevant blocks of the notional Fisher information matrix. The induced sparsity is exploited through a m...

Find Similar View on arXiv

Understanding Phase Transitions via Mutual Information and MMSE

July 3, 2019

89% Match

Galen Reeves, Henry Pfister

Information Theory

Statistics Theory

The ability to understand and solve high-dimensional inference problems is essential for modern data science. This article examines high-dimensional inference problems through the lens of information theory and focuses on the standard linear model as a canonical example that is both rich enough to be practically useful and simple enough to be studied rigorously. In particular, this model can exhibit phase transitions where an arbitrarily small change in the model parameters c...

Find Similar View on arXiv

High Dimensional Statistical Inference and Random Matrices

November 19, 2006

89% Match

Iain M. Johnstone

Statistics Theory

Probability

Statistics Theory

Multivariate statistical analysis is concerned with observations on several variables which are thought to possess some degree of inter-dependence. Driven by problems in genetics and the social sciences, it first flowered in the earlier half of the last century. Subsequently, random matrix theory (RMT) developed, initially within physics, and more recently widely in mathematics. While some of the central objects of study in RMT are identical to those of multivariate statistic...

Find Similar View on arXiv

Sparse High-Dimensional Regression: Exact Scalable Algorithms and Phase Transitions

September 28, 2017

88% Match

Dimitris Bertsimas, Parys Bart Van

Optimization and Control

Machine Learning

We present a novel binary convex reformulation of the sparse regression problem that constitutes a new duality perspective. We devise a new cutting plane method and provide evidence that it can solve to provable optimality the sparse regression problem for sample sizes n and number of regressors p in the 100,000s, that is two orders of magnitude better than the current state of the art, in seconds. The ability to solve the problem for very high dimensions allows us to observe...

Find Similar View on arXiv

Statistical Physics and Information Theory Perspectives on Linear Inverse Problems

May 15, 2017

88% Match

Junan Zhu

Information Theory

Many real-world problems in machine learning, signal processing, and communications assume that an unknown vector $x$ is measured by a matrix A, resulting in a vector $y=Ax+z$, where $z$ denotes the noise; we call this a single measurement vector (SMV) problem. Sometimes, multiple dependent vectors $x^{(j)}, j\in \{1,...,J\}$, are measured at the same time, forming the so-called multi-measurement vector (MMV) problem. Both SMV and MMV are linear models (LM's), and the process...

Find Similar View on arXiv

Sub-optimality of the Naive Mean Field approximation for proportional high-dimensional Linear Regression

October 15, 2023

88% Match

Jiaze Qiu

Statistics Theory

The Na\"ive Mean Field (NMF) approximation is widely employed in modern Machine Learning due to the huge computational gains it bestows on the statistician. Despite its popularity in practice, theoretical guarantees for high-dimensional problems are only available under strong structural assumptions (e.g., sparsity). Moreover, existing theory often does not explain empirical observations noted in the existing literature. In this paper, we take a step towards addressing thes...

Find Similar View on arXiv

Invariance-based Inference in High-Dimensional Regression with Finite-Sample Guarantees

December 22, 2023

88% Match

Wenxuan Guo, Panos Toulis

Methodology

Statistics Theory

In this paper, we develop invariance-based procedures for testing and inference in high-dimensional regression models. These procedures, also known as randomization tests, provide several important advantages. First, for the global null hypothesis of significance, our test is valid in finite samples. It is also simple to implement and comes with finite-sample guarantees on statistical power. Remarkably, despite its simplicity, this testing idea has escaped the attention of ea...

Find Similar View on arXiv

Concentration Inequalities for Statistical Inference

November 4, 2020

88% Match

Huiming Zhang, Song Xi Chen

Statistics Theory

Machine Learning

Probability

Machine Learning

Statistics Theory

This paper gives a review of concentration inequalities which are widely employed in non-asymptotical analyses of mathematical statistics in a wide range of settings, from distribution-free to distribution-dependent, from sub-Gaussian to sub-exponential, sub-Gamma, and sub-Weibull random variables, and from the mean to the maximum concentration. This review provides results in these settings with some fresh new results. Given the increasing popularity of high-dimensional data...

Find Similar View on arXiv

Exact results on high-dimensional linear regression via statistical physics

High-dimensional inference: a statistical mechanics perspective

Statistical Mechanics of High-Dimensional Inference

Inference in High-dimensional Linear Regression

Understanding Phase Transitions via Mutual Information and MMSE

High Dimensional Statistical Inference and Random Matrices

Sparse High-Dimensional Regression: Exact Scalable Algorithms and Phase Transitions

Statistical Physics and Information Theory Perspectives on Linear Inverse Problems

Sub-optimality of the Naive Mean Field approximation for proportional high-dimensional Linear Regression

Invariance-based Inference in High-Dimensional Regression with Finite-Sample Guarantees

Concentration Inequalities for Statistical Inference