Scalable Bayesian inference for the generalized linear mixed model

March 5, 2024

Samuel I. Berchuck, Felipe A. Medeiros, Sayan Mukherjee, Andrea Agazzi

Statistics

Computation

Methodology

Machine Learning

The generalized linear mixed model (GLMM) is a popular statistical approach for handling correlated data, and is used extensively in applications areas where big data is common, including biomedical data settings. The focus of this paper is scalable statistical inference for the GLMM, where we define statistical inference as: (i) estimation of population parameters, and (ii) evaluation of scientific hypotheses in the presence of uncertainty. Artificial intelligence (AI) learning algorithms excel at scalable statistical estimation, but rarely include uncertainty quantification. In contrast, Bayesian inference provides full statistical inference, since uncertainty quantification results automatically from the posterior distribution. Unfortunately, Bayesian inference algorithms, including Markov Chain Monte Carlo (MCMC), become computationally intractable in big data settings. In this paper, we introduce a statistical inference algorithm at the intersection of AI and Bayesian inference, that leverages the scalability of modern AI algorithms with guaranteed uncertainty quantification that accompanies Bayesian inference. Our algorithm is an extension of stochastic gradient MCMC with novel contributions that address the treatment of correlated data (i.e., intractable marginal likelihood) and proper posterior variance estimation. Through theoretical and empirical results we establish our algorithm's statistical inference properties, and apply the method in a large electronic health records database.

A stochastic variational framework for fitting and diagnosing generalized linear mixed models

August 24, 2012

91% Match

Linda S. L. Tan, David J. Nott

Methodology

In stochastic variational inference, the variational Bayes objective function is optimized using stochastic gradient approximation, where gradients computed on small random subsets of data are used to approximate the true gradient over the whole data set. This enables complex models to be fit to large data sets as data can be processed in mini-batches. In this article, we extend stochastic variational inference for conjugate-exponential models to nonconjugate models and prese...

Find SimilarView on arXiv

On Scalable Inference with Stochastic Gradient Descent

July 1, 2017

91% Match

Yixin Fang, Jinfeng Xu, Lei Yang

Machine Learning

In many applications involving large dataset or online updating, stochastic gradient descent (SGD) provides a scalable way to compute parameter estimates and has gained increasing popularity due to its numerical convenience and memory efficiency. While the asymptotic properties of SGD-based estimators have been established decades ago, statistical inference such as interval estimation remains much unexplored. The traditional resampling method such as the bootstrap is not comp...

Find SimilarView on arXiv

LR-GLM: High-Dimensional Bayesian Inference Using Low-Rank Data Approximations

May 17, 2019

91% Match

Brian L. Trippe, Jonathan H. Huggins, ... , Broderick Tamara

Computation

Machine Learning

Methodology

Machine Learning

Due to the ease of modern data collection, applied statisticians often have access to a large set of covariates that they wish to relate to some observed outcome. Generalized linear models (GLMs) offer a particularly interpretable framework for such an analysis. In these high-dimensional problems, the number of covariates is often large relative to the number of observations, so we face non-trivial inferential uncertainty; a Bayesian approach allows coherent quantification of...

Find SimilarView on arXiv

MCMC for GLMMs

April 4, 2022

90% Match

Vivekananda Roy

Methodology

Applications

Computation

Generalized linear mixed models (GLMMs) are often used for analyzing correlated non-Gaussian data. The likelihood function in a GLMM is available only as a high dimensional integral, and thus closed-form inference and prediction are not possible for GLMMs. Since the likelihood is not available in a closed-form, the associated posterior densities in Bayesian GLMMs are also intractable. Generally, Markov chain Monte Carlo (MCMC) algorithms are used for conditional simulation in...

Find SimilarView on arXiv

Accelerating Generalized Linear Models by Trading off Computation for Uncertainty

October 31, 2023

90% Match

Lukas Tatzel, Jonathan Wenger, ... , Hennig Philipp

Machine Learning

Bayesian Generalized Linear Models (GLMs) define a flexible probabilistic framework to model categorical, ordinal and continuous data, and are widely used in practice. However, exact inference in GLMs is prohibitively expensive for large datasets, thus requiring approximations in practice. The resulting approximation error adversely impacts the reliability of the model and is not accounted for in the uncertainty of the prediction. In this work, we introduce a family of iterat...

Find SimilarView on arXiv

Big Learning with Bayesian Methods

November 24, 2014

90% Match

Jun Zhu, Jianfei Chen, ... , Zhang Bo

Machine Learning

Applications

Computation

Methodology

Machine Learning

Explosive growth in data and availability of cheap computing resources have sparked increasing interest in Big learning, an emerging subfield that studies scalable machine learning algorithms, systems, and applications with Big Data. Bayesian methods represent one important class of statistic methods for machine learning, with substantial recent developments on adaptive, flexible and scalable Bayesian learning. This article provides a survey of the recent advances in Big lear...

Find SimilarView on arXiv

Generalised linear mixed model analysis via sequential Monte Carlo sampling

October 7, 2008

90% Match

Y. Fan, D. S. Leslie, M. P. Wand

Computation

We present a sequential Monte Carlo sampler algorithm for the Bayesian analysis of generalised linear mixed models (GLMMs). These models support a variety of interesting regression-type analyses, but performing inference is often extremely difficult, even when using the Bayesian approach combined with Markov chain Monte Carlo (MCMC). The Sequential Monte Carlo sampler (SMC) is a new and general method for producing samples from posterior distributions. In this article we demo...

Find SimilarView on arXiv

Extended Stochastic Gradient MCMC for Large-Scale Bayesian Variable Selection

February 7, 2020

90% Match

Qifan Song, Yan Sun, ... , Liang Faming

Computation

Machine Learning

Stochastic gradient Markov chain Monte Carlo (MCMC) algorithms have received much attention in Bayesian computing for big data problems, but they are only applicable to a small class of problems for which the parameter space has a fixed dimension and the log-posterior density is differentiable with respect to the parameters. This paper proposes an extended stochastic gradient MCMC lgoriathm which, by introducing appropriate latent variables, can be applied to more general lar...

Find SimilarView on arXiv

Scalable Monte Carlo for Bayesian Learning

July 17, 2024

90% Match

Paul Fearnhead, Christopher Nemeth, ... , Sherlock Chris

Machine Learning

Computation

Methodology

This book aims to provide a graduate-level introduction to advanced topics in Markov chain Monte Carlo (MCMC) algorithms, as applied broadly in the Bayesian computational context. Most, if not all of these topics (stochastic gradient MCMC, non-reversible MCMC, continuous time MCMC, and new techniques for convergence assessment) have emerged as recently as the last decade, and have driven substantial recent practical and theoretical advances in the field. A particular focus is...

Find SimilarView on arXiv

Unbiased Bayes for Big Data: Paths of Partial Posteriors

January 14, 2015

90% Match

Heiko Strathmann, Dino Sejdinovic, Mark Girolami

Machine Learning

Methodology

A key quantity of interest in Bayesian inference are expectations of functions with respect to a posterior distribution. Markov Chain Monte Carlo is a fundamental tool to consistently compute these expectations via averaging samples drawn from an approximate posterior. However, its feasibility is being challenged in the era of so called Big Data as all data needs to be processed in every iteration. Realising that such simulation is an unnecessarily hard problem if the goal is...

Find SimilarView on arXiv