Scalable Bayesian inference for the gene...

Massive parallelization of serial inference algorithms for a complex generalized linear model

August 4, 2012

90% Match

Marc A. Suchard, Shawn E. Simpson, Ivan Zorych, ... , Madigan David

Computation

Optimization and Control

Following a series of high-profile drug safety disasters in recent years, many countries are redoubling their efforts to ensure the safety of licensed medical products. Large-scale observational databases such as claims databases or electronic health record systems are attracting particular attention in this regard, but present significant methodological and computational concerns. In this paper we show how high-performance statistical computation, including graphics processi...

Find SimilarView on arXiv

A subsampling approach for Bayesian model selection

January 31, 2022

90% Match

Jon Lachmann, Geir Storvik, ... , Hubin Aliaksadr

Methodology

Statistics Theory

Computation

Statistics Theory

It is common practice to use Laplace approximations to compute marginal likelihoods in Bayesian versions of generalised linear models (GLM). Marginal likelihoods combined with model priors are then used in different search algorithms to compute the posterior marginal probabilities of models and individual covariates. This allows performing Bayesian model selection and model averaging. For large sample sizes, even the Laplace approximation becomes computationally challenging b...

Find SimilarView on arXiv

Patterns of Scalable Bayesian Inference

February 16, 2016

90% Match

Elaine Angelino, Matthew James Johnson, Ryan P. Adams

Machine Learning

Datasets are growing not just in size but in complexity, creating a demand for rich models and quantification of uncertainty. Bayesian methods are an excellent fit for this demand, but scaling Bayesian inference is a challenge. In response to this challenge, there has been considerable recent work based on varying assumptions about model structure, underlying computational resources, and the importance of asymptotic correctness. As a result, there is a zoo of ideas with few c...

Find SimilarView on arXiv

Simple, Scalable and Accurate Posterior Interval Estimation

May 13, 2016

89% Match

Cheng Li, Sanvesh Srivastava, David B. Dunson

Computation

Statistics Theory

There is a lack of simple and scalable algorithms for uncertainty quantification. Bayesian methods quantify uncertainty through posterior and predictive distributions, but it is difficult to rapidly estimate summaries of these distributions, such as quantiles and intervals. Variational Bayes approximations are widely used, but may badly underestimate posterior covariance. Typically, the focus of Bayesian inference is on point and interval estimates for one-dimensional functio...

Find SimilarView on arXiv

Efficient variational inference for generalized linear mixed models with large datasets

July 30, 2013

89% Match

David J Nott, Minh-Ngoc Tran, ... , Kohn Robert

Methodology

The article develops a hybrid Variational Bayes algorithm that combines the mean-field and fixed-form Variational Bayes methods. The new estimation algorithm can be used to approximate any posterior without relying on conjugate priors. We propose a divide and recombine strategy for the analysis of large datasets, which partitions a large dataset into smaller pieces and then combines the variational distributions that have been learnt in parallel on each separate piece using t...

Find SimilarView on arXiv

Efficient Computation of High-Dimensional Penalized Generalized Linear Mixed Models by Latent Factor Modeling of the Random Effects

May 14, 2023

89% Match

Hillary M. Heiling, Naim U. Rashid, Quefeng Li, Xianlu L. Peng, ... , Ibrahim Joseph G.

Methodology

Computation

Modern biomedical datasets are increasingly high dimensional and exhibit complex correlation structures. Generalized Linear Mixed Models (GLMMs) have long been employed to account for such dependencies. However, proper specification of the fixed and random effects in GLMMs is increasingly difficult in high dimensions, and computational complexity grows with increasing dimension of the random effects. We present a novel reformulation of the GLMM using a factor model decomposit...

Find SimilarView on arXiv

Scaling Bayesian inference of mixed multinomial logit models to very large datasets

April 11, 2020

89% Match

Filipe Rodrigues

Computation

Machine Learning

Methodology

Machine Learning

Variational inference methods have been shown to lead to significant improvements in the computational efficiency of approximate Bayesian inference in mixed multinomial logit models when compared to standard Markov-chain Monte Carlo (MCMC) methods without compromising accuracy. However, despite their demonstrated efficiency gains, existing methods still suffer from important limitations that prevent them to scale to very large datasets, while providing the flexibility to allo...

Find SimilarView on arXiv

Deep Bayesian Gaussian Processes for Uncertainty Estimation in Electronic Health Records

March 23, 2020

89% Match

Yikuan Li, Shishir Rao, Abdelaali Hassaine, Rema Ramakrishnan, Yajie Zhu, Dexter Canoy, Gholamreza Salimi-Khorshidi, ... , Rahimi Kazem

Machine Learning

One major impediment to the wider use of deep learning for clinical decision making is the difficulty of assigning a level of confidence to model predictions. Currently, deep Bayesian neural networks and sparse Gaussian processes are the main two scalable uncertainty estimation methods. However, deep Bayesian neural network suffers from lack of expressiveness, and more expressive models such as deep kernel learning, which is an extension of sparse Gaussian process, captures o...

Find SimilarView on arXiv

Scalable expectation propagation for generalized linear models

July 2, 2024

89% Match

Niccolò Anceschi, Augusto Fasano, ... , Rebaudo Giovanni

Computation

Generalized linear models (GLMs) arguably represent the standard approach for statistical regression beyond the Gaussian likelihood scenario. When Bayesian formulations are employed, the general absence of a tractable posterior distribution has motivated the development of deterministic approximations, which are generally more scalable than sampling techniques. Among them, expectation propagation (EP) showed extreme accuracy, usually higher than many variational Bayes solutio...

Find SimilarView on arXiv

An Algorithm for Distributed Bayesian Inference in Generalized Linear Models

November 18, 2019

89% Match

Nariankadu D. Shyamalkumar, Sanvesh Srivastava

Computation

Monte Carlo algorithms, such as Markov chain Monte Carlo (MCMC) and Hamiltonian Monte Carlo (HMC), are routinely used for Bayesian inference in generalized linear models; however, these algorithms are prohibitively slow in massive data settings because they require multiple passes through the full data in every iteration. Addressing this problem, we develop a scalable extension of these algorithms using the divide-and-conquer (D&C) technique that divides the data into a suffi...

Find SimilarView on arXiv

Scalable Bayesian inference for the generalized linear mixed model

Massive parallelization of serial inference algorithms for a complex generalized linear model

A subsampling approach for Bayesian model selection

Patterns of Scalable Bayesian Inference

Simple, Scalable and Accurate Posterior Interval Estimation

Efficient variational inference for generalized linear mixed models with large datasets

Efficient Computation of High-Dimensional Penalized Generalized Linear Mixed Models by Latent Factor Modeling of the Random Effects

Scaling Bayesian inference of mixed multinomial logit models to very large datasets

Deep Bayesian Gaussian Processes for Uncertainty Estimation in Electronic Health Records

Scalable expectation propagation for generalized linear models

An Algorithm for Distributed Bayesian Inference in Generalized Linear Models