DDAC-SpAM: A Distributed Algorithm for F...

DECOrrelated feature space partitioning for distributed sparse regression

February 8, 2016

90% Match

Xiangyu Wang, David Dunson, Chenlei Leng

Methodology

Distributed, Parallel, and C...

Computation

Machine Learning

Fitting statistical models is computationally challenging when the sample size or the dimension of the dataset is huge. An attractive approach for down-scaling the problem size is to first partition the dataset into subsets and then fit using distributed algorithms. The dataset can be partitioned either horizontally (in the sample space) or vertically (in the feature space). While the majority of the literature focuses on sample space partitioning, feature space partitioning ...

Find SimilarView on arXiv

Sparse Additive Models

November 28, 2007

89% Match

Pradeep Ravikumar, John Lafferty, ... , Wasserman Larry

Statistics Theory

We present a new class of methods for high-dimensional nonparametric regression and classification called sparse additive models (SpAM). Our methods combine ideas from sparse linear modeling and additive nonparametric regression. We derive an algorithm for fitting the models that is practical and effective even when the number of covariates is larger than the sample size. SpAM is closely related to the COSSO model of Lin and Zhang (2006), but decouples smoothing and sparsity,...

Find SimilarView on arXiv

PAC-Bayesian Estimation and Prediction in Sparse Additive Models

August 6, 2012

89% Match

Benjamin Guedj, Pierre Alquier

Methodology

Statistics Theory

The present paper is about estimation and prediction in high-dimensional additive models under a sparsity assumption ($p\gg n$ paradigm). A PAC-Bayesian strategy is investigated, delivering oracle inequalities in probability. The implementation is performed through recent outcomes in high-dimensional MCMC algorithms, and the performance of our method is assessed on simulated data.

Find SimilarView on arXiv

Sparse Additive Model using Symmetric Nonnegative Definite Smoothers

September 9, 2014

89% Match

Yan Li

Machine Learning

We introduce a new algorithm, called adaptive sparse backfitting algorithm, for solving high dimensional Sparse Additive Model (SpAM) utilizing symmetric, non-negative definite smoothers. Unlike the previous sparse backfitting algorithm, our method is essentially a block coordinate descent algorithm that guarantees to converge to the optimal solution. It bridges the gap between the population backfitting algorithm and that of the data version. We also prove variable selection...

Find SimilarView on arXiv

Sparse Partially Linear Additive Models

July 17, 2014

88% Match

Yin Lou, Jacob Bien, ... , Gehrke Johannes

Methodology

Machine Learning

The generalized partially linear additive model (GPLAM) is a flexible and interpretable approach to building predictive models. It combines features in an additive manner, allowing each to have either a linear or nonlinear effect on the response. However, the choice of which features to treat as linear or nonlinear is typically assumed known. Thus, to make a GPLAM a viable approach in situations in which little is known $a~priori$ about the features, one must overcome two pri...

Find SimilarView on arXiv

Debiased distributed learning for sparse partial linear models in high dimensions

August 18, 2017

88% Match

Shaogao Lv, Heng Lian

Machine Learning

Although various distributed machine learning schemes have been proposed recently for pure linear models and fully nonparametric models, little attention has been paid on distributed optimization for semi-paramemetric models with multiple-level structures (e.g. sparsity, linearity and nonlinearity). To address these issues, the current paper proposes a new communication-efficient distributed learning algorithm for partially sparse linear models with an increasing number of fe...

Find SimilarView on arXiv

High-dimensional additive modeling

June 25, 2008

88% Match

Lukas Meier, de Geer Sara van, Peter Bühlmann

Machine Learning

We propose a new sparsity-smoothness penalty for high-dimensional generalized additive models. The combination of sparsity and smoothness is crucial for mathematical theory as well as performance for finite-sample data. We present a computationally efficient algorithm, with provable numerical convergence properties, for optimizing the penalized likelihood. Furthermore, we provide oracle results which yield asymptotic optimality of our estimator for high dimensional but sparse...

Find SimilarView on arXiv

A Partition-insensitive Parallel Framework for Distributed Model Fitting

June 2, 2024

87% Match

Xiaofei Wu, Rongmei Liang, Fabio Roli, ... , Yuan Jing

Computation

Distributed model fitting refers to the process of fitting a mathematical or statistical model to the data using distributed computing resources, such that computing tasks are divided among multiple interconnected computers or nodes, often organized in a cluster or network. Most of the existing methods for distributed model fitting are to formulate it in a consensus optimization problem, and then build up algorithms based on the alternating direction method of multipliers (AD...

Find SimilarView on arXiv

Distributed Dynamic Safe Screening Algorithms for Sparse Regularization

April 23, 2022

87% Match

Runxue Bao, Xidong Wu, ... , Huang Heng

Machine Learning

Distributed, Parallel, and C...

Machine Learning

Distributed optimization has been widely used as one of the most efficient approaches for model training with massive samples. However, large-scale learning problems with both massive samples and high-dimensional features widely exist in the era of big data. Safe screening is a popular technique to speed up high-dimensional models by discarding the inactive features with zero coefficients. Nevertheless, existing safe screening methods are limited to the sequential setting. In...

Find SimilarView on arXiv

Convex-constrained Sparse Additive Modeling and Its Extensions

May 1, 2017

87% Match

Junming Yin, Yaoliang Yu

Machine Learning

Sparse additive modeling is a class of effective methods for performing high-dimensional nonparametric regression. In this work we show how shape constraints such as convexity/concavity and their extensions, can be integrated into additive models. The proposed sparse difference of convex additive models (SDCAM) can estimate most continuous functions without any a priori smoothness assumption. Motivated by a characterization of difference of convex functions, our method incorp...

Find SimilarView on arXiv

DDAC-SpAM: A Distributed Algorithm for Fitting High-dimensional Sparse Additive Models with Feature Division and Decorrelation

DECOrrelated feature space partitioning for distributed sparse regression

Sparse Additive Models

PAC-Bayesian Estimation and Prediction in Sparse Additive Models

Sparse Additive Model using Symmetric Nonnegative Definite Smoothers

Sparse Partially Linear Additive Models

Debiased distributed learning for sparse partial linear models in high dimensions

High-dimensional additive modeling

A Partition-insensitive Parallel Framework for Distributed Model Fitting

Distributed Dynamic Safe Screening Algorithms for Sparse Regularization

Convex-constrained Sparse Additive Modeling and Its Extensions