Distributed Kernel Regression: An Algori...

Distributed Sparse Linear Regression under Communication Constraints

January 9, 2023

90% Match

Rodney Fonseca, Boaz Nadler

Machine Learning

Statistics Theory

In multiple domains, statistical tasks are performed in distributed settings, with data split among several end machines that are connected to a fusion center. In various applications, the end machines have limited bandwidth and power, and thus a tight communication budget. In this work we focus on distributed learning of a sparse linear regression model, under severe communication constraints. We propose several two round distributed schemes, whose communication per machine ...

Find SimilarView on arXiv

Decentralized Online Learning with Kernels

October 11, 2017

90% Match

Alec Koppel, Santiago Paternain, ... , Ribeiro Alejandro

math.OC

cs.DC

cs.LG

cs.MA

math.ST

stat.ML

stat.TH

We consider multi-agent stochastic optimization problems over reproducing kernel Hilbert spaces (RKHS). In this setting, a network of interconnected agents aims to learn decision functions, i.e., nonlinear statistical models, that are optimal in terms of a global convex functional that aggregates data across the network, with only access to locally and sequentially observed samples. We propose solving this problem by allowing each agent to learn a local regression function wh...

Find SimilarView on arXiv

Adaptive Kernel Learning in Heterogeneous Networks

August 1, 2019

90% Match

Hrusikesha Pradhan, Amrit Singh Bedi, ... , Rajawat Ketan

Optimization and Control

Signal Processing

Machine Learning

We consider learning in decentralized heterogeneous networks: agents seek to minimize a convex functional that aggregates data across the network, while only having access to their local data streams. We focus on the case where agents seek to estimate a regression \emph{function} that belongs to a reproducing kernel Hilbert space (RKHS). To incentivize coordination while respecting network heterogeneity, we impose nonlinear proximity constraints. To solve the constrained stoc...

Find SimilarView on arXiv

Collaborative Learning over Wireless Networks: An Introductory Overview

December 7, 2021

90% Match

Emre Ozfatura, Deniz Gunduz, H. Vincent Poor

Machine Learning

Distributed, Parallel, and C...

Signal Processing

In this chapter, we will mainly focus on collaborative training across wireless devices. Training a ML model is equivalent to solving an optimization problem, and many distributed optimization algorithms have been developed over the last decades. These distributed ML algorithms provide data locality; that is, a joint model can be trained collaboratively while the data available at each participating device remains local. This addresses, to some extend, the privacy concern. Th...

Find SimilarView on arXiv

On the Feasibility of Distributed Kernel Regression for Big Data

May 5, 2015

90% Match

Chen Xu, Yongquan Zhang, Runze Li

Machine Learning

In modern scientific research, massive datasets with huge numbers of observations are frequently encountered. To facilitate the computational process, a divide-and-conquer scheme is often used for the analysis of big data. In such a strategy, a full dataset is first split into several manageable segments; the final output is then averaged from the individual outputs of the segments. Despite its popularity in practice, it remains largely unknown that whether such a distributiv...

Find SimilarView on arXiv

COKE: Communication-Censored Decentralized Kernel Learning

January 28, 2020

90% Match

Ping Xu, Yue Wang, ... , Tian Zhi

Machine Learning

This paper studies the decentralized optimization and learning problem where multiple interconnected agents aim to learn an optimal decision function defined over a reproducing kernel Hilbert space by jointly minimizing a global objective function, with access to their own locally observed dataset. As a non-parametric approach, kernel learning faces a major challenge in distributed implementation: the decision variables of local objective functions are data-dependent and thus...

Find SimilarView on arXiv

Linear Regression over Networks with Communication Guarantees

March 6, 2021

90% Match

Konstantinos Gatsis

Machine Learning

Systems and Control

Optimization and Control

Machine Learning

A key functionality of emerging connected autonomous systems such as smart cities, smart transportation systems, and the industrial Internet-of-Things, is the ability to process and learn from data collected at different physical locations. This is increasingly attracting attention under the terms of distributed learning and federated learning. However, in connected autonomous systems, data transfer takes place over communication networks with often limited resources. This pa...

Find SimilarView on arXiv

Learning Theory of Distributed Regression with Bias Corrected Regularization Kernel Network

August 7, 2017

90% Match

Zhengchu Guo, Lei Shi, Qiang Wu

Machine Learning

Distributed learning is an effective way to analyze big data. In distributed regression, a typical approach is to divide the big data into multiple blocks, apply a base regression algorithm on each of them, and then simply average the output functions learnt from these blocks. Since the average process will decrease the variance, not the bias, bias correction is expected to improve the learning performance if the base regression algorithm is a biased one. Regularization kerne...

Find SimilarView on arXiv

Communication Efficient Distributed Kernel Principal Component Analysis

March 23, 2015

89% Match

Maria-Florina Balcan, Yingyu Liang, Le Song, ... , Xie Bo

Machine Learning

Kernel Principal Component Analysis (KPCA) is a key machine learning algorithm for extracting nonlinear features from data. In the presence of a large volume of high dimensional data collected in a distributed fashion, it becomes very costly to communicate all of this data to a single data center and then perform kernel PCA. Can we perform kernel PCA on the entire dataset in a distributed and communication efficient fashion while maintaining provable and strong guarantees in ...

Find SimilarView on arXiv

CoCoA: A General Framework for Communication-Efficient Distributed Optimization

November 7, 2016

89% Match

Virginia Smith, Simone Forte, Chenxin Ma, Martin Takac, ... , Jaggi Martin

Machine Learning

The scale of modern datasets necessitates the development of efficient distributed optimization methods for machine learning. We present a general-purpose framework for distributed computing environments, CoCoA, that has an efficient communication scheme and is applicable to a wide variety of problems in machine learning and signal processing. We extend the framework to cover general non-strongly-convex regularizers, including L1-regularized problems like lasso, sparse logist...

Find SimilarView on arXiv

Distributed Kernel Regression: An Algorithm for Training Collaboratively

Distributed Sparse Linear Regression under Communication Constraints

Decentralized Online Learning with Kernels

Adaptive Kernel Learning in Heterogeneous Networks

Collaborative Learning over Wireless Networks: An Introductory Overview

On the Feasibility of Distributed Kernel Regression for Big Data

COKE: Communication-Censored Decentralized Kernel Learning

Linear Regression over Networks with Communication Guarantees

Learning Theory of Distributed Regression with Bias Corrected Regularization Kernel Network

Communication Efficient Distributed Kernel Principal Component Analysis

CoCoA: A General Framework for Communication-Efficient Distributed Optimization