Complexities of convex combinations and ...

Random-projection ensemble classification

April 17, 2015

84% Match

Timothy I. Cannings, Richard J. Samworth

Methodology

We introduce a very general method for high-dimensional classification, based on careful combination of the results of applying an arbitrary base classifier to random projections of the feature vectors into a lower-dimensional space. In one special case that we study in detail, the random projections are divided into disjoint groups, and within each group we select the projection yielding the smallest estimate of the test error. Our random projection ensemble classifier then ...

Find SimilarView on arXiv

Efficient Estimation of Generalization Error and Bias-Variance Components of Ensembles

November 15, 2017

84% Match

Dhruv Mahajan, Vivek Gupta, S Sathiya Keerthi, Sellamanickam Sundararajan, ... , Kidambi Rahul

Machine Learning

For many applications, an ensemble of base classifiers is an effective solution. The tuning of its parameters(number of classes, amount of data on which each classifier is to be trained on, etc.) requires G, the generalization error of a given ensemble. The efficient estimation of G is the focus of this paper. The key idea is to approximate the variance of the class scores/probabilities of the base classifiers over the randomness imposed by the training subset by normal/beta ...

Find SimilarView on arXiv

Minimizing The Misclassification Error Rate Using a Surrogate Convex Loss

June 27, 2012

84% Match

Shai University of Waterloo Ben-David, David University of Waterloo Loker, ... , Sridharan Karthik University of Pennsylvania

Machine Learning

We carefully study how well minimizing convex surrogate loss functions, corresponds to minimizing the misclassification error rate for the problem of binary classification with linear predictors. In particular, we show that amongst all convex surrogate losses, the hinge loss gives essentially the best possible bound, of all convex loss functions, for the misclassification error rate of the resulting linear predictor in terms of the best possible margin error rate. We also pro...

Find SimilarView on arXiv

On the complexity of PAC learning in Hilbert spaces

March 3, 2023

84% Match

Sergei Chubanov

Machine Learning

Functional Analysis

Machine Learning

We study the problem of binary classification from the point of view of learning convex polyhedra in Hilbert spaces, to which one can reduce any binary classification problem. The problem of learning convex polyhedra in finite-dimensional spaces is sufficiently well studied in the literature. We generalize this problem to that in a Hilbert space and propose an algorithm for learning a polyhedron which correctly classifies at least $1- \varepsilon$ of the distribution, with a ...

Find SimilarView on arXiv

Ensembles of Kernel Predictors

February 14, 2012

84% Match

Corinna Cortes, Mehryar Mohri, Afshin Rostamizadeh

Machine Learning

This paper examines the problem of learning with a finite and possibly large set of p base kernels. It presents a theoretical and empirical analysis of an approach addressing this problem based on ensembles of kernel predictors. This includes novel theoretical guarantees based on the Rademacher complexity of the corresponding hypothesis sets, the introduction and analysis of a learning algorithm based on these hypothesis sets, and a series of experiments using ensembles of ke...

Find SimilarView on arXiv

A complexity analysis of statistical learning algorithms

December 19, 2012

84% Match

Mark A. Kon

Machine Learning

We apply information-based complexity analysis to support vector machine (SVM) algorithms, with the goal of a comprehensive continuous algorithmic analysis of such algorithms. This involves complexity measures in which some higher order operations (e.g., certain optimizations) are considered primitive for the purposes of measuring complexity. We consider classes of information operators and algorithms made up of scaled families, and investigate the utility of scaling the comp...

Find SimilarView on arXiv

A Note on Improved Loss Bounds for Multiple Kernel Learning

June 30, 2011

84% Match

Zakria Hussain, John Shawe-Taylor, Mario Marchand

Machine Learning

In this paper, we correct an upper bound, presented in~\cite{hs-11}, on the generalisation error of classifiers learned through multiple kernel learning. The bound in~\cite{hs-11} uses Rademacher complexity and has an\emph{additive} dependence on the logarithm of the number of kernels and the margin achieved by the classifier. However, there are some errors in parts of the proof which are corrected in this paper. Unfortunately, the final result turns out to be a risk bound wh...

Find SimilarView on arXiv

Risk bounds for statistical learning

February 23, 2007

84% Match

Pascal Massart, Élodie Nédélec

Statistics Theory

We propose a general theorem providing upper bounds for the risk of an empirical risk minimizer (ERM).We essentially focus on the binary classification framework. We extend Tsybakov's analysis of the risk of an ERM under margin type conditions by using concentration inequalities for conveniently weighted empirical processes. This allows us to deal with ways of measuring the ``size'' of a class of classifiers other than entropy with bracketing as in Tsybakov's work. In particu...

Find SimilarView on arXiv

Convex Optimization for Binary Classifier Aggregation in Multiclass Problems

January 16, 2014

84% Match

Sunho Park, TaeHyun Hwang, Seungjin Choi

Machine Learning

Multiclass problems are often decomposed into multiple binary problems that are solved by individual binary classifiers whose results are integrated into a final answer. Various methods, including all-pairs (APs), one-versus-all (OVA), and error correcting output code (ECOC), have been studied, to decompose multiclass problems into binary problems. However, little study has been made to optimally aggregate binary problems to determine a final answer to the multiclass problem....

Find SimilarView on arXiv

New Generalization Bounds for Learning Kernels

December 17, 2009

84% Match

Corinna Cortes, Mehryar Mohri, Afshin Rostamizadeh

Artificial Intelligence

This paper presents several novel generalization bounds for the problem of learning kernels based on the analysis of the Rademacher complexity of the corresponding hypothesis sets. Our bound for learning kernels with a convex combination of p base kernels has only a log(p) dependency on the number of kernels, p, which is considerably more favorable than the previous best bound given for the same problem. We also give a novel bound for learning with a linear combination of p b...

Find SimilarView on arXiv

Complexities of convex combinations and bounding the generalization error in classification

Random-projection ensemble classification

Efficient Estimation of Generalization Error and Bias-Variance Components of Ensembles

Minimizing The Misclassification Error Rate Using a Surrogate Convex Loss

On the complexity of PAC learning in Hilbert spaces

Ensembles of Kernel Predictors

A complexity analysis of statistical learning algorithms

A Note on Improved Loss Bounds for Multiple Kernel Learning

Risk bounds for statistical learning

Convex Optimization for Binary Classifier Aggregation in Multiclass Problems

New Generalization Bounds for Learning Kernels