October 5, 2004
Consider a (possibly infinite) exchangeable sequence X={X_n:1\leqn<N}, where N\in N\cup {\infty}, with values in a Borel space (A,A), and note X_n=(X_1,...,X_n). We say that X is Hoeffding decomposable if, for each n, every square integrable, centered and symmetric statistic based on X_n can be written as an orthogonal sum of n U-statistics with degenerated and symmetric kernels of increasing order. The only two examples of Hoeffding decomposable sequences studied in the literature are i.i.d. random variables and extractions without replacement from a finite population. In the first part of the paper we establish a necessary and sufficient condition for an exchangeable sequence to be Hoeffding decomposable, that is, called weak independence. We show that not every exchangeable sequence is weakly independent, and, therefore, that not every exchangeable sequence is Hoeffding decomposable. In the second part we apply our results to a class of exchangeable and weakly independent random vectors X_n^{(\alpha, c)}=(X_1^{(\alpha, c)},...,X_n^{(\alpha, c)}) whose law is characterized by a positive and finite measure \alpha (\cdot) on A and by a real constant c. For instance, if c=0, X_n^{(\alpha, c)} is a vector of i.i.d. random variables with law \alpha (\cdot)/\alpha (A); if A is finite, \alpha (\cdot) is integer valued and c=-1, X_n^{(\alpha, c)} represents the first n extractions without replacement from a finite population; if c>0, X_n^{(\alpha, c)} consists of the first n instants of a generalized P\'olya urn sequence.
Similar papers 1
May 23, 2012
We study Hoeffding decomposable exchangeable sequences with values in a finite set D. We provide a new combinatorial characterization of Hoeffding decomposability and use this result to show that, if the cardinality of D is strictly greater than 2, then there exists a class of neither P\'{o}lya nor i.i.d. D-valued exchangeable sequences that are Hoeffding decomposable. The construction of such sequences is based on some ideas appearing in Hill, Lane and Sudderth [1987] and an...
April 9, 2024
The aim of this paper is to establish Hoeffding and Bernstein type concentration inequalities for weighted sums of exchangeable random variables. A special case is the i.i.d. setting, where random variables are sampled independently from some distribution (and are therefore exchangeable). In contrast to the existing literature on this problem, our results provide a natural unified view of both the i.i.d. and the exchangeable setting.
March 7, 2008
We point out that a proper use of the Hoeffding--ANOVA decomposition for symmetric statistics of finite urn sequences, previously introduced by the author, yields a decomposition of the space of square-integrable functionals of a Dirichlet--Ferguson process, written $L^2(D)$, into orthogonal subspaces of multiple integrals of increasing order. This gives an isomorphism between $L^2(D)$ and an appropriate Fock space over a class of deterministic functions. By means of a well-k...
January 15, 2024
We consider $U$-statistics on row-column exchangeable matrices. We derive a decomposition for them, based on orthogonal projections on probability spaces generated by sets of Aldous-Hoover-Kallenberg variables. The specificity of these sets is that they are indexed by bipartite graphs, which allows for the use of concepts from graph theory to describe this decomposition. The decomposition is used to investigate the asymptotic behavior of $U$-statistics of row-column exchangea...
January 25, 2015
A length-$n$ random sequence $X_1,\ldots,X_n$ in a space $S$ is finitely exchangeable if its distribution is invariant under all $n!$ permutations of coordinates. Given $N > n$, we study the extendibility problem: when is it the case that there is a length-$N$ exchangeable random sequence $Y_1,\ldots, Y_N$ so that $(Y_1,\ldots,Y_n)$ has the same distribution as $(X_1,\ldots,X_n)$? In this paper, we give a necessary and sufficient condition so that, for given $n$ and $N$, the ...
July 24, 2015
We provide a systematic approach to deal with the following problem. Let $X_1,\ldots,X_n$ be, possibly dependent, $[0,1]$-valued random variables. What is a sharp upper bound on the probability that their sum is significantly larger than their mean? In the case of independent random variables, a fundamental tool for bounding such probabilities is devised by Wassily Hoeffding. In this paper we consider analogues of Hoeffding's result for sums of dependent random variables for ...
November 12, 2014
Let $\{X_n, n \ge 1\}$ be a sequence of stationary associated random variables. We discuss another set of conditions under which a central limit theorem for U-statistics based on $\{X_n, n \ge 1\}$ holds. We look at U-statistics based on differentiable kernels of degree 2 and above. We also discuss some applications.
June 16, 2021
We present a novel analogue for finite exchangeable sequences of the de Finetti, Hewitt and Savage theorem and investigate its implications for multi-marginal optimal transport (MMOT) and Bayesian statistics. If $(Z_1,...,Z_N)$ is a finitely exchangeable sequence of $N$ random variables taking values in some Polish space $X$, we show that the law $\mu_k$ of the first $k$ components has a representation of the form $\mu_k=\int_{{\mathcal P}_{\frac{1}{N}}(X)} F_{N,k}(\lambda)...
February 6, 2021
We study the distribution of a general class of asymptoticallylinear statistics which are symmetric functions of $N$ independent observations. The distribution functions of these statistics are approximated by an Edgeworth expansion with a remainder of order $o(N^{-1})$. The Edgeworth expansion is based on Hoeffding's decomposition which provides a stochastic expansion into a linear part, a quadratic part as well as smaller higher order parts. The validity of this Edgeworth e...
July 13, 2016
We study the convergence in total variation distance for series of the form $$ S_{N}(c,Z)=\sum_{l=1}^{N}\sum_{i_{1}<\cdots<i_{l}}c(i_{1},...,i_{l})Z_{i_{1}}\cdots Z_{i_{l}}, $$ where $Z_{k},k\in {\mathbb{N}}$ are independent centered random variables with ${\mathbb{E}}(Z_{k}^{2})=1$. This enters in the framework of the $U$--statistics theory which plays a major role in modern statistic. In the case when $Z_{k},k\in {\mathbb{N}}$ are standard normal, $S_{N}(c,Z)$ is an element...