Skip to main content

Open Access 2020 | Open Access | Buch

Buchtitelbild

An Invitation to Statistics in Wasserstein Space

insite
SUCHEN

Über dieses Buch

This open access book presents the key aspects of statistics in Wasserstein spaces, i.e. statistics in the space of probability measures when endowed with the geometry of optimal transportation. Further to reviewing state-of-the-art aspects, it also provides an accessible introduction to the fundamentals of this current topic, as well as an overview that will serve as an invitation and catalyst for further research.

Statistics in Wasserstein spaces represents an emerging topic in mathematical statistics, situated at the interface between functional data analysis (where the data are functions, thus lying in infinite dimensional Hilbert space) and non-Euclidean statistics (where the data satisfy nonlinear constraints, thus lying on non-Euclidean manifolds). The Wasserstein space provides the natural mathematical formalism to describe data collections that are best modeled as random measures on Euclidean space (e.g. images and point processes). Such random measures carry the infinite dimensional traits of functional data, but are intrinsically nonlinear due to positivity and integrability restrictions. Indeed, their dominating statistical variation arises through random deformations of an underlying template, a theme that is pursued in depth in this monograph.

Inhaltsverzeichnis

Frontmatter

Open Access

Chapter 1. Optimal Transport
Abstract
In this preliminary chapter, we introduce the problem of optimal transport, which is the main concept behind Wasserstein spaces. General references on this topic are the books by Rachev and Rüschendorf [107], Villani [124, 125], Ambrosio et al. [12], Ambrosio and Gigli [10], and Santambrogio [119]. This chapter includes only few proofs, when they are simple, informative, or are not easily found in one of the cited references.
Victor M. Panaretos, Yoav Zemel

Open Access

Chapter 2. The Wasserstein Space
Abstract
The Kantorovich problem described in the previous chapter gives rise to a metric structure, the Wasserstein distance, in the space of probability measures \(P(\mathcal X)\) on a space \(\mathcal X\). The resulting metric space, a subspace of \(P(\mathcal X)\), is commonly known as the Wasserstein space \(\mathcal W\) (although, as Villani [125, pages 118–119] puts it, this terminology is “very questionable”; see also Bobkov and Ledoux [25, page 4]). In Chap. 4, we shall see that this metric is in a sense canonical when dealing with warpings, that is, deformations of the space \(\mathcal X\) (for example, in Theorem 4.​2.​4). In this chapter, we give the fundamental properties of the Wasserstein space. After some basic definitions, we describe the topological properties of that space in Sect. 2.2. It is then explained in Sect. 2.3 how \(\mathcal W\) can be endowed with a sort of infinite-dimensional Riemannian structure. Measurability issues are dealt with in the somewhat technical Sect. 2.4.
Victor M. Panaretos, Yoav Zemel

Open Access

Chapter 3. Fréchet Means in the Wasserstein Space
Abstract
If H is a Hilbert space (or a closed convex subspace thereof) and x 1, …, x N ∈ H, then the empirical mean \(\overline x_N=N^{-1}\sum x_i\) is the unique element of H that minimises the sum of squared distances from the x i’s.
Victor M. Panaretos, Yoav Zemel

Open Access

Chapter 4. Phase Variation and Fréchet Means
Abstract
Why is it relevant to construct the Fréchet mean of a collection of measures with respect to the Wasserstein metric? A simple answer is that this kind of average will often express a more natural notion of “typical” realisation of a random probability distribution than an arithmetic average.
Victor M. Panaretos, Yoav Zemel

Open Access

Chapter 5. Construction of Fréchet Means and Multicouplings
Abstract
When given measures μ 1, …, μ N are supported on the real line, computing their Fréchet mean \(\bar \mu \) is straightforward (Sect. 3.​1.​4). This is in contrast to the multivariate case, where, apart from the important yet special case of compatible measures, closed-form formulae are not available. This chapter presents an iterative procedure that provably approximates at least a Karcher mean with mild restrictions on the measures μ 1, …, μ N. The algorithm is based on the differentiability properties of the Fréchet functional developed in Sect. 3.​1.​6 and can be interpreted as classical steepest descent in the Wasserstein space \({\mathcal {W}}_2({\mathbb {R}}^d)\). It reduces the problem of finding the Fréchet mean to a succession of pairwise transport problems, involving only the Monge–Kantorovich problem between two measures. In the Gaussian case (or any location-scatter family), the latter can be done explicitly, rendering the algorithm particularly appealing (see Sect. 5.4.1).
Victor M. Panaretos, Yoav Zemel
Backmatter
Metadaten
Titel
An Invitation to Statistics in Wasserstein Space
verfasst von
Prof. Victor M. Panaretos
Yoav Zemel
Copyright-Jahr
2020
Electronic ISBN
978-3-030-38438-8
Print ISBN
978-3-030-38437-1
DOI
https://doi.org/10.1007/978-3-030-38438-8