Categorification of Persistent Homology

Bubenik, Peter; Scott, Jonathan A.

doi:10.1007/s00454-014-9573-x

Categorification of Persistent Homology

Published: 28 January 2014

Volume 51, pages 600–627, (2014)
Cite this article

Download PDF

Discrete & Computational Geometry Aims and scope Submit manuscript

Categorification of Persistent Homology

Download PDF

Peter Bubenik¹ &
Jonathan A. Scott¹

4156 Accesses
76 Citations
6 Altmetric
Explore all metrics

Abstract

We redevelop persistent homology (topological persistence) from a categorical point of view. The main objects of study are $\mathbf {(\mathbb {R},\leq)}$-indexed diagrams in some target category. A set of such diagrams has an interleaving distance, which we show generalizes the previously studied bottleneck distance. To illustrate the utility of this approach, we generalize previous stability results for persistence, extended persistence, and kernel, image, and cokernel persistence. We give a natural construction of a category of ε-interleavings of $\mathbf {(\mathbb {R},\leq)}$-indexed diagrams in some target category and show that if the target category is abelian, so is this category of interleavings.

Persistence Diagrams as Diagrams: A Categorification of the Stability Theorem

Persistent cup product structures and related invariants

Article Open access 07 October 2023

Facundo Mémoli, Anastasios Stefanou & Ling Zhou

Distributing Persistent Homology via Spectral Sequences

Article Open access 20 September 2023

Álvaro Torras-Casas

1 Introduction

The ideas of topological persistence [13] and persistent homology [21] have had a great impact on computational geometry and the newer field of applied topology. This method applies geometric and algebraic constructions to input from applications, followed by clever modifications of tools from algebraic topology. It has found many uses, and the results can be global qualitative descriptions inaccessible to other methods. Subsequent theoretical work in this subject has given stronger results and adapted the basic constructions so that they might be applied in more diverse situations. For surveys and books on this subject, see [3, 11, 12, 15, 20].

1.1 Motivation

Throughout its history, algebraic topology has frequently undergone a process in which previous results were redeveloped from a more abstract point of view. This has had two main advantages. First, abstraction clarified the key ideas and proofs. Second, and more importantly, the more abstract setting allowed previous results to be vastly generalized and applied in ways never considered in the original. The development and use of category theory has been a critical part of this process.

The main motivation of this paper is to subject the ideas and results of topological persistence to this process.

1.2 Prior Work

In the descriptions below, we will make anachronistic use of this paper’s point of view, in particular, its focus on diagrams (see (1) and Sect. 2.1).

Two foundational papers in this subject are [13] and [21]. In the first, Edelsbrunner, Letscher, and Zomorodian define persistent homology for $(\mathbb {Z}_{+},\leq)$-indexed diagrams of finite-dimensional vector spaces that are obtained from filtered finite simplicial complexes by taking simplicial homology with coefficients in a field. In the second, Zomorodian and Carlsson take a purely algebraic point of view. They define persistent homology for tame $(\mathbb {Z}_{+},\leq)$-indexed diagrams of finite-dimensional vector spaces and prove a bijection between isomorphism classes of such tame diagrams and finite barcodes whose endpoints lie in $\mathbb {Z}_{+} \cup\{\infty\}$. $(\mathbb {Z}_{+},\leq)$-indexed diagrams of finite-dimensional vector spaces are called persistence modules.

These papers are rounded out by [7], where Cohen-Steiner, Edelsbrunner, and Harer prove that persistent homology is useful in applications by showing that it is stable in the following sense. Let $f,g:X \to \mathbb {R}$ be continuous functions on a triangulable space. Define an $\mathbf {(\mathbb {R},\leq)}$-indexed diagram of topological spaces, F, by setting F(a)=f ⁻¹(−∞,a] and letting F(a≤b) be given by inclusion. Define G similarly using g. Let H be the singular homology functor with coefficients in a field. Assume that HF and HG are diagrams of finite-dimensional vector spaces and that they are tame. Then the bottleneck distance between HF and HG is bounded by the supremum norm between f and g.

This stability result is significantly strengthened by Chazal, Cohen-Steiner, Glisse, Guibas, and Oudot in [5]. They drop the assumptions that X be triangulable, that f,g be continuous, and that HF and HG be tame. Their approach is crucial to this paper. They explicitly work with $\mathbf {(\mathbb {R},\leq)}$-indexed diagrams, though they consider them from an algebraic, not categorical, point of view. They define the interleaving distance, d, between such diagrams, and define the bottleneck distance, d _B, between such diagrams using limits of discretizations, and show that d _B≤d.

The basic idea of persistent homology has been extended in numerous ways. Here we focus on two particularly useful extensions, given in [8] and [9]. In the first, Cohen-Steiner, Edelsbrunner, and Harer define extended persistence for finite-dimensional simplicial complexes with a finite filtration and homology with coefficients in $\mathbb {Z}/ 2\mathbb {Z}$. They show that in this case the stability result of [7] applies. In the second, Cohen-Steiner, Edelsbrunner, Harer, and Morozov consider a triangulated space X with subcomplex Y and maps $f,f':X \to \mathbb {R}$ and $g,g':Y \to \mathbb {R}$ such that for all y∈Y, f(y)≤g(y) and f′(y)≤g′(y). They assume that f,f′,g,g′ are continuous and tame. Then there are maps of the corresponding $\mathbf {(\mathbb {R},\leq)}$-indexed diagrams HG→HF and HG′→HF′. Let ε=max{∥f−f′∥_∞,∥g−g′∥_∞}. The authors show that the bottleneck distances between the kernels, images, and cokernels, respectively, of these maps are each bounded above by ε.

An early categorical approach to persistence can be found in [2].

1.3 Our Contributions

We redevelop persistent homology from a categorical point of view. In particular, we consider diagrams indexed by $\mathbf {(\mathbb {R},\leq)}$ to be the main objects of study. An $\mathbf {(\mathbb {R},\leq)}$-indexed diagram consists of a set of objects X(a) for each $a \in \mathbb {R}$ and morphisms

$$ X(a) \to X(b) $$

(1)

for each a≤b, satisfying certain composition and unit axioms (see Sect. 2.1). The objects and morphisms lie in some fixed category, such as topological spaces and continuous maps, or finite-dimensional vector spaces and linear transformations. In Sect. 2.2, we show that the basic constructions of persistent homology are special cases of this construction. We will show that in this setting, functoriality provides concise and powerful results.

In Sect. 3, we define an ε-interleaving for $\mathbf {(\mathbb {R},\leq)}$-indexed diagrams (Definition 3.1) and show that this induces a metric (Theorem 3.3 and Corollary 3.5).

We specialize to $\mathbf {(\mathbb {R},\leq)}$-indexed diagrams of finite-dimensional vector spaces in Sect. 4. These are also called (real) persistence modules. We study barcodes, persistence diagrams, and the bottleneck and interleaving distances. We define finite type diagrams to be direct sums of certain indecomposable diagrams (Definition 4.1). We show that these are exactly the tame diagrams (Theorem 4.6). Furthermore, we show that they satisfy a Krull–(Remak–)Schmidt theorem. That is, the direct sum decomposition is essentially unique (Corollary 4.7). We show that the metric space of finite barcodes together with the bottleneck distance embeds isometrically into the metric space of $\mathbf {(\mathbb {R},\leq)}$-indexed diagrams of finite-dimensional vector spaces with the interleaving distance (Theorem 4.16). This result justifies our assertion that our stability theorems, which use the interleaving distance, are generalizations of previously established stability theorems, which use the bottleneck distance.

In Sect. 5, we give a simple formal argument for a stability theorem for the interleaving distance. By the previous work identifying the interleaving and bottleneck distances, this allows us to both remove assumptions and to significantly generalize, the stability result of [7]. Given any functions $f,g:X \to \mathbb {R}$ on any topological space X and any functor H on topological spaces, we show that the interleaving distance of HF and HG is bounded above by the supremum norm between f and g (Theorem 5.1).

We generalize the extended persistence construction of [8] in Sect. 6. For any (not necessarily continuous) map $f:X \to(-\infty,M] \subset \mathbb {R}$, we define an $\mathbf {(\mathbb {R},\leq)}$-indexed diagram of pairs of topological spaces. We prove a stability theorem for extended persistence. Given f,g:X→(−∞,M] and corresponding diagrams F and G of pairs of spaces, and any functor H on pairs of spaces, the interleaving distance between HF and HG is bounded above by the supremum norm between f and g (Theorem 6.1).

In Sect. 7, we define a category of interleavings of $\mathbf {(\mathbb {R},\leq)}$-indexed diagrams in a given base category (Definition 7.1). We show that in the case that the base category is an abelian category, then so is this category of interleavings (Theorem 7.10). As a result, this category has direct sums, kernels, images, and cokernels. As an application, we generalize the stability theorem of [9], dropping the assumptions that X and Y are triangulated, that f,f′,g,g′ are continuous and tame, replacing the subcomplex condition with a continuous map Y→X and replacing singular homology with coefficients in $\mathbb {Z}/2\mathbb {Z}$, with any functor from topological spaces to an abelian category (Theorem 7.13). We also give a version of this theorem for extended persistence (Theorem 7.14).

1.4 Comparison with Other Recent Work

The material in Sect. 4 has been studied in greater detail in the algebraic setting by Lesnick [17] and by Chazal, de Silva, Glisse, and Oudot [6]. In particular, Lesnick proves a more general Isometry Theorem [17, Theorem 2.4.2], removing the condition that the persistence modules have finite type from our Theorem 4.16. This is further generalized to q-tame persistence modules in [6, Theorem 4.11]. We also remark that one of the directions in our isometry theorem is due to [5]. Crawley-Boevey [10] has shown that any $(\mathbb {R},\leq)$-indexed diagram of finite-dimensional vector spaces is a direct sum of interval modules. In light of this result, it may be possible to generalize some of our work in Sect. 4.

Our Stability Theorem (Theorem 5.1) is quite general and once the categorical machinery has been set up, has a very simple proof. However it applies to persistence modules, not for their corresponding persistence diagrams. In the language of [1], it is a soft stability theorem. Hard stability theorems [5–7] giving stability for persistence diagrams require more detailed analysis. For example, an Isometry Theorem can be used to show that soft stability implies hard stability. On the other hand, our Stability Theorem is more general in that it applies to functors to arbitrary categories. For a simple example, consider homology with integer coefficients. It also clarifies what part of stability is purely formal and what part requires detailed analysis. This viewpoint is expanded upon in [1].

2 Background

In Sect. 2.1, we give the basic definitions of category theory that we will use throughout the paper. In Sect. 2.2, we show how the standard constructions of persistent homology fit within our categorical approach. The last two sections give more specialized background. In Sect. 2.3, we define abelian categories, which we use in Sect. 7. In Sect. 2.4, we give some algebraic definitions used in the proof of Theorem 4.6.

2.1 Categorical Terminology

A category, C, consists of a class of objects, C ₀, and for each pair of objects X,Y∈C ₀, a set of morphisms, C(X,Y). We often write f:X→Y if f∈C(X,Y). For every triple X,Y,Z∈C ₀, there is a set mapping,

$$\mathbf {C}(Y,Z) \times \mathbf {C}(X,Y) \rightarrow \mathbf {C}(X,Z), \quad(g,f) \mapsto gf, $$

called composition. Composition must be associative, in the sense that (hg)f=h(gf). Finally, for all X∈C, there is an identity morphism, Id_X:X→X, that satisfies Id_X f=f and gId_X=g for all f:W→X and all g:X→Y. The identity morphism is unique. We will regularly abuse notation and write X∈C to mean X∈C ₀.

A category C is called small if C ₀ is a set rather than a proper class.

Example 2.1

Let Top be the category whose objects are all topological spaces and whose morphisms are all continuous maps. Here, composition is the composition of mappings, and the identity morphisms are what one would expect.

A related category is Pair, whose objects are pairs (X,A), where X is a topological space, and A is a subspace of X. A morphism from (X,A) to (Y,B) is a continuous map f:X→Y such that f(A)⊂B. We express this condition by saying that the diagram

commutes, where j _A and j _B are the canonical inclusions, and f|_A is f restricted to A.

Example 2.2

Let Vec be the category of finite-dimensional vector spaces over a fixed ground field $\mathbb {F}$, along with the linear transformations between them. Again, composition is that of mappings, and the identities are simply the identity mappings.

A graded vector space is a collection $V_{*} = \{ V_{n} \}_{n \in \mathbb {Z}}$ with each V _n∈Vec. A morphism, f _∗:V _∗→W _∗, of graded vector spaces is a sequence, f _∗={f _n:V _n→W _n}. Denote by grVec the category of graded vector spaces and their morphisms.

A reflexive, antisymmetric, and transitive relation ≤ on a set P is called a partial order. A set P equipped with a partial order is called a poset. We identify each poset P with the small category P that has P ₀=P, and P(x,y) has precisely one element if x≤y and is otherwise empty. Conversely, let P be a small category in which each set of morphisms contains at most one element, and if P(x,y) and P(y,x) are both nonempty, then x=y. Then P ₀ is a poset, with partial ordering defined by x≤y if and only if P(x,y)≠∅.

Example 2.3

The set of real numbers, $\mathbb {R}$, with its usual ordering is a poset. The set of integers, $\mathbb {Z}$, of nonnegative integers $\mathbb {Z}_{+}$, and $\mathbf {[n]} = \{ 0, \ldots, n \}$, are subposets. For a partial order that is not a total order, consider the set $\mathbb {R}^{n}$ with n>1 and the ordering (x ₁,…,x _n)≤(y ₁,…,y _n) if and only if x _i≤y _i for all i=1,…,n.

Two objects X,Y∈C ₀ are said to be isomorphic if there exist morphisms f:X→Y and g:Y→X such that gf=Id_X and fg=Id_Y. In this case, f and g are called isomorphisms. Clearly, isomorphism is an equivalence relation. In Top, isomorphism becomes homeomorphism.

The notion of functor expresses relationships between categories. Let A and C be categories. A functor, F:A→C, consists of a mapping F:A ₀→C ₀, and for each pair X,Y∈A ₀, a mapping F:A(X,Y)→C(F(X),F(Y)). These mappings must be compatible with the composition and identity structure of the categories, in the sense that if f:X→Y and g:Y→Z, then F(gf)=F(g)F(f), and if X∈A ₀, then F(Id_X)=Id_F(X).

Example 2.4

Denote by H _∗(−) singular homology with coefficients in some fixed field, $\mathbb {F}$. Then H _∗(X) is a graded $\mathbb {F}$-vector space for all X∈Top ₀. Furthermore, if f:X→Y is continuous, then we get the induced homomorphism, H _∗(f):H _∗(X)→H _∗(Y). Since H _∗(gf)=H _∗(g)H _∗(f), singular homology defines a functor H _∗:Top→grVec. If we consider only homology in degree k, then we get a functor H _k:Top→Vec.

Let F,G:A→C be functors. A natural transformation η:F⇒G consists of, for all A∈A ₀, a morphism η _A:F(A)→G(A) in C such that whenever φ:A→A′ is a morphism, the diagram

(2)

commutes. If for all A∈A ₀, η _A is an isomorphism, then η is called a natural isomorphism, and we write F≅G.

Example 2.5

Consider the poset $(\mathbb {R},\leq)$, and let ε≥0. Define $T_{\varepsilon }:(\mathbb {R}, \leq) \rightarrow(\mathbb {R},\leq)$ by T _ε(x)=x+ε. If x≤y, then x+ε≤y+ε, so T _ε defines a functor to $(\mathbb {R}, \leq)$ to itself. We call T _ε translation by ε. Since ε≥0 and x≤x+ε for all $x \in \mathbb {R}$, we get a natural transformation η:I⇒T _ε, where $I : \mathbb {R}\rightarrow \mathbb {R}$ is the identity functor.

The collection of all small categories, and the functors between them, itself forms a category, denoted by Cat.

Let C and D be categories with C small. A functor, F:C→D, is called a diagram in D indexed by C. The collection of all such functors, and natural transformations between them, forms a category, D ^C.

Example 2.6

Let C be the discrete category whose objects are the integers; the only morphisms are the identity morphisms. Then Vec ^C=grVec.

Example 2.7

A diagram F in a category D indexed by $(\mathbb {Z}_{+}, \leq)$ is a sequence of morphisms in D:

$$F(0) \rightarrow F(1) \rightarrow F(2) \rightarrow\cdots. $$

If D=Top, then each F(n) is a topological space, and the morphisms are continuous maps. If D=Vec, then each F(n) is a finite-dimensional vector space, and the morphisms are linear maps.

Indexed by $(\mathbb {Z},\leq)$, the diagram extends in both directions:

$$\cdots\rightarrow F(-2) \rightarrow F(-1) \rightarrow F(0) \rightarrow F(1) \rightarrow F(2) \rightarrow\cdots. $$

If the indexing category is $(\mathbb {R},\leq)$, then we have objects F(a) for all $a \in \mathbb {R}$, and for each a≤b, a morphism F(a)→F(b).

Given two natural transformations φ:F⇒G and ψ:G⇒H, their (vertical) composition ψ∘φ is the natural transformation given by the composition of morphisms $F(A) \xrightarrow {\varphi_{A}} G(A) \xrightarrow {\psi_{A}} H(A)$ and the composition of the corresponding commutative squares (2).

For i=1,2, let F _i,G _i:A _i→A _i+1 be functors, and let φ _i:F _i⇒G _i be a natural transformation. The (horizontal) composition of φ ₁ and φ ₂ is the natural transformation, φ ₂ φ ₁:F ₂ F ₁⇒G ₂ G ₁, defined on morphisms by (φ ₂ φ ₁)(f)=φ ₂(φ ₁(f)). For every functor H, there is the identity natural isomorphism Id_H:H⇒H. We abuse notation and refer to the horizontal composition of a natural transformation φ with Id_H as the composition of φ with H.

2.2 Categorical Persistent Homology

In this section, we consider two prototypical examples in which persistent homology is applied and show how they fit into our categorical framework. We also show how diagrams indexed by $\mathbf {[n]}$, $\mathbf {(\mathbb {Z}_{+},\leq)}$, and $\mathbf {(\mathbb {Z},\leq)}$ are special cases of diagrams indexed by $\mathbf {(\mathbb {R},\leq)}$. Finally, we define persistent homology.

2.2.1 Filtered Simplicial Complexes

First, let K be a finite simplicial complex with filtration

$$ \emptyset= K_0 \subseteq K_1 \subseteq\cdots\subseteq K_n = K. $$

Then this gives an $\mathbf {[n]}$-indexed diagram of topological spaces, i.e., $K \in \mathbf {Top}^{\mathbf {[n]}}$, with K(i)=K _i and K(i≤j) given by inclusion.

Let H _k be the degree k simplicial homology functor with coefficients in a field $\mathbb {F}$. Then H _k K is an $\mathbf {[n]}$-indexed diagram of finite-dimensional vector spaces. That is, $H_{k} K(i) = H_{k}(K_{i},\mathbb {F})$ and H _k K(i≤j) is the map induced on homology by the inclusion K _i↪K _j. So $H_{k} K \in \mathbf {Vec}^{\mathbf {[n]}}$.

We can sum homology in all degrees to get $HF \in \mathbf {Vec}^{\mathbf {[n]}}$, given by $HF(i) = \bigoplus_{k} H_{k}(K_{i},\mathbb {F})$.

2.2.2 Sublevel Sets

Second, let X be a topological space, and let $f: X \to \mathbb {R}$ be a not necessarily continuous real-valued function on X. Let $a \in \mathbb {R}$. We consider the sublevel set (or lower excursion set, also called a half space)

$$ f^{-1}\bigl((-\infty,a]\bigr) = \bigl\{ x \in X \ | \ f(x) \leq a \bigr\} . $$

For simplicity, we will usually write f ⁻¹(−∞,a]. We consider it as a topological space using the subspace topology. Notice that if a≤b, then f ⁻¹(−∞,a]⊆f ⁻¹(−∞,b], and this inclusion is a continuous map.

This data can be assembled into an $\mathbf {(\mathbb {R},\leq)}$-indexed diagram of topological spaces, $F \in { \mathbf {Top}^{\mathbf {(\mathbb {R},\leq )}}}$. For $a \in \mathbb {R}$, we define F(a)=f ⁻¹(−∞,a]. For a≤b, we define F(a≤b) to be the inclusion f ⁻¹(−∞,a]↪f ⁻¹(−∞,b]. It is easy to check that this defines a functor $F: \mathbf {(\mathbb {R},\leq)} \to \mathbf {Top}$.

Let H _k be the kth singular homology functor with coefficients in some field $\mathbb {F}$. Then H _k F is an $\mathbf {(\mathbb {R},\leq)}$-indexed diagram of (not necessarily finite-dimensional) vector spaces. That is, $H_{k}F(a) = H_{k}(f^{-1}(-\infty,a], \mathbb {F})$, and for a≤b, H _k F(a≤b) is the map induced on homology by the inclusion f ⁻¹(−∞,a]↪f ⁻¹(−∞,b]. If f has the property that for all $a \in \mathbb {R}$, $H_{k}(f^{-1}(-\infty ,a], \mathbb {F})$ is a finite-dimensional vector space, then $H_{k}F \in { \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$.

If f has the property that for all $a \in \mathbb {R}$, $H_{*}(f^{-1}(\infty ,a],\mathbb {F})$ is finite-dimensional, then $HF \in { \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ is given by $HF(a) = \bigoplus_{k} H_{k}(f^{-1}(-\infty,a],\mathbb {F})$.

2.2.3 Diagrams by $\mathbf {[n]}$, $\mathbf {(\mathbb {Z}_{+},\leq)}$, and $\mathbf {(\mathbb {Z},\leq)}$

In this paper, we will only consider the indexing category $\mathbf {(\mathbb {R},\leq)}$. However, this case also includes the cases $\mathbf {[n]}$, $\mathbf {(\mathbb {Z}_{+},\leq),}$ and $\mathbf {(\mathbb {Z},\leq)}$, by the following observation. Consider $F \in \mathbf {Top}^{\mathbf {[n]}}$. Then we can extend F to an $\mathbf {(\mathbb {R},\leq)}$-indexed diagram as follows. The inclusion functor $\mathbf {i: [n] \to \mathbf {(\mathbb {R},\leq)}}$ given by i(j)=j has a retraction functor $\mathbf {r: (\mathbb {R},\leq) \to[n]}$ given by

$$ \mathbf {r}(a) = \begin{cases} 0 & \mbox{if}\ a \leq0, \\ \lfloor a \rfloor& \mbox{if}\ 0 < a < n,\\ n & \mbox{if}\ a \geq n. \end{cases} $$

Thus, the composite functor F r is an element of ${ \mathbf {Top}^{\mathbf {(\mathbb {R},\leq )}}}$, and F ri=F. There are similarly defined retraction functors to $\mathbf {(\mathbb {Z}_{+},\leq)}$ and to $\mathbf {(\mathbb {Z},\leq)}$.

2.2.4 Persistent Homology

Given a diagram $F \in { \mathbf {Top}^{\mathbf {(\mathbb {R},\leq )}}}$, we define the p-persistent kth homology group of F(a) to be the image of the map H _k F(a≤a+p).

2.2.5 Persistence Modules

Diagrams in $\mathbf {Vec}^{\mathbf {[n]}}$, $\mathbf {Vec}^{(\mathbb{Z_{+}},\leq)}$, and ${ \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ are often called persistence modules.

2.3 Abelian Categories

In this section, we recall standard definitions from category theory that we will use in Sect. 7. Details can be found in, for example, [18]. Throughout this section, C denotes a category.

2.3.1 Initial, Terminal, and Final Objects

We say that an object ∅ of C is initial if, for every object X in C, there is a unique morphism ∅→X. An object ∗ is terminal if, for every object X, there is a unique morphism X→∗. It follows from these definitions that initial and terminal objects, if they exist, are unique up to canonical isomorphism. If an object is both initial and terminal, we say that it is zero and denote it by 0. In the presence of a zero object, for every pair of objects X,Y∈C, we can define the zero morphism 0:X→Y to be the composite of the unique morphisms, X→0→Y. It follows by uniqueness that if f is any morphism, then f0=0f=0.

2.3.2 Monomorphisms, Epimorphisms, Kernels, and Cokernels

Let f:X→Y be a morphism. We say that f is a monomorphism if, whenever g,h:W→X are morphisms such that fg=fh, we have that g=h. Dually, f is an epimorphism if, whenever k,ℓ:Y→Z are morphisms such that kf=ℓf, then k=ℓ. An isomorphism class of monomorphisms to Y is called a subobject of Y. Dually, isomorphism classes of epimorphisms are called quotient objects.

Suppose that C has a zero object, 0. Let f:X→Y be a morphism in C. The kernel of f is the equalizer of f and 0:X→Y. That is, the kernel is a morphism, j:kerf→X, such that fj=0 and that is “universal” in the sense that whenever g:W→X is a morphism satisfying fg=0, then there is a unique morphism $\tilde{g} : W \rightarrow\ker f$ such that $j \tilde{g} = g$. Since j is an equalizer, it follows that j is a monomorphism. So kerf represents a subobject of X. Thus, the kernel is the appropriate categorical notion for the part of X that f sends to 0. We use the word “kernel” to mean both the object kerf and the universal morphism, kerf→X, according to the context. We remark that it follows from the definition that all such universal objects are unique up to unique isomorphism. That is, if g:W→X and g′:W′→X are both kernels of f:X→Y, then there is a unique isomorphism $\tilde{g}:W \to W'$ such that $g'\tilde{g} = g$.

Dually, the cokernel of f:X→Y is the coequalizer of f and 0. That is, the cokernel is a universal morphism $q : Y \rightarrow \operatorname {coker}f$, such that whenever h:Y→Z satisfies hf=0, there exists a unique morphism $\tilde{h} : \operatorname {coker}f \rightarrow Z$ such that $\tilde{h}q = h$. Again, we sometimes abuse notation and use “cokernel” to refer to the object, $\operatorname {coker}f$. Since q is a coequalizer, it is an epimorphism, and $\operatorname {coker}f$ represents a quotient object of Y. Again, cokernels, if they exist, are unique up to canonical isomorphism.

2.3.3 Products, Coproducts, Pull-Backs, and Push-Outs

Let X,Y∈C. The product of X and Y, if it exists in C, is an object denoted by X×Y, along with morphisms p _X:X×Y→X and p _Y:X×Y→Y satisfying the following universal property. For every object W together with a pair of morphisms f _X:W→X and f _Y:W→Y, there is a unique morphism f:W→X×Y such that f _X=p _X f and f _Y=p _Y f. The product, if it exists, is unique up to canonical isomorphism.

Dually, the coproduct of X and Y, if it exists in C, is an object X⊕Y, along with morphisms j _X:X→X⊕Y and j _Y:Y→X⊕Y satisfying the following universal property. For every object U together with a pair of morphisms g _X:X→U and g _Y:Y→U, there is a unique morphism g:X⊕Y→U such that g _X=gj _X and g _Y=gj _Y. The coproduct, if it exists, is unique up to canonical isomorphism.

Consider the diagram $X \xrightarrow{f} Z \xleftarrow{g} Y$. The pull-back of f and g consists of an object P and morphisms $X \xleftarrow{p_{X}} P \xrightarrow{p_{Y}}Y$ satisfying fp _X=gp _Y and the following universal property. For each diagram

where the outer paths commute, there is a unique morphism W→P that makes the entire diagram commute. The pull-back is unique up to canonical isomorphism and is denoted by P=X×_Z Y when there can be no ambiguity concerning f and g.

Dually, the push-out of the diagram $Y \xleftarrow{f} X \xrightarrow{g} Z$ consists of an object Q along with universal morphisms $Y \xrightarrow{j_{Y}} Q \xleftarrow{j_{Z}} Z$ satisfying j _Y f=j _Z g and the following universal property. Whenever the outer paths in the diagram

commute, there is a unique morphism k:Q→U making the entire diagram commute. The push-out is unique up to canonical isomorphism, and is denoted by Q=Y⨁_X Z.

2.3.4 Abelian Categories

An abelian category is a category that contains a zero object and all products and coproducts, in which every morphism has a kernel and cokernel, every monomorphism is a kernel, and every epimorphism is a cokernel. By Freyd [14], every abelian category, A, is preadditive, that is, it is naturally enriched in abelian groups. This means that for all pairs of objects X and Y, the set of morphisms A(X,Y) is an abelian group, and composition is bilinear. Furthermore, binary products and coproducts coincide, in the sense that the natural morphism X⊕Y→X×Y is an isomorphism.

We say that an object X of an abelian category is indecomposable if whenever X≅U⊕V, either U≅0 or V≅0.

Example 2.8

Let Vec be the category of finite-dimensional vector spaces over some fixed field, $\mathbb {F}$. The morphisms are linear transformations. The zero object is (an element of the isomorphism class of) the trivial vector space, 0={0}. The product of V and W is the Cartesian (direct) product, V×W. The coproduct is the direct sum, V⊕W, which is canonically isomorphic to the direct product.

If f:V→W is linear, we set kerf={v∈V∣f(v)=0}, f(V)={f(v)∣v∈V}, and $\operatorname {coker}f = W / f(V)$. It is a straightforward exercise to show that monomorphisms are simply injective linear transformations and that if f is injective, then V≅f(V), and so V is (isomorphic to) the kernel of the quotient map, $W \rightarrow \operatorname {coker}f$. Similarly, epimorphisms are surjective linear transformations, and by the First Homomorphism Theorem, if f:V→W is surjective, then W is the cokernel of kerf→V. Thus, Vec is an abelian category.

2.4 Algebra

We will need the following definitions in Lemma 4.5, which we use in the proof of Theorem 4.6.

A (nonnegatively) graded ring is a ring, R, along with a direct-sum decomposition, $R = \bigoplus_{n=0}^{\infty} R_{n}$, such that 1∈R ₀, and if a∈R _m and b∈R _n, then ab∈R _m+n. Our primary example will be the polynomial ring $\mathbb {F}[t]$ for a field $\mathbb {F}$ that is graded by degree.

A graded $\mathbb {F}[t]$ -module is an $\mathbb {F}[t]$-module, M, with a decomposition $M = \bigoplus_{n=0}^{\infty} M_{n}$ that satisfies t ^m x∈M _m+n whenever x∈M _n. We say that M has finite type if each M _n is finite dimensional over $\mathbb {F}$.

We will also make use of the following structure theorem for finitely generated modules over a principal ideal domain.

Theorem 2.9

[16, Theorem 6.12(ii), p. 225]

Let A be a finitely generated module over a principal ideal domain R. Then A is the direct sum of a free submodule E of finite rank and a finite number of cyclic torsion modules. The cyclic torsion summands (if any) are of orders $p_{1}^{s_{1}}, \ldots, p_{k}^{s_{k}}$, where p ₁,…,p _k are (not necessarily distinct) positive integers. The rank of E and the list of ideals $(p_{1}^{s_{1}}), \ldots, (p_{k}^{s_{k}})$ are uniquely determined by A (except for the order of the p _i).

3 Interleavings of Diagrams

In this section, we define ε-interleavings for $\mathbf {(\mathbb {R},\leq )}$-indexed diagrams and show that they induce a metric on a set of $\mathbf {(\mathbb {R},\leq)}$-indexed diagrams. Our definition is a categorical version of the definition in [5].

We consider the category $\mathbf {(\mathbb {R},\leq)}$ whose objects are the real numbers and the set of morphisms from a to b consists of a single morphism if a≤b and is otherwise empty. For b≥0, define $T_{b}: \mathbf {(\mathbb {R},\leq)} \to \mathbf {(\mathbb {R},\leq)}$ to be the functor given by T _b(a)=a+b, and define $\eta_{b}:\mathop {\mathrm {Id}}_{\mathbf {(\mathbb {R},\leq)}} \Rightarrow T_{b}$ to be the natural transformation given by η _b(a):a≤a+b. Note that T _b T _c=T _b+c and that η _b η _c=η _b+c.

Let D be any category, and let ε≥0. Let $F, G \in { \mathbf {D}^{\mathbf {(\mathbb {R},\leq )}}}$.

Definition 3.1

An ε-interleaving of F and G consists of natural transformations φ:F⇒GT _ε and ψ:G⇒FT _ε, i.e.,

such that

$$ (\psi T_{\varepsilon }) \varphi= F \eta_{2\varepsilon } \quad \mbox{and}\quad (\varphi T_{\varepsilon }) \psi= G \eta_{2\varepsilon }. $$

(3)

If (F,G,φ,ψ) is an ε-interleaving, then we say that F and G are ε-interleaved.

The existence of the natural transformations φ and ψ implies that we have the following commutative diagrams for all a≤b:

Identities (3) imply that the following diagrams commute for all a:

Definition 3.2

Say that d(F,G)≤ε if F and G are ε-interleaved. Explicitly,

$$ d(F,G) = \inf\{ \varepsilon \geq0 \ | \ F \mbox{ and } G \mbox{ are } \varepsilon \mbox{-interleaved} \}, $$

where we set d(F,G)=∞ if F and G are not ε-interleaved for any ε≥0.

We will show that this function d is a generalized metric. It fails to be a metric because it can take the value ∞ and d(F,G)=0 does not imply that F≅G. Notice that if F and G are 0-interleaved, then F≅G. However, d(F,G)=0 only implies that F and G are ε-interleaved for all ε>0. This does not imply that F≅G. For an example, consider $F,G \in { \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ where F=0 and G(a) is the ground field for a=0 but is otherwise 0. However, it does satisfy the other conditions of a metric, so it is an extended pseudometric.

Theorem 3.3

The function d defined above is an extended pseudometric on any subset of the class of $\mathbf {(\mathbb {R},\leq)}$-indexed diagrams in D.

To prove the theorem, we will need the following lemma, which shows that the set of ε for which two diagrams are ε-interleaved form a ray.

Lemma 3.4

If the $\mathbf {(\mathbb {R},\leq)}$-indexed diagrams F and G are ε-interleaved, then they are also ε′-interleaved for any ε′≥ε.

Proof

Let φ:F⇒GT _ε and ψ:G⇒FT _ε be such that (ψT _ε)φ=Fη _2ε and (φT _ε)ψ=Gη _2ε.

Let ε′≥ε and set $\bar{\varepsilon } = \varepsilon ' - \varepsilon $. Recall that we have the natural transformation, $\eta_{\bar{\varepsilon }}: \mathop {\mathrm {Id}}_{\mathbf {(\mathbb {R},\leq)}} \Rightarrow T_{\bar{\varepsilon }}$, and thus, $\eta_{\bar{\varepsilon }} T_{\varepsilon }: T_{\varepsilon } \Rightarrow T_{\bar{\varepsilon }} T_{\varepsilon } = T_{\varepsilon '}$. Therefore, $G\eta_{\bar{\varepsilon }} T_{\varepsilon }: GT_{\varepsilon } \Rightarrow GT_{\varepsilon '}$. Define $\hat{\varphi} = (G \eta_{\bar{\varepsilon }} T_{\varepsilon }) \varphi$. For example,

$$ \hat{\varphi}(a): F(a) \xrightarrow {\varphi(a)} G(a+\varepsilon ) \xrightarrow {G\eta_{\bar{\varepsilon }} T_{\varepsilon } (a)} G\bigl(a + \varepsilon '\bigr). $$

Similarly, define $\hat{\psi} = (F \eta_{\bar{\varepsilon }} T_{\varepsilon }) \psi$.

We see that $(\hat{\psi} T_{\varepsilon '}) \hat{\varphi} = F \eta_{2\varepsilon '}$ from the following commutative diagram:

Similarly, one can check that $(\hat{\varphi} T_{\varepsilon '}) \hat{\psi} = G \eta_{2\varepsilon '}$. □

Proof of Theorem 3.3

The identity natural transformation shows that d(F,F)=0 for any diagram F. By the symmetry of the definition of ε-interleaving, we see that d(F,G)=d(G,F) for any diagrams F and G. It remains to show the triangle inequality.

Consider diagrams F, G, and H. Let a=d(F,G) and b=d(G,H). Let ε>0. Then by Lemma 3.4 and the definition of infimum, F and G are (a+ε)-interleaved, and G and H are (b+ε)-interleaved. Let φ′:F⇒GT _a+ε and ψ′:G⇒FT _a+ε and φ″:G⇒HT _b+ε and ψ″:H⇒GT _b+ε be the corresponding natural transformations. We will show that composing these natural transformations gives the desired natural transformations for an interleaving of F and H.

Let φ=(φ″T _a+ε)φ′:F⇒HT _b+ε T _a+ε=HT _a+b+2ε and ψ=(ψ′T _b+ε)ψ″:H⇒FT _a+ε T _b+ε=FT _a+b+2ε. The first composition comes from the following diagram. The second is similar.

We claim that (ψT _a+b+2ε)φ=Fη _2(a+b+2ε) and (φT _a+b+2ε)ψ=Hη _2(a+b+2ε). The first identity comes from the following diagram. The second is similar.

Thus, F and H are (a+b+2ε)-interleaved for all ε>0. Therefore d(F,H)≤a+b. □

Let us declare F equivalent to G if d(F,G)=0; this is an equivalence relation, and we obtain the following corollary.

Corollary 3.5

If we identify diagrams whose interleaving distance is 0, then d is an extended metric on this set of equivalence classes.

One of the mostly useful aspects of the categorical view of interleavings is that if we apply a functor to ε-interleaved diagrams, then the resulting diagrams are also ε-interleaved. That is,

Proposition 3.6

Let $F,G: \mathbf {(\mathbb {R},\leq)} \to \mathbf {D}$ and H:D→E. If F and G are ε-interleaved, then so are HF and HG. Thus,

$$ d(HF, HG ) \leq d(F,G). $$

Proof

Assume that F and G are ε-interleaved. Let φ:F⇒GT _ε, ψ:G⇒FT _ε be the corresponding natural transformations. Then by functoriality, Hφ:HF⇒HGT _ε and Hψ:HG⇒HFT _ε, and (HψT _ε)(Hφ)=(HF)η _2ε and (HφT _ε)(Hψ)=(HG)η _2ε, as pictured in the following diagram:

Therefore, HF and HG are ε-interleaved. □

4 Diagrams of Vector Spaces

From our categorical point of view, persistent homology calculations are done on diagrams in the category Vec of finite-dimensional vector spaces over a fixed ground field $\mathbb {F}$. In this section, we study $(\mathbb {R},\leq)$-indexed diagrams in Vec and define some of the usual characters in topological persistence in this setting: barcodes, persistence diagrams, and the bottleneck distance. Our main result is an isometric embedding of the set of finite barcodes with the bottleneck distance into the set of objects of ${ \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ with the interleaving distance.

The category Vec is one of the motivating examples of an abelian category. If the target category in a diagram category is an abelian category, then the diagram category inherits this structure. The necessary constructions are done objectwise. In particular, ${ \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ is an abelian category.

4.1 Finite-Type Diagrams

In this section we define finite-type and tame diagrams in ${ \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ and show that the two conditions are equivalent. As a corollary, we obtain a Krull–Schmidt theorem.

Definition 4.1

Given an interval $I \subseteq \mathbb {R}$, define the diagram $\chi_{I} \in { \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ by

$$\chi_I(a) = \begin{cases} \mathbb {F}& \mbox{if}\ a \in I,\\ 0& \mbox{otherwise}, \end{cases} \quad\quad \chi_I(a\leq b) = \begin{cases} \mathop {\mathrm {Id}}_{\mathbb {F}}& \mbox{if}\ a,b \in I,\\ 0& \mbox{otherwise}. \end{cases} $$

We say that a diagram $F \in { \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ has finite type if $F \cong \bigoplus_{k=1}^{N} \chi_{I_{k}}$.

We remark that $\chi_{\mathbb {R}}$ and χ _∅ are the constant functors $\mathbb {F}$ and 0, respectively.

Lemma 4.2

For an interval $I \subseteq \mathbb {R}$, the diagram χ _I is indecomposable.

Proof

Assume that χ _I≅P⊕Q. If there is some c∉I, then P(c)⊕Q(c)≅χ _I(c)=0, and therefore P(c)=Q(c)=0.

Let a∈I. Then $P(a) \oplus Q(a) \cong \chi_{I}(a) \cong \mathbb {F}$. Without loss of generality, assume that $P(a) \cong \mathbb {F}$ and Q(a)=0. Let a≤b∈I. Since Q(a)=0, Q(a≤b)=0. Thus, it follows from $P(a\leq b) \oplus Q(a\leq b) = (P\oplus Q)(a\leq b) \cong \chi_{I}(a\leq b) = \mathop {\mathrm {Id}}_{\mathbb {F}}$ that $P(a\leq b) \cong \mathop {\mathrm {Id}}_{\mathbb {F}}$. Hence, from $P(b) \oplus Q(b) \cong \chi_{I}(b) \cong \mathbb {F}$ we get that $P(b) \cong \mathbb {F}$ and Q(b)=0. Similarly, if d≤a∈I, we get that Q(d≤a)=0, $P(d\leq a) \cong \mathop {\mathrm {Id}}_{\mathbb {F}}$, $P(d) \cong \mathbb {F}$ and Q(d)=0.

We have shown that P≅χ _I and Q=0. Therefore, χ _I is indecomposable. □

The following definitions are variations of those in [7].

Definition 4.3

Let $F \in { \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$. Let $I \subseteq \mathbb {R}$ be an interval. Say that F is constant on I if for all a≤b∈I, F(a≤b) is an isomorphism. We call $a \in \mathbb {R}$ a regular value of F if there is some open interval I∋a such that F is constant on I. Otherwise, we call a a critical value of F.^{Footnote 1} We call F tame if it has a finite number of critical values.

Lemma 4.4

(Critical Value Lemma)

If an interval I does not contain any critical values of F, then F is constant on I.

Proof

Let a≤b∈I. By assumption, for all c∈[a,b], there exists an interval I _c∋c such that F is constant on I _c. Since [a,b] is compact, the cover {I _c | c∈[a,b]} has a finite subcover $\{I_{c_{1}}, \ldots, I_{c_{n}}\}$. Choose a sequence a=d ₀≤d ₁≤⋯≤d _m+1=b such that for all 0≤k≤m, $d_{k},d_{k+1} \in I_{c_{j}}$ for some 1≤j≤n. Then F(d _k≤d _k+1) is an isomorphism for 0≤k≤m, and thus F(a≤b) is an isomorphism. □

We will need the following lemma in the proof of Theorem 4.6. We refer the reader to Sect. 2.4 for the definition of a finite-type graded $\mathbb {F}[t]$-module.

Lemma 4.5

The category ${ \mathbf {Vec}^{\mathbf {(\mathbb {Z},\leq )}}}$ is isomorphic to the category of finite-type graded $\mathbb {F}[t]$-modules.

Proof

To each diagram $F \in { \mathbf {Vec}^{\mathbf {(\mathbb {Z},\leq )}}}$, we can assign the finite-type graded $\mathbb {F}[t]$-module M, where for $k \in \mathbb {Z}$, M _k=F(k), and for a∈M _k, t⋅a=F(k≤k+1)(a).

To each finite-type graded $\mathbb {F}[t]$-module M, we can assign the diagram $F \in { \mathbf {Vec}^{\mathbf {(\mathbb {Z},\leq )}}}$ given by F(k)=M _k and whose morphisms are generated by F(k≤k+1)(a)=t⋅a for a∈F(k).

Both composites of these two functors are equal to the identity functor. □

Theorem 4.6

A diagram in ${ \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ is tame if and only if it has finite type.

Proof

To prove the “if” statement, we consider an interval $I \subseteq \mathbb {R}$. By definition, $a \in \mathbb {R}$ is a critical value of χ _I if and only if a is an endpoint of I. Let $F \in { \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ such that $F \cong \bigoplus_{k=1}^{N} \chi_{I_{k}}$. Then $a \in \mathbb {R}$ is a critical value of F if and only if it is an endpoint of one of the intervals I _k, and so F is tame.

The remainder of the proof is devoted to establishing the “only if” statement. Assume that $F \in { \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ has critical values a ₁<a ₂<⋯<a _n. Choose b ₀,…,b _n such that b ₀<a ₁ for k∈{1,…,n−1}, a _k<b _k<a _k+1, and a _n<b _n. For convenience, set a ₀=−∞, a _n+1=∞ and F(a ₀)=F(b ₀), F(a _n+1)=F(b _n). We have the ordered sequence

$$ -\infty= a_0 < b_0 < a_1 < b_1 < a_2 < \cdots< b_{n-1} < a_n < b_n < a_{n+1} = \infty. $$

We now identify the finite-valued part of this sequence with the integers from 0 to 2n.

More precisely, we define a functor $i:[2n] \to \mathbf {(\mathbb {R},\leq)}$ given by

$$ k \mapsto \begin{cases} b_{\frac{k}{2}} & \mbox{if}\ k\ \mbox{is even,} \\ a_{\frac{k+1}{2}} & \mbox{if}\ k\ \mbox{is odd.} \end{cases} $$

We also define a functor $r: \mathbf {(\mathbb {R},\leq)} \to[2n]$ given by

$$ c \mapsto \begin{cases} 2k-1 & \mbox{if } c=a_k, \ k \in\{1,\ldots,n\}, \\ 2k & \mbox{if } a_k < c < a_{k+1}, \ k \in\{0,\ldots,n\}. \end{cases} $$

Then we have the composite functor $ir: \mathbf {(\mathbb {R},\leq)} \to \mathbf {(\mathbb {R},\leq)}$ given by

$$ c \mapsto \begin{cases} a_k & \mbox{if } c=a_k, \ k \in\{1,\ldots,n\}, \\ b_k & \mbox{if } a_k < c < a_{k+1}, \ k \in\{0,\ldots,n\}. \end{cases} $$

Precomposing F with this functor gives us an induced functor $(ir)^{*}F \in { \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$. That is, (ir)^∗ F(c)=F(irc).^{Footnote 2} Notice that by Lemma 4.4, F(irc)≅F(c) for all $c \in \mathbb {R}$. Thus, (ir)^∗:F↦(ir)^∗ F is a natural isomorphism.

Next, i ^∗ F:[2n]→Vec can be extended to a functor $i^{*}F: {(\mathbb {Z},\leq )} \to \mathbf {Vec}$ by setting (i ^∗ F)(k)=(i ^∗ F)(0) for k<0, with i ^∗ F(k≤0) the identity, and for k>2n, setting (i ^∗ F)(k)=(i ^∗ F)(2n) with i ^∗ F(2n≤k) the identity. By Lemma 4.5, we can consider i ^∗ F to be a graded $\mathbb {F}[t]$-module. Note that by assumption, i ^∗ F is a finitely generated graded $\mathbb {F}[t]$-module.

By the structure theorem for finitely generated graded modules over a principal ideal domain (Theorem 2.9), there is a unique decomposition,

$$ i^*F \cong \Biggl( \bigoplus_{i=1}^{n_1} t^{c_i} \mathbb {F}[t] \Biggr) \oplus \Biggl( \bigoplus _{j=1}^{n_2} t^{d_j} \! \bigl( \mathbb {F}[t] / \bigl(t^{e_j}\bigr) \bigr) \Biggr). $$

It follows that as elements of ${ \mathbf {Vec}^{\mathbf {(\mathbb {Z},\leq )}}}$,

$$ i^*F \cong \Biggl( \bigoplus_{i=1}^{n_1} \chi_{[c_i,\infty)} \Biggr) \oplus \Biggl( \bigoplus _{j=1}^{n_2} \chi_{[d_j,d_j+e_j)} \Biggr). $$

Therefore,

$$ F \cong (ir)^*F = r^*i^*F \cong \Biggl( \bigoplus_{i=1}^{n_1} r^*\chi _{[c_i,\infty)} \Biggr) \oplus \Biggl( \bigoplus _{j=1}^{n_2} r^*\chi_{[d_j,d_j+e_j)} \Biggr), $$

where

$$ r^*\chi_{[k,\infty)} = \begin{cases} \chi_{[a_{\frac{k+1}{2}},\infty)} & \mbox{if}\ k\ \mathrm{odd}, \\ \chi_{(a_{\frac{k}{2}},\infty)} & \mbox{if}\ k \mathrm{even}, \end{cases} $$

and

$$ r^*\chi_{[k,\ell)} = \begin{cases} \chi_{[a_{\frac{k+1}{2}},a_{\frac{\ell+1}{2}})} & \mbox{if } k,\ell \mbox{ odd}, \\ \chi_{[a_{\frac{k+1}{2}},a_{\frac{\ell}{2}}]} & \mbox{if}\ k\ \mathrm{odd}, \ell\ \mathrm{even}, \\ \chi_{(a_{\frac{k}{2}},a_{\frac{\ell+1}{2}})} & \mbox{if}\ k\ \mathrm{even},\ \ell\ \mathrm{odd}, \\ \chi_{(a_{\frac{k}{2}},a_{\frac{\ell+1}{2}}]} & \mbox{if}\ k, l\ \mathrm{even}. \end{cases} $$

Thus, F has finite type. □

By the uniqueness of the decomposition in the structure theorem for graded modules over a graded PID in the previous proof, we get that finite-type diagrams in ${ \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ satisfy the following Krull–Schmidt theorem. Compare this with [4, Proposition 2.2].

Corollary 4.7

(Krull–Schmidt)

If $F \cong \bigoplus_{k=1}^{n} \chi_{I_{k}}$ and $F \cong \bigoplus_{j=1}^{m} \chi _{I'_{j}}$, then n=m and the sequences I ₁,…,I _n and $I'_{1},\ldots ,I'_{m}$ are the same up to reordering.

4.2 Barcodes and Persistence Diagrams

Here we define barcodes and persistence diagrams for finite-type diagrams in ${ \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$. We observe that finite-type diagrams in ${ \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ are a categorification of finite barcodes.

Definition 4.8

Assume that $F \in { \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ has finite type. A barcode is a multiset of intervals. The barcode of F is the multiset $\{I_{k}\}_{k=1}^{n}$ where $F \cong \bigoplus_{k=1}^{n} \chi_{I_{k}}$. This is well defined by Corollary 4.7, which follows from Theorem 4.6.

A persistence diagram is a multiset of increasing pairs of extended real numbers. The persistence diagram of F is the multiset $\{(a_{k},b_{k})\}_{k=1}^{n}$, where a _k≤b _k and {a _k,b _k} are the endpoints of I _k, with $F \cong \bigoplus_{k=1}^{n} \chi_{I_{k}}$. Again, this is well defined by Corollary 4.7.

By Corollary 4.7, we immediately have the following. Compare with [21, Corollary 3.1] and [12, Persistence Equivalence Theorem]. Note that a finite barcode is a finite multiset of intervals, not a multiset of finite intervals.

Corollary 4.9

(Categorification of Barcodes)

There is a bijection between isomorphism classes of finite-type diagrams in ${ \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ and finite barcodes.

4.3 Bottleneck Distance

In this section, we define the bottleneck distance between two barcodes in terms of the interleaving distance. We show that this results in the usual definition of [7]. We end by proving an isometric embedding of the set of finite barcodes with the bottleneck distance into the set of $\mathbf {(\mathbb {R},\leq)}$-indexed diagrams in Vec with the interleaving distance.

Definition 4.10

Given multisets A and B, define the multiset A _B to be the disjoint union of A and the multiset containing the empty interval ∅ with cardinality |B|. A stable bijection or partial matching between two multisets A and B is a bijection, f:A _B→B _A. Write f:A⇌B.

Definition 4.11

Let B and B′ be two barcodes. Define the bottleneck distance between B and B′ by

$$ d_B\bigl(B,B'\bigr) = \inf _{f: B \rightleftharpoons B'} \sup_{I \in \operatorname {dom}{f}} d(\chi_I, \chi_{f(I)}). $$

(4)

On the right-hand side of (4) we have the interleaving distance. It follows from the following two propositions that this definition of bottleneck distance is equivalent to that in [7].

Proposition 4.12

Let I and I′ be two finite intervals.

(1)
If I=I′=∅, then d(χ _I,χ _I′)=0.
(2)
If I′=∅ and I has endpoints a and b, then $d(\chi_{I},\chi_{I'}) = \frac{b-a}{2}$.
(3)
If I and I′ have endpoints a,b and a′,b′, respectively, then
$$ d(\chi_I,\chi_{I'}) = \min \biggl( \max \bigl(\bigl\lvert a-a'\bigr\rvert ,\bigl\lvert b-b'\bigr\rvert \bigr), \max \biggl( \frac{b-a}{2},\frac{b'-a'}{2} \biggr) \biggr). $$

Proposition 4.13

Let I and I′ be two intervals, at least one of which is infinite.

(1)
If $I = I' = \mathbb {R}$, then d(χ _I,χ _I′)=0.
(2)
If inf(I)=inf(I′)=−∞ and I and I′ have right endpoints b and b′, then d(χ _I,χ _I′)=|b−b′|.
(3)
If sup(I)=sup(I′)=∞ and I and I′ have left endpoints a and a′, then d(χ _I,χ _I′)=|a−a′|.
(4)
In all other cases, d(χ _I,χ _I′)=∞.

Propositions 4.12 and 4.13 follow from the following two lemmas. The proofs are technical yet straightforward, and we leave them to the motivated reader.

Assume that the intervals I and I′ are finite. Let h and h′ each denote half the length of the interval I and I′, respectively, where the length of ∅ is 0. If I and I′ are nonempty, let m and m′ denote their respective midpoints.

Lemma 4.14

Assume that I and I′ are finite intervals. d(χ _I,χ _I′)≤max(h,h′).

Proof

Let ε>max(h,h′). Then χ _I η _2ε=0=χ _I′ η _2ε. Let φ=0 and ψ=0. Then φ and ψ give an ε-interleaving of χ _I and χ _I′. □

Lemma 4.15

Assume that I and I′ are finite intervals. If m∉I′, then d(χ _I,χ _I′)≥h.

Proof

Let ε<h. Then [m−ε,m+ε]⊂I. Thus, $\chi_{I} \eta_{2\varepsilon }(m-\varepsilon ) = \mathop {\mathrm {Id}}_{\mathbb {F}}$. Suppose m∉I′. Assume that there exists an ε-interleaving (φ, ψ) of χ _I and χ _I′. Then $(\psi T_{\varepsilon }) \varphi(m-\varepsilon ) = \mathop {\mathrm {Id}}_{\mathbb {F}}$. But φ(m−ε)∈χ _I′(m)=0. Therefore, (ψT _ε)φ(m−ε)=0, which is a contradiction. Thus, d(χ _I,χ _I′)≥h. □

In the statement of the following theorem, we abuse notation slightly by using ${ \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$ to denote the set of objects in the category ${ \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}}$.

Theorem 4.16

(Categorification of the Metric Space of Persistence Diagrams)

Let $\mathcal {B}$ be the set of finite barcodes, d _B the bottleneck distance, and d the interleaving distance. The mapping χ defined by $\chi(\{I_{k}\}_{k=1}^{n}) = \bigoplus_{k=1}^{n} \chi_{I_{k}}$ gives an isometric embedding of metric spaces

$$ \chi: (\mathcal {B},d_B) \hookrightarrow \bigl({ \mathbf {Vec}^{\mathbf {(\mathbb {R},\leq )}}},d\bigr). $$

Proof

Let $B, B' \in \mathcal {B}$. By [5, Theorem 4.4], we know that d _B(B,B′)≤d(χ(B),χ(B′)). It remains to show that

$$d\bigl(\chi(B),\chi\bigl(B'\bigr)\bigr) \leq d_B \bigl(B,B'\bigr). $$

If d _B(B,B′)=∞, then this is trivial. Assume that d _B(B,B′)<∞.

Let f:B⇌B′ such that $\sup_{I \in \operatorname {dom}(f)} d(\chi _{I},\chi_{f(I)}) < \infty$. Choose $\varepsilon > \sup_{I \in \operatorname {dom}(f)} d(\chi_{I},\chi_{f(I)})$. By Lemma 3.4, for each $I \in \operatorname {dom}(f)$, χ _I and χ _f(I) are ε-interleaved. By Corollary 7.11, χ(B) and χ(B′) are ε-interleaved.

Thus, χ(B) and χ(B′) are ε-interleaved for all ε>d _B(B,B′). It follows that d(χ(B),χ(B′))≤d _B(B,B′). □

5 Stability

In [7], Cohen-Steiner, Edelsbrunner, and Harer prove that persistent homology of sublevel sets of a function is stable with respect to perturbations of the function as measured by the supremum norm. In this section, we use our categorical framework to generalize this Stability Theorem, as well as its generalization in [5].

Let X∈Top. Assume that $f,g: X \to \mathbb {R}$. Note that we do not require that f and g be continuous. Let $F \in { \mathbf {Top}^{\mathbf {(\mathbb {R},\leq )}}}$ be defined by F(a)=f ⁻¹(∞,a] for $a \in \mathbb {R}$ and F(a≤b) is given by inclusion. Define G similarly using g. Let H:Top→D be any functor, e.g., singular homology with coefficients in a field $\mathbb {F}$, or rational homotopy groups. Recall that ∥f−g∥_∞=sup_x∈X|f(x)−g(x)|.

Theorem 5.1

(Stability Theorem)

$$ d(HF,HG) \leq \|f-g\|_{\infty}. $$

Proof

Let ε=∥f−g∥_∞. First, we observe that by the assumption,

$$ F(a) = f^{-1}(-\infty,a] \subseteq g^{-1}(-\infty,a+\varepsilon ] = G(a+\varepsilon ), $$

and similarly, G(a)⊆F(a+ε). Thus, F and G are ε-interleaved. It follows that HF and HG are ε-interleaved (Proposition 3.6), and thus

$$ d(HF,HG) \leq\|f-g\|_{\infty}. $$

□

6 Extended Persistence

In [8], Cohen-Steiner, Edelsbrunner, and Harer, define extended persistence to obtain a sequence of vector spaces in which the homology classes of the total space do not live forever. Given a simplicial complex K on n ordered vertices, let K _i be the subcomplex spanned by the first i vertices, and let L _i be the subcomplex spanned by the last i vertices. Let H _k denote degree k relative simplicial homology with coefficients in the field $\mathbb {Z}/ 2\mathbb {Z}$. Then they construct the sequence

$$\begin{aligned} 0 =& H_k(K_0,\emptyset) \to H_k(K_1, \emptyset) \to\cdots\to H_k(K_n,\emptyset) \\ =& H_k(K,L_0) \to H_k(K,L_1) \to\cdots\to H_k(K,L_n) = 0. \end{aligned}$$

They show that the Stability Theorem of [7] can be applied in this case. Here we give a generalization of this construction and the corresponding stability theorem.

Let X∈Top. Assume that $f: X \to \mathbb {R}$, where f need not be continuous, and there exists an $M \in \mathbb {R}$ such that f(x)≤M for all x∈X. Let s>0 be an arbitrary amount to space out the upward and downward filtrations. Define the $\mathbf {(\mathbb {R},\leq)}$-indexed diagram of pairs of topological spaces, $F \in \mathbf {Pair}^{\mathbf {(\mathbb {R},\leq )}}$, as follows.

For c<M+s, let F(c)=(f ⁻¹(−∞,c],∅). For c≥M+s, let F(c)=(X,f ⁻¹[2M+s−c,∞)). Notice that for M≤c<M+s, F(c)=(X,∅), and F(M+s)=(X,f ⁻¹(M)).

For c≤d, F(c≤d) is given by inclusion. Indeed, if c≤d<M+s, we have (f ⁻¹(−∞,c],∅)⊆(f ⁻¹(−∞,d],∅), if M+s≤c≤d, then (X,f ⁻¹[2M+s−c,∞))⊆(X,f ⁻¹[2M+s−d,∞)), and if c<M+s≤d, then (f ⁻¹(−∞,c],∅)⊆(X,f ⁻¹[2M+s−d,∞)).

In the special case that there exists an $m \in \mathbb {R}$ such that f(x)≥m for all x∈X, then for c<m, F(c)=(∅,∅), and for c≥M+s+(M−m)=2M+s−m, F(c)=(X,X).

Now assume that we also have another (not necessarily continuous) map g:X→(−∞,M]. Define $G \in \mathbf {Pair}^{\mathbf {(\mathbb {R},\leq )}}$ similarly. Let H:Pair→D be any functor, e.g., relative homology with coefficients in some field $\mathbb {F}$.

Theorem 6.1

(Stability Theorem for Extended Persistence)

$$ d(HF,HG) \leq \lVert f-g\rVert _{\infty}. $$

Proof

Let ε=∥f−g∥_∞. Let $c \in \mathbb {R}$. Then by assumption, (f ⁻¹(−∞,c],∅)⊆(g ⁻¹(−∞,c+ε],∅), (X,f ⁻¹[2M+s−c,∞))⊆(X,g ⁻¹[2M+s−(c+ε),∞)), and (f ⁻¹(−∞,c],∅)⊆(X,g ⁻¹[2M+s−(c+ε),∞)). Also, by assumption, we have the same relations with f and g switched. Thus, F and G are ε-interleaved. It follows that HF and HG are ε-interleaved (Proposition 3.6), and thus

$$ d(HF,HG) \leq d(F,G) \leq \lVert f-g\rVert _{\infty}. $$

□

7 Abelian Structure of Interleavings

Let D be a category, and let ε≥0. In this section, we consider the category $\mathbf {Int_{\varepsilon }({D})}$ of ε-interleavings of diagrams in ${ \mathbf {D}^{\mathbf {(\mathbb {R},\leq )}}}$, which we define below. We will show that this construction is functorial and that if D is an abelian category, then so is $\mathbf {Int_{\varepsilon }({D})}$. As a corollary, we obtain stability theorems for kernels, images, and cokernels in persistence and extended persistence.

First, let us recall that the functor $T_{\varepsilon } : \mathbb{R} \rightarrow\mathbb{R}$, T _ε(x)=x+ε, T _ε(x≤y)=x+ε≤y+ε, comes equipped with a “unit” natural transformation, η _ε:Id⇒T _ε, since x≤x+ε. We will write $\eta_{\varepsilon }^{2}$ for the iteration $\mathop {\mathrm {Id}}\Rightarrow T_{\varepsilon } \Rightarrow T_{\varepsilon }^{2}$.

Definition 7.1

The objects of $\mathbf {Int_{\varepsilon }({D})}$ are ε-interleavings, (F,G,φ,ψ), where φ:F⇒GT _ε, ψ:G⇒FT _ε, such that $(\psi T_{\varepsilon }) \varphi= F \eta_{\varepsilon }^{2}$ and $(\varphi T_{\varepsilon }) \psi= G \eta_{\varepsilon }^{2}$ (Definition 3.1). A morphism (α,β):(F,G,φ,ψ)⇒(F′,G′,φ′,ψ′) consists of a pair of natural transformations, α:F⇒F′ and β:G→G′, such that the diagrams

commute.

Let us also verify the naturality of the above construction.

Proposition 7.2

Definition 7.1 of $\mathbf {Int_{\varepsilon }({D})}$ is functorial in ε and in D.

Proof

Let ε≤ε′. The functor, $\mathbf {Int_{\varepsilon }({D})} \to \mathbf {Int_{\varepsilon '}({D})}$, is defined on objects by Lemma 3.4. To be precise, we have $(F,G,\varphi,\psi) \mapsto(F,G,\hat{\varphi },\hat{\psi})$, where $\hat{\varphi} = (G\eta_{\varepsilon '-\varepsilon }T_{\varepsilon })\varphi$, and $\hat{\psi} = (F\eta_{\varepsilon '-\varepsilon }T_{\varepsilon })\psi$. Let $(\alpha,\beta): (F,G,\varphi,\psi) \to(F',G',\varphi',\psi') \in \mathbf {Int_{\varepsilon }({D})}$. Then the commutative diagram

and a similar one show that $(\alpha,\beta): (F,G,\hat{\varphi},\hat {\psi}) \to(F',G',\hat{\varphi}',\hat{\psi}') \in \mathbf {Int_{\varepsilon '}({D})}$. From this it follows that $\mathbf {Int_{\varepsilon }({D})}$ is functorial in ε.

Now consider a functor H:D→D′. The induced functor $\mathbf {Int_{\varepsilon }({D})} \to \mathbf {Int_{\varepsilon }({D'})}$ is defined by composition with H (for the details of the definition on objects, see the proof of Proposition 3.6). It follows that $\mathbf {Int_{\varepsilon }({D})}$ is functorial in D. □

Let A be an abelian category. Then, as discussed at the start of Sect. 4, so is ${ \mathbf {A}^{\mathbf {(\mathbb {R},\leq )}}}$. We claim that $\mathbf {Int_{\varepsilon }({A})}$ is also an abelian category. Recall (Sect. 2.3.4) that a category is abelian if it has a zero object, all finite products and coproducts, every morphism has a kernel and a cokernel, and all monomorphisms and epimorphisms are kernels and cokernels, respectively.

Lemma 7.3

The category $\mathbf {Int_{\varepsilon }({A})}$ has a zero object.

Proof

The zero object of $\mathbf {Int_{\varepsilon }({A})}$ comes from the zero object, 0, of A. The diagram category $\mathbf {Vec}^{(\mathbb{R},\leq)}$ then inherits the constant zero diagram, O _x=0 for all $x \in\mathbb{R}$, with the identity morphism O _x→O _y for x≤y. In turn, we define the trivial ε-interleaving, (O,O,ω,ω), where ω _x:O _x→O _x+ε is again the identity. It is easy to see that (O,O,ω,ω) is the desired zero object. Indeed, to see that it is initial, we note that for every interleaving (F,G,φ,ψ) and for every $x \in\mathbb{R}$, there are unique morphisms O _x→F _x and O _x→G _x because O _x is initial in A, and the appropriate diagrams commute. Similarly, (O,O,ω,ω) is final, and hence the desired zero object in $\mathbf {Int_{\varepsilon }({A})}$. □

In particular, for any objects $X,Y \in \mathbf {Int_{\varepsilon }({A})}$, we now have the zero morphism 0:X→Y, that is, the composite X→O→Y.

Lemma 7.4

The category $\mathbf {Int_{\varepsilon }({A})}$ has all pull-backs and push-outs, and their components in ${ \mathbf {A}^{\mathbf {(\mathbb {R},\leq )}}}$ are given by the respective pull-backs and push-outs in ${ \mathbf {A}^{\mathbf {(\mathbb {R},\leq )}}}$.

Proof

We show that $\mathbf {Int_{\varepsilon }({A})}$ has all pull-backs. The arguments and constructions for push-outs are dual.

Consider the diagram

$$\bigl(F',G',\varphi', \psi'\bigr) \xrightarrow{(\alpha', \beta')} (F,G,\varphi,\psi ) \xleftarrow{( \alpha'',\beta'')} \bigl(F'',G'',\varphi'', \psi''\bigr). $$

The category ${ \mathbf {A}^{\mathbf {(\mathbb {R},\leq )}}}$ is abelian. Thus, we may form the pull-back functors,

To simplify the notation, let T=T _ε and η=η _ε. Observe that

$$G'T \times_{GT} G''T = \bigl( G' \times_{G} G'' \bigr) T, $$

and so from the universal property of pull-backs we obtain the natural transformation

$$\varPhi= \varphi' \times_{\varphi} \varphi'' : F' \times_{F} F'' \rightarrow \bigl(G' \times_{G} G'' \bigr) T. $$

Similarly, we get a natural transformation

$$\varPsi= \psi' \times_{\psi} \psi'' : G' \times_{G} G'' \rightarrow \bigl( F' \times_{F} F'' \bigr) T. $$

We need to check that (F′×_F F″,G′×_G G″,Φ,Ψ) is an ε-interleaving that is indeed the relevant pull-back.

To see that we have an interleaving, it only remains to show that (ΦT)Ψ=(G′×_G G″)η ² and (ΨT)Φ=(F′×_F F″)η ². We prove the second identity. The verification of the first is symmetric.

We observe that (ΨT)Φ is one morphism F′×_F F″→(F′×_F F″)T ² that provides the unique dotted arrow (by the universal property of the pull-back) making the diagram

commute. Since (F′×_F F″)η ² also fits the diagram, by uniqueness, (ΨT)Φ=(F′×_F F″)η ². □

Corollary 7.5

The category $\mathbf {Int_{\varepsilon }({A})}$ has all finite products and coproducts. Every morphism in $\mathbf {Int_{\varepsilon }({A})}$ has a kernel and a cokernel.

Proof

Since $\mathbf {Int_{\varepsilon }({A})}$ has a terminal object and pull-backs, it has finite products. Since the category has an initial object and push-outs, it has finite coproducts. Since $\mathbf {Int_{\varepsilon }({A})}$ has a zero object and pull-backs, and the kernel of (α,β):(F,G,φ,ψ)→(F′,G′,φ′,ψ′) can be obtained by pulling back along the initial morphism (O,O,ω,ω)→(F′,G′,φ′,ψ′), every morphism has a kernel. Similarly, every morphism has a cokernel since $\mathbf {Int_{\varepsilon }({A})}$ has a zero object and push-outs. □

It remains to show that every monomorphism is a kernel and that every epimorphism is a cokernel. Before doing so, we show that in a preadditive category, monomorphisms and epimorphisms are characterized by their trivial kernels and cokernels, respectively.

Lemma 7.6

[19] Let C be a category with zero object, kernels, and cokernels. If f:X→Y is a monomorphism in C, then kerf=0. Dually, if f is an epimorphism, then $\operatorname {coker}f = 0$.

If the category is preadditive, then we have the following converse for Lemma 7.6.

Lemma 7.7

Let C be a preadditive category with zero object, kernels, and cokernels. If f:X→Y is a morphism in C with trivial kernel, then f is a monomorphism. Dually, if f has trivial cokernel, then f is an epimorphism.

Proof

Suppose that kerf=0 and that g,h:W→X are morphisms that satisfy fg=fh. Then using the preadditive structure of C, we find that f(g−h)=0, and so g−h factors through kerf=0. It follows that g−h=0, so g=h. Therefore, f is a monomorphism.

The proof of the dual statement is dual. □

Next, we show that the above characterization of monomorphisms and epimorphisms applies to our setting.

Lemma 7.8

The category $\mathbf {Int_{\varepsilon }({A})}$ is preadditive.

Proof

Let X=(F,G,φ,ψ) and X′=(F′,G′,φ,ψ) be ε-interleavings. By definition, $\mathbf {Int_{\varepsilon }({A})}(X,Y) \subset { \mathbf {A}^{\mathbf {(\mathbb {R},\leq )}}}(F,F') \times { \mathbf {A}^{\mathbf {(\mathbb {R},\leq )}}}(G,G')$, which is itself an abelian group. One can readily verify that $(0,0) \in \mathbf {Int_{\varepsilon }({A})}(X,Y)$. Since ${ \mathbf {A}^{\mathbf {(\mathbb {R},\leq )}}}$ is preadditive, composition distributes over the addition of morphisms. If $(\alpha,\beta),(\alpha',\beta') \in \mathbf {Int_{\varepsilon }({A})}(X,Y)$, then φ′(α+α′)=φ′α+φ′α′=(βT)φ+(β′T)φ=((β+β′)T)φ, so $(\alpha+\alpha',\beta+ \beta') \in \mathbf {Int_{\varepsilon }({A})}(X,Y)$. Finally, we verify that if $(\alpha,\beta) \in \mathbf {Int_{\varepsilon }({A})}(X,Y)$, then so is its additive inverse. Since ${ \mathbf {A}^{\mathbf {(\mathbb {R},\leq )}}}$ is preadditive, we have natural transformations −α and −β. Since 0=φ′0=φ′(α+(−α))=φ′α+φ′(−α), it follows that φ′(−α)=−φ′α. Similarly, (−β)φ=−βφ. It follows that $(-\alpha,-\beta) \in \mathbf {Int_{\varepsilon }({A})}(X,Y)$. Since (α,β)+(−α,−β)=(0,0), $\mathbf {Int_{\varepsilon }({A})}(X,Y)$ is an abelian group.

Composition is bilinear since it is the restriction of composition in the additive category ${ \mathbf {A}^{\mathbf {(\mathbb {R},\leq )}}}\times { \mathbf {A}^{\mathbf {(\mathbb {R},\leq )}}}$. □

Lemma 7.9

In $\mathbf {Int_{\varepsilon }({A})}$, every monomorphism is a kernel, and every epimorphism is a cokernel.

Proof

Let (α,β):(F,G,φ,ψ)→(F′,G′,φ,ψ) be a monomorphism in $\mathbf {Int_{\varepsilon }({A})}$. We will show that (α,β) is the kernel of the natural morphism

$$\pi:\bigl(F', G', \varphi, \psi\bigr) \rightarrow \operatorname {coker}(\alpha,\beta). $$

First, we calculate ker(α,β) in terms of kerα and kerβ. If we pull back the morphism (α,β):(F,G,φ,ψ)→(F′,G′,φ′,ψ′) along the initial morphism (O,O,ω,ω)→(F′,G′,φ′,ψ′), we obtain the interleaving (kerα,kerβ,Φ,Ψ) constructed in the proof of Lemma 7.4.

Cokernels are obtained in a dual manner; we have that $\operatorname {coker}(\alpha ,\beta) = (\operatorname {coker}\alpha, \operatorname {coker}\beta, \bar{\varPhi}, \bar{\varPsi})$.

By Lemma 7.6, every monomorphism has trivial kernel. Thus, ker(α,β)=(kerα,kerβ,Φ,Ψ)=(O,O,ω,ω). This means, in particular, that kerα=kerβ=O. It follows from Lemmas 7.7 and 7.8 that α and β are monomorphisms. Since ${ \mathbf {A}^{\mathbf {(\mathbb {R},\leq )}}}$ is abelian, α is the kernel of the quotient map, $F' \rightarrow \operatorname {coker}\alpha$, and likewise β is the kernel of $G' \rightarrow \operatorname {coker}\beta$. It then follows that (α,β) is the kernel of the natural morphism, $(F',G',\varPhi,\varPsi) \rightarrow(\operatorname {coker}\alpha,\operatorname {coker}\beta,\bar{\varPhi},\bar{\varPsi})$.

The dual statement follows from the dual proof. □

Combining Lemma 7.3, Corollary 7.5, and Lemma 7.9, we have the following.

Theorem 7.10

Given an abelian category A and ε≥0, the category $\mathbf {Int_{\varepsilon }({A})}$ of ε-interleavings in A is an abelian category.

From Theorem 7.10 and Lemma 7.4 we immediately have the following two applications.

Corollary 7.11

If the two pairs of diagrams (F,G) and (F′,G′) in ${ \mathbf {A}^{\mathbf {(\mathbb {R},\leq )}}}$ are ε-interleaved, then so is the pair (F⊕F′,G⊕G′).

Corollary 7.12

Let (α,β) be a morphism in $\mathbf {Int_{\varepsilon }({A})}$. Then each of the following three pairs of diagrams in ${ \mathbf {A}^{\mathbf {(\mathbb {R},\leq )}}}$ are ε-interleaved: (kerα,kerβ), $(\operatorname {im}\alpha, \operatorname {im}\beta)$, and $(\operatorname {coker}\alpha, \operatorname {coker}\beta)$.

As an application of Corollary 7.12, we get the following generalization of the Stability Theorem of [9].

Theorem 7.13

(Stability Theorem for Kernels, Images, and Cokernels)

Let h:Y→X be a continuous map of topological spaces. Let $f,f': X \to \mathbb {R}$ and $g,g':Y \to \mathbb {R}$ be (not necessarily continuous) maps such that

$$ \mathit{for\ all}\ y \in Y,\quad fh(y) \leq g(y),\quad \mathit{and}\quad f'h(y) \leq g'(y). $$

(5)

Let $F \in { \mathbf {Top}^{\mathbf {(\mathbb {R},\leq )}}}$ be given by F(a)=f ⁻¹(−∞,a] and inclusion. Define F′, G, and G′ similarly. By (5), h induces maps α:G→F and β:G′→F′ in ${ \mathbf {Top}^{\mathbf {(\mathbb {R},\leq )}}}$. Let A be an abelian category and H:Top→A be some functor. Let ε=max{∥f−f′∥_∞,∥g−g′∥_∞}. Then

$$ d(\ker H\alpha, \ker H\beta),\qquad d(\operatorname {im}H\alpha, \operatorname {im}H\beta),\qquad d(\operatorname {coker}H\alpha, \operatorname {coker}H\beta) \leq \varepsilon . $$

Proof

By the definition of ε, F↪F′T _ε, F′↪FT _ε, G↪G′T _ε, and G′↪GT _ε in ${ \mathbf {Top}^{\mathbf {(\mathbb {R},\leq )}}}$. Since α and β are both induced by h, the diagrams

commute, and thus $(\alpha,\beta) \in \mathbf {Int_{\varepsilon }({ \mathbf {Top}})}$. By functoriality $(H\alpha,H\beta) \in \mathbf {Int_{\varepsilon }({A})}$ (see Proposition 7.2). Apply Corollary 7.12 and Definition 3.2 to obtain the desired result. □

Strengthening (5), we obtain an extended persistence version of this theorem. It has essentially the same proof, so we omit it.

Theorem 7.14

(Stability Theorem for Kernels, Images, and Cokernels in Extended Persistence)

Let h:Y→X be a continuous map of topological spaces. Let f,f′:X→(−∞,M] be (not necessarily continuous) maps. Let g=fh and g′=f′h. Let s>0. Let $F \in \mathbf {Pair}^{\mathbf {(\mathbb {R},\leq )}}$ be given by F(c)=(f ⁻¹(−∞,c],∅) if c<b+s and F(c)=(X,f ⁻¹[2b+s−c,∞)) if c≥b+s, and inclusion. Define F′, G, and G′ similarly. Then h induces maps α:G→F and β:G′→F′ in $\mathbf {Pair}^{\mathbf {(\mathbb {R},\leq )}}$. Let A be an abelian category, and H:Pair→A be some functor. Then

$$\begin{aligned} &d(\ker H\alpha, \ker H\beta),\qquad d(\operatorname {im}H\alpha, \operatorname {im}H\beta), \\ & d(\operatorname {coker}H\alpha, \operatorname {coker}H\beta) \leq \bigl\lVert f-f'\bigr\rVert _{\infty}. \end{aligned}$$

8 Future Work

In this paper, we have studied persistence by considering diagrams indexed by $\mathbf {(\mathbb {R},\leq)}$. However, there are versions of persistence in which the objects of study can be viewed as diagrams with more general indexing categories. For example, we would like to be able to consider diagrams indexed by $(\mathbb {R}^{n},\leq)$ for multidimensional persistence, S ¹ for circle-valued persistence, and the category ⋅→⋅←⋅→⋯←⋅ for zig-zag persistence. This generalization will be presented in [1].

It would be nice to have a categorical definition of bottleneck distance arbitrary $(\mathbb {R},\leq)$-indexed diagrams of vector spaces and to have a corresponding Isometry Theorem.

Notes

Even if F is induced by sublevel sets, it is inadequate to define $a \in \mathbb {R}$ to be a critical value of F if for all sufficiently small ε>0, the map F(a−ε≤a+ε) is not an isomorphism [7]. Consider the example $X = \{(x,y) \in \mathbb {R}^{2} \ | \ 0\leq x \leq1, \ 0 < y \leq1\}$ and f(x,y) equal to 0 if x=0, −1 if x=1, and y otherwise. Then 0 is not a critical value under this stricter definition, but the map H ₀(f ⁻¹(−∞,0])→H ₀(f ⁻¹(−∞,1]) induced by inclusion is not an isomorphism, contradicting the Critical Value Lemma.
Our notation comes from category theory; an arrow f:x→y defines a natural map $f^{*}:\operatorname{Hom}(y,z) \rightarrow\operatorname{Hom}(x,z)$ obtained by precomposing a given arrow y→z with f.

References

Bubenik, P., de Silva, V., Scott, J.A.: Metrics for generalized persistence modules (2013). arXiv:1312.3829
Cagliari, F., Ferri, M., Pozzi, P.: Size functions from a categorical viewpoint. Acta Appl. Math. 67(3), 225–235 (2001)
Article MATH MathSciNet Google Scholar
Carlsson, G.: Topology and data. Bull. Am. Math. Soc. (N.S.) 46(2), 255–308 (2009)
Article MATH MathSciNet Google Scholar
Carlsson, G., de Silva, V.: Zigzag persistence. Found. Comput. Math. 10(4), 367–405 (2010)
Article MATH MathSciNet Google Scholar
Chazal, F., Cohen-Steiner, D., Glisse, M., Guibas, L.J., Oudot, S.Y.: Proximity of persistence modules and their diagrams. In: Proceedings of the 25th Annual Symposium on Computational Geometry, SCG’09, pp. 237–246. ACM, New York (2009)
Chapter Google Scholar
Chazal, F., de Silva, V., Glisse, M., Oudot, S.: The structure and stability of persistence modules (2012). arXiv:1207.3674
Cohen-Steiner, D., Edelsbrunner, H., Harer, J.: Stability of persistence diagrams. Discrete Comput. Geom. 37(1), 103–120 (2007)
Article MATH MathSciNet Google Scholar
Cohen-Steiner, D., Edelsbrunner, H., Harer, J.: Extending persistence using Poincaré and Lefschetz duality. Found. Comput. Math. 9(1), 79–103 (2009)
Article MATH MathSciNet Google Scholar
Cohen-Steiner, D., Edelsbrunner, H., Harer, J., Morozov, D.: Persistent homology for kernels, images, and cokernels. In: Proceedings of the Twentieth Annual ACM–SIAM Symposium on Discrete Algorithms, pp. 1011–1020. SIAM, Philadelphia (2009)
Chapter Google Scholar
Crawley-Boevey, W.: Decomposition of pointwise finite-dimensional persistence modules (2012). arXiv:1210.0819
Edelsbrunner, H., Harer, J.: Persistent homology—a survey. In: Surveys on Discrete and Computational Geometry. Contemp. Math., vol. 453, pp. 257–282. Am. Math. Soc., Providence (2008)
Chapter Google Scholar
Edelsbrunner, H., Harer, J.L.: Computational Topology. Am. Math. Soc., Providence (2010). An introduction
MATH Google Scholar
Edelsbrunner, H., Letscher, D., Zomorodian, A.: Topological persistence and simplification. Discrete Comput. Geom. 28(4), 511–533 (2002). Discrete and computational geometry and graph drawing (Columbia, SC, 2001)
Article MATH MathSciNet Google Scholar
Freyd, P.: Abelian Categories. An Introduction to the Theory of Functors. Harper’s Series in Modern Mathematics. Harper & Row, New York (1964). xi+164
MATH Google Scholar
Ghrist, R.: Barcodes: the persistent topology of data. Bull. Am. Math. Soc. (N.S.) 45(1), 61–75 (2008)
Article MATH MathSciNet Google Scholar
Hungerford, T.W.: Algebra. Graduate Texts in Mathematics, vol. 73. Springer, New York (1980). Reprint of the 1974 original
Book MATH Google Scholar
Lesnick, M.: Multidimensional interleavings and applications to topological inference (2012). arXiv:1106.5305
Mac Lane, S.: Categories for the Working Mathematician, 2nd edn. Graduate Texts in Mathematics, vol. 5. Springer, New York (1998)
MATH Google Scholar
Mitchell, B.: Theory of Categories. Pure and Applied Mathematics, vol. XVII. Academic Press, New York (1965)
MATH Google Scholar
Zomorodian, A.J.: Topology for Computing. Cambridge Monographs on Applied and Computational Mathematics, vol. 16. Cambridge University Press, Cambridge (2005)
Book MATH Google Scholar
Zomorodian, A., Carlsson, G.: Computing persistent homology. Discrete Comput. Geom. 33(2), 249–274 (2005)
Article MATH MathSciNet Google Scholar

Download references

Acknowledgements

The first author would like to thank Vin de Silva, Robert Ghrist, David Lipsky, John Oprea, and Radmila Sazdanovic for helpful conversations. Both authors thank Peter Landweber for many helpful corrections. The first author also gratefully acknowledges the support from AFOSR grant # FA9550-13-1-0115.

Author information

Authors and Affiliations

Department of Mathematics, Cleveland State University, 2121 Euclid Ave., RT 1515, Cleveland, OH, 44115, USA
Peter Bubenik & Jonathan A. Scott

Authors

Peter Bubenik
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan A. Scott
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jonathan A. Scott.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bubenik, P., Scott, J.A. Categorification of Persistent Homology. Discrete Comput Geom 51, 600–627 (2014). https://doi.org/10.1007/s00454-014-9573-x

Download citation

Received: 08 March 2013
Revised: 12 November 2013
Accepted: 10 January 2014
Published: 28 January 2014
Issue Date: April 2014
DOI: https://doi.org/10.1007/s00454-014-9573-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Categorification of Persistent Homology

Abstract

Similar content being viewed by others

Persistence Diagrams as Diagrams: A Categorification of the Stability Theorem

Persistent cup product structures and related invariants

Distributing Persistent Homology via Spectral Sequences

1 Introduction

1.1 Motivation

1.2 Prior Work

1.3 Our Contributions

1.4 Comparison with Other Recent Work

2 Background

2.1 Categorical Terminology

Example 2.1

Example 2.2

Example 2.3

Example 2.4

Example 2.5

Example 2.6

Example 2.7

2.2 Categorical Persistent Homology

2.2.1 Filtered Simplicial Complexes

2.2.2 Sublevel Sets

2.2.3 Diagrams by \(\mathbf {[n]}\), \(\mathbf {(\mathbb {Z}_{+},\leq)}\), and \(\mathbf {(\mathbb {Z},\leq)}\)

2.2.4 Persistent Homology

2.2.5 Persistence Modules

2.3 Abelian Categories

2.3.1 Initial, Terminal, and Final Objects

2.3.2 Monomorphisms, Epimorphisms, Kernels, and Cokernels

2.3.3 Products, Coproducts, Pull-Backs, and Push-Outs

2.3.4 Abelian Categories

Example 2.8

2.4 Algebra

Theorem 2.9

3 Interleavings of Diagrams

Definition 3.1

Definition 3.2

Theorem 3.3

Lemma 3.4

Proof

Proof of Theorem 3.3

Corollary 3.5

Proposition 3.6

Proof

4 Diagrams of Vector Spaces

4.1 Finite-Type Diagrams

Definition 4.1

Lemma 4.2

Proof

Definition 4.3

Lemma 4.4

Proof

Lemma 4.5

Proof

Theorem 4.6

Proof

Corollary 4.7

4.2 Barcodes and Persistence Diagrams

Definition 4.8

Corollary 4.9

4.3 Bottleneck Distance

Definition 4.10

Definition 4.11

Proposition 4.12

Proposition 4.13

Lemma 4.14

Proof

Lemma 4.15

Proof

Theorem 4.16

Proof

5 Stability

Theorem 5.1

Proof

6 Extended Persistence

Theorem 6.1

Proof

7 Abelian Structure of Interleavings

Definition 7.1

Proposition 7.2