Limiting behaviour of a geometric-type estimator for tail indices

doi:10.1016/S0167-6687(03)00135-5

Insurance: Mathematics and Economics

Volume 33, Issue 2, 20 October 2003, Pages 211-226

https://doi.org/10.1016/S0167-6687(03)00135-5 Get rights and content

Abstract

We propose a consistent estimator for the exponential tail coefficient of a d.f., that is directly related to least squares estimators of Schultze and Steinebach [Statist. Decis. 14 (1996) 353]. We investigate here the weak asymptotic properties of this geometric-type estimator, showing in particular that, under general conditions, its distribution is asymptotically normal. The results are then applied to the related problem of estimating the adjustment coefficient in risk theory [Insur.: Math. Econ. 10 (1991) 37]. A simulation study is performed in order to illustrate the finite sample behaviour of the proposed estimator.

Introduction

Let Z₁,Z₂,… be independent, non-negative random variables with common distribution function (d.f.) F satisfying $1−F(z)=P(Z_{1} >z)=r(z) e^{−Rz}, z>0,$ where r is a regularly varying function at infinity and R a positive constant. Denoting by F⁻¹ the left continuous inverse of F, i.e., F⁻¹(s)≔inf{x:F(x)≥s}, (1) is equivalent to $F^{−1} (1−s)=− 1 R log s+ log L ̃ (s), 0<s<1,$ where L̃ is a slowly varying function at zero (see, e.g. Schultze and Steinebach, 1996 and references therein).

We shall be concerned here with the estimation of the tail coefficient R in (1) or, equivalently, in (2). The problem of estimating R or other related tail indices has received considerable attention and common applications may be found in a big variety of domains. We consider here an important application in risk theory, namely the estimation of the adjustment coefficient (see Csörgő and Steinebach, 1991). For a comprehensive overview of this subject we refer to Csörgő and Viharos (1998).

Based on least squares considerations, Schultze and Steinebach (1996) proposed three estimators for the exponential tail coefficient R, given as follows. Let Z_1,n≤Z_2,n≤⋯≤Z_n,n denote the order statistics of the sample Z₁,Z₂,…,Z_n and assume that (k_n) is a sequence of positive integers satisfying $1≤k_{n} <n, lim n→∞ k_{n} =∞ and lim n→∞ k_{n} n =0.$ The Schultze and Steinebach estimators are defined by $R ̂_{1} (k_{n})= ∑_{i=1}^{k_{n}} log^{2} (n/i)−(1/k_{n}) ∑_{i=1}^{k_{n}} log (n/i)^{2} ∑_{i=1}^{k_{n}} log (n/i)Z_{n−i+1,n} −(1/k_{n}) ∑_{i=1}^{k_{n}} Z_{n−i+1,n} ∑_{i=1}^{k_{n}} log (n/i), R ̂_{2} (k_{n})= ∑_{i=1}^{k_{n}} log^{2} (n/i) ∑_{i=1}^{k_{n}} log (n/i)Z_{n−i+1,n}$ and $R ̂_{3} (k_{n})= ∑_{i=1}^{k_{n}} log (n/i)Z_{n−i+1,n} −(1/k_{n}) ∑_{i=1}^{k_{n}} Z_{n−i+1,n} ∑_{i=1}^{k_{n}} log (n/i) ∑_{i=1}^{k_{n}} Z_{n−i+1,n}^{2} −(1/k_{n}) ∑_{i=1}^{k_{n}} Z_{n−i+1,n}^{2} .$ Recently, Brito and Moreira (2001) have introduced a new estimator of R, $R ̂ (k_{n})$ , related to $R ̂_{1} (k_{n})$ and $R ̂_{3} (k_{n})$ . This estimator arises in a natural way from a geometrical adaptation of the procedure used by Schultze and Steinebach in the construction of $R ̂_{i} (k_{n})$ , i=1,3. These estimators are motivated by the fact that, for large z, −log(1−F(z)) is approximately linear with slope R, since $z^{−1} log r(z)→0$ as z→∞. If the regularly varying function r was constant, say r(z)=e^d, d∈R, then $− log (1−F(z))=Rz−d.$ We thus expect that the above linear relation approximately holds for the largest observations realized in the sample (Z₁,Z₂,…,Z_n), which we simply denote by z_(i)=z_n−i+1,n, i=1,…,k_n. Approximating F(z_(i)) by F_n(z_(i)⁻), where F_n is the empirical d.f., this gives that −log(1−F_n(z_(i)⁻))=log(n/i) is “close” to Rz_(i)−d, or z_(i) is “close” to $R^{−1} log (n/i)+R^{−1} d$ , i=1,…,k_n. Setting a=R⁻¹ and b=R⁻¹d, a least squares estimator may then be obtained by minimizing $f_{1} (a,b)=∑_{i=1}^{k_{n}} (z_{(i)} −a log (n/i)−b)^{2}$ , leading to the estimator $R ̂_{1} (k_{n})≡ a ̂^{−1}$ . In the particular case where d=0, the minimization of f₂(a)=f₁(a,0) yields the estimator $R ̂_{2} (k_{n})$ . On the other hand, the direct minimization of f₃(R,d)=∑_i=1^k_n(log(n/i)−Rz_(i)+d)², leads to the least squares estimator $R ̂_{3} (k_{n})$ .

Considering the two points of view simultaneously, by minimizing the global sum of the areas of the rectangles indicated in Fig. 1, we obtain the estimator $R ̂ (k_{n})$ .

In this way, $R ̂ (k_{n})$ results from minimizing $f(R,d)=∑_{i=1}^{k_{n}} (log (n/i)−Rz_{(i)} +d)(R^{−1} log (n/i)+R^{−1} d−z_{(i)})$ , and is given by the geometric mean of $R ̂_{1} (k_{n})$ and $R ̂_{3} (k_{n})$ , that is $R ̂ (k_{n})= ∑_{i=1}^{k_{n}} log^{2} (n/i)−(1/k_{n}) ∑_{i=1}^{k_{n}} log (n/i)^{2} ∑_{i=1}^{k_{n}} Z_{n−i+1,n}^{2} −(1/k_{n}) ∑_{i=1}^{k_{n}} Z_{n−i+1,n}^{2} .$ Schultze and Steinebach (1996) established the consistency of the estimators $R ̂_{i} (k_{n})$ , i=1,2,3 and their corresponding asymptotic behaviour was subsequently investigated by Csörgő and Viharos (1997). Independent of these authors, Kratz and Resnick (1996) introduced an equivalent form of $1/ R ̂_{1} (k_{n})$ , designated by qq-estimator, in reference to the quantile–quantile plots (for this interpretation and application of qq-plots in this estimation problem, see also Beirlant et al., 1996). Kratz and Resnick proved the consistency and the asymptotic normality of the qq-estimator centred at 1/R. Not forcing the centring at 1/R, Csörgő and Viharos (1997) have shown that, for suitable sequences (k_n), $1/ R ̂_{i} (k_{n})$ , i=1,2,3, are universally asymptotically normal over the family (1), in the usual sense, that is, with deterministic centring sequences converging to 1/R. Moreover, for $1/ R ̂_{i} (k_{n})$ , i=1,3, the norming sequence is k_n^1/2, and as Csörgő and Viharos (1997) pointed out, these were the first estimators asymptotically normal over the whole family (1), with the ideal factor k_n^1/2.

The above estimation problem is equivalent to the estimation of the tail index of a Pareto type distribution. In fact, setting X_i=e^Z_i with Z_i, i=1,2,… as above, we have $1−G(x)=P(X_{1} >x)=x^{−1/α} L(x), x>0,$ where α=1/R and $L(x)=r(log x)$ is slowly varying at infinity. The qq-estimator was actually introduced under (7). In this context, several estimators have been proposed. One of the most commonly used estimators for α, is the Hill estimator (1975), defined by $H_{n} (k_{n})= 1 k_{n} ∑_{i=1}^{k_{n}} log X_{n−i+1,n} − log X_{n−k_{n},n},$ where X_1,n≤X_2,n≤⋯≤X_n,n denote the order statistics of the sample X₁,X₂,…,X_n (for related estimators, see, e.g. De Haan and Resnick, 1980, Csörgő et al., 1985, Bacro and Brito, 1993). The asymptotic properties of the Hill estimator have been much studied and it is well known that, under certain conditions, H_n(k_n) is a strongly consistent estimator (cf. Deheuvels et al., 1988) with asymptotic normal distribution (cf. Haeusler and Teugels, 1985).

In this paper, we investigate the asymptotic properties of the geometric-type estimator $R ̂ (k_{n})$ . In particular, we shall give conditions which ensure the asymptotic normality of $R ̂ (k_{n})$ when centred at R. We shall also see that $1/ R ̂ (k_{n})$ is universally asymptotically normal over the family (1). We recall that this property is not shared by the Hill estimator (see, e.g. Csörgő and Viharos, 1998). Moreover, the norming sequence is again the ideal factor k_n^1/2. This specific property, jointly with the fact that $R ̂ (k_{n})$ takes values between those of $R ̂_{1} (k_{n})$ and $R ̂_{3} (k_{n})$ , makes the use of the estimator $R ̂ (k_{n})$ specially attractive for the case where R is expected to be small. The application in risk theory considered here is of this kind. Our results are given in Section 2 and the proofs are collected in Section 3. The application in the estimation of the adjustment coefficient is discussed in Section 4. One complex practical problem is the choice of the number of observations included in the estimation of R. We consider here an heuristic method suggested by Schultze and Steinebach (1996) and adapt it to our estimator $R ̂ (k_{n})$ . This procedure is applied in a small-scale simulation study and the corresponding results are contained in Section 5.

Section snippets

Results

We begin by considering the consistency of the estimator $R ̂ (k_{n})$ defined by (6). In the sequel, $→ D$ and $= D$ stand, respectively, for convergence and equality in distribution. In the same way, $→ P$ denotes convergence in probability.

Theorem 1

Assume that F satisfies condition (1) and k_n is a sequence of positive integers satisfying (3) and such that $lim_{n→∞} log^{2} n/k_{n} =0$ . If F⁻¹ is continuous on (s₀,1) for some s₀∈(0,1), then, $R ̂ (k_{n}) → P R.$

As noted in Section 1, $R ̂ (k_{n})$ is the geometric mean of the estimators $R ̂_{1} (k_{n})$

Proofs

Throughout this section we shall assume that (1) holds. We assume also that U₁,U₂,… is a sequence of independent uniform U(0,1) random variables. The order statistics of the sample (U₁,U₂,…,U_n) are denoted by U_1,n≤U_2,n≤⋯≤U_n,n.

Proof of Theorem 1

Schultze and Steinebach (1996) proved that if k_n satisfies (3) and $log^{2} n/k_{n} →0$ as n→∞, then $R ̂_{1} (k_{n})$ is a consistent estimator of R. Moreover, if F⁻¹ is continuous on (s₀,1) for some s₀∈(0,1), then $R ̂_{3} (k_{n})$ is also a consistent estimator of R.

Thus, since $R ̂ (k_{n})= R ̂_{1} (k_{n}) R ̂_{3} (k_{n})$

Estimating the adjustment coefficient in risk theory

The problem of estimating the coefficient R in Eq. (1) is motivated by an important problem in risk theory. Consider the Sparre Andersen model for claims arriving at an insurance company, and assume that the sequence C₁,C₂,… of claims occur at times T₁,T₁+T₂,…, where {C_i} and {T_i} are independent sequences of i.i.d. r.v.’s. Starting with initial capital x and with incoming premiums in the time interval [0,t] equal to γt, the risk reserve is $S(t)=x+γt−∑_{i=1}^{N(t)} C_{i},$ where N(t)=max{n≥0:∑_i=1ⁿT_i≤t} is

Simulation results

Below we extend the simulation study of Schultze and Steinebach (1996) to the estimator $R ̂ (k_{n})$ , where samples Z₁,Z₂,…,Z_n have been simulated making use of the exact distribution F of the above example, or more precisely, its quantile function $F^{−1} (1−s)= β 1−a log a(1−a) s +a^{2}, 0<s< a 1+a, 0 elsewhere,$ where a=β/α<1. For sake of comparison with related studies (see Schultze and Steinebach, 1996 and references therein) we take $(α,β)=(24 000,10 000)$ , resulting in R=5.8(3)×10⁻⁵.

In this section we illustrate the

Acknowledgements

The research of the second author was partially supported by PRODEP III, Action 5.3. The authors also thank the referee for his comments.

References (18)

J.N. Bacro et al.
A tail bootstrap procedure for estimating the tail Pareto index
Journal of Statistical Planning and Inference
(1998)
M. Csörgő et al.
On the estimation of the adjustment coefficient in risk theory via intermediate order statistics
Insurance: Mathematics and Economics
(1991)
Bacro, J.N., Brito, M., 1993. Strong limiting behaviour of a simple tail Pareto-index estimator. Statistics and...
J. Beirlant et al.
Tail estimation, Pareto quantile plots, and regression diagnostics
Journal of the American Statistical Association
(1996)
Bingham, N.H., Goldie, C.M., Teugels, J.L., 1987. Regular Variation. Cambridge University Press,...
Brito, M., Moreira, A.C., 2001. Estimação do Coeficiente de Cauda Exponencial. In: Oliveira, P., Athayde, E. (Eds.), Um...
S. Csörgő et al.
Asymptotic normality of least squares estimators of tail indices
Bernoulli
(1997)
Csörgő, S., Viharos, L., 1998. Estimating the tail index. In: Szyszkowicz, B. (Ed.), Asymptotic Methods in Probability...
S. Csörgő et al.
Kernel estimates of the tail index of a distribution
The Annals of Statistics
(1985)

There are more references available in the full text version of this article.

Cited by (11)

On tail index estimation using a sample with missing observations
2012, Statistics and Probability Letters
For the sequence of heavy-tailed, dependent and heterogeneous random variables with the missing observations the estimation of the tail-index is considered. Under minimal but verifiable assumption of “extremal dependence” we proved the consistency of a geometric-type estimator (Brito and Freitas, 2003). We extended results from Mladenović and Piterbarg (2008) and proved the consistency and the asymptotic normality of the Hill estimator. Illustrative examples are provided.
Consistent estimation of the tail index for dependent data
2010, Statistics and Probability Letters
Edgeworth expansion for an estimator of the adjustment coefficient
2008, Insurance: Mathematics and Economics
Citation Excerpt :
The general conditions ensuring the asymptotic normality are stated in the theorem below. This result follows directly from Proposition 1 of Brito and Freitas (2003). Now consider the normalized estimator
We establish an Edgeworth expansion for an estimator of the adjustment coefficient $R$ , directly related to the geometric-type estimator for general exponential tail coefficients, proposed in [Brito, M., Freitas, A.C.M., 2003. Limiting behaviour of a geometric-type estimator for tail indices. Insurance Math. Econom. 33, 211–226].Using the first term of the expansion, we construct improved confidence bounds for $R$ . The accuracy of the approximation is illustrated using an example from insurance (cf. [Schultze, J., Steinebach, J., 1996. On least squares estimates of an exponential tail coefficient. Statist. Dec. 14, 353–372]).
Weak convergence of a bootstrap geometric-type estimator with applications to risk theory
2006, Insurance: Mathematics and Economics
Based on least square considerations, Brito and Moreira Freitas [Brito, M., Moreira Freitas, A.C., 2003. Limiting behaviour of a geometric-type estimator for tail indices. Insurance: Math. Econ. 33, 211–226] proposed a geometric-type estimator for estimating an exponential tail coefficient. We consider here the tail bootstrap method introduced by Bacro and Brito [Bacro, J.N., Brito, M., 1998. A tail bootstrap procedure for estimating the tail Pareto index. J. Stat. Plan. Infer. 71, 245–260] and show that this procedure works for this estimator. Moreover, we extend the application given in Brito and Moreira Freitas [Brito, M., Moreira Freitas, A.C., 2003. Limiting behaviour of a geometric-type estimator for tail indices. Insurance: Math. Econ. 33, 211–226], by showing that the results obtained may be applied to the related problem of estimating the adjustment coefficient in the Sparre Andersen model, under the standard conditions.
The climate niche of Homo Sapiens
2023, arXiv
Testing the Dismal Theorem
2022, Journal of the Association of Environmental and Resource Economists

View all citing articles on Scopus

View full text

Limiting behaviour of a geometric-type estimator for tail indices

Abstract

Introduction

Section snippets

Results

Proofs

Estimating the adjustment coefficient in risk theory

Simulation results

Acknowledgements

Journal of Statistical Planning and Inference

Insurance: Mathematics and Economics

Tail estimation, Pareto quantile plots, and regression diagnostics

Journal of the American Statistical Association

Asymptotic normality of least squares estimators of tail indices

Bernoulli

Kernel estimates of the tail index of a distribution

The Annals of Statistics