Loading [MathJax]/jax/output/SVG/jax.js
Research article

A study on the estimator for the extreme value index of heavy-tailed distribution generated from moment statistic

  • Received: 27 December 2024 Revised: 09 April 2025 Accepted: 15 April 2025 Published: 18 April 2025
  • In extreme value statistics, the extreme value index of heavy-tailed distribution is closely related to the probability of occurrence of extreme events, and its estimator has become a major research topic. Based on the moment statistic, we constructed a class of estimators with four parameters for the extreme value index of heavy-tailed distribution. The consistency and asymptotic normality of the proposed estimator were proved under the first-order regular variation condition and the second-order regular variation condition. Specific expressions for ten estimators were given by the specific values of the parameters, which contain both existing estimators in the literature and newly derived ones. The asymptotical unbiasedness of specific new estimators was discussed, and some of the asymptotical unbiased estimators were compared with existing ones in terms of asymptotic variance. The results show that the new estimators perform better. In addition, in the finite sample case, using Monte-Carlo simulation, it can be seen from the simulated mean value and mean square error that the obtained results are in line with the theoretical analysis among the asymptotical unbiased estimators compared. Furthermore, it can be concluded that some of the new estimators perform better at the optimal level.

    Citation: Shuai Chang, Jinrui Guan. A study on the estimator for the extreme value index of heavy-tailed distribution generated from moment statistic[J]. Electronic Research Archive, 2025, 33(4): 2295-2311. doi: 10.3934/era.2025101

    Related Papers:

    [1] Mahmoud El-Morshedy, Mohamed S. Eliwa, Mohamed El-Dawoody, Hend S. Shahen . A weighted hybrid discrete probability model: Mathematical framework, statistical analysis, estimation techniques, simulation-based ranking, and goodness-of-fit evaluation for over-dispersed data. Electronic Research Archive, 2025, 33(4): 2061-2091. doi: 10.3934/era.2025091
    [2] Ke Liu, Hanzhong Liu . Testing for individual and time effects in unbalanced panel data models with time-invariant regressors. Electronic Research Archive, 2022, 30(12): 4574-4592. doi: 10.3934/era.2022232
    [3] Lei Liu, Jun Dai . Estimation of partially linear single-index spatial autoregressive model using B-splines. Electronic Research Archive, 2024, 32(12): 6822-6846. doi: 10.3934/era.2024319
    [4] Showkat Ahmad Lone, Hanieh Panahi, Sadia Anwar, Sana Shahab . Estimations and optimal censoring schemes for the unified progressive hybrid gamma-mixed Rayleigh distribution. Electronic Research Archive, 2023, 31(8): 4729-4752. doi: 10.3934/era.2023242
    [5] Chi Liu, Ruiqin Tian, Dengke Xu . Generalized estimation equations method for fixed effects panel interval-valued data models. Electronic Research Archive, 2025, 33(6): 3733-3755. doi: 10.3934/era.2025166
    [6] Amal S. Hassan, Najwan Alsadat, Mohammed Elgarhy, Laxmi Prasad Sapkota, Oluwafemi Samson Balogun, Ahmed M. Gemeay . A novel asymmetric form of the power half-logistic distribution with statistical inference and real data analysis. Electronic Research Archive, 2025, 33(2): 791-825. doi: 10.3934/era.2025036
    [7] Yusen Lin . Periodic measures of reaction-diffusion lattice systems driven by superlinear noise. Electronic Research Archive, 2022, 30(1): 35-51. doi: 10.3934/era.2022002
    [8] Xintao Li, Rongrui Lin, Lianbing She . Periodic measures for a neural field lattice model with state dependent superlinear noise. Electronic Research Archive, 2024, 32(6): 4011-4024. doi: 10.3934/era.2024180
    [9] Sadia Anwar, Showkat Ahmad Lone, Aysha Khan, Salmeh Almutlak . Stress-strength reliability estimation for the inverted exponentiated Rayleigh distribution under unified progressive hybrid censoring with application. Electronic Research Archive, 2023, 31(7): 4011-4033. doi: 10.3934/era.2023204
    [10] Wenhui Niu, Xinhong Zhang, Daqing Jiang . Dynamics and numerical simulations of a generalized mosquito-borne epidemic model using the Ornstein-Uhlenbeck process: Stability, stationary distribution, and probability density function. Electronic Research Archive, 2024, 32(6): 3777-3818. doi: 10.3934/era.2024172
  • In extreme value statistics, the extreme value index of heavy-tailed distribution is closely related to the probability of occurrence of extreme events, and its estimator has become a major research topic. Based on the moment statistic, we constructed a class of estimators with four parameters for the extreme value index of heavy-tailed distribution. The consistency and asymptotic normality of the proposed estimator were proved under the first-order regular variation condition and the second-order regular variation condition. Specific expressions for ten estimators were given by the specific values of the parameters, which contain both existing estimators in the literature and newly derived ones. The asymptotical unbiasedness of specific new estimators was discussed, and some of the asymptotical unbiased estimators were compared with existing ones in terms of asymptotic variance. The results show that the new estimators perform better. In addition, in the finite sample case, using Monte-Carlo simulation, it can be seen from the simulated mean value and mean square error that the obtained results are in line with the theoretical analysis among the asymptotical unbiased estimators compared. Furthermore, it can be concluded that some of the new estimators perform better at the optimal level.



    The heavy-tailed distribution is an important model in extreme value statistics with applications in finance, insurance, meteorology, and hydrology. Its primary parameter is the positive extreme value index (also known as extreme value index of heavy-tailed distribution, abbreviated as heavy-tailed index), which characterizes the probability of extreme events such as the catastrophic flood on the 100-year return period, enormous earthquakes, large insurance claims, and so on (see [1,2]). Therefore, estimators for the extreme value index has become one of the main research problems in extreme value statistics.

    The estimation of the extreme value index started earlier. However, in the 1970s, researchers started working on semi-parametric estimators of extreme value index. The seminal and most famous estimator is the Hill estimator [3], which is favored by many scholars for its simplicity in form; however, it is sensitive to the threshold k. In general, the Hill estimator has a large variance for small values of k, while large values of k usually induce a high bias. This means that an inadequate selection of k can result in large expected errors. Therefore, the threshold selection is also one of the most fundamental problems in extreme value statistics. The existing methods include graphical diagnostics and heuristic procedures based on sample paths' stability as a function of k, as well as minimization of the estimator of the mean square error, also as functions of k. More comprehensive reviews about threshold selection can be found in [4,5].

    Afterward, Dekkers et al. [6] proposed a moment estimator by constructing a moment statistic, which improved the Hill estimator. By use of the moment statistic, many estimators for the extreme value index have been proposed, such as the moment ratio estimator [7], the estimator proposed by Gomes and Martins [8], the estimator proposed by Caeiro and Gomes [9], and Lehmer's mean-of-order-p(Lp) estimator [10]; more estimators can be found in [11,12]. As a further study, based on the moment statistic, this paper constructs a class of heavy-tailed index estimators with four parameters. Many estimators can be obtained from specific parameter values, which include not only the existing estimators in the literature but also newly derived ones. The consistency and asymptotic normality of the proposed estimators are investigated under the first-order regular variation and second-order regular variation conditions. The asymptotical unbiasedness of the specified estimators is discussed, and a comparison of the main components of the asymptotic variance is performed in the asymptotical unbiased estimators. In addition, their finite sample performance is analyzed through a Monte-Carlo simulation in terms of the simulated mean value and mean square error. The results show that some of the new estimators perform better.

    Let X be a non-negative random variable with a distribution function F(x), for sufficiently large x>0, satisfying

    ¯F(x):=1F(x)=x1/γL(x),γ>0, (1.1)

    we call F(x) a heavy-tailed distribution, and γ is the heavy-tailed index, one of the primary parameters of extreme events, where L(x)>0 is a slow varying function at infinity, that is, for all t>0,

    limx+L(tx)L(x)=1.

    If a positive measure function g(x) has infinite right endpoints and satisfies that for all t>0, there are

    limx+g(tx)g(x)=tα,

    then g(x) is said to be a regularly varying function with index α, denoted gRα.

    Let us denote U(t):=F(11/t)t>1 as the tail quantile function, and F(t):=inf{x:F(x)t},0<t<1 as a generalized inverse function of F(x); from [13], the heavy-tailed distribution F(x) has the following equivalence relation

    ¯FR1/γURγ. (1.2)

    In general, the equivalence relation (1.2) is called the first-order regular variation condition.

    The classical heavy-tailed index estimators are all composed of the number k of the top order statistics, whose consistency are obtained under the first-order regular variation condition (1.2) and the following condition on the sequences k,

    k=k(n)+,kn0,n+. (1.3)

    In general, we call this k intermediate if it satisfies (1.3).

    In addition, in order to obtain the asymptotic normality of the heavy-tailed index estimator, the second-order regular variation condition is often needed. The F(x) is said to satisfy the second-order regular variation condition if there exists an eventually positive regular varying function A with index ρ0, that is |A(t)|Rρ, such that

    limt+lnU(tx)lnU(t)γlnxA(t)={xρ1ρ,ifρ<0,lnx,ifρ=0, (1.4)

    for every x>0, where ρ is the second-order parameter. In this paper, we only consider the case ρ<0.

    To construct new estimators, we shall consider the following moment statistic and give some heavy-tailed index estimators.

    Let X1,X2,,Xn be a sample of n independent observations from a common unknown distribution function F(x), X1,nX2,nXn,n denote the ascending order statistics associated with the sample X1,X2,,Xn.

    Let us consider the moment statistic

    M(α)n(k):=1kki=1(lnXni+1,nlnXnk,n)α,α0. (1.5)

    Now we introduce several heavy-tailed index estimators constructed from the moment statistic.

    The Hill estimator is expressed as follows:

    ˆγH(k)H(k):=M(1)n(k). (1.6)

    The monent estimator is expressed as follows:

    ˆγM(k)M(k):=M(1)n(k)+112(1(M(1)n(k))2M(2)n(k))1. (1.7)

    The moment estimator is an asymptotical unbiased estimator when γ,ρ satisfy γγρ+ρ=0.

    The moment ratio estimator is expressed as follows:

    ˆγMR(k)MR(k):=M(2)n(k)2H(k). (1.8)

    The estimators in [8] are expressed as follows:

    ˆγαGM1(k)=GM1(α):=M(α)n(k)Γ(α+1)[M(1)n(k)]α1,α>0, (1.9)

    and

    ˆγαGM2(k)=GM2(α):=(M(α)n(k)Γ(α+1))1/α,α>0, (1.10)

    where for any ρ<0, when α>2, there exists α0 satisfying (1ρ)α01[1+ρ(α02)]=1, such that GM1(α0) is an asymptotical unbiased estimator. In addition, note that the estimator GM1(α) becomes the moment ratio estimator when α=2.

    The estimator in [9] is expressed as follows:

    ˆγαCG(k)=CG(α):=Γ(α)M(α1)n(k)(M(2α)n(k)Γ(2α+1))1/2,α1, (1.11)

    where for any ρ<0, CG(α) is an asymptotical unbiased estimator when α=α0=ln[1ρ(1ρ)21]ln(1ρ).

    The Lp estimator in [10] is expressed as follows:

    ˆγLp(k)=Lp(k):=M(p)n(k)pM(p1)n(k),p1, (1.12)

    where the Lp estimator becomes the moment ratio estimator when p=2.

    Based on the moment statistic (1.5), we construct a class of heavy-tailed index estimators with four parameters, whose expression is as follows:

    ˆγMG(k,a,b,α,β)MG(a,b,α,β):=H(k)(M(a)n(k)Γ(a+1)Ha(k))α(Γ(b+1)Hb(k)M(b)n(k))β, (2.1)

    where a,b0,α,βR, M(a)n(k) and H(k), see (1.5) and (1.6), respectively.

    Before discussing the properties of the proposed estimator, the following two lemmas are introduced.

    Lemma 2.1. [8] Let X1,X2,,Xn be a sample of n independent observations from a common unknown distribution function F(x), X1,nX2,nXn,n denote the ascending order statistics associated with the sample X1,X2,,Xn. If F(x) satisfies the first-order regular variation condition (1.2) and k is intermediate, then the moment statistic

    M(α)n(k)PΓ(α+1)γα. (2.2)

    Lemma 2.2. [8] Let X1,X2,,Xn be a sample of n independent observations from a common unknown distribution function F(x), X1,nX2,nXn,n denote the ascending order statistics associated with the sample X1,X2,,Xn. If F(x) satisfies the second-order regular variation condition (1.4) and k is intermediate, then the moment statistic has the following asymptotic distributional representation

    M(α)n(k)d=Γ(α+1)γα+γασM(α)kZ(α)k+γα1bM(α)A(n/k)+op(A(n/k)),

    where bM(α)=Γ(α+1)ρ(1(1ρ)α1), σM(α)=Γ(2α+1)Γ2(α+1), Z(α)k:=1kki=1(EαiΓ(α+1))σM(α)/k is an asymptotical standard normal random variable, {Ei}, i=1,2,,n are independent identically distributed standard exponential random variables, and the covariance of Z(α)k and Z(β)k is denoted by

    σM(α,β):=cov(Z(α)k,Z(β)k)=Γ(α+β+1)Γ(α+1)Γ(β+1)σM(α)σM(β).

    The consistency and asymptotic normality of the new estimators are discussed below.

    Theorem 2.1. Let X1,X2,,Xn be a sample of n independent observations from a common unknown distribution function F(x), X1,nX2,nXn,n denote the ascending order statistics associated with the sample X1,X2,,Xn. If F(x) satisfies the first-order regular variation condition (1.2) and k is intermediate, then

    ˆγMG(k,a,b,α,β)Pγ. (2.3)

    Proof. According to Lemma 2.1 and the continuous mapping theorem, we have H(k)Pγ, (M(a)n(k)Γ(a+1)Ha(k))αP1, and (Γ(b+1)Hb(k)M(b)n(k))βP1. Using the continuous mapping theorem again, we obtain ˆγMG(k,a,b,α,β)Pγ.

    Theorem 2.2. Let X1,X2,,Xn be a sample of n independent observations from a common unknown distribution function F(x), X1,nX2,nXn,n denote the ascending order statistics associated with the sample X1,X2,,Xn. If F(x) satisfies the second-order regular variation condition (1.4) and k is intermediate, then

    ˆγMG(k,a,b,α,β)d=γ+σMG(a,b,α,β)kPMGk+bMG(a,b,α,β)A(n/k)(1+op(1)), (2.4)

    where

    bMG(a,b,α,β)=αρ1(1ρ)a(1ρ)aβρ1(1ρ)b(1ρ)b+(βbαa+1)11ρ,
    σ2MG(a,b,α,β)=α2γ2(Γ(2a+1)Γ2(a+1))Γ2(a+1)+β2γ2(Γ(2b+1)Γ2(b+1))Γ2(b+1)+2αβγ2(Γ(a+b+1)Γ(a+1)Γ(b+1))Γ(a+1)Γ(b+1)+[1(βbαa)2]γ2,
    PMGk:=αγσM(a)Γ(a+1)Z(a)kβγσM(b)Γ(b+1)Z(b)k+(βbαa+1)γσM(1)Z(1)kσMG(a,b,α,β)

    is an asymptotical standard normal random variable.

    Proof. By Lemma 2.2, we obtain

    H(k)=M(1)n(k)d=γ+γσM(1)kZ(1)k+bM(1)A(n/k)(1+op(1)), (2.5)

    and

    M(a)n(k)Γ(a+1)d=γa[1+σM(a)Γ(a+1)Z(a)kk+bM(a)γΓ(a+1)A(n/k)(1+op(1))]. (2.6)

    Using (1+x)a=1+ax+o(x),x0, one gets

    Ha(k)d=γa[1+aσM(1)kZ(1)k+abM(1)γA(n/k)(1+op(1))].

    From 11+x=1x+o(x),x0, it follows that

    1Ha(k)d=γa[1aσM(1)kZ(1)kabM(1)γA(n/k)(1+op(1))]. (2.7)

    Combining with (2.6) and (2.7), we obtain

    M(a)n(k)Γ(a+1)Ha(k)d=γa[1+σM(a)Γ(a+1)Z(a)kk+bM(a)γΓ(a+1)A(n/k)(1+op(1))]×γa[1aσM(1)kZ(1)kabM(1)γA(n/k)(1+op(1))]=1+σM(a)Γ(a+1)Z(a)kkaσM(1)kZ(1)k+[bM(a)γΓ(a+1)abM(1)γ]A(n/k)(1+op(1)).

    Applying (1+x)α=1+αx+o(x),x0 again, we obtain

    (M(a)n(k)Γ(a+1)Ha(k))αd=1+ασM(a)Γ(a+1)Z(a)kkαaσM(1)kZ(1)k+αγ[bM(a)Γ(a+1)abM(1)]A(n/k)(1+op(1)). (2.8)

    Similar to the proof of (2.8), we have

    (Γ(b+1)Hb(k)M(b)n(k))βd=1βσM(b)Γ(b+1)Z(b)kk+βbσM(1)kZ(1)kβγ[bM(b)Γ(b+1)bbM(1)]A(n/k)(1+op(1)). (2.9)

    Thus, combining with (2.5), (2.8), and (2.9), it follows that

    ˆγMG(k,a,b,α,β)=H(k)(M(a)n(k)Γ(a+1)Ha(k))α(Γ(b+1)Hb(k)M(b)n(k))βd=[γ+γσM(1)kZ(1)k+bM(1)A(n/k)(1+op(1))]×{1+ασM(a)Γ(a+1)Z(a)kkαaσM(1)kZ(1)k+αγ[bM(a)Γ(a+1)abM(1)]A(n/k)(1+op(1))}×{1βσM(b)Γ(b+1)Z(b)kk+βbσM(1)kZ(1)kβγ[bM(b)Γ(b+1)bbM(1)]A(n/k)(1+op(1))}=γ+αγσM(a)Γ(a+1)Z(a)kkβγσM(b)Γ(b+1)Z(b)kk+(βbαa+1)γσM(1)Z(1)kk+[αbM(a)Γ(a+1)βbM(b)Γ(b+1)+(βbαa+1)σM(1)]A(n/k)(1+op(1))=:γ+σMG(a,b,α,β)kPMGk+bMG(a,b,α,β)A(n/k)(1+op(1)).

    From the above equation, we can see that

    bMG(a,b,α,β)=αbM(a)Γ(a+1)βbM(b)Γ(b+1)+(βbαa+1)σM(1)=αρ1(1ρ)a(1ρ)aβρ1(1ρ)b(1ρ)b+(βbαa+1)11ρ,
    PMGk:=αγσM(a)Γ(a+1)Z(a)kβγσM(b)Γ(b+1)Z(b)k+(βbαa+1)γσM(1)Z(1)kσMG(a,b,α,β).

    Let A=αγΓ(a+1), B=βγΓ(b+1) and C=(βbαa+1)γ, we have

    σMG(a,b,α,β)PMGk=AσM(a)Z(a)k+BσM(b)Z(b)k+CσM(1)Z(1)k=Akki=1(EaiΓ(a+1))+Bkki=1(EbiΓ(b+1))+Ckki=1(Ei1)=1kki=1[AEai+BEbi+CEi(AΓ(a+1)+BΓ(b+1)+C)]=:1kki=1[Yi(AΓ(a+1)+BΓ(b+1)+C)]

    From Lemma 2.2, one can deduce that {Yi}, i=1,2,,k are independent, identically distributed random variables. It is easy to obtain that E(Yi)=AΓ(a+1)+BΓ(b+1)+C and Var(Yi)=σ2MG(a,b,α,β). Applying Lindeberg-Lévy central limit theorem, we get that PMGk is an asymptotical standard normal random variable, and

    σ2MG(a,b,α,β)=Var(αγσM(a)Γ(a+1)Z(a)kβγσM(b)Γ(b+1)Z(b)k+(βbαa+1)γσM(1)Z(1)k)=α2γ2σ2M(a)Γ2(a+1)+β2γ2σ2M(b)Γ2(b+1)+(βbαa+1)2γ2σ2M(1)2αβγ2σM(a)σM(b)Γ(a+1)Γ(b+1)σM(a,b)+2α(βbαa+1)γ2σM(a)σM(1)Γ(a+1)σM(a,1)2β(βbαa+1)γ2σM(b)σM(1)Γ(b+1)σM(b,1)=α2γ2(Γ(2a+1)Γ2(a+1))Γ2(a+1)+β2γ2(Γ(2b+1)Γ2(b+1))Γ2(b+1)+(βbαa+1)2γ22αβγ2(Γ(a+b+1)Γ(a+1)Γ(b+1))Γ(a+1)Γ(b+1)+2(αaβb)(βbαa+1)γ2=α2γ2(Γ(2a+1)Γ2(a+1))Γ2(a+1)+β2γ2(Γ(2b+1)Γ2(b+1))Γ2(b+1)+2αβγ2(Γ(a+b+1)Γ(a+1)Γ(b+1))Γ(a+1)Γ(b+1)+[1(βbαa)2]γ2.

    Corollary 2.1. Under the conditions of Theorem 2.2, suppose that kA(n/k)λ<+,asn+, then

    k(ˆγMG(k,a,b,α,β)γ)dN(λbMG(a,b,α,β),σ2MG(a,b,α,β)).

    Remark 1. For every ρ<0, if there exist a,b,α,β, such that bMG(a,b,α,β)=0, then the corresponding MG(a,b,α,β) is an asymptotical unbiased estimator of γ, even when λ0.

    The estimator MG(a,b,α,β) has four parameters, and for the sake of dealing with practical problems, one can consider specifying the parameters to obtain different specific estimator expressions. We give ten specific estimators below.

    (E1) MG(a,b,0,0)=H(k):=M(1)n(k);

    (E2) MG(a,b,1,0)=GM1(a):=M(a)n(k)Γ(a+1)[M(1)n(k)]a1,a>0;

    (E3) MG(a,b,1/a,0)=GM2(a):=(M(a)n(k)Γ(a+1))1/a,a>0;

    (E4) MG(2a,a1,1/2,1)=CG(a):=Γ(a)M(a1)n(k)(M(2a)n(k)Γ(2a+1))1/2,a1;

    (E5) MG(a,a1,1,1)=La(k):=M(a)n(k)aM(a1)n(k),a1;

    (E6) MG(2a,a,1/2,1)=H(k)(M(2a)n(k)Γ(2a+1))1/2Γ(a+1)M(a)n(k);

    (E7) MG(2a,a,1/2,0)=H(k)(M(2a)n(k)Γ(2a+1)H2a(k))1/2;

    (E8) MG(2,b,a,0)=H(k)(M(2)n(k)2H2(k))a;

    (E9) MG(a,2,1,a/2)=2a/2H(k)M(a)n(k)Γ(a+1)(M(2)n(k))a/2;

    (E10) MG(2,a,1,1)=M(2)n(k)M(a)n(k)2Γ(a+1)Ha+1(k).

    Among the above-mentioned ten estimators, (E1)–(E5) are existing estimators and (E6)–(E10) are new estimators. As described in the literature [3,8,9,10], (E1), (E3), and (E5) are asymptotical biased estimators, while (E2) and (E4) are asymptotical unbiased estimators when appropriate parameters are chosen. The asymptotic unbiasedness of the new estimators (E6)–(E10) is discussed below.

    Let f(x)=1(1ρ)xρ(1ρ)x,x>0,ρ<0, then f(x)=ln(1ρ)ρ1(1ρ)x>0. Therefore, f(x) is an increasing function for a given ρ.

    Thus, the main component of the asymptotic bias of the new estimator bMG(a,b,α,β) in Theorem 2.2 can be rewritten as

    bMG(a,b,α,β)=αf(a)βf(b)+(βbαa+1)f(1).

    Since the main component of the asymptotic bias of the estimator (E6)–(E10) involves only one parameter, it can be abbreviated as bMGi(a),i=6,7,8,9,10 for convenience. The asymptotic unbiasedness of the estimators (E6)–(E10) is discussed below.

    For the estimator (E6), the main component of the asymptotic bias is bMG6(a)=12f(2a)f(a)+f(1). Since dbMG6(a)da=f(2a)f(a)=ln(1ρ)ρ(1(1ρ)2a1(1ρ)a)<0, bMG6(a) is a decreasing function on a for a given ρ<0. Also, bMG6(0)=11ρ>0, and lima+bMG6(a)=1+ρ2ρ(1ρ). It is easy to see that 1+ρ2ρ(1ρ)<0 when 1+ρ>0. There exists a0 such that bMG6(a0)=0, i.e., the estimator (E6) with the parameter a0 is an asymptotical unbiased estimator.

    For the estimator (E7), the main component of the asymptotic bias is bMG7(a)=12f(2a)+(1a)f(1). Since bMG7(0)=11ρ>0, and lima+bMG7(a)=, there always exists a0 such that bMG7(a0)=0, i.e., the estimator (E7) with the parameter a0 is asymptotically unbiased, where a0 satisfies (1ρ)2a01[(2a03)ρ+1]=1.

    For the estimator (E8), the main component of the asymptotic bias is bMG8(a)=af(2)+(12a)f(1). Since bMG8(0)=11ρ>0, and lima+bMG8(a)=, there always exists a0 such that bMG8(a0)=0, i.e., the estimator (E8) with the parameter a0 is asymptotically unbiased, where a0=11/ρ.

    For the estimator (E9), the main component of the asymptotic bias is bMG9(a)=f(a)a2f(2)+f(1). Since bMG9(0)=11ρ>0, and lima+bMG9(a)=, there always exists a0 such that bMG9(a0)=0, i.e., the estimator (E9) with the parameter a0 is asymptotically unbiased, where a0 satisfies (1ρ)a02[(4a0)ρ2+(2a06)ρ+2]=2.

    For the estimator (E10), the main component of the asymptotic bias is bMG10(a)=f(a)+f(2)(1+a)f(1). Since bMG10(0)=f(2)f(1)>0, and lima+bMG10(a)=, there always exists a0 such that bMG10(a0)=0, i.e., the estimator (E10) with the parameter a0 is asymptotically unbiased, where a0 satisfies (1ρ)a02[(1a0)ρ2+(a03)ρ+1]=1.

    In summary, the estimator (E6) can be asymptotically unbiased only if it satisfies 1+ρ>0 and a suitable a is chosen. However, for any ρ<0, one can always find a such that the estimators (E7)–(E10) are asymptotically unbiased and the value of a depends only on ρ.

    Next, in the above-mentioned asymptotical unbiased estimators, we compare the main components of their asymptotic variances σ2MG(a,b,α,β). The asymptotical unbiased estimators (E2), (E4), (E7), (E8), (E9), and (E10) are considered here. For convenience, the main components of the asymptotic variance of the estimators involved are denoted as σ2i,i=2,4,7,8,9,10. Given different values of ρ, we compute the ratio of σ2i,i=2,4,7,8,9,10 to γ2 in the compared estimators (the ratio depends only on ρ) and the corresponding values of a0. The results of the calculations are shown in Table 1. From Table 1, we can draw the following conclusions.

    Table 1.  Comparison of σ2i/γ2,i=2,4,7,8,9,10 in the asymptotic unbiased estimators and the corresponding values of a0.
    ρ -0.25 -0.5 -0.75 -1 -1.25 -1.5 -1.75 -2
    σ22/γ2 47.22 14.64 8.707 6.443 5.287 4.596 4.140 3.818
    (a0) (3.91) (3.17) (2.86) (2.69) (2.58) (2.50) (2.44) (2.39)
    σ24/γ2 277.8 38.75 17.21 10.92 8.133 6.609 5.661 5.019
    (a0) (3.11) (2.37) (2.07) (1.90) (1.79) (1.71) (1.65) (1.60)
    σ27/γ2 121.7 29.62 16.37 11.72 9.452 8.131 7.274 6.678
    (a0) (2.78) (2.26) (2.05) (1.93) (1.86) (1.80) (1.76) (1.73)
    σ28/γ2 26.00 10.00 6.444 5.000 4.240 3.778 3.469 3.250
    (a0) (5.00) (3.00) (2.33) (2.00) (1.80) (1.67) (1.57) (1.50)
    σ29/γ2 120.5 42.60 28.70 23.74 21.49 20.38 19.84 19.60
    (a0) (4.74) (4.12) (3.91) (3.81) (3.76) (3.73) (3.72) (3.71)
    σ210/γ2 35.72 11.18 6.700 5.000 4.142 3.637 3.310 3.083
    (a0) (3.52) (2.64) (2.24) (2.00) (1.84) (1.72) (1.63) (1.56)

     | Show Table
    DownLoad: CSV

    1) All ratios decrease as ρ decreases.

    2) For a given ρ, σ22/γ2, σ28/γ2 and σ210/γ2 are smaller, followed by σ24/γ2 and σ27/γ2, and finally σ29/γ2.

    3) For σ22/γ2, σ28/γ2 and σ210/γ2, when ρ>1, σ28/γ2<σ210/γ2<σ22/γ2; when ρ<1, σ210/γ2<σ28/γ2<σ22/γ2.

    Overall, among the asymptotical unbiased estimators compared, the estimators (E8) and (E10) perform better with smaller values of asymptotic variance.

    In order to investigate the performance of the estimators (E2), (E4), (E7)–(E10) mentioned in the previous section in the finite sample case, Monte-Carlo simulations are used to generate N=100 samples of sample size n=1000 from the following model.

    1) Frˊechet(γ) model, F(x)=exp(x1/γ),x0,ρ=1;

    2) Burr(γ,ρ) model, F(x)=1(1+xρ/γ)1/ρ,x0.

    For convenience, the estimators involved are denoted as ˆγi,i=2,4,7,8,9,10. And for each estimator, we computed the simulated mean value (E) and mean square error (MSE) of the estimators. The calculation formulas are as follows:

    E[ˆγi]:=1NNj=1ˆγij,MSE[ˆγi]:=1NNj=1(ˆγijγ)2.

    From the discussion in the previous section, it can be seen that the parameter values of the asymptotic unbiased estimators ˆγi,i=2,4,7,8,9,10 only depend on the parameter ρ. Therefore, the following simulations will be divided into two cases: ρ is known and ρ is unknown. For the case where ρ is unknown, we will adopt the approach of the literature [14,15] to give an estimator of ρ, which in turn can be used to obtain an estimator of the parameter a0 in the estimators compared. Specifically, the following is discussed.

    We estimate the parameter ρ by the following estimator proposed in [14].

    ˆρ=ˆρ(τ)n(k):=|3(R(τ)n(k)1)R(τ)n(k)3| (4.1)

    where

    R(τ)n(k):={(M(1)n(k))τ(M(2)n(k)/2)τ/2(M(2)n(k)/2)τ/2(M(3)n(k)/6)τ/3,ifτ0,ln(M(1)n(k))(1/2)ln(M(2)n(k)/2)(1/2)ln(M(2)n(k)/2)(1/3)ln(M(3)n(k)/6),ifτ=0.

    To decide which value (0 or 1) of the parameter τ to take in the aforementioned estimator, we use the algorithm provided in [15]. And for the estimator ˆρ, following the recommendation from [15], we use k=min{n1,[2n0.995/ln(ln(n))]}.

    The simulation results are shown in Figures 14. Figure 1 shows the simulated mean values and MSEs of the estimators involved under study for sample of size n=1000 from the Frˊechet(1) model when ρ is unknown. The simulated mean values show that all estimators are almost asymptotically unbiased. Regarding the simulated MSEs, for almost all values of k, we can see that

    MSE(ˆγ10)<MSE(ˆγ8)<MSE(ˆγ2)<MSE(ˆγ4)<MSE(ˆγ7)<MSE(ˆγ9)

    and MSE(ˆγ10) is almost equal to MSE(ˆγ8). This is generally consistent with the theoretical analysis.

    Figure 1.  Simulated mean values (left) and MSEs (right) of the estimators involved under study for sample of size n=1000 from Frˊechet(1) model (ρ is unknown).
    Figure 2.  Simulated mean values (left) and MSEs (right) of the estimators involved under study for sample of size n=1000 from Frˊechet(1) model (ρ is known).
    Figure 3.  Simulated mean values (left) and MSEs (right) of the estimators involved under study for sample of size n=1000 from Burr(1, -1) model (ρ is unknown).
    Figure 4.  Simulated mean values (left) and MSEs (right) of the estimators involved under study for sample of size n=1000 from Burr(1, -1) model (ρ is known).

    Figure 2 is the equivalent of Figure 1 when ρ is known and similar to Figure 7 in [9]. The simulated mean values show that all estimators are asymptotically unbiased. Regarding the simulated MSEs, for every k, we can see that

    MSE(ˆγ10)<MSE(ˆγ8)<MSE(ˆγ2)<MSE(ˆγ4)<MSE(ˆγ7)<MSE(ˆγ9).

    In addition, MSE(ˆγ10) is almost equal to MSE(ˆγ8) and MSE(ˆγ4) is closer to MSE(ˆγ7). This is consistent with the theoretical analysis.

    Figures 3 and 4 show the simulated mean values and MSEs of the estimators involved under study for sample of size n=1000 from the Burr(1, -1) model when ρ is unknown and known, respectively. They perform essentially the same in terms of the simulated mean value and MSEs. The simulated mean values show that the estimators seem to be asymptotically unbiased for some of the values of k.

    At this time, for most of the values of k, it is clear from the simulated MSEs that

    MSE(ˆγ10)<MSE(ˆγ8)<MSE(ˆγ2)<MSE(ˆγ4)<MSE(ˆγ7)<MSE(ˆγ9).

    For other different values of Burr's model, the conclusions obtained are similar to the results of the above analysis.

    In the following, the above-mentioned estimators are considered to be compared in terms of the simulated mean value and mean square error at the optimal level k0,

    k0:=argminkMSE(),

    where denote ˆγi,i=2,4,7,8,9,10. The estimator of the parameter α is given by the estimator of ρ in (4.1). And the estimators ˆγi,i=2,4,7,8,9,10 are denoted as ˆγi(αi0,ki0),i=2,4,7,8,9,10 at the optimal level. We have implemented Monte-Carlo simulation experiments of size 100 for sample sizes n=100,200,500,1000, and 2000, from the Frˊechet and Burr models. For each model, mean values with the smallest squared bias and the smallest root mean square error (RMSE) are presented in bold, where RMSE:=MSE. The simulated results are shown in Tables 25.

    Table 2.  Simulated mean values/RMSE of ˆγi(αi0,ki0),i=2,4,7,8,9,10 at their simulated optimal level from Frˊechet(1) model.
    n 100 200 500 1000 2000
    ˆγ2(α20,k20) 1.0186 / 0.1983 1.0606 / 0.1689 1.0128 / 0.1067 1.0135 / 0.0713 1.0139 / 0.0593
    ˆγ4(α40,k40) 1.0059 / 0.2051 1.0525 / 0.1772 1.0092 / 0.1139 1.0079 / 0.0770 1.0123 / 0.0648
    ˆγ7(α70,k70) 1.0037 / 0.2062 1.0587 / 0.1817 1.0185 / 0.1158 1.0113 / 0.0778 1.0134 / 0.0658
    ˆγ8(α80,k80) 1.0239 / 0.1965 1.0639 / 0.1656 1.0166 / 0.1042 1.0165 / 0.0692 1.0135 / 0.0574
    ˆγ9(α90,k90) 1.0137 / 0.2049 1.0866 / 0.1889 1.0475 / 0.1205 1.0278 / 0.0760 1.0268 / 0.0675
    ˆγ10(α100,k100) 1.0241 / 0.1966 1.0634 / 0.1650 1.0123 / 0.1044 1.0167 / 0.0692 1.0119 / 0.0573

     | Show Table
    DownLoad: CSV
    Table 3.  Simulated mean values/RMSE of ˆγi(αi0,ki0),i=2,4,7,8,9,10 at their simulated optimal level from Burr(1, -0.5) model.
    n 100 200 500 1000 2000
    ˆγ2(α20,k20) 1.2094 / 0.3335 1.0682 / 0.1914 1.1336 / 0.2139 1.1185 / 0.1626 1.0658 / 0.1133
    ˆγ4(α40,k40) 1.1723 / 0.3194 1.0807 / 0.1801 1.1297 / 0.2097 1.1073 / 0.1700 1.0608 / 0.1173
    ˆγ7(α70,k70) 1.1741 / 0.3219 1.0833 / 0.1805 1.1317 / 0.2097 1.1086 / 0.1690 1.0527 / 0.1169
    ˆγ8(α80,k80) 1.2247 / 0.3408 1.0907 / 0.1986 1.1461 / 0.2127 1.1246 / 0.1606 1.0566 / 0.1112
    ˆγ9(α90,k90) 1.2031 / 0.3417 1.0558 / 0.1821 1.1324 / 0.2228 1.1210 / 0.1856 1.0643 / 0.1228
    ˆγ10(α100,k100) 1.2227 / 0.3392 1.0875 / 0.1973 1.1444 / 0.2127 1.1234 / 0.1608 1.0547 / 0.1113

     | Show Table
    DownLoad: CSV
    Table 4.  Simulated mean values/RMSE of ˆγi(αi0,ki0),i=2,4,7,8,9,10 at their simulated optimal level from Burr(1, -1) model.
    n 100 200 500 1000 2000
    ˆγ2(α20,k20) 1.0693 / 0.2338 1.0050 / 0.1174 1.0125 / 0.0844 0.9968 / 0.0764 1.0142 / 0.0609
    ˆγ4(α40,k40) 1.0625 / 0.2392 1.0049 / 0.1213 1.0158 / 0.0945 1.0078 / 0.0818 1.0158 / 0.0670
    ˆγ7(α70,k70) 1.0826 / 0.2444 1.0169 / 0.1191 1.0216 / 0.0931 1.0101 / 0.0795 1.0156 / 0.0671
    ˆγ8(α80,k80) 1.0724 / 0.2312 1.0050 / 0.1170 1.0053 / 0.0826 0.9881 / 0.0770 1.0123 / 0.0578
    ˆγ9(α90,k90) 1.1114 / 0.2863 1.0260 / 0.1162 1.0450 / 0.1053 1.0310 / 0.0847 1.0275 / 0.0748
    ˆγ10(α100,k100) 1.0731 / 0.2313 1.0058 / 0.1176 1.0068 / 0.0834 0.9899 / 0.0773 1.0125 / 0.0581

     | Show Table
    DownLoad: CSV
    Table 5.  Simulated mean values/RMSE of ˆγi(αi0,ki0),i=2,4,7,8,9,10 at their simulated optimal level from Burr(1, -2) model.
    n 100 200 500 1000 2000
    ˆγ2(α20,k20) 0.9839 / 0.1887 0.9784 / 0.1530 0.9680 / 0.1306 0.9826 / 0.0944 0.9897 / 0.0646
    ˆγ4(α40,k40) 0.9753 / 0.1977 0.9833 / 0.1657 0.9774 / 0.1377 0.9851 / 0.1026 0.9865 / 0.0706
    ˆγ7(α70,k70) 0.9914 / 0.2070 0.9960 / 0.1652 0.9799 / 0.1362 0.9837 / 0.1030 0.9915 / 0.0722
    ˆγ8(α80,k80) 0.9682 / 0.1879 0.9649 / 0.1480 0.9680 / 0.1266 0.9820 / 0.0911 0.9860 / 0.0622
    ˆγ9(α90,k90) 1.0491 / 0.2175 1.0209 / 0.1553 1.0259 / 0.1136 1.0140 / 0.0878 1.0130 / 0.0601
    ˆγ10(α100,k100) 0.9675 / 0.1870 0.9652 / 0.1480 0.9681 / 0.1265 0.9821 / 0.0911 0.9859 / 0.0615

     | Show Table
    DownLoad: CSV

    From Tables 25, we now provide a few comments.

    1) For all simulated models, the simulated values of ˆγ8(α80,k80) and ˆγ10(α100,k100) are almost equal especially in terms of RMSE.

    2) For models with ρ1, ˆγ4(α40,k40) has a smaller squared bias than other estimators in terms of the simulated mean value.

    3) For models with ρ<1, ˆγ7(α70,k70) has a smaller squared bias than other estimators in terms of the simulated mean value. However, ˆγ9(α90,k90) has a smaller RMSE than other estimators when k500.

    Overall, the new estimators perform well within a certain range.

    In extreme value statistics, the estimator for the heavy-tailed index is one of the most important current research topics. By means of the moment statistic, this paper constructs a class of heavy-tailed index estimators with four parameters. The consistency and asymptotic normality of the proposed estimators are proved under first-order and second-order regular variation conditions. For the proposed estimators, ten estimators are given by the specific values of the parameters, which include both the existing estimators in the literature and new estimators. The asymptotic unbiasedness is discussed for specific new estimators. In the asymptotical unbiased estimators, some of the new estimators are compared with the existing ones in terms of asymptotic variance, and the new estimators perform better. In the finite sample case, the simulated mean and mean square error of the estimators compared were calculated by Monte-Carlo simulation. The results show that the size relationship of the simulated mean square error is consistent with the theoretical analysis in asymptotical unbiased estimators. In addition, we compare the simulated mean value and mean square error of the mentioned estimators at the optimal level. It is concluded that the new estimators perform better within a certain range. Although we propose a class of parameterized heavy-tailed index estimators, the selection of parameters is still an open problem and will require further study.

    The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

    The authors would like to express their gratitude to the editor and two anonymous reviewers for their valuable comments and suggestions, which have greatly improved the quality of the paper. This work was supported by the National Natural Science Foundation of China (12001395).

    The authors declare there is no conflicts of interest.



    [1] P. Embrechts, C. Kluppelberg, T. Mikosch, Modelling Extremal Events for Insurance and Finance, Springer-Verlag, 1997.
    [2] A. Z. Alexandros, E. F. Nickolaos, A. P. Athanasios, Modeling earthquake risk via extreme value theory and pricing the respective catastrophe bonds, Astin Bull.: J. IAA, 37 (2007), 163–183. https://doi.org/10.2143/AST.37.1.2020804 doi: 10.2143/AST.37.1.2020804
    [3] B. M. Hill, A simple general approach to inference about the tail of a distribution, Ann. Stat., 3 (1975), 1163–1174. https://www.jstor.org/stable/2958370
    [4] C. Scarrott, A. MacDonald, A review of extreme value threshold estimation and uncertainty quantification, Revstat-Stat. J., 10 (2012), 33–60.
    [5] D. Y. Dey, J. Yan, Extreme Value Modeling and Risk Analysis, Taylor and Francis Group, 2016.
    [6] A. L. M. Dekker, J. H. J. Einamhl, L. de Haan, A moment estimator for the index of an extreme-value distribution, Ann. Stat., 17 (1989), 1833–1855.
    [7] J. Danielsson, D. W. Jansen, C. G. De Vries, The method of moments ratio estimator for the tail shape parameter, Commun. Stat-Theor. M., 25 (1996), 711–720. https://doi.org/10.1080/03610929608831727 doi: 10.1080/03610929608831727
    [8] M. I. Gomes, M. J. Martins, Generalizations of the Hill estimator-asymptotic versus finite sample behaviour, J. Stat. Plann. Inference, 93 (2001), 161–180. https://doi.org/10.1016/S0378-3758(00)00201-9 doi: 10.1016/S0378-3758(00)00201-9
    [9] F. Caeiro, M. I. Gomes, A class of asymptotically unbiased semi-parametric estimators of the tail index, Test, 11 (2002), 345–364. https://doi.org/10.1007/BF02595711 doi: 10.1007/BF02595711
    [10] H. Penalva, M. I. Gomes, F. Caeiro, M. M. Neves, A couple of non reduced bias generalized means in extreme value theory: An asymptotic comparison, Revstat-Stat. J., 18 (2020), 281–298.
    [11] M. I. Gomes, A. Guillou, Extreme value theory and statistics of univariate extremes: A review, Int. Stat. Rev., 83 (2015), 263–292. https://doi.org/10.1111/insr.12058 doi: 10.1111/insr.12058
    [12] I. Fedotenkov, A review of more than one hundred Pareto-tail index estimators, Statistica, 80 (2020), 245–299. https://doi.org/10.6092/issn.1973-2201/9533 doi: 10.6092/issn.1973-2201/9533
    [13] L. de Haan, A. Ferreira, Exreme Value Theory: A Introduction, New York, Springer, 2006.
    [14] M. I. F. Alves, M. I. Gomes, L. de Haan, A new class of semi-parametric estimators of the second order parameter, Port. Math., 60 (2003), 193–214.
    [15] M. I. Gomes, D. Pestana, F. Caeiro, A note on the asymptotic variance at optimal levels of a bias-corrected Hill estimator, Stat. Probab. Lett., 79 (2009), 295–303. https://doi.org/10.1016/j.spl.2008.08.016 doi: 10.1016/j.spl.2008.08.016
  • Reader Comments
  • © 2025 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(329) PDF downloads(44) Cited by(0)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog