Processing math: 68%
Research article

Asymptotic behavior for a class of population dynamics

  • Received: 06 February 2020 Accepted: 19 March 2020 Published: 01 April 2020
  • MSC : 34C25, 34K13, 34K25

  • This paper investigates the asymptotic behavior for a class of n-dimensional population dynamics systems described by delay differential equations. With the help of technique of differential inequality, we show that each solution of the addressed systems tends to a constant vector as t → ∞, which includes many generalizations of Bernfeld-Haddock conjecture. By the way, our results extend some existing literatures.

    Citation: Chuangxia Huang, Luanshan Yang, Jinde Cao. Asymptotic behavior for a class of population dynamics[J]. AIMS Mathematics, 2020, 5(4): 3378-3390. doi: 10.3934/math.2020218

    Related Papers:

    [1] Zuliang Lu, Xiankui Wu, Fei Huang, Fei Cai, Chunjuan Hou, Yin Yang . Convergence and quasi-optimality based on an adaptive finite element method for the bilinear optimal control problem. AIMS Mathematics, 2021, 6(9): 9510-9535. doi: 10.3934/math.2021553
    [2] Zuliang Lu, Fei Cai, Ruixiang Xu, Chunjuan Hou, Xiankui Wu, Yin Yang . A posteriori error estimates of hp spectral element method for parabolic optimal control problems. AIMS Mathematics, 2022, 7(4): 5220-5240. doi: 10.3934/math.2022291
    [3] Xin Zhao, Xin Liu, Jian Li . Convergence analysis and error estimate of finite element method of a nonlinear fluid-structure interaction problem. AIMS Mathematics, 2020, 5(5): 5240-5260. doi: 10.3934/math.2020337
    [4] Zuliang Lu, Ruixiang Xu, Chunjuan Hou, Lu Xing . A priori error estimates of finite volume element method for bilinear parabolic optimal control problem. AIMS Mathematics, 2023, 8(8): 19374-19390. doi: 10.3934/math.2023988
    [5] Changling Xu, Hongbo Chen . A two-grid P20-P1 mixed finite element scheme for semilinear elliptic optimal control problems. AIMS Mathematics, 2022, 7(4): 6153-6172. doi: 10.3934/math.2022342
    [6] Tiantian Zhang, Wenwen Xu, Xindong Li, Yan Wang . Multipoint flux mixed finite element method for parabolic optimal control problems. AIMS Mathematics, 2022, 7(9): 17461-17474. doi: 10.3934/math.2022962
    [7] Yuelong Tang . Error estimates of mixed finite elements combined with Crank-Nicolson scheme for parabolic control problems. AIMS Mathematics, 2023, 8(5): 12506-12519. doi: 10.3934/math.2023628
    [8] Miao Xiao, Zhe Lin, Qian Jiang, Dingcheng Yang, Xiongfeng Deng . Neural network-based adaptive finite-time tracking control for multiple inputs uncertain nonlinear systems with positive odd integer powers and unknown multiple faults. AIMS Mathematics, 2025, 10(3): 4819-4841. doi: 10.3934/math.2025221
    [9] Jie Liu, Zhaojie Zhou . Finite element approximation of time fractional optimal control problem with integral state constraint. AIMS Mathematics, 2021, 6(1): 979-997. doi: 10.3934/math.2021059
    [10] Cagnur Corekli . The SIPG method of Dirichlet boundary optimal control problems with weakly imposed boundary conditions. AIMS Mathematics, 2022, 7(4): 6711-6742. doi: 10.3934/math.2022375
  • This paper investigates the asymptotic behavior for a class of n-dimensional population dynamics systems described by delay differential equations. With the help of technique of differential inequality, we show that each solution of the addressed systems tends to a constant vector as t → ∞, which includes many generalizations of Bernfeld-Haddock conjecture. By the way, our results extend some existing literatures.


    With the development of computer technology, the finite element method follows closely in the early 1960s. The research on the reliability and validity of finite element analysis promoted the development of the finite element method [3,4,5,25,27]. Applying finite element methods, the emergence of errors has captured the attention of scholars. One of the main sources of errors is the error caused by the discretisation of the model, while researchers dissect various aspects of finite element analysis. In addition, the generation of finite element mesh is also a concern for scholars. In the early conventional FEA, scholars usually used experience, intuition or even guesses to generate meshes to simple judge whether the approximation results are reasonable or not. If it is not reasonable, the grid needs to be redesigned whenever necessary for the efficiency of the analysis and the reliability of the results. Therefore, the emergence of the adaptive finite element method is because the computer, after checking the current conditions, decides whether the solution is accurate enough to meet the determination needs according to the error information obtained during the adjustment process.

    To the best of our knowledge, the adaptive finite element method is a numerical method that can automatically adjust the algorithm to improve the process of solving [2]. An appropriate mesh can greatly reduce the errors arising from the discretisation of the finite element approximation process during the replication. According to the current situation, solutions of optimal control problems for nonlinear systems are usually not available. Besides the complexity and diversity of nonlinear equations, it is also very practical to use the adaptive finite element method for solving nonlinear equations.

    Adaptive finite element methods have been widely and successfully applied in various linear optimal control problems, for example, Eriksson and Johnson proposed that the adaptive finite element algorithm produces a series of successively refined meshes in which the final mesh satisfies a given error tolerance [11]. Gaevaskaya and Hope et al. proposed an adaptive finite element method for a class of distributed optimal control problems with control constraints is analysed by applying the reliability and discrete local efficiency of the posterior estimator and the quasi-orthogonality property as basic tools [13]. Braess and Carstensen et al. Investigate the residual jump contributions of a standard explicit residual-based a posteriori error estimator for an adaptive finite element method [1]. Hu and Xu et al. performed research on the convergence and optimality of the adaptive nonconforming linear element method for the stokes problem [17].

    However, a large number of literature show that the researches on adaptive finite element method for nonlinear optimal control problems have not reached its peak yet.

    Recently, for instance, Wriggers and Scherf propose an adaptive finite element technique for nonlinear contact problems [29]. Eriksson and Johnson consider adaptive finite element methods for parabolic problems to a class of nonlinear scalar problems, the authors obtain posteriori error estimates and design corresponding adaptive algorithms [12]. Nowadays, the problem of nonlinear optimal control problems, similar to big data, is the focus of scholars worldwide. Hence it is worthy of investigating the adaptive finite element method for such nonlinear problems.

    Many scholars have also studied the prior error estimates of the bilinear optimal control problem. For example, Yang, Demlow, and Dobrowolski et al. investigated the prior error estimates and superconvergence of optimal control problems for bilinear models and give the optimal L2-norm error estimates and the almost optimal L-norm estimates about the state and co-state variables [7,8,31]. Lu investigated a second-order parabolic bilinear optimal control problems and provided a priori error estimates for the finite element solutions of the state equations describing the system [24]. Shen and Yang et al. investigated a quadratic optimal control problem governed by a linear hyperbolic integro differential equation and its finite element approximation, a priori estimates have been carried out using the standard functional analysis techniques, and the existence and regularity of the solution are provided by using these estimates. At the same time, some scholars have analysed the posteriori error estimates of the finite element method for bilinear optimal control problems [15,26]. Lu, Chen, and Leng et al. discussed the discretisation of Raviart-Thomas mixed finite element for general bilinear optimal control problems, a posteriori error estimates are derived for both the coupled state and the control solutions [18,23]. Although bilinear optimal control problems are frequently met in applications, they are much more difficult to handle than linear or nonlinear cases. There is little work on nonlinear optimal control problems.

    In this paper, we focus on nonlinear optimal control problem with integral control constraints where we deal with the control via adopting piecewise constant discetization while applying continuous piecewise linear discretization for the state and the co-state, respectively. Then a posteriori error estimates is gained. For the convergence and the quasi-optimality, we prove them relying on quasi-orthogonality and discrete local upper bound. Based on the mild assumption to the initial grids, we obtain the proof of convergence and quasi-optimality by means of the solution operator of nonlinear elliptic equations. Finally, some numerical simulations are provided support for our theoretical analysis.

    Here are some notations will be used in this paper. Let Ω be a bounded Lipschitz domain in R2 and Ω denote the boundary of Ω. We use the standard notation Wm,q(ω) with norm m,q,ω and seminorm ||m,q,ω to express the standard Sobolev space for ωΩ. Moreover, we will omit the subscription if ω=Ω. For q=2, we denote Wm,2(Ω) by Hm(Ω) and m=m,2. Also for m=0 and q=2, we denote W0,2(ω)=L2(ω) and 0,2,ω=0,ω. Additionally, we observe that H10(Ω)={vH1(Ω):v=0 on Ω}. Let Th0 be the initial partition of ˉΩ into disjoint triangles. By newest-vertex bisections for Th0, we can obtain a class T of conforming partitions. For Th,˜ThT, we use Th˜Th to indicate that ˜Th is a refinement of Th and hT=|T|1/2, T is the partition diameter. In addition (,) denotes the L2 inner product. Beyond that, let C be a constant which independent of grids size, then we use AB to represent cABCA.

    The rest of the paper is organized as follows. In Section 2, we introduce the optimal control problem of our interest and obtain a posteriori error estimates. The relevant algorithms are introduced in Section 3. In Section 4, we use quasi-orthogonality and discrete local upper bounds to prove the convergence of the adaptive finite element method, as well as quasi-optimality in Section 5. We provide an adaptive finite element algorithm and some numerical simulations to verify our theoretical analysis in Section 6. Finally, we summarize the results of this paper and develop a plan for future work.

    In this paper we mainly enter into meaningful discussions with the following nonlinear optimal control problem governed by nonlinear elliptic equations:

    minuUad{12yyd20+α2u20},Δy+ϕ(y)=f+u,in Ω,y=0,on Ω,

    where y is the state variable, u is the control variable, f is a function of the control variable, α is a constant greater than zero, ydL2(Ω), Uad={v:vL2(Ω), Ωv dx0} is a closed convex subset of U=L2(Ω) and ϕ()W2,(R,R) for any R>0, ϕ(y)L2(Ω) for any yH1(Ω), ϕ0. Let V=H10(Ω), we give the weak formulation to deal with state equation, namely, find yV such that

    a(y,v)+(ϕ(y),v)=(f+u,v), vV,

    where

    a(y,v)=Ωyv dx.

    Then the nonlinear optimal control problem can be restated as follows

    minuUad{12yyd20+α2u20}, (2.1)
    a(y,v)+(ϕ(y),v)=(f+u,v), vV. (2.2)

    It is well known [14,20] that the nonlinear optimal control problem has at least one solution (y,u), and that if a pair (y,u) is the solution of the nonlinear optimal control problem, then there is a co-state pV such that the triplet (y,p,u) satisfies the following optimality conditions:

    a(y,v)+(ϕ(y),v)=(f+u,v), vV, (2.3)
    a(q,p)+(ϕ(y)p,q)=(yyd,q), qV, (2.4)
    (αu+p,vu)0, vUad. (2.5)

    Due to coercivity of a(,), we define a linear operator S:L2(Ω)H10(Ω) such that S(f+u)=y and let S be the adjoint of S such that S(yyd)=p where Vh is the continuous piecewise linear finite element space with respect to the partition ThT. For ThT, we define Uh as the piecewise constant finite element space with respect to Th. Let Uhad={vhUh:Ωvhdx0}. Then we derive the standard finite element discretization for the nonlinear optimal control problem as follows:

    minuhUhad{12yhyd20+α2uh20}, (2.6)
    a(yh,v)+(ϕ(yh),v)=(f+uh,v), vVh. (2.7)

    Similarly the nonlinear optimal control problem (2.6)–(2.7) has a solution (yh,uh), and that if a pair (yh,uh)Vh×Uh is the solution of (2.6)–(2.7), then there is a co-state phVh such that the triplet (yh,ph,uh) satisfies the following optimality conditions:

    a(yh,v)+(ϕ(yh),v)=(f+uh,v), vVh, (2.8)
    a(q,ph)+(ϕ(yh)ph,q)=(yhyd,q), qVh, (2.9)
    (αuh+ph,vhuh)0, vhUhad. (2.10)

    Here we define some error indicators we largely and frequently use in this paper where η() are error indicators and osc() represent the data oscillations. For ThT, TTh, we define

    η21,Th(ph,T)=h2Tph20,T,η22,Th(uh,yh,T)=h2Tf+uhϕ(yh)20,T+hT[yh]n20,TΩ,η23,Th(yh,ph,T)=h2Tyhydϕ(yh)ph20,T+hT[ph]n20,TΩ,osc2Th(f,T)=h2TffT20,T,osc2Th(yhyd,T)=h2T(yhyd)(yhyd)T20,T,

    where uhUhad, yh,phVh, and where fT is L2-projection of f onto piecewise constant space on T and fT=Tf|T|. For ωTh, we have

    η21,Th(ph,ω)=Tωη21,Th(ph,T),osc2Th(f,ω)=Tωosc2Th(f,T),

    by which η22,Th(uh,yh,ω), η23,Th(uh,yh,ω) and osc2Th(yhyd,ω) can be denoted similarly.

    Also, a reliable and effective a posteriori error estimates for the nonlinear optimal control problem (2.1)–(2.2) which is presented in next Theorem.

    Theorem 2.1. For ThT, let (y,p,u) be the exactsolution of problem (2.3), (2.4) and (2.5) and (yh,ph,uh) be the solution of problem (2.8), (2.9) and (2.10) with respect to Th. Then thereexist constants c and C such that

     uuh20+yyh21+pph21C(η21,Th(ph,Th)+η22,Th(uh,yh,Th)+η23,Th(yh,ph,Th)), (2.11)

    and

     c(η21,Th(ph,Th)+η22,Th(uh,yh,Th)+η23,Th(yh,ph,Th))uuh20+yyh21+pph21+osc2Th(f,Th)+osc2Th(yhyd,Th). (2.12)

    Proof. Step 1. We seek functions yh,phV satisfying the following auxiliary problems

    a(yh,v)+(ϕ(yh),v)=(f+uh,v), vV, (2.13)
    a(q,ph)+(ϕ(yh)ph,q)=(yhyd,q), qV. (2.14)

    It follows that

    uuh20C(αu,uuh)C(αuh,uuh)C(αuh,uuh)=C(αuh,uhu)+C(αuhαuh,uuh).

    With the help of the proof of Lemma 3.4 in [14] and the lemma 3.4 in [21], we have

    (αuh,uhu)=(αuh+ph,uhu)C(TThphπhph20,T+uuh20)C(σ)η21,Th(ph,Th)+Cσuuh20, (2.15)

    where η21,Th(ph,Th)=TThη21,Th(ph,T), and

    (αuhαuh,uuh)=(phph,uuh)C(σ)phph20+Cσuuh20, (2.16)

    where πh is the L2-projection operator onto piecewise constant space on Th, C(σ) is a universal constant, which depends on σ, σ is an arbitrary positive number, and C is a general universal constant, which can include C(σ). Obviously, from (2.15) and (2.16) there holds

    uuh20C(η21,Th(ph,Th)+phph20).

    Let ey=yhyh, and eyI=ˆπhey, where ˆπh is the average interpolation operator defined in Lemma 3.2 of [14], then we can obtain

    cey21((yhyh),ey)+(ϕ(yh)ϕ(yh),ey)=((yhyh),(eyeyI))+(ϕ(yh)ϕ(yh),eyeyI)=TThT(f+uhϕ(yh))(eyeyI)dxTThT([yh]n)(eyeyI)dxC(σ)TThh2TT(f+uhϕ(yh))2dx+C(σ)TΩhT([yh]n)2dx+Cσey21=C(σ)η22,Th(uh,yh,Th)+Cσey21,

    where σ is an arbitrary positive number. The definition of η2 will be given later on. Then let σ=c2C, we have

    yhyh21Cη22,Th(uh,yh,Th).

    Similarly, let ep=phph and epI be the average interpolation of ep, then we can get

    cep21(ep,(phph))+((ϕ(yh)(phph),ep)=(ep,(phph))+(ϕ(yh)phϕ(yh)ph,ep)+((ϕ(yh)ϕ(yh))ph,ep)=((epepI),(phph))+(ϕ(yh)phϕ(yh)ph,epepI)+(epI,(phph))+(ϕ(yh)phϕ(yh)ph,epI)+((ϕ(yh)ϕ(yh))ph,ep)=TThT(yhydϕ(yh)ph)(epepI)dxTΩT([ph]n)(epepI)(yhyh,epI)+((ϕ(yh)ϕ(yh))ph,ep)dxC(σ)TThh2TT(yhydϕ(yh)ph)2dx+C(σ)TΩhTT([ph]n)2dx+CσTThh2TT|ep|2dx+Cyhyh0ep0+Cϕ(yh)ϕ(yh)0ph0,4ep0,4C(σ)η23,Th(yh,ph,Th)+C(σ)yhyh20+C(σ)ph21yhyh20+Cσphph21)C(σ)η23,Th(yh,ph,Th)+Cyhyh20+Cσphph21,

    in which we apply the embedding theorem v0,4Cv1 (see [6]) and the property: ph1C, C(σ)yhyh20+C(σ)ph21yhyh20C(σ)yhyh20+C(σ)Cyhyh20Cyhyh20. The definition of η3 will be given later on. Absolutely, we can obtain

    phph21Cη23,Th(yh,ph,Th)+Cyhyh20.

    Hence we gain

    phph20Cphph21+Cyhyh21C(η22,Th(uh,yh,Th)+η23,Th(yh,ph,Th)).

    By the triangular inequality we obtain that

    yyh1yyh1+yhyh1C(uuh0+yhyh1),pph1pph1+phph1C(yyh1+phph1).

    In connection with what we discussed above, we have

    uuh20+pph21+yyh21C(η21,Th(ph,Th)+η22,Th(uh,yh,Th)+η23,Th(yh,ph,Th)).

    Step 2. Now we are in the position to get the lower bound. Consulting to the proof of Lemma 3.6 in [14] and Lemma 3.4 in [21], we conclude that

    h2Tph20TThphπhph20,TC(uuh20+pph21). (2.17)

    Proof. It is easily seen that

    TThphπhph20,T=TThphπhph0,Tphp+pπhp+πhpπhph0,TTThphπhph0,Tpπhp+13TThphπhph20,T+Cphp21.

    Since u+p=max(0,ˉp)=const, hence

    πh(u+p)=u+p,

    such that

    TThphπhph0,Tpπhp0,T=TThphπhph0,Tp+uπh(p+u)+πhpu0,T=TThphπhph0,Tπh(uuh)(uuh)0,T13TThphπhph20,T+Cuhu21.

    This completes the proof.

    According to [28], we empoly the standard bubble function technique to estimate error indicators η2,Th(uh,yh,Th) and η3,Th(yh,ph,Th). Similar to Chapter 7.2 in [22], there exists polynomials wTH10(T) and wTH10(TΩ) such that

    ThT([ph]n)2dx=T([ph]n)wTdx, (2.18)
    Th2T((yhyd)Tϕ(yh)ph)2dx=T((yhyd)Tϕ(yh)ph)wTdx, (2.19)

    and apparently

    wT21,TΩCThT([ph]n)2dx, (2.20)
    h2TwT20,TΩCThT([ph]n)2dx, (2.21)
    wT21,TChh2T((yhyd)Tϕ(yh)ph)2dx, (2.22)
    h2TwT20,TChh2T((yhyd)Tϕ(yh)ph)2dx. (2.23)

    By using the Schwarz inequality, it follows from (2.18), (2.20), and (2.21) that

    ThT([ph]n)2dx=T([ph]n)wTdx=T([ph]n[p]n)wTdx=T(php)wTdx+div(Δ(php))wT=T(php)wTdx+T(yyd)dxϕ(y)p)wTdx=T(php)wTdx+T(yhydϕ(yh)ph)wTdx+T((yyd)(yhyd))wTdx+T(ϕ(y)pϕ(yh)ph)wTdxC(σ)php21,TΩ+C(σ)yyh20,TΩ+Tϕ(y)(pph)wTdx+T(ϕ(y)ϕ(yh))phwTdx+C(σ)T(yhydϕ(yh)ph)2dx+Cσ(wT21,TΩ+h2TwT20,TΩ)C(σ)php21,TΩ+C(σ)yyh20,TΩ+C(σ)ϕ(y)20,TΩ(pph)20,TΩ+T˜ϕ(yh)(yyh)phwTdx+C(σ)T(yhydϕ(yh)ph)2dx+Cσ(wT21,TΩ+h2TwT20,TΩ)C(σ)php21,TΩ+C(σ)yyh20,TΩ+C(σ)˜ϕ(yh)20,TΩyyh20,TΩph20,TΩ+C(σ)T(yhydϕ(yh)ph)2dx+Cσ(wT21,TΩ+h2TwT20,TΩ)C(σ)php21,TΩ+C(σ)yyh20,TΩ+C(σ)T(yhydϕ(yh)ph)2dx+CσThT([ph]n)2dx,

    where σ is an arbitrary positive number and ϕ()W2,(Ω) has been used. Then let σ=12C and we have

    ThT([ph]n)2dxCphp21,TΩ+Cyyh20,TΩ+CT(yhydϕ(yh)ph)2dx. (2.24)

    Next, it follows from (2.19), (2.22), and (2.23) that

    Th2T((yhyd)Tϕ(yh)ph)2dx=T((yhyd)Tϕ(yh)ph)wTdx=T(yhydϕ(yh)ph)wTdx+T((yhyd)(yhyd)T)wTdxT(yhydϕ(yh)ph(yyd)+ϕ(y)p)wTdx+C(σ)Th2T((yhyd)(yhyd)T)2dx+Cσh2TwT20,T=T(php)wTdx+T((yhyd)(yyd))wTdx+T(ϕ(yh)phϕ(y)p)wTdx+C(σ)Th2T((yhyd)(yhyd)T)2dx+Cσh2TwT20,TC(σ)php21,T+C(σ)(yyd)(yhyd)20,T+Tϕ(yh)(php)wTdx+T(ϕ(yh)ϕ(y))pwTdx+C(σ)Th2T((yhyd)(yhyd)T)2dx+Cσ(wT21,T+h2TwT20,T)C(σ)php21,T+C(σ)yyh20,T+C(σ)Th2T((yhyd)(yhyd)T)2dx+C(σ)ϕ(yh)20,Tphp20+C(σ)˜ϕ(y)20,Tyhy20,Tp20,T+Cσ(wT21,T+h2TwT20,T)C(σ)php21,T+C(σ)yyh20,T+C(σ)Th2T((yhyd)(yhyd)T)2dx+CσTh2T((yhyd)Tϕ(yh)ph)2dx,

    where ϕ()W2,(Ω) has been used. Absolutely, we can deduce that

    Th2T((yhyd)Tϕ(yh)ph)2dxCphp21,T+Cyyh20,T+CTh2T((yhyd)(yhyd)T)2dx.

    Then we have

    Th2T(yhydϕ(yh)ph)2dxCTh2T((yhyd)Tϕ(yh)ph)2dx+CTh2T((yhyd)(yhyd)T)2dxCphp21,T+Cyyh20,T+CTh2T((yhyd)(yhyd)T)2dx. (2.25)

    In connection with (2.24) and (2.25) we are easy to gain

    η23,Th(yh,ph,Th)=TThTh2T(yhydϕ(yh)ph)2dx+TΩThT([ph]n)2dxCphp21+Cyyh20+CTThTh2T((yhyd)(yhyd)T)2dxCphp21+Cyyh21+Cosc2Th(yhyd,Th).

    It can also be deduced that

    η22,Th(uh,yh,Th)=TThTh2T(f+uhϕ(yh))2dx+TΩThT([yh]n)2dxCyhy21+Cuuh20+CTThTh2T(ffT)2dxCyhy21+Cuuh20+Cosc2Th(f,Th).

    Above-mentioned results tell the proof of Theorem 2.1 is accomplished.

    In this section, we introduce two related algorithms as follows:

    Algorithm 3.1. Adaptive finite element algorithm for nonlinear optimal controlproblems:

    (0) Given an initial grids Th0 and construct finiteelement space Uh0ad and Vh0. Select marking parameter 0<θ1 and set k:=0.

    (1) Solve the discrete nonlinear optimal control problem (2.8)–(2.10), then obtain approximate solution (uhk,yhk,phk) with respect to Thk.

    (2) Compute the local error estimator ηThk(T) for all TThk.

    (3) Select a minimal subset Mhk of Thk such that

    η2Thk(Mhk)θη2Thk(Thk),

    where η2Thk(ω)=η21,Th(ph,ω)+η22,Th(uh,yh,ω)+η23,Th(yh,ph,ω) for all ωThk.

    (4) Refine Mhk by bisecting b1 times inpassing from Thk to Thk+1 andgenerally additional elements are refined in the process in order toensure that Thk+1 is conforming.

    (5) Solve the discrete nonlinear optimal control problem (2.8)–(2.10), then obtain approximate solution (uhk+1,yhk+1,phk+1) with respect to Thk+1.

    (6) Set k=k+1 and go to step (2).

    Algorithm 3.2. Given an initial control u0hUhad, then seek (ykh,pkh,ukh) such that

    a(ykh,wh)+(ϕ(ykh),wh)=(f+uk1h,wh), whVh,a(qh,pk1h)+(ϕ(ykh)pk1h,qh)=(ykhyd,qh), qhVh,(αukh+pk1h,vhukh)0, vkhUhad,

    for k=1,2,, and apparently

    ukh=1α(Phpkh+max(0,ˉpkh)),

    where Ph is the L2-projection from L2(Ω) to Uh and ˉpkh=Ωpkh|Ω|.

    In this section we first consider the nonlinear elliptic equations as follows:

    {Δy+ϕ(y)=f,in Ω,y=0,on Ω, (4.1)

    where fL2(Ω). We introduce the quantity J(h) in view of the idea in [30] as follows:

    J(h):=supfL2(Ω), f0,Ω=1infvhvTSvh1,

    where S is the solution operator for nonlinear elliptic equations. Obviously, J(h)1 for h(0,h0) if h01. Hereinafter there holds the Lemma about the quantity referring to [10,30].

    Lemma 4.1. For each fL2(Ω), there exists a constant C such that

    SfShf1CJ(h)f0, (4.2)

    and

    SfShf0CJ(h)SfShf1, (4.3)

    where Sh is the discretesolution operator for nonlinear elliptic equations.

    Here, Lemma 4.1 is a preparation for Lemma 4.4.

    According to [19], the local perturbation property plays an important role for the proof of the convergence. It impels us to combine the sum of the error estimates.

    Lemma 4.2. For ThT, TTh, let uh1,uh2Uhad, yh1,yh2,ph1,ph2Vh, we have

    η1,Th(ph1,T)η1,Th(ph2,T)ChTph1ph21,T, (4.4)
    η2,Th(uh1,yh1,T)η2,Th(uh2,yh2,T)C(hTuh1uh20,T+yh1yh21,T), (4.5)
    η3,Th(yh1,ph1,T)η3,Th(yh2,ph2,T)C(hTyh1yh20,T+ph1ph21,T), (4.6)
    oscTh(yh1yd,T)oscTh(yh2yd,T)Ch2Tyh1yh21,T. (4.7)

    Proof. Step 1. According to [16,21] we have

    v0,TΩC(h1/2Tv0,T+h1/2Tv1,T). (4.8)

    By adopting the inverse estimates and (4.8) we obtain

    [(yh1yh2)]n0,TΩCh1/2Tyh1yh21,ωT, (4.9)
    [(ph1ph2)]n0,TΩCh1/2Tph1ph21,ωT, (4.10)

    where ωT denotes the patch of elements that share an edge with T. By the definition of η1,Th(ph,T) and (4.10) we can deduce that

    η1,Th(ph1,T)η1,Th(ph2,T)+ChT[(ph1ph2)]n0,TΩ.

    Then we have

    η1,Th(ph1,T)η1,Th(ph2,T)ChT[(ph1ph2)]n0,TΩChTph1ph21,T,

    where the proof of (4.4) is finished.

    Step 2. By the definition of η2,Th(uh,yh,T) and (4.9), we can deduce that

    η2,Th(uh1,yh1,T)η2,Th(uh2,yh2,T)hTuh1uh20,T+h1/2T[(yh1yh2)]n0,TΩ+hTϕ(yh1)ϕ(yh2)0,ThTuh1uh20,T+Cyh1yh21,ωT+ChT˜ϕ(yh1)0,Tyh1yh20,TC(hTuh1uh20,T+yh1yh21,T),

    where ϕ()W2,(Ω) has been used and the proof of (4.5) is finished.

    Step 3. By the definition of η3,Th(yh,ph,T) and (4.10) we can derive that

    η3,Th(yh1,ph1,T)η3,Th(yh2,ph2,T)hTyh1yh20,T+h1/2T[(ph1ph2)]n0,TΩ+hTϕ(yh1)ph1ϕ(yh2)ph20,ThTyh1yh20,T+Cph1ph21,ωT+hTϕ(yh1)0,Tph1ph20,T+hT˜ϕ(yh1)0,Tyh1yh20,Tph20,ThTyh1yh20,T+Cph1ph21,ωT+ChTph1ph20,TC(hTyh1yh20,T+ph1ph21,T),

    where ϕ()W2,(Ω) has been used and the proof of (4.6) is finished. \\Step 4. Similarly, we have

     oscTh(yh1yd,T)oscTh(yh2yd,T)h2Tyh1yh20,T.

    In brief, Lemma 4.2 is proved.

    The authors in [25] demonstrate an error reduction provided the current errors are larger than the desired errors, that is to say, the errors may not be reduced in the process of coarse grids refinement before introducing a node of the refined grids inside each marked element while D¨orfler proves a similar result assumption [9].

    Lemma 4.3. Let Th~Th for Th,~ThT.MhTh denotes the set of elements whichare marked from Th to ˜Th. Then for uhUhad, ˜uhU˜had, yh,phVh, ˜yh,˜phV˜h and any δ,δ1(0,1], we have

     η21,˜Th(˜ph,~Th)(1+δ1){η21,Th(ph,Th)(121/2)η1,Th(ph,Rh)}C(1+δ11)h20ph˜ph21, (4.11)

    and

     η22,˜Th(˜uh,˜yh,˜Th)(1+δ){η22,T(uh,yh,Th)λη22,Th(uh,yh,Mh)}C(1+δ1)(h20uh˜uh20+yh˜yh21), (4.12)

    and

     η23,˜Th(˜yh,˜ph,˜Th)(1+δ){η23,Th(yh,ph,Th)λη23,Th(uh,yh,Mh)}C(1+δ1)(h20yh˜yh20+ph˜ph21), (4.13)

    and

     osc2Th(yhyd,Th˜Th)2osc2˜Th(˜yhyd,Th˜Th)2Ch40yh˜yh21, (4.14)

    where λ=12b2, h0=maxTTh0hT and Rh denotes the set ofelements which are refined from Th to ~Th.

    Proof. Step 1. Applying the Young's inequality with parameter δ1 and (4.4) we get

    η21,~Th(˜ph,~Th)η21,~Th(ph,~Th)Ch20ph˜ph21+2η1,~Th(˜ph,~Th)η1,~Th(ph,~Th)C(1+δ11)h20ph˜ph21+δ1η21,~Th(ph,~Th). (4.15)

    Note that T will be bisected at least one time for the element TRhTh, then we have

    TTη1,~Th(ph,T)21/2η1,Th(ph,T).

    For TThRh, we gain

    η21,~Th(ph,T)=η21,Th(ph,T).

    In connection with the above estimates we demonstrate that

    η21,˜Th(˜ph,~Th)(1+δ1){η21,Th(ph,Th)(121/2)η1,Th(ph,Rh)}=η21,~Th(ph,Rh)+η21,~Th(ph,ThRh)(1+δ1){η21,Th(ph,Th)(121/2)η1,Th(ph,Rh)}21/2η21,~Th(ph,Rh)+η21,~Th(ph,ThRh)(1+δ1){η21,Th(ph,Th)(121/2)η1,Th(ph,Rh)}C(1+δ11)h20ph˜ph21,

    which illustrates (4.11) has been proved.

    Step 2. Employing the Young's inequality with parameter δ and (4.5) we obtain

    η23,~Th(˜yh,˜ph,~Th)η23,~Th(yh,ph,~Th)C(1+δ1)h2T(TThhT˜yhyh20,T+˜phph21)+δη23,~Th(yh,ph,~Th),

    where it is similar to (4.15). Then let ˜ThT={T~Th:TT} where TMh is a marked element. For arbitrary phVThV~Th, we find the jump [ph]=0 on the interior sides of ˜ThT. Suppose b is the number of bisections, we can deduce that

    hT=|T|1/2(2b|T|)1/22b2hT,

    due to refinement by bisection, then we obtain

    T~ThTη23,~Th(yh,ph,T)2b2η23,Th(yh,ph,T).

    It is easy to find that

    η23,~Th(yh,ph,T)η23,Th(yh,ph,T),

    for any TThMh. In connection with the above estimates we expound that

    η23,~Th(yh,ph,~Th)=η23,~Th(yh,ph,Mh)+η23,~Th(yh,ph,ThMh)2b2η23,Th(yh,ph,Mh)+η23,Th(yh,ph,ThMh)=η23,Th(yh,ph,Th)(12b2)η23,Th(yh,ph,Mh).

    As has been said, we can achieve (4.13) connecting with the above estimates to which the proof of (4.12) is similar.

    Step 3. For arbitrary TTh~Th via using (4.7) and the Young's inequality we obtain that

    oscTh(yhyd,T)=osc~Th(yhyd,T),osc2Th(yhyd,T)2osc2~Th(˜yhyd,T)2Ch4T˜yhyh21,T.

    Obviously, (4.14) can be got by summing the above inequality over TTh~Th. To sum up, Lemma 4.3 is proved.

    As to the proof of the convergence, one of the main obstacle is that there do not have the orthogonality while it is vital to prove the convergence. Thus getting back to the second place we transfer proof of the quasi-orthogonality. The latter is popularly adopted in the adaptive mixed and the nonconforming adaptive finite element methods [19]. Apparently it is true for the following basic relationships with Thk,Thk+1T and ThkThk+1,

    uuhk+120=uuhk20uhkuhk+1202(uuhk+1,uhk+1uhk), (4.16)
    yyhk+121=yyhk21yhkyhk+1212a(yyhk+1,yhk+1yhk), (4.17)
    pphk+121=pphk21phkphk+1212a(pphk+1,phk+1phk), (4.18)

    where (u,y,p) are the solution of (2.3)–(2.5), (uhk,yhk,phk) and (uhk+1,yhk+1,phk+1) are the solution of (2.8)–(2.10) with respect to Thk and Thk+1, respectively. Accordingly we have the quasi-orthogonality below.

    Lemma 4.4. For Thk,Thk+1T and ThkThk+1, we have

     (1δ)uuhk+120uuhk20+uhkuhk+120Cδ1(η21,Thk(phk,Rh)+J2(h0)(η22,Thk(uhk,yhk,Rh)+η23,Thk(yhk,phk,Rh))), (4.19)

    and

     (1δ)yyhk+121yyhk21yhkyhk+121δuuhk+120Cδ1(η21,Thk(phk,Rh)+J2(h0)(η22,Thk(uhk,yhk,Rh)+η23,Thk(yhk,phk,Rh))), (4.20)

    and

     (1δ)pphk+121pphk21+phkphk+121δyyhk+121Cδ1(η21,Thk(phk,Rh)+J2(h0)(η22,Thk(uhk,yhk,Rh)+η23,Thk(yhk,phk,Rh))). (4.21)

    Proof. Step 1. It follows from Lemma 4.3 in [19] that for UhkadUhk+1ad we have

    αuhk+1uhk20(phk+1phk,uhkuhk+1)+(αuhk+phk,uhkuhk+1)(Shk+1(Shk+1(f+uhk)yd)Shk(Shk(f+uhk)yd),uhkuhk+1)+Cη1,Thk(phk,Rh)uhkuhk+10, (4.22)

    where Rh is the set of elements which are refined from Thk to Thk+1.

    For the right-hand first item of (4.22) we let ζhH10(Ω) be the solution of the following problem based on Lemma 4.1

    a(ζh,q)=(Shk+1(Shk+1(f+uhk)yd)Shk(Shk(f+uhk)yd),q), qH10(Ω).

    Hence we can get

    Shk+1(Shk+1(f+uhk)yd)Shk(Shk(f+uhk)yd)20=a(ζh,Shk+1(Shk+1(f+uhk)yd)Shk(Shk(f+uhk)yd))=a(ζhζhk,Shk+1(Shk+1(f+uhk)yd)Shk(Shk(f+uhk)yd))+(Shk+1(f+uhk)Shk(f+uhk),ζhkζh)+(Shk+1(f+uhk)Shk(f+uhk),ζh).

    From the proof of Lemma 3.3 in [19], we infer that

    a(ζhζhk,Shk+1(Shk+1(f+uhk)yd)Shk(Shk(f+uhk)yd))J(h0)Shk+1(Shk+1(f+uhk)yd)Shk(Shk(f+uhk)yd)0(Shk+1(f+uhk)Shk(f+uhk)1+Shk+1(Shk(f+uhk)yd)Shk(Shk(f+uhk)yd)1),

    and

    (Shk+1(f+uhk)Shk(f+uhk),ζhkζh)CJ(h0)Shk+1(f+uhk)Shk(f+uhk)0Shk+1(Shk(f+uhk)yd)Shk(Shk(f+uhk)yd)0,

    and

    (Shk+1(f+uhk)Shk(f+uhk),ζ)Shk+1(f+uhk)Shk(f+uhk)0Shk+1(Shk(f+uhk)yd)Shk(Shk(f+uhk)yd)0.

    Similar to Lemma 4.6 in [4], we derive that

    Shk+1(f+uhk)Shk(f+uhk)1Cη2,Thk(uhk,yhk,Rh),Shk+1(Shk(f+uhk)yd)Shk(Shk(f+uhk)yd)1Cη3,Thk(yhk,phk,Rh).

    In connection with the above estimates we conclude that

     Shk+1(Shk(f+uhk)yd)Shk(Shk(f+uhk)yd)0C(J(h0)(η2,Thk(uhk,yhk,Rh)+η3,Thk(yhk,phk,Rh))+Shk+1(f+uhk)Shk(f+uhk)0).

    For the third term at the right end of the above inequality, we let φhH10(Ω) be the solution of the following problem

    a(q,φh)=(Shk+1(f+uhk)Shk(f+uhk),q), qH10(Ω).

    According to the standard duality theory, we can deduce that

    Shk+1(f+uhk)Shk(f+uhk)20=a(Shk+1(f+uhk)Shk(f+uhk),φhφhk)CJ(h0)Shk+1(f+uhk)Shk(f+uhk)1Shk+1(f+uhk)Shk(f+uhk)0,

    where φhk is the standard finite element estimate of φh with respect to VThk. So we have

    Shk+1(Shk(f+uhk)yd)Shk(Shk(f+uhk)yd)0C(J(h0)(η2,Thk(uk,yk,Rh)+η3,Thk(yhk,phk,Rh))).

    As mentioned above, we can get

    (phk+1phk,uhkuhk+1)C(J(h0)(η2,Thk(uhk,yhk,Rh)+η3,Thk(yhk,phk,Rh))uhkuhk+10).

    In combination with (4.22) and above inequality, we deduct that

    uhkuhk+10C(η1,Thk(phk,Rh)+J(h0)(η2,Thk(uhk,yhk,Rh)+η3,Thk(yhk,phk,Rh))). (4.23)

    It is easy to derive the desired result (4.19) with the help of (4.16) and (4.23).

    Step 2. Our task now is to prove (4.20) and so is (4.21). Obviously we have

    yhk+1yhk0=Shk+1(f+uhk+1)Shk(f+uhk)0Shk+1(f+uhk+1)Shk+1(f+uhk)0+Shk+1(f+uhk)Shhk(f+uhk)0C(uhkuhk+10+J(h0)η2,Thk(uhk,yhk,Rh)). (4.24)

    By using the Cauchy inequality, we obtain

    2a(yyhk+1,yhk+1yhk)=2(uuhk+1,yhk+1yhk)2(ϕ(y)ϕ(yhk+1),yhk+1yhk)δuuhk+120+1δyhk+1yhk20+(˜ϕ(y)(yyhk+1),yhk+1yhk)δuuhk+120+1δyhk+1yhk20+˜ϕ(y)0yyhk+10yhk+1yhk0δuuhk+120+1δyhk+1yhk20+C(δyyhk+120+1δyhk+1yhk20). (4.25)

    It is easy to derive the desired result (4.20) with the assistance of (4.17) and (4.23)–(4.25).

    For ThkT, we will denote Uhkad, Vhk and the solution (uhk,yhk,phk) of (2.8)–(2.10) with respect to Thk by Uhkad, Vhk, and (uhk,yhk,phk) and we define some notations before we prove the convergence of Algorithm 3.1 as follows:

    e2hk=uuhk20+yyhk21+pphk21,E2hk=uhkuhk+120+yhkyhk+121+phkphk+121,˜η2Thk(ω)=η22,Thk(uhk,yhk,ω)+η23,Thk(yhk,phk,ω),

    for ωΩ.

    Theorem 4.1. Let (Thk,Uhkad,Vhk,uhk,yhk,phk) be the sequence of grids, finite element spaces and discretesolutions produced by the Algorithm 3.1. Then there existconstants γ1>0, γ2>0 and α(0,1), onlydepending on the shape regularity of initial grids Th0, b, and the marking parameter θ(0,1], such that

    e2hk+1+γ1η21,Thk+1(phk+1,Thk+1)+γ2˜η2Thk+1(Thk+1)α(e2hk+γ1η21,Thk(phk,Thk)+γ2˜η2Thk(Thk)), (4.26)

    provided h01.

    Proof. We get the following results from Theorem 2.1, Lemma 4.3 and Lemma 4.4,

    e2hkCη2Thk(Thk), (4.27)
    ˜η2Thk+1(Thk+1)(1+δ){˜η2Thk(Thk)λ˜η2Thk(Mhk)}+C(1+1δ)E2hk, (4.28)
    η21,Thk+1(phk+1,Thk+1)(1+δ1){η21,Thk(phk,Thk)(121/2)η21,Thk(phk,Rhk)}+C(1+1δ1)h20phkphk+121, (4.29)
    (12δ)e2hk+1e2hkE2hk+C1δ(η21,Thk(phk,Rhk)+J(h0)˜η2Thk(Thk)), (4.30)

    where Rhk is the set of elements which are refined from Thk to Thk+1. Applying the upper bound in Theorem 2.1, (4.29) can be simplified into

    η21,Thk+1(phk+1,Thk+1)(1+δ1){η21,Thk(phk,Thk)(121/2)η21,Thk(phk,Rhk)}+C(1+1δ1)h20(η21,Thk+1(phk+1,Thk+1)+˜η2Thk+1(Thk+1)+η21,Thk(phk,Thk)+˜η2Thk(Thk)). (4.31)

    Multiplying (4.28) and (4.31) with ~γ2 and ~γ1, respectively, and adding the results to (4.30) yields

     (12δ)e2hk+1+~γ1η21,Thk+1(phk+1,Thk+1)+~γ2˜η2Thk+1(Thk+1)e2hk+~γ2(1+δ){˜η2Thk(Thk)λ˜η2Thk(Mhk)}+~γ2C(1+1δ)E2hkE2hk(~γ1(1+δ1)(121/2)C1δ)η21,Thk(phk,Rhk)+~γ1(1+δ1)η21,Thk(phk,Thk)+C1δJ2(h0)˜η2Thk(Thk)+~γ1C(1+1δ1)h20(η21,Thk+1(phk+1,Thk+1)+˜η2Thk+1(Thk+1)+η21,Thk(phk,Thk)+˜η2Thk(Thk)).

    If \tilde{\gamma_{1}} is such that

    \tilde{\gamma_{1}}(1+\delta_{1})(1-2^{-1/2})-C\frac{1}{\delta} > 0,

    and one chooses \tilde{\gamma_{2}} such that

    \begin{equation} \tilde{\gamma_{2}}C\Big(1+\frac{1}{\delta}\Big) = 1, \end{equation} (4.32)

    then we have

    \begin{align*} &\quad(1-2\delta)e_{h_{k+1}}^2+\tilde{\gamma_{1}}\Big(1-C\Big(1+\frac{1}{\delta_{1}}\Big)h_{0}^2\Big)\eta_{1, \mathcal{T}_{h_{k+1}}}^2(p_{h_{k+1}}, \mathcal{T}_{h_{k+1}})\\ &\quad +\Big(\tilde{\gamma_{2}}-\tilde{\gamma_{1}}C\Big(1+\frac{1}{\delta_{1}}\Big)h_{0}^2\Big)\tilde{\eta}_{\mathcal{T}_{h_{k+1}}}^2(\mathcal{T}_{h_{k+1}})\\ &\leq e_{h_k}^2+\tilde{\gamma_{1}}\Big((1+\delta_{1})+C\Big(1+\frac{1}{\delta_{1}}\Big)h_{0}^2\Big)\eta_{1, \mathcal{T}_{h_k}}^2(p_{h_k}, \mathcal{T}_{h_k})\\ &\quad +\Big(\tilde{\gamma_{2}}(1+\delta)+\tilde{\gamma_{1}}C\Big(1+\frac{1}{\delta_{1}}\Big)h_{0}^2+C\frac{1}{\delta}\mathcal{J}^2(h_0)\Big)\tilde{\eta}_{\mathcal{T}_{h_k}}^2(\mathcal{T}_{h_k})\\ &\quad -c\Big(\eta_{1, \mathcal{T}_{h_k}}^2(p_{h_k}, \mathcal{M}_{h_k})+\tilde{\eta}_{\mathcal{T}_{h_k}}^2(\mathcal{M}_{h_k})\Big), \end{align*}

    where

    c = \min\Big\{\tilde{\gamma_{2}}(1+\delta)\lambda, \tilde{\gamma_{1}}(1+\delta_{1})(1-2^{-1/2})-C\frac{1}{\delta}\Big\}.

    By using the marking strategy in Algorithm 3.1 and the upper bound in Theorem 2.1 to arrive at

    \begin{align*} &\quad(1-2\delta)e_{h_{k+1}}^2+\tilde{\gamma_{1}}\Big(1-C\Big(1+\frac{1}{\delta_{1}}\Big)h_{0}^2\Big)\eta_{1, \mathcal{T}_{h_{k+1}}}^2(p_{h_{k+1}}, \mathcal{T}_{h_{k+1}})\\ &\quad +\Big(\tilde{\gamma_{2}}-\tilde{\gamma_{1}}C\Big(1+\frac{1}{\delta_{1}}\Big)h_{0}^2\Big)\tilde{\eta}_{\mathcal{T}_{h_{k+1}}}^2(\mathcal{T}_{h_{k+1}})\\ &\leq(1-C\theta\beta)e_{h_k}^2+\Big(\tilde{\gamma_{1}}\Big((1+\delta_{1})+C\Big(1+\frac{1}{\delta_{1}}\Big)h_{0}^2\Big)-c\theta(1-\beta)\Big)\eta_{1, \mathcal{T}_{h_k}}(p_{h_k}, \mathcal{T}_{h_k})\\ &\quad +\Big(\tilde{\gamma_{2}}(1+\delta)+\tilde{\gamma_{1}}C\Big(1+\frac{1}{\delta_{1}}\Big)h_{0}^2+C\frac{1}{\delta}\mathcal{J}^2(h_{0})-c\theta(1-\beta)\Big) \tilde{\eta}_{\mathcal{T}_{h_k}}(\mathcal{T}_{h_k}), \end{align*}

    where \beta\in(0, 1) . Then we deduce that

    \begin{align} &\ \quad e_{h_{k+1}}^2+\gamma_{1}\eta_{1, \mathcal{T}_{h_{k+1}}}^2(p_{h_{k+1}}, \mathcal{T}_{h_{k+1}})+\gamma_{2}\tilde{\eta}_{\mathcal{T}_{h_{k+1}}}^2(\mathcal{T}_{h_{k+1}})\\ &\leq\alpha_{1}e_{h_k}^2+\alpha_{2}\gamma_{1}\eta_{1, \mathcal{T}_{h_k}}^2(p_{h_k}, \mathcal{T}_{h_k})+\alpha_{3}\gamma_{2}\tilde{\eta}_{\mathcal{T}_{h_k}}^2(\mathcal{T}_{h_k}), \end{align}

    where

    \begin{align*} &\alpha_{1} = \frac{1-C\theta\beta}{1-2\delta}, \\ &\gamma_{1} = \frac{\tilde{\gamma_{1}}\Big(1-C(1+\delta_{1}^{-1})h_{0}^2\Big)}{1-2\delta}, \\ &\gamma_{2} = \frac{\tilde{\gamma_{2}}-\tilde{\gamma_{1}}C(1+\delta_{1}^{-1})h_{0}^2}{1-2\delta}, \\ &\alpha_{2} = \frac{\tilde{\gamma_{1}}\Big((1+\delta_{1})+C(1+\delta_{1}^{-1})h_{0}^2\Big)-c\theta(1-\beta)}{\tilde{\gamma_{1}}\Big(1-C(1+\delta_{1}^{-1})h_{0}^2\Big)}, \\ &\alpha_{3} = \frac{\tilde{\gamma_{2}}(1+\delta)+\tilde{\gamma_{1}}C(1+\delta_{1}^{-1})h_{0}^2+C\delta^{-1}\mathcal{J}^2(h_0)-c\theta(1-\beta)} {\tilde{\gamma_{2}}-\tilde{\gamma_{1}}C(1+\delta_{1}^{-1})h_{0}^2}. \end{align*}

    As long as \delta < 1 and \beta is small enough, \alpha_{1}\in(0, 1) can be guaranteed. To facilitate judgment, we transfer the following adjustments to the above formula:

    \begin{align*} &\alpha_{2} = \frac{(1-C(1+\delta_{1}^{-1})h_{0}^2)+\delta_{1}+2C(1+\delta_{1}^{-1})h_{0}^2-\frac{c\theta(1-\beta)}{\tilde{\gamma_{1}}}}{1-C(1+\delta_{1}^{-1})h_{0}^2}, \\ &\alpha_{3} = \frac{\tilde{\gamma_{2}}(1+\delta)-\tilde{\gamma_{1}} C(1+\delta_{1}^{-1})h_{0}^2+2\tilde{\gamma_{1}} C(1+\delta_{1}^{-1})h_{0}^2+C\delta^{-1}\mathcal{J}^2(h_{0})-c\theta(1-\beta)}{\tilde{\gamma_{2}}-\tilde{\gamma_{1}}C(1+\delta_{1}^{-1})h_{0}^2}. \end{align*}

    It is absolutely clear that

    \alpha_{2}\in(0, 1),

    if h_{0}\ll 1 and \delta_{1} is sufficiently small. Then consider (4.32) to deduce that

    \gamma_{2} = \frac{\delta^2}{C(1+\delta)},

    which can say

    \alpha_{3}\in(0, 1),

    if h_{0}\ll 1 and \delta is sufficiently small. Therefore if choose \alpha = \max\{\alpha_{1}, \alpha_{2}, \alpha_{3}\} , we can derive the expected results.

    Theorem 4.2. Let (\mathcal{T}_{h_k}, U_{ad}^{h_k}, V_{h_k}, u_{h_k}, y_{h_k}, p_{h_k}) be the sequence of grids, finite element spaces and discretesolutions produced by the Algorithm 3.1 and the conditionsof Theorem 4.1 keep. Then we have

    \begin{align*} \|u-u_{h_k}\|_0^2+\|y-y_{h_k}\|^2_1+\|p-p_{h_k}\|_1^2\rightarrow 0 \quad\mathrm{as}\quad k\rightarrow \infty. \end{align*}

    Proof. It is obviously true combining Theorem 2.1 and Theorem 4.1.

    In this section we consider the quasi-optimality for the adaptive finite element method. Firstly we give the notations interpretation. For \mathcal{T}_h, \mathcal{T}_{h_1}, \mathcal{T}_{h_2}\in \mathbb{T} , let \#\mathcal{T}_h be the number of elements in \mathcal{T}_h , and \mathcal{T}_{h_1}\oplus\mathcal{T}_{h_2} be the smallest common conforming refinement of \mathcal{T}_{h_1} and \mathcal{T}_{h_2} and satisfies [4,27]

    \begin{equation} \#(\mathcal{T}_{h_1}\oplus\mathcal{T}_{h_2})\leq \#\mathcal{T}_{h_1}+\#\mathcal{T}_{h_2}-\#\mathcal{T}_{h_0}. \end{equation} (5.1)

    According to [19], we need defining a function approximation class

    \begin{align*} \mathcal{A}^s: = &\{(u, y, p, y_{d}, f)\in L^2(\Omega)\times H_{0}^1(\Omega)\times H_{0}^1(\Omega)\times L^2(\Omega)\\ &\times L^2(\Omega):|(u, y, p, y_{d}, f)|_{s} < +\infty\}, \end{align*}

    where

    \begin{align*} |(u, y, p, y_{d}, f)|{_s}: = &\sup\limits_{N > 0}N^s\inf\limits_{\mathcal{T}_h\in\mathbb{T}_{N}}\inf\limits_{(u_{h}, y_{h}, p_{h})\in U_{ad}^{h}\times V_{h}\times V_{h}}\{\|u-u_{h}\|_{0}^2\\ &+\|y-y_{h}\|_{1}^2+\|p-p_{h}\|_{1}^2+osc_{\mathcal{T}_h}^2(f, \mathcal{T}_h))+osc_{\mathcal{T}_h}^2(y_{h}-y_{d}, \mathcal{T}_h)\}^{\frac{1}{2}}, \end{align*}

    and

    \mathbb{T}_{N}: = \{\mathcal{T}_h\in\mathbb{T}:\#\mathcal{T}_h-\#\mathcal{T}_{h_0}\leq N_{0}\}.

    To illustrate the quasi-optimality of the adaptive finite element method, we need a local upper bound on the distance between nested solutions [4], since the error of this method can only be estimated by using the indicators of refined elements without a buffer layer.

    Lemma 5.1. For \mathcal{T}_h, \tilde{\mathcal{T}}_h\in \mathbb{T} and \mathcal{T}_h\subset\tilde{\mathcal{T}}_h , let \mathcal{R}_h bethe set of refined elements from \mathcal{T}_h to \tilde{\mathcal{T}}_h . Let (u_{h}, y_{h}, p_{h}) and (\tilde{u}, \tilde{y}, \tilde{p}) be the solutions of (2.8)–(2.10) with respect to \mathcal{T}_h and \tilde{\mathcal{T}}_h , respectively. Then there exists a constant C , depending on the shape regularity of initial grids \mathcal{T}_{h_{0}} and b such that

    \begin{equation} \|u_{h}-\tilde{u}\|_{0}^2+\|y_{h}-\tilde{y}\|_{1}^2+\|p_{h}-\tilde{p}\|_{1}^2\leq C\eta_{\mathcal{T}_h}^2(\mathcal{R}_h), \end{equation} (5.2)

    where

    \eta_{\mathcal{T}_h}^2(\mathcal{R}_h) = \eta_{1, \mathcal{T}_h}^2(p_{h}, \mathcal{R}_h)+\eta_{2, \mathcal{T}_h}^2(u_{h}, y_{h}, \mathcal{R}_h)+\eta_{3, \mathcal{T}_h}^2(y_{h}, p_{h}, \mathcal{R}_h).

    Proof. From (4.23) of Lemma 4.4, we have

    \begin{equation} \|u_{h}-\tilde{u}\|_{0}^2\leq\eta_{\mathcal{T}_h}^2(\mathcal{R}_h). \end{equation} (5.3)

    By Lemma 4.6 in [4], we deduce that

    \begin{align} \|y_{h}-\tilde{y}\|_{1}& = \|S_{h}(f+u_{h})-S_{\tilde{h}}(f+\tilde{u})\|_{1}\\ &\leq\|S_{h}(f+u_{h})-S_{\tilde{h}}(f+u_{h})\|_{1}+\|S_{\tilde{h}}(f+u_{h})-S_{\tilde{h}}(f+\tilde{u})\|_{1}\\ &\leq C(\eta_{2, \mathcal{T}_h}(u_{h}, y_{h}, \mathcal{R}_h)+\|u_{h}-\tilde{u}\|_{0}). \end{align} (5.4)

    The corresponding result is attained by the similar method above

    \begin{align} \|p_{h}-\tilde{p}\|_{1}& = \|S_{h}^*(S_{h}(f+u_{h})-y_{d})-S_{\tilde{h}}^*(S_{\tilde{h}}(f+\tilde{u})-y_{d})\|_{1}\\ \ &\leq C(\eta_{3, \mathcal{T}_h}(y_{h}, p_{h}, \mathcal{R}_h)+\|y_{h}-\tilde{y}\|_{1}). \end{align} (5.5)

    In connection with (5.3), (5.4) and (5.5) we can derive (5.2).

    Dörfler introduced a crucial marking and proved strict energy error reduction for the Laplacian provided the initial grids \mathcal{T}_{h_0} satisfying a mild assumption [9]. If the sum of errors satisfy suitable error reductions, the error indicators on the coarse grids must satisfy a Dörfler property on the refined one [19].

    Lemma 5.2. Assume that the marking parameter \theta\in(0, \theta^*) , where

    \theta^* = \frac{C}{2C(1+h_{0}^4)+1}.

    For \mathcal{T}_h, \tilde{\mathcal{T}}_h\in\mathbb{T} and \mathcal{T}_h\subset\tilde{\mathcal{T}}_h , let (u_{h}, y_{h}, p_{h}) and (\tilde{u}, \tilde{y}, \tilde{p}) be thesolutions of (2.8)–(2.10) with respect to \mathcal{T}_h and \tilde{\mathcal{T}}_h , respectively. If

    \begin{equation} e_{\tilde{\mathcal{T}}_h}^2+osc_{\tilde{\mathcal{T}}_h}^2(\tilde{\mathcal{T}}_h)\leq \mu[e_{\mathcal{T}_h}^2+osc_{\mathcal{T}_h}^2(\mathcal{T}_h)], \end{equation} (5.6)

    is satisfied for \mu: = \frac{1}{2}\Big(1-\frac{\theta}{\theta^*}\Big) . Then, the set \mathcal{R}_h of elements which are refined from \mathcal{T}_h to \tilde{\mathcal{T}}_h satisfies the Dörfler property

    \eta_{\mathcal{T}_h}^2(\mathcal{R}_h)\geq\theta\eta_{\mathcal{T}_h}^2(\mathcal{T}_h),

    where

    \begin{align*} &e_{\mathcal{T}_h}^2 = \|u-u_{h}\|_{0}^2+\|y-y_{h}\|_{1}^2+\|p-p_{h}\|_{1}^2, \\ &osc_{\mathcal{T}_h}^2(\omega) = osc_{\mathcal{T}_h}^2(f, \omega)+osc_{\mathcal{T}_h}^2(y_{h}-y_{d}, \omega), \end{align*}

    for \omega\subset\mathcal{T}_h and e_{\tilde{\mathcal{T}}_h}^2 , osc_{\tilde{\mathcal{T}}_h}^2(\tilde{\mathcal{T}}_h) similarly defined.

    Proof. By the lower bound in Theorem 2.1 and (5.6) to obtain that

    \begin{align} (1-2\mu)C\eta_{\mathcal{T}_h}^2(\mathcal{T}_h)&\leq(1-2\mu)(e_{\mathcal{T}_h}^2+osc_{\mathcal{T}_h}^2(\mathcal{T}_h))\\ &\leq e_{\mathcal{T}_h}^2-2e_{\tilde{\mathcal{T}}_h}^2+osc_{\mathcal{T}_h}^2(\mathcal{T}_h)-2osc_{\tilde{\mathcal{T}}_h}^2(\tilde{\mathcal{T}}_h). \end{align}

    It is well-known that there exists the fundamental relationships:

    \begin{align} &\|u-u_{h}\|_{0}^2\leq2\|u-\tilde{u}\|_{0}^2+2\|u_{h}-\tilde{u}\|_{0}^2, \\ &\|y-y_{h}\|_{1}^2\leq2\|y-\tilde{y}\|_{1}^2+2\|y_{h}-\tilde{y}\|_{1}^2, \\ &\|p-p_{h}\|_{1}^2\leq2\|p-\tilde{p}\|_{1}^2+2\|p_{h}-\tilde{p}\|_{1}^2. \end{align}

    Hence we can get the following result form Lemma 5.1

    \begin{equation} e_{\mathcal{T}_h}^2-2e_{\tilde{\mathcal{T}}_h}^2\leq2C\eta_{\mathcal{T}_h}^2(\mathcal{R}_h). \end{equation} (5.7)

    For T\in \mathcal{T}_h\cap\tilde{\mathcal{T}}_h , we can get the following result from (4.14) of Lemma 4.3

    \begin{equation} osc_{\mathcal{T}_h}^2(y_{h}-y_{d}, \mathcal{T}_h\cap\tilde{\mathcal{T}}_h)-2osc_{\tilde{\mathcal{T}}_h}^2(\tilde{y}-y_{d}, \mathcal{T}_h\cap\tilde{\mathcal{T}}_h)\leq 2C(h_{0}^4\eta_{\mathcal{T}_h}^2(\mathcal{R}_h)). \end{equation} (5.8)

    According to Remark 2.1 in [4], we get the following result as the indicator \eta_{\mathcal{T}_h}(T) dominates oscillation osc_{\mathcal{T}_h}(T)

    \begin{equation} osc_{\mathcal{T}_h}^2(T)\leq\eta_{\mathcal{T}_h}^2(T), \end{equation} (5.9)

    for all T\in\mathcal{R}_h . Then in connection with (5.7), (5.8) and (5.9) one obtains

    \begin{align*} (1-2\mu)C\eta_{\mathcal{T}_h}^2(\mathcal{T}_h)&\leq(2C(1+h_{0}^4)+1)\eta_{\mathcal{T}_h}^2(\mathcal{R}_h), \\ (1-2\mu)\theta^*\eta_{\mathcal{T}_h}^2(\mathcal{T}_h)&\leq\eta_{\mathcal{T}_h}^2(\mathcal{R}_h), \\ \theta\eta_{\mathcal{T}_h}^2(\mathcal{T}_h)&\leq\eta_{\mathcal{T}_h}^2(\mathcal{R}_h), \end{align*}

    where

    \begin{align*} &\theta^* = \frac{C}{2C(1+h_{0}^4)+1}, \quad\mathrm{and}\quad\theta = (1-2\mu)\theta^*. \end{align*}

    Lemma 5.3. Let (u, y, p) and (\mathcal{T}_{h_k}, U_{ad}^{h_k}, V_{h_k}, u_{h_k}, y_{h_k}, p_{h_k}) be the solution of (2.3)–(2.5) and the sequence ofgrids, finite element spaces and discrete solutions produced by Algorithm 3.1, respectively. Assume that the markingparameter \theta satisfies the condition in Lemma 5.2, then the following estimate is valid

    \begin{equation} \#\mathcal{M}_{h_k}\leq C\Big( N^{\frac{1}{2s}}|(u, y, p, y_{d}, f)|_{s}^{\frac{1}{s}}\mu^{-\frac{1}{2s}}(e_{h_k}^2+osc_{\mathcal{T}_{h_k}}^2(\mathcal{T}_{h_k}))^{-\frac{1}{2s}}\Big), \end{equation} (5.10)

    if (u, y, p, y_{d}, f)\in \mathcal{A}^s .

    Proof. Let \epsilon^2: = \mu N^{-1}(e_{\mathcal{T}_{k}}^2+osc_{\mathcal{T}_{k}}^2(\mathcal{T}_{k})) , where N shall be produced in the proof of (5.13) and \mu is defined in Lemma 5.2. Because of (u, y, p, y_{d}, f)\in \mathcal{A}^s , there exists a \mathcal{T}_{h_\epsilon}\in\mathbb{T} and a (u_{h_\epsilon}, y_{h_\epsilon}, p_{h_\epsilon})\in U_{ad}^{h_{\epsilon}}\times V_{h_{\epsilon}}\times V_{h_{\epsilon}} such that

    \begin{equation} \#\mathcal{T}_{h_\epsilon}-\#\mathcal{T}_{h_0}\leq|(u, y, p, y_{d}, f)|_{s}^{1/s}\in\epsilon^{-1/s}, \end{equation} (5.11)

    and

    \begin{equation} \|u-u_{h_\epsilon}\|_{0}^2+\|y-y_{h_\epsilon}\|_{1}^2+\|p-p_{h_\epsilon}\|_{1}^2+osc_{\mathcal{T}_{h_\epsilon}}^2(f, \mathcal{T}_{h_\epsilon}) +osc_{\mathcal{T}_{h_\epsilon}}^2(y_{h_\epsilon}-y_{d}, \mathcal{T}_{h_\epsilon})\leq\epsilon^2. \end{equation} (5.12)

    Let (u_{h_*}, y_{h_*}, p_{h_*}) be the solution of (2.8)–(2.10) with respect to \mathcal{T}_{h_*} , where \mathcal{T}_{h_*} = \mathcal{T}_{h_\epsilon}\oplus\mathcal{T}_{h_k} is the smallest common refinement of \mathcal{T}_{h_\epsilon} and \mathcal{T}_{h_k} . In the following we will prove the following inequality firstly

    \begin{equation} e_{h_*}^2+osc_{\mathcal{T}_{h_*}}^2(\mathcal{T}_{h_*})\leq N\Big(e_{\mathcal{T}_{h_\epsilon}}^2+osc_{\mathcal{T}_{h_\epsilon}}^2(\mathcal{T}_{h_\epsilon})\Big), \end{equation} (5.13)

    where

    \begin{align*} &e_{\mathcal{T}_{h_\epsilon}}^2: = \|u-u_{h_\epsilon}\|_{0}^2+\|y-y_{h_\epsilon}\|_{1}^2+\|p-p_{h_\epsilon}\|_{1}^2, \\ &osc_{\mathcal{T}_{h_\epsilon}}^2(\mathcal{T}_{h_\epsilon}): = osc_{\mathcal{T}_{h_\epsilon}}^2(f, \mathcal{T}_{h_\epsilon}) +osc_{\mathcal{T}_{h_\epsilon}}^2(y_{h_\epsilon}-y_{d}, \mathcal{T}_{h_\epsilon}). \end{align*}

    According to the principle of adding one item and subtracting one item, then we have

    \begin{align*} &\|u-u_{h_\epsilon}\|_{0}^2 = \|u-u_{h_*}\|^2_{0}+\|u_{h_\epsilon}-u_{h_*}\|_{0}^2+2(u-u_{h_*}, u_{h_*}-u_{h_\epsilon}), \\ &\|y-y_{h_\epsilon}\|_{1}^2 = \|y-y_{h_*}\|_{1}^2+\|y_{h_\epsilon}-y_{h_*}\|_{1}^2+2a(y-y_{h_*}, y_{h_*}-y_{h_\epsilon}), \\ &\|p-p_{h_\epsilon}\|_{1}^2 = \|p-p_{h_*}\|_{1}^2+\|p_{h_\epsilon}-p_{h_*}\|_{1}^2+2a(p-p_{h_*}, p_{h_*}-p_{h_\epsilon}). \end{align*}

    With the help of the Young's inequality one obtains

    \begin{align*} (u-u_{h_*}, u_{h_*}-u_{h_\epsilon})& = (u-u_{h_\epsilon}, u_{h_*}-u_{h_\epsilon})-(u_{h_*}-u_{h_\epsilon}, u_{h_*}-u_{h_\epsilon})\\ &\leq (u-u_{h_\epsilon}, u_{h_*}-u_{h_\epsilon})\\ &\leq\|u-u_{h_\epsilon}\|_{0}^2+\frac{1}{4}\|u_{h_*}-u_{h_\epsilon}\|_{0}^2, \end{align*}

    and so are a(y-y_{h_*}, y_{h_*}-y_{h_\epsilon}) and a(p-p_{h_*}, p_{h_*}-p_{h_\epsilon}) . Hence in connection with what we get above we deduce that

    \begin{align} &\ \quad \|u-u_{h_*}\|_{0}^2+\|u_{h_*}-u_{h_\epsilon}\|_{0}^2+\|y-y_{h_*}\|_{a}^2+\|y_{h_*}-y_{h_\epsilon}\|_{1}^2+\|p-p_{h_*}\|_{1}^2+\|p_{h_*}-p_{h_\epsilon}\|_{1}^2\\ &\leq 6\Big(\|u-u_{h_\epsilon}\|_{0}^2+\|y-y_{h_\epsilon}\|_{1}^2+\|p-p_{h_\epsilon}\|_{1}^2\Big). \end{align} (5.14)

    From Remark 2.1 in [4] and (4.14) in Lemma 4.3 with \mathcal{T}_h = \tilde{\mathcal{T}}_h = \mathcal{T}_{h_*}, \ y = y_{h_*} and \tilde{y} = y_{h_\epsilon} , we obtain that

    \begin{align} osc_{\mathcal{T}_{h_*}}^2(y_{h_*}-y_{d}, \mathcal{T}_{h_*})-2osc_{\mathcal{T}_{h_\epsilon}}^2(y_{h_\epsilon}-y_{d}, \mathcal{T}_{h_\epsilon})&\leq osc_{\mathcal{T}_{h_*}}^2(y_{h_*}-y_{d}, \mathcal{T}_{h_*})-2osc_{\mathcal{T}_{h_*}}^2(y_{h_\epsilon}-y_{d}, \mathcal{T}_{h_*})\\ &\leq2Nh_{0}^4\|y_{h_*}-y_{h_\epsilon}\|_{1}^2. \end{align} (5.15)

    For any T'\in\mathcal{T}_{h_\epsilon} , let \mathcal{T}_{h_{T'}}: = \{T\in \mathcal{T}_{h_*}:T\in T'\} . From the proof of Lemma 4.3 in [19], we derive that

    \begin{align*} \sum\limits_{T\in\mathcal{T}_{h_{T'}}}\|f-f_{T}\|_{0, T}^2\leq N\|f-f_{T'}\|_{0, T'}^2. \end{align*}

    And then we can get

    \begin{equation} osc_{\mathcal{T}_{*}}^2(f, \mathcal{T}_{*})\leq N (osc_{\mathcal{T}_{\epsilon}}^2(f, \mathcal{T}_{\epsilon})). \end{equation} (5.16)

    Combining (5.14)–(5.16) to obtain (5.13) and using (5.12) and the definition of \epsilon^2 , we have

    \begin{align} e_{h_*}^2+osc_{\mathcal{T}_{h_*}}^2(\mathcal{T}_{h_*})\leq N\Big(e_{\mathcal{T}_{h_\epsilon}}^2+osc_{\mathcal{T}_{h_\epsilon}}^2(\mathcal{T}_{h_\epsilon})\Big)\leq N\epsilon^2 = \mu\Big(e_{h_k}^2+osc_{\mathcal{T}_{h_k}}^2(\mathcal{T}_{h_k})\Big). \label{4-17} \end{align}

    It is true for the following result from Lemma 5.2

    \begin{equation} \#\mathcal{M}_{h_k}\leq\#\mathcal{R}_h\leq\#\mathcal{T}_{h_*}-\#\mathcal{T}_{h_k}\leq\#\mathcal{T}_{h_\epsilon}-\#\mathcal{T}_{h_0}. \end{equation} (5.17)

    Combining (5.11), (5.17) and the definition of \epsilon^2 to derive the desired result (5.10).

    The following consequence is the result of previous estimates where the truth is that the number of elements is a dwindle for the errors. Namely, if a given adaptive method is used to approximate the exact solution at a certain convergence rate, the iteratively constructed grids sequence will achieve this rate until a constant factor.

    Theorem 5.1. Let (u, y, p) and (\mathcal{T}_{h_k}, U_{ad}^{h_k}, V_{h_k}, u_{h_k}, y_{h_k}, p_{h_k}) be the solution of (2.3)–(2.5) and the sequence ofgrids, finite element spaces and discrete solutions produced by Algorithm 3.1, respectively. Assume that \mathcal{T}_{h_0} satisfies the condition (b) of Section 4 in [27]. Let (u, y, p, y_{d}, f)\in \mathcal{A}_{s} , then we have

    \begin{equation} \#\mathcal{T}_{h_k}-\#\mathcal{T}_{h_0}\leq C|(u, y, p, y_{d}, f)|_{s}^{\frac{1}{s}}\Big(e_{h_k}^2+osc_{\mathcal{T}_{h_k}}^2(\mathcal{T}_{h_k})\Big)^{-\frac{1}{2s}}, \end{equation} (5.18)

    provided h_{0}\ll 1 .

    Proof. It follows from Theorem 2.1 and Lemma 5.3 that

    \begin{equation} e_{h_k}^2+\gamma_{1}\eta_{1, \mathcal{T}_{h_k}}^2(p_{h_k}, \mathcal{T}_{h_k})+\gamma_{2}\tilde{\eta}_{\mathcal{T}_{h_k}}(\mathcal{T}_{h_k})\approx e_{h_k}^2+osc_{\mathcal{T}_{h_k}}^2(\mathcal{T}_{h_k}). \end{equation} (5.19)

    From Lemma 2.3 in [4], we have

    \begin{equation} \#\mathcal{T}_{h_k}-\#\mathcal{T}_{h_0}\leq C\sum\limits_{i = 0}^{k-1}\mathcal{M}_{h_i}. \end{equation} (5.20)

    With the assistance of Lemma 5.3 and (5.20) to gain

    \begin{equation} \#\mathcal{T}_{h_k}-\#\mathcal{T}_{h_0}\leq C\sum\limits_{i = 0}^{k-1}\mathcal{M}_{h_i}\leq C\Big( M\sum\limits_{i = 0}^{k-1}\Big(e_{h_k}^2+osc_{\mathcal{T}_{h_k}}^2(\mathcal{T}_{h_k})\Big)^{-\frac{1}{2s}}\Big), \end{equation} (5.21)

    where

    M = N^{\frac{1}{2s}}|(u, y, p, y_{d}, f)|_{s}^{\frac{1}{s}}\alpha^{-\frac{1}{2s}}.

    Then it follows from (5.19), (5.21) and Theorem 4.1 that

    \begin{align} \#\mathcal{T}_{h_k}-\#\mathcal{T}_{h_0}&\leq C\Big( M\Big(e_{h_k}^2+osc_{\mathcal{T}_{h_k}}^2(\mathcal{T}_{h_k})\Big)^{-\frac{1}{2s}}\sum\limits_{i = 1}^k\alpha^{-\frac{1}{2s}}\Big)\\ &\leq C|(u, y, p, y_{d}, f)|_{s}^{\frac{1}{s}}\Big(e_{h_k}^2+osc_{\mathcal{T}_{h_k}}^2(\mathcal{T}_{h_k})\Big)^{-\frac{1}{2s}}, \end{align}

    which tells the proof of Theorem 5.1.

    In this section, we firstly present an adaptive finite element and then give the adaptive iteration method where the purpose is to provide empirical analysis for our theory.

    Example 1. We consider the nonlinear optimal control problem governed by nonlinear elliptic equations subject to the state equation

    \begin{align*} -\Delta y+y^3 = f+u, \quad-\Delta p+3y^2p = y-y_d, \end{align*}

    where we choose \alpha = 1 and \Omega = [0, 1]\times[0, 1] and apparently exact solution

    \begin{align*} u = &\frac{1}{\alpha}(\max(0, \bar{p})-p), \\ y = &\sin(\pi x_1)+\sin(\pi x_2), \\ p = &-y. \end{align*}

    By simple calculation we have \int_\Omega pdx = -\frac{4}{\pi} which satisfies u\in U_{ad} .

    We choose 15 adaptive loops for Example 1, then we plot the profiles of the exact state and the numerical state on adaptively refined grids with \theta = 0.5 in Figure 1. It is easy to see that the solution is smooth, but we can find larger gradients in some regions, hence comparing with uniform refinement, adaptive finite element method can provide smaller error. In Figure 2, we provide the triangle refined grids after 6 and 12 adaptive iterations of Algorithm 3.1.

    Figure 1.  The exact state (left) and the numerical state (right) for Example.
    Figure 2.  The adaptive grids after 6 steps (left) and 12 steps (right) for Example.

    In Figure 3, we plot the convergence history for the errors where the left is adaptive refinement (\theta = 0.5) and the right is uniform refinement (\theta = 1) . We can find it very intuitively when we provide the optimal convergence rate slope -1 via adopting the linear finite elements, the error reduction can be observed.

    Figure 3.  The error estimate compares between adaptively (left) and uniformly (right) refined grids for Example 1.

    Example 2. We consider the same nonlinear optimal control problem as Example 1 with \alpha = 0.1 , \Omega = (-1, 1)\times(0, 1)\cup(-1, 0)\times(-1, 0] and apparently exact solution

    \begin{align*} &p = \begin{cases} -5\times10^{10}e^\frac{1}{m}, \quad&m < 0, \\0, &m\geq 0, \end{cases}\\ &y = -p, \\ &m = (x_1-0.2)^2+(x_2-0.6)^2-0.04, \end{align*}

    where u\in U_{ad} can be guaranteed.

    Comparing with Example 1, we provide some plots concerning with 21 adaptive loops for Example 2 with \theta = 0.5 . In Figure 4, it is more easy to say that the solution is smooth while the large gradients can be found in some regions illustrating that adaptive refinement can obtain smaller errors than the uniformly refinement where we offer the error estimate graphs to explain. In Figure 5, the left plot tells us the error estimates on adaptive refinement (\theta = 0.5) and the right shows the error estimates on uniform refinement (\theta = 1) . With the slope -1 being the optimal convergence rate expected, we see the error reduction from Figure 5. Meanwhile we can also find that the convergence order of the total-error and the error estimate indicators are approaching to straight line slope -1 in which they are roughly parallel where it is showed that the posteriori error estimates we obtained in Section 2 are reliable.

    Figure 4.  The exact state (upper-middle) and the numerical state (left) and the adjoint state (right) variables for Example 2.
    Figure 5.  The error estimate compares between adaptively (left) and uniformly (right) refined grids for Example 2.

    In Figure 6, we show the adaptive grids after 9 and 19 adaptive iterations for Example 2 of 21 adaptive loops with \theta = 0.5 . We can find that the grids are concentrated on the regions where the solutions have larger gradients. Only can we note that reduced orders are observed for the uniform refinement because of the singularity of the solutions.

    Figure 6.  The adaptive grids after 9 steps (left) and 19 steps (right) for Example 2.

    In this paper, we first study the adaptive finite element method for nonlinear optimal control problems and give the corresponding adaptive algorithm. To evaluate the adaptive finite element method, we obtain the a posteriori error estimates for the nonlinear elliptic equations with upper and lower bound convergence and optimality, which are also important indicators for evaluating the algorithms. Therefore, we prove that the sum of the posterior errors of the control, state and covariance variables are convergent, as shown in Theorem 4.2. Based on the local upper bound, we prove the quasi-optimality of the proposed adaptive algorithm, see Section 5. To verify our theoretical analysis, we finally provide some numerical simulations. In previous research papers, the finite element methods for linear optimal control problems were studied. Our innovation is to extend the method of linear optimal control problems to a series of nonlinear optimal control problems.

    There are a lot of problems which can not be tackled, such as the L^2-L^2 posteriori error estimates for nonlinear elliptic equations as well as the convergence and quasi-optimality for nonlinear parabolic equations. Furthermore, we note that the analysis in this paper can be generalized to common nonlinear parabolic problems and boundary problems, and we will work on these problems.

    This work is supported by National Science Foundation of China (11201510), National Social Science Fund of China (19BGL190), Chongqing Research Program of Basic Research and Frontier Technology (cstc2019jcyj-msxmX0280), General project of Chongqing Natural Science Foundation (cstc2021jcyj-msxmX0949), Scientific and Technological Research Program of Chongqing Municipal Education Commission (KJZD-K20200120), Chongqing Key Laboratory of Water Environment Evolution and Pollution Control in Three Gorges Reservoir Area (WEPKL2018YB-04), Science and technology research project of Chongqing Education Commission (KJQN202101212), Research Center for Sustainable Development of Three Gorges Reservoir Area (2019sxxyjd07), Guangdong Basic and Applied Basic Research Foundation of Joint Fund Project (2021A1515111048), and Guangdong Province Characteristic Innovation Project (2021WTSCX120).

    The authors declare that they have no competing interests.



    [1] D. Yang, X. Li, J. Qiu, Output tracking control of delayed switched systems via state-dependent switching and dynamic output feedback, Nonlinear Anal. Hybrid Syst., 32 (2019), 294-305. doi: 10.1016/j.nahs.2019.01.006
    [2] X. Yang, X. Li, Q. Xi, et al. Review of stability and stabilization for impulsive delayed systems, Math. Biosci. Eng., 15 (2018), 1495-1515. doi: 10.3934/mbe.2018069
    [3] X. Li, X. Yang, T. Huang, Persistence of delayed cooperative models: Impulsive control method, Appl. Math. Comput., 342 (2019), 130-146.
    [4] Y. Tan, C. Huang, B. Sun, et al. Dynamics of a class of delayed reaction-diffusion systems with Neumann boundary condition, J. Math. Anal. Appl., 458 (2018), 1115-1130. doi: 10.1016/j.jmaa.2017.09.045
    [5] C. Huang, X. Long, J. Cao, Stability of anti-periodic recurrent neural networks with multiproportional delays, Math. Method Appl. Sci., 2020.
    [6] J. Zhang, C. Huang, Dynamics analysis on a class of delayed neural networks involving inertial terms, Adv. Differ. Equations, 120 (2020), 1-12. doi: 10.1186/s13662-019-2438-0
    [7] X. Long, S. Gong, New results on stability of Nicholson's blowflies equation with multiple pairs of time-varying delays, Appl. Math. Lett., 100 (2020), 106027. doi: 10.1016/j.aml.2019.106027
    [8] C. Huang, Y. Qiao, L. Huang, et al. Dynamical behaviors of a food-chain model with stage structure and time delays, Adv. Differ. Equations, 186 (2018).
    [9] C. Huang, J. Cao, F. Wen, et al. Stability Analysis of SIR Model with Distributed Delay on Complex Networks, Plos One, 11 (2016), e0158813.
    [10] H. Hu, X. Yuan, L. Huang, et al. Global dynamics of an SIRS model with demographics and transfer from infectious to susceptible on heterogeneous networks, Math. Biosci. Eng., 16 (2019), 5729-5749. doi: 10.3934/mbe.2019286
    [11] H. Hu, X. Zou, Existence of an extinction wave in the fisher equation with a shifting habitat, Proc. Am. Math. Soc., 145 (2017), 4763-4771. doi: 10.1090/proc/13687
    [12] H. Hu, T. Yi, X. Zou, On spatial-temporal dynamics of Fisher-KPP equation with a shifting environment, Proc. Amer. Math. Soc., 148 (2020), 213-221. doi: 10.1090/proc/14659
    [13] J. Wang, C. Huang, L. Huang, Discontinuity-induced limit cycles in a general planar piecewise linear system of saddle-focus type, Nonlinear Anal. Hybrid Syst., 33 (2019), 162-178. doi: 10.1016/j.nahs.2019.03.004
    [14] J. Wang, X. Chen, L. Huang, The number and stability of limit cycles for planar piecewise linear systems of nodeCsaddle type, J. Math. Anal. Appl., 469 (2019), 405-427. doi: 10.1016/j.jmaa.2018.09.024
    [15] C. Huang, Z. Yang, T. Yi, et al. On the basins of attraction for a class of delay differential equations with non-monotone bistable nonlinearities, J. Differ. Equations, 256 (2014), 2101-2114. doi: 10.1016/j.jde.2013.12.015
    [16] C. Huang, H. Zhang, J. Cao, et al. Stability and Hopf bifurcation of a delayed prey-predator model with disease in the predator, Int. J. Bifurcation Chaos, 29 (2019), 1950091.
    [17] C. Huang, H. Zhang, L. Huang, Almost periodicity analysis for a delayed Nicholson's blowflies model with nonlinear density-dependent mortality term, Commun. Pure Appl. Anal., 18 (2019), 3337-3349. doi: 10.3934/cpaa.2019150
    [18] C. Qian, Y. Hu, Novel stability criteria on nonlinear density-dependent mortality Nicholson's blowflies systems in asymptotically almost periodic environments, J. Inequal. Appl., 13 (2020), 1-18.
    [19] C. Huang, X. Long, L. Huang, et al. Stability of almost periodic Nicholson's blowflies model involving patch structure and mortality terms, Can. Math. Bull., (2019), 1-18.
    [20] C. Huang, H. Yang, J. Cao, Weighted Pseudo Almost Periodicity of Multi-Proportional Delayed Shunting Inhibitory Cellular Neural Networks with D operator, Discrete Contin. Dyn. Syst. Ser. S, 2020.
    [21] S. R. Bernfeld, J. R. A. Haddock, A variation of Razumikhin's method for retarded functional equations, In: Nonlinear systems and applications, An International Conference, New York: Academic Press, 1977, 561-566.
    [22] C. Jehu, Comportement asymptotique des solutions de equation x'(t) = -f (t, x(t)) + f (t, x(t - 1)) + h(t) (in French), Ann. Soc. Sci. Brux. I, 92 (1979), 263-269.
    [23] T. Ding, Asymptotic behavior of solutions of some retarded differential equations, Sci. China Ser. A-Math., 25 (1982), 363-371.
    [24] T. Yi, L. Huang, Asymptotic behavior of solutions to a class of systems of delay differential equations, Acta Math. Sin. (Engl. Ser.), 23 (2007), 1375-1384. doi: 10.1007/s10114-005-0932-7
    [25] M. Xu, W. Chen, X. Yi, New generalization of the two-dimensional Bernfeld-Haddock conjecture and its proof, Nonlinear Anal. Real World Appl., 11 (2010), 3413-3420. doi: 10.1016/j.nonrwa.2009.12.001
    [26] Q. Zhou, W. Wang, Q. Fan, A generalization of the three-dimensional Bernfeld-Haddock conjecture and its proof, J. Comput. Appl. Math., 233 (2009), 473-481. doi: 10.1016/j.cam.2009.07.047
    [27] B. S. Chen, Asymptotic behavior of solutions of some infinite retarded differential equations(in Chinese), Acta Math. Sin. (Engl. Ser.), 3 (1990), 353-358.
    [28] T. Ding, Applications of the qualitative methods in ordinary differential equations (in Chinese), Peking: China Higher Education Press, 2004, 155-163.
    [29] T. Yi, L. Huang, Convergence of solution to a class of systems of delay differential equations, Nonlinear Dyn. Syst. Theory, 5 (2005), 189-200.
    [30] Q. Zhou, Convergence for a two-neuron network with delays, Appl. Math. Lett., 22 (2009), 1181-1184. doi: 10.1016/j.aml.2009.01.028
    [31] S. Hu, L. Huang, T. Yi. Convergence of bounded solutions for a class of systems of delay differential equations, Nonlinear Anal., 61 (2005), 543-549.
    [32] B. S. Chen, Asymptotic behavior of a class of nonautonomous retarded differential equations (in Chinese), Chinese Sci. Bull., 6 (1988), 413-415.
    [33] T. Yi, L. Huang, Convergence for pseudo monotone semi-flows on product ordered topological spaces, J. Differ. Equations, 214 (2005), 429-456. doi: 10.1016/j.jde.2005.02.005
    [34] Q. Zhou, Asymptotic behavior of solutions to a first-order non-homogeneous delay differential equation, Electron. J. Differ. Equations, 103 (2011), 1-8.
    [35] B. Liu, Asymptotic behavior of solutions to a class of non-autonomous delay differential equations, J. Math. Anal. Appl., 446 (2017), 580-590. doi: 10.1016/j.jmaa.2016.09.001
    [36] B. Liu, A generalization of the Bernfeld-Haddock conjecture, Appl. Math. Lett., 65 (2017), 7-13. doi: 10.1016/j.aml.2016.09.018
    [37] S. Xiao, Asymptotic behavior of solutions to a non-autonomous system of two-dimensional differential equations, Electron. J. Differ. Equations, 2017 (2017), 1-12.
  • Reader Comments
  • © 2020 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(4441) PDF downloads(424) Cited by(33)

Figures and Tables

Figures(2)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog