Loading [MathJax]/jax/output/SVG/jax.js
Research article

Convergence and quasi-optimality based on an adaptive finite element method for the bilinear optimal control problem

  • Received: 05 March 2021 Accepted: 24 May 2021 Published: 24 June 2021
  • MSC : 49J20, 65M08

  • This paper investigates the adaptive finite element method for an optimal control problem governed by a bilinear elliptic equation. We establish the finite element discrete scheme for the bilinear optimal control problem and use a dual argument, linearization method, bubble function, and new bubble function to obtain a posteriori error estimates. To prove the convergence and the quasi-optimality for adaptive finite element methods, we introduce the adaptive finite element algorithm, local perturbation, error reduction, discrete local upper bound, Dörfler property, dual argument method, and quasi orthogonality. A few numerical examples are given at the end of the paper to demonstrate our theoretical analysis.

    Citation: Zuliang Lu, Xiankui Wu, Fei Huang, Fei Cai, Chunjuan Hou, Yin Yang. Convergence and quasi-optimality based on an adaptive finite element method for the bilinear optimal control problem[J]. AIMS Mathematics, 2021, 6(9): 9510-9535. doi: 10.3934/math.2021553

    Related Papers:

    [1] Zuliang Lu, Fei Cai, Ruixiang Xu, Lu Xing . Convergence and proposed optimality of adaptive finite element methods for nonlinear optimal control problems. AIMS Mathematics, 2022, 7(11): 19664-19695. doi: 10.3934/math.20221079
    [2] Zuliang Lu, Ruixiang Xu, Chunjuan Hou, Lu Xing . A priori error estimates of finite volume element method for bilinear parabolic optimal control problem. AIMS Mathematics, 2023, 8(8): 19374-19390. doi: 10.3934/math.2023988
    [3] Zuliang Lu, Fei Cai, Ruixiang Xu, Chunjuan Hou, Xiankui Wu, Yin Yang . A posteriori error estimates of hp spectral element method for parabolic optimal control problems. AIMS Mathematics, 2022, 7(4): 5220-5240. doi: 10.3934/math.2022291
    [4] Shengying Mu, Yanhui Zhou . An analysis of the isoparametric bilinear finite volume element method by applying the Simpson rule to quadrilateral meshes. AIMS Mathematics, 2023, 8(10): 22507-22537. doi: 10.3934/math.20231147
    [5] Zuliang Lu, Xiankui Wu, Fei Cai, Fei Huang, Shang Liu, Yin Yang . Error estimates in L2 and L norms of finite volume method for the bilinear elliptic optimal control problem. AIMS Mathematics, 2021, 6(8): 8585-8599. doi: 10.3934/math.2021498
    [6] Changling Xu, Hongbo Chen . A two-grid P20-P1 mixed finite element scheme for semilinear elliptic optimal control problems. AIMS Mathematics, 2022, 7(4): 6153-6172. doi: 10.3934/math.2022342
    [7] Tiantian Zhang, Wenwen Xu, Xindong Li, Yan Wang . Multipoint flux mixed finite element method for parabolic optimal control problems. AIMS Mathematics, 2022, 7(9): 17461-17474. doi: 10.3934/math.2022962
    [8] Yuelong Tang . Error estimates of mixed finite elements combined with Crank-Nicolson scheme for parabolic control problems. AIMS Mathematics, 2023, 8(5): 12506-12519. doi: 10.3934/math.2023628
    [9] Jie Liu, Zhaojie Zhou . Finite element approximation of time fractional optimal control problem with integral state constraint. AIMS Mathematics, 2021, 6(1): 979-997. doi: 10.3934/math.2021059
    [10] Cagnur Corekli . The SIPG method of Dirichlet boundary optimal control problems with weakly imposed boundary conditions. AIMS Mathematics, 2022, 7(4): 6711-6742. doi: 10.3934/math.2022375
  • This paper investigates the adaptive finite element method for an optimal control problem governed by a bilinear elliptic equation. We establish the finite element discrete scheme for the bilinear optimal control problem and use a dual argument, linearization method, bubble function, and new bubble function to obtain a posteriori error estimates. To prove the convergence and the quasi-optimality for adaptive finite element methods, we introduce the adaptive finite element algorithm, local perturbation, error reduction, discrete local upper bound, Dörfler property, dual argument method, and quasi orthogonality. A few numerical examples are given at the end of the paper to demonstrate our theoretical analysis.



    Bilinear optimal control problems are types of quintessential optimal control problems governed by a partial differential equation, which investigated in material mechanics, engineering mechanics, and engineering design, etc [9,11,22]. Effective numerical methods, such as in [1,12,21] for finite element methods, [4,5] for spectral Galerkin methods, and [6] for fast algorithms, are the key to the successful application of optimal control problems in the practical field. Accordingly, the study of the efficient numerical algorithm for the bilinear optimal control problem has far-reaching theoretical value and application prospects.

    The finite element method is frequently applied to solve optimal control problems and uses the thought of mathematical approximation to model the actual physical system. It replaces complex problems with simpler ones by assuming that each element has a suitable, simple approximate solution and then derives a universal satisfaction condition for the solution domain [10,23]. Since the actual problem is replaced by a simpler one, this solution is not precise, but approximate. Given that the finite element method has high computational accuracy and can adapt to a variety of complex shapes, it has become an effective engineering analysis method.

    Babuška first proposed the adaptive finite element method [2]. The adaptive finite element method is an accurate and efficient method of finite element discretization, which saves a lot of calculation time based on ensuring the specified accuracy. Appropriate grids can enormously reduce the errors resulting from the discretization of finite element approximation procedure in copying with an optimal control problem. As the case stands, the solutions for the optimal control problem of the nonlinear system are generally not available. In addition to the complexity and diversity of the nonlinear equations, it is practicable to solve the nonlinear equations with the thought of the adaptive finite element method.

    Adaptive finite element methods had popularized for many years, but the theoretical analysis of the entire algorithm has just been actualized. The subject work in the early stage was made by Dörfler [13], in which he investigated an adaptive finite element for Poisson's equation by a reduction of energy errors under a mild assumption on the initial grid Th0. Later, Demlow, Morin, et-al. [14,24,25] proved the convergence of adaptive finite element methods without the mild assumption. Then another decisive issue of adaptive finite element methods has been investigated by Binev et al. [3]. In the last decades, much work has been done on the concept of total error and the sum of error energy plus oscillations [8,12,26]. Recently, Liu and Yan [22] found that adaptive finite element methods could be successfully applied in constrained optimal control problems. The later investigations could be found in [15,19]. Inspired by these works, Gong and Yan [18] investigated the convergence and quasi-optimality of an adaptive finite element for control constrained optimal control problems via applying variation control discretization. Later, Leng and Chen [20] studied the convergence of an adaptive finite element method for optimal control problems with integral control constraints.

    To proceed, Section 2 next focuses on the bilinear optimal control problem with integral control constraints, which adopt piecewise constant discretization to deal with the control via. For the state and the co-state, by applying continuous piecewise linear discretization, respectively. Section 3 deduce a posteriori error estimate. For the convergence and the quasi-optimality, we prove them relying on quasi-orthogonality and discrete local upper bound. Section 4 based on the mild assumption to the initial grids, obtain the proof of convergence and quasi-optimality employing the solution operator of nonlinear elliptic equations. Section 5 then advances some numerical illustration.

    Here are some notations that will be employed in this paper. Let Ω be a bounded Lipschitz domain on R2 and Ω denote the boundary of Ω. We adopt the standard notation Wm,q(ω) for Sobolev space with norm m,q,ω and seminorm ||m,q,ω. Where ωΩ, and we will omit the sub description if ω=Ω. We denote Wm,2(Ω) by Hm(Ω) and set H10(Ω)={vH1(Ω):v=0 on Ω}. For m=0 and q=2, we denote W0,2(ω)=L2(ω) and set 0,2,ω=0,ω. Let Th0 denote the initial partition of ˉΩ into disjoint triangles. By newest-vertex bisections for Th0, we obtain a class T of conforming partitions. For Th,˜ThT, we use Th˜Th to indicate that ˜Th is a refinement of Th, where hT=|T|1/2. Denote the L2 inner product by (,), and let C be a constant independent of grids size.

    In this paper, we focus our attention on the following bilinear convex optimal control problem:

    minuUad{12yyd20+α2u20}, (2.1)
    Δy+uy=f,in Ω,y|Ω=0, (2.2)

    where Ω and ΩU are bounded open sets in R2 with a Lipschitz boundary Ω and ΩU. Let fL2(Ω), and Uad be a closed convex set defined as follows

    Uad={uL2(ΩU):ΩUu0}.

    We take the state space V=H10(Ω), the control space U=L2(ΩU), and H=L2(Ω) to determine the concept of finite element approximation for our discussion with the bilinear optimal control problems (2.1)-(2.2). We first give a weak formula for the state equation

    a(y,v)=Ωyvdx, y,vV,(f1,f2)=Ωf1f2dx, (f1,f2)H×H,(y,v)=ΩUyvdx, (f1,f2)U×U.

    We set a norm ||v||a=a(v,v), which is equivalent to ||v||1. Exist constants c and C that satisfies

    a(v,v)c||v||2V,a(y,v)C||y||V||v||V, y,vV. (2.3)

    Then the bilinear optimal control problem (2.1)-(2.2) can be restated as

    minuUad{12yyd20+α2u20}, (2.4)
    a(y,v)+(uy,v)=(f,v), vV. (2.5)

    It is well known that the optimal control problem (2.4)-(2.5) has a solution (y,u) which was proved in the Theorem 2.2.4 of [22], and that if a pair (y,u) is the solution of (2.4)-(2.5), then there exist a costate pV such that the triplet (y,u,p) satisfies the following optimality conditions:

    a(y,v)+(uy,v)=(f,v),vV, (2.6)
    a(q,p)+(up,q)=(yyd,q),qV, (2.7)
    (αuyp,vu)U0,vUadU. (2.8)

    For (2.6), since the coercivity of a(,), and we set a operator S:L2(Ω)H10(Ω) such that Su=y. For (2.7), let S be the adjoint operator of S, we assume that yyd is given, such that S(yyd)=p. Let Vh be the continuous piecewise linear finite element space for the partition ThT. We define Uh as the piecewise constant finite element space of Th. Set Uhad={vhUh:Ωvh0}, then we derive the standard finite element discretization for the bilinear optimal control problem

    minuUad{12yhyd20+α2uh20}, (2.9)
    a(yh,v)+(uhyh,v)=(f,v), vVh. (2.10)

    It is well known that the optimal control problem (2.9)-(2.10) has a solution (yh,uh) and that if a pair (yh,uh)Vh×Uhad is the solution of (2.9)-(2.10), then there is a co-state phVh such that the triplet (yh,uh,ph) satisfies the following optimality conditions

    a(yh,v)+(uhyh,v)=(f,v),vVhV, (2.11)
    a(q,ph)+(uhph,q)=(yhyd,q),qVhV, (2.12)
    (αuhyhph,vhuh)U0,vUhadU. (2.13)

    Similarly, we set a operator Sh: UhadVh such that Suh=yh. Let Sh be the adjoint operator of Sh such that Sh(yhyd)=ph. Let us now introduce some error indicators which we will frequently use in this paper, namely the error η() and oscillation osc(). For ThT, TTh, that are defined by:

    η21,Th(ph,T)=h2Tph20,T,η22,Th(uh,yh,T)=h2Tfuhyh20,T+hT[yh]n20,TΩ,η23,Th(yh,ph,T)=h2Tyhyduhph20,T+hT[ph]n20,TΩ,osc2Th(f,T)=h2TffT20,T,osc2Th(yhyd,T)=h2T(yhyd)(yhyd)T20,T,

    where uhUhad and yh,phVh, fT is L2-projection of f onto piecewise constant space on T and fT=Tf|T|. For ωTh, such that

    η21,Th(ph,ω)=Tωη21,Th(ph,T),osc2Th(f,ω)=Tωosc2Th(f,T),

    Similarly, η22,Th(uh,yh,ω), η23,Th(uh,yh,ω) and osc2Th(yhyd,ω) can be denoted as well.

    Before we discuss residual-based posteriori error estimates for the bilinear optimal control problem, we introduce the well-known result for the integral averaging operator [12].

    Lemma 2.1. Let πah:L1(ΩU)UhU be the integral averaging operator such that

    (πahv)|T:=1|T|Tv, TTh.

    Then for m=0,1 and 1q, here

    ||vπahv||0,q,TChmT|v|m,q,T, vWm,q(ΩU). (2.14)

    We next discuss the residual-based posteriori error estimates for the bilinear optimal control problem. Moreover, we introduce the following auxiliary problems

    a(yh,v)+(uhyh,v)=(f,v), vV, (2.15)
    a(q,ph)+(yhph,q)=(yhyd,q), vV. (2.16)

    Here we state the theorem about a posteriori error estimates.

    Theorem 2.1. Let (y,u,p)H10(Ω)×Uad×H10(Ω) be the solution of problems (2.6)-(2.8), and (yh,uh,ph)Vh×Uhad×Vh be the solution of problems (2.11)-(2.13). Then exist constants c and C holds

    ||uuh||20+||yyh||2a+||pph||2ac(η21,Th(ph,Th)+η22,Th(uh,yh,Th)+η23,Th(yh,ph,Th)), (2.17)

    and

    C(η21,Th(ph,Th)+η22,Th(uh,yh,Th)+η23,Th(yh,ph,Th))||uuh||20+||yyh||2a+||pph||2a+osc2Th(f,Th)+osc2Th(yhyd,Th). (2.18)

    Proof. By applying Lemma 2.1, we infer that

    TTh||phπhph||20,TCη21,Th(ph,T),

    where πh is the L2projection operator onto piecewise constant space on Th. Therefore, it follows from Theorem 3.1 of [16], we derive that

    ||uuh||20,TCTTh||phπhph||20,T+Cη22,Th(uh,yh,T)+Cη23,Th(yh,ph,T)C(η21,Th(ph,T)+η22,Th(uh,yh,T)+η23,Th(yh,ph,T)). (2.19)

    Assume that ep=phph, and epI=ˆπhep, where ˆπh is the average interpolation operator defined in Lemma 3.2 of [17], then we obtain

    c||phph||2a(ep,phph)+(uh(phph),ep)=((epepI),(phph))+(uh(phph),epepI)+(epI,phph)+(uh(phph),epI)=TThT(yhyduhph)(epepI)TΩT([ph]n)(epepI)(yhyh,ep)C(σ)TThh2TT(yhyduhph)2+C(σ)TΩhTT([ph]n)2+CσTThh2TT|epepI|2+CσTΩhT|epepI|2+C||yhyh||0||ep||aC(σ)TThh2TT(yhyduhph)2+C(σ)TΩhTT([ph]n)2+C(σ)||yhyh||20+Cσ||ep||2a,

    where ||v||0,4C||v||1 is defined in the embedding theorem of [7], and the property ||v||1C. Then, let σ=c2C, we obtain

    ||phph||2aCTThh2TT(yhyduhph)2+CTΩhTT([ph]n)2+||yhyh||20. (2.20)

    Similarly, let ey=yhyh, and eyI be the average interpolation of ey. It follows from (2.11) and (2.15) that

    c||yhyh||2a((yhyh),ey)+(uh(yhyh),ey)=((yhyh),(eyeyI))+(uh(yhyh),eyeyI)=TThT(fuhyh)(eyeyI)TΩT([ph]n)(eyeyI)C(σ)TThh2TT(fuhyh)2+C(σ)TΩhTT([yh]n)2+Cσ||ey||2a.

    Thus, it holds that

    ||yhyh||2aCTThh2TT(fuhyh)2+CTΩhTT([yh]n)2. (2.21)

    Note that

    ||yyh||a||yhyh||a+||yyh||a, (2.22)
    ||pph||a||phph||+||pph||a, (2.23)
    ||pph||2a+||yyh||2aC||uuh||20. (2.24)

    Hence, (2.17) follows from (2.19)-(2.21) and (2.22)-(2.24).

    To derive the a posteriori lower error bounds for the optimal control problems governed by bilinear elliptic equations we use the standard bubble function technique (see [1,17,27]). Let bT be the standard third order polynomial bubble on T scaled with bT=λ1λ2λ3. We denote by {λ1,λ2,λ3} the barycentric coordinates of T. Due to yh is piecewise linear, we set

    wT=((yhyd)Tyhph)b2T,

    where c1=Tb2T|T| denotes a positive constant, here

    ||wT||20,T=Tc21((yhyd)Tuhph)2b4Tc1T((yhyd)Tuhph)2.

    Then there holds λ1λ2λ3=0 on T, and satisfies

    wT|TΩ=c1((yhyd)Tuhph)(λ1λ2λ3)2|T=0.

    Moreover, we obtain

    wT|TΩ=2c1((yhyd)Tuhph)(λ1λ2λ3)(λ1λ2λ3)|T=0,

    thus we can conclude that wTH20(T). Using the results from [1,27], we derive that

    |wT|20,2,TCh4TT|wT(T)|2.

    We set ˆT be a reference element, and ˆX=FT(X)=X+bT be an affine map from T onto ˆT, we also set ˆw=wF1T(ˆX). Thus we conclude that

    |wT|20,T=ˆT|ˆwT(ˆX)|2.

    Where wTH20(T), we obtain that ˆwTH20(ˆT). Using the Poincare's inequality, we deduce the results below

    ˆT|ˆwT(ˆX)|2C|a|=2ˆT|DaˆwT(ˆX)|2,

    such that

    |wT|20,TCh4T|wT|20,2,T.

    Thus we can obtain that

    ch2T||wT||20,T|wT|20,2,TCh2T||wT||20,T, TTh. (2.25)

    Then use of bubble function wT with (2.25) yields

    Th2T((yhyd)Tyhph)2=Th2T(yhyd)Tuhph)wTT(yhyduhph(yydup))wT+C(σ)T(yhyd(yhyd)T)2+Cσh2T||wT||20,T=T(php)wT+T(yhy)wT+T(uhphup)+C(σ)T(yhyd(yhyd)T)2+Cσh2T||wT||20,TC(σ)||php||2a,T+C(σ)||yhy||20,T+C(σ)||uhu||20,T+C(σ)Th2T(yhyd(yhyd)T)2+Cσh2T||wT||20,T,

    We set uhphup=uh(php)+(uhu)p, such that

    Th2T(yhyduhph)2CT((yhyd)Tuhph)2+CTh2T(yhyd(yhyd)T))2C||php||2a,T+C||yhy||20,T+C||uhu||20,T+CTh2T(yhyd(yhyd)T))2. (2.26)

    Then by using the Schwarz's inequality, it follows from (2.25) that

    ThT([ph]n)2=T([ph]n)wT=T([ph]n[p]n)wT=T(php)wT+T(yydup)wTC(σ)||php||2a,TΩ+T(yyh)wT+T(yhyduhph)wT+T(upuhph)wTC(σ)||php||2a,TΩ+C(σ)||yyh||20,TΩ+C(σ)||uuh||20,TΩ+C(σ)T(yhyduhph)2+Cσ(||wT||2a,TΩ+h2T||wT||20,TΩ),

    assume that upuhph=(uuh)p+uh(pph). Then there holds

    ThT([ph]n)2C||php||2a,TΩ+C||yyh||20,TΩ+C||uuh||20,TΩ+CT(yhyduhph)2. (2.27)

    Applying (2.26)-(2.27) and the Poincare's inequality, we obtain

    η23,Th(yh,ph,Th)=TThTh2T(yhyduhph)2+TΩThT([ph]n)2C||pph||2a+C||yyh||2a+C||uuh||20+Cosc2Th(yhyd,T). (2.28)

    Similarly, we also derive that

    η22,Th(uh,yh,Th)=TThTh2T(fuhph)2+TΩThT([yh]n)2C||pph||2a+C||yyh||2a+C||uuh||20+Cosc2Th(f,T). (2.29)

    Combining Lemma 3.6 of [17], we can derive the same result for bilinear elliptic equations

    ||phπhph||20,TC||uuh||20,T+C||pph||20,T.

    From Lemma 2.1, we infer that

    ||phπhph||20,TCh2T||ph||2a,T.

    Then using the inverse estimates to infer that

    η21,Th(ph,Th)=TThh2T||phπhph||20,TC||uuh||20+C||pph||2a, (2.30)

    and hence, (2.18) follows from (2.28)-(2.30).

    Theorem 2.1 gives reliable and efficient posteriori error estimates. Now we introduce an adaptive finite element algorithm to explain what we mainly investigate in this paper.

    Algorithm 2.1. Adaptive finite element algorithm for bilinear optimal control problems:

    (0) Given an initial mesh Th0 and construct finite element space Uh0ad and Vh0. Select marking parameter 0<θ1 and set k:=0.

    (1) Solve the discrete bilinear optimal control problem (2.11)-(2.13), then obtain an approximate solution (yhk,uhk,phk) with respect to Thk.

    (2) Compute the local error estimator ηThk(T) for all TThk.

    (3) Select a minimal subset Mhk of Thk such that

    η2Thk(Mhk)θη2Thk(Thk),

    where η2Thk(ω)=η21,Th(ph,ω)+η22,Th(uh,yh,ω)+η23,Th(yh,ph,ω) for all ωThk.

    (4) Refine Mhk by bisecting b1 times in passing from Thk to Thk+1 and generally additional elements are refined in the process in order to ensure that Thk+1 is conforming.

    (5) Solve the discrete bilinear optimal control problem (2.11)-(2.13), then obtain approximate solution (yhk+1,uhk+1,phk+1) with respect to Thk+1.

    (6) Set k=k+1 and go to step (2).

    In this section, we first introduce the relevant theorem of reference [24], where the authors construct a simple and efficient adaptive finite element methods to ensure a reduction rate of data oscillation, together with an error reduction based on a posteriori error estimators. In addition, we introduce the internal node properties and mark the data oscillations. To derive the residual-type posterior error estimates [21], we employ the lemmas as follows.

    Lemma 3.1. For all vH1(Ω), ThT,TTh, there exists

    ||v||0,TΩCh1/2T||v||0,T+Ch1/2T||v||1,T.

    Here we state the following local perturbation property.

    Lemma 3.2. For ThT, TTh, let uh1,uh2Uhad, yh1,yh2,ph1,ph2Vh, we have

    η1,Th(ph1,T)η1,Th(ph2,T)ChTph1ph2a,T, (3.1)
    η2,Th(uh1,yh1,T)η2,Th(uh2,yh2,T)C(hTuh1uh20,T+hTph1ph20,T+yh1yh2a,ωT), (3.2)
    η3,Th(yh1,ph1,T)η3,Th(yh2,ph2,T)C(hTuh1uh20,T+hTyh1yh20,T+ph1ph2a,ωT), (3.3)
    oscTh(yh1yd,T)oscTh(yh2yd,T)Ch2Tyh1yh2a,T. (3.4)

    Proof. We only give the proof of (3.1) because of the proofs of (3.1), (3.3)-(3.4) are similar. By the definition of η2,Th(uh,yh,T), we deduce that

    η2,Th(uh1,yh1,T)η2,Th(uh2,yh2,T)+h1/2T||[(yh1yh2)]n||0,TΩ+hT||uh1ph1uh2ph2||0,Tη2,Th(uh2,yh2,T)+h1/2T||[(yh1yh2)]n||0,TΩ+ChT||uh1uh2||0,T+ChT||ph1ph2||0,T. (3.5)

    With the help of inverse estimates and Lemma 3.1, we infer that

    ||[(yh1yh2)]n||0,TΩCh1/2T||yh1yh2||a,ωT. (3.6)

    Therefore (3.2) follows from (3.5)-(3.6).

    By using the similar method with [24], we prove the following result.

    Lemma 3.3. Let Th~Th for Th,˜ThT.MhTh denotes the set of elements which are marked from Th to ˜Th. Then for uhUhad, ˜uhU˜had, yh,phVh, ˜yh,˜phV˜h and any δ,δ1(0,1], we have

     η21,˜Th(˜ph,˜Th)(1+δ1){η21,Th(ph,Th)(121/2)η1,Th(ph,Rh)}C(1+δ11)h20ph˜ph2a, (3.7)

    and

     η22,˜Th(˜uh,˜yh,˜Th)(1+δ){η22,T(uh,yh,Th)λη22,Th(uh,yh,Mh)}C(1+δ1)(h20uh˜uh20+h20ph˜ph20+yh˜yh2a), (3.8)

    and

     η23,˜Th(˜yh,˜ph,˜Th)(1+δ){η23,Th(yh,ph,Th)λη23,Th(uh,yh,Mh)}C(1+δ1)(h20uh˜uh20+h20yh˜yh20+ph˜ph2a), (3.9)

    and

    osc2Th(yhyd,Th˜Th)2osc2˜Th(˜yhyd,Th˜Th)2Ch40yh˜yh2a, (3.10)

    where λ=12b2, h0=maxTTh0hT and Rh denotes the set of elements which are refined from Th to ˜Th.

    Proof. We only give the proof of (3.7)-(3.8) and (3.10) owing to the proof of (3.9) can be prove with (3.8) similarly. From (3.1) of Lemma 3.2, then combining with the Young's inequality yield

    η21,˜Th(˜ph,˜Th)Cη21,˜Th(ph,˜Th)+Ch2T||ph˜ph||2a,T+Cδ1η21,˜Th(ph,˜Th)+Cδ11h2T||p˜ph||2a,T. (3.11)

    Note for TRhTh that T will be bisected at least one time, it holds that

    TTη1,˜Th(ph,T)221/2η1,Th(p,T)2.

    For TThRh, we obtain

    η1,˜Th(p,T)2=η1,Th(p,T)2.

    Thus we can conclude that

    η1,˜Th(ph,˜Th)2=η1,˜Th(ph,Rh)+η1,˜Th(ph,ThRh)η1,Th(ph,Th)2(121/2)η1,Th(ph,Rh). (3.12)

    Hence, (3.7) follows from (3.11)-(3.12).

    From Lemma 3.2 and the Young's inequality with parameter δ, we deduce that

    η22,~Th(˜uh,˜yh,~Th)η22,~Th(uh,yh,~Th)+Ch2T(||˜uhuh||20+||˜phph||20)+Cδη22,~Th(uh,yh,~Th)+Cδ1h2T(||˜uhuh||20+||˜phph||20)+||˜yhyh||2a+Cδ1||˜yhyh||2a. (3.13)

    Given a marked element TMh, let ˜TT={T˜Th:TT}. For yhVhV˜h, we derive the jump [yh]=0 on the interior sides of ˜TT. Let b be the number of bisections, it implies hT=|T|2(2b|T|)1/22b/2hT, and then it holds that

    Th˜TTη22,˜Th(uh,yh,T)2b/2η22,Th(uh,yh,T).

    When TThMh, we check that

    η22,˜Th(uh,yh,T)η22,Th(uh,yh,T).

    Thus we conclude that

    η22,˜Th(uh,yh,˜Th)=η22,Th(uh,yh,Mh)+η22,˜Th(uh,yh,ThMh)η22,Th(uh,yh,Th)(12b/2)η22,Th(uh,yh,Mh). (3.14)

    And hence, (3.8) follows from (3.13)-(3.14).

    It is obviously that for TTh˜Th, such that

    oscTh(yhyd,T)=osc˜Th(yhyd,T). (3.15)

    From (3.4) of Lemma 3.2 and the Young's inequality, such that

    osc˜Th(˜yhyd,T)2osc˜Th(yhyd,T)+2h4T||˜yhyh||2a,T. (3.16)

    Brings (3.15) into (3.16), we conclude that (3.10) follows from summing over TTh˜Th for (3.16).

    One difficulty in the proof of the convergence is lack of the orthogonality, thus we have to prove the quasi-orthogonality. We introduce the following fundamental relationships for Thi, where Thi+1T, ThiThi+1, such that

    uuhi+120=uuhi20uhiuhi+1202(uuhi+1,uhi+1uhi), (3.17)
    yyhi+12a=yyhi2ayhiyhi+12a2a(yyhi+1,yhi+1yhi), (3.18)
    pphi+121=pphi2aphiphi+12a2a(pphi+1,phi+1phi), (3.19)

    where (y,u,p) are the solution of (2.6)-(2.8), (yhi,uhi,phi) and (yhi+1,uhi+1,phi+1) are the solution of (2.11)-(2.13) with respect to Thi and Thi+1, respectively.

    Lemma 3.4. For Thi,Thi+1T and ThiThi+1, such that

     (1δ)uuhi+120uuhi20+uhiuhi+120Cδ1(η21,Thi(phi,Rh)+J2(h0)(η22,Thi(uhi,yhi,Rh)+η23,Thi(yhi,phi,Rh))), (3.20)

    and

     (1δ)yyhi+12ayyhi2ayhiyhi+12aδuuhi+120δpphi+120Cδ1(η21,Thi(phi,Rh)+J2(h0)(η22,Thi(uhi,yhi,Rh)+η23,Thi(yhi,phi,Rh))), (3.21)

    and

     (1δ)pphi+12apphi2a+phiphi+12aδuuhi+120δyyhi+120Cδ1(η21,Thi(phi,Rh)+J2(h0)(η22,Thi(uhi,yhi,Rh)+η23,Thi(yhi,phi,Rh))), (3.22)

    we follow the idea of [28] to introduce the quantity J2(h) that

    J2(h)=supfL2(Ω),||f||0=1infvhVh||Sfvh||a.

    Proof. For Thi+1 and from (2.8), we obtain

    (αuhi+1yhi+1phi+1,vhuhi+1)0, vhUhi+1ad. (3.23)

    For UhiadUhi+1ad and from (3.23), such that

    α||uhi+1uhi||20=(αuhi+1,uhi+1uhi)(αuhi,uhi+1uhi)(yhi+1phi+1,uhi+1uhi)(αuhi,uhi+1uhi)=(yhi+1phi+1yhiphi,uhi+1uhi)+(αuhiyhiphi,uhiuhi+1)=(yhi+1(phi+1phi),uhi+1uhi)+(phi(yhi+1yhi),uhi+1uhi)+(αuhiyhiphi,uhiuhi+1)C(phi+1phi,uhi+1uhi)+C(yhi+1yhi,uhi+1uhi)+(αuhiyhiphi,uhiuhi+1). (3.24)

    Next, we divide (3.24) into three parts to prove. Firstly, Let πhi be the L2projection onto Uhiad, then for TThiThi+1, we obtain

    (πhiuhi+1uhi+1)|T=0.

    Thus we infer that

    (αuhiyhiphi,uhiuhi+1)(αuhiyhiphi,πhiuhi+1uhi+1)=(phiπhiphi,πhi(uhi+1uhi)(uhi+1uhi))Cη1,Thi(phi,Rh)||uhiuhi+1||0, (3.25)

    we denote Rh be the set of elements that are refined from Thi to Thi+1. It follows from the definition of Sh, there exists

    (phi+1phi,uhi+1uhi)=(Shi+1(Shi+1uhi+1yd)Shi(Shiuhiyd),uhi+1uhi)=(Shi+1(Shi+1uhi+1yd)Shi+1(Shi+1uhiyd),uhi+1uhi)+(Shi+1(Shi+1uhiyd)Shi(Shiuhiyd),uhi+1uhi)=(Shi+1(uhi+1uhi),Shi+1(uhi+1uhi))+(Shi+1(Shi+1uhiyd)Shi(Shiuhiyd),uhi+1uhi)(Shi+1(Shi+1uhiyd)Shi(Shiuhiyd),uhi+1uhi)+||yhi+1yhi||20. (3.26)

    Let ϕH10(Ω) be the solution of the following problem

    a(ϕh,q)=(Shi+1(Shi+1uhiyd)Shi(Shiuhiyd),q), qV.

    Thus we obtain that

    ||Shi+1(Shi+1uhiyd)Shi(Shiuhiyd)||20=a(ϕhϕhi,Shi+1(Shi+1uhiyd)Shi(Shiuhiyd))+(Shi+1uhiShiuhi,ϕhiϕh)+(Shi+1uhiShiuhi,ϕh), (3.27)

    we denote ϕhi the standard finite element estimate of ϕh with respect to Vhi. It follows from Proposition 2.1 of [18] that for VhiVhi+1

    a(ϕhϕhi,Shi+1(Shi+1uhiyd)Shi(Shiuhiyd))a(ϕhϕhi,Shi+1(Shi+1uhiyd)Shi+1(Shiuhiyd))+a(ϕhϕhi,Shi+1(Shiuhiyd)Shi(Shiuhiyd))CJ(h0)||Shi+1(Shi+1uhiyd)Shi(Shiuhiyd)||0(||Shi+1uhiShiuhi||a+||Shi+1(Shiuhiyd)Shi(Shiuhiyd)||a), (3.28)

    and

    (Shi+1uhiShiuhi,ϕhiϕh)CJ(h0)||Shi+1uhiShiuhi||0||Shi+1(Shi+1uhiyd)Shi(Shiuhiyd)||0, (3.29)

    and

    (Shi+1uhiShiuhi,ϕh)C||Shi+1uhiShiuhi||0||Shi+1(Shi+1uhiyd)Shi(Shiuhiyd)||0. (3.30)

    From Lemma 3.6 of [12], we obtain

    ||Shi+1uhiShiuhi||aCη2,Thi(uhi,yhi,Rh), (3.31)
    ||Shi+1(Shiuhiyd)Shi(Shiuhiyd)||aCη3,Thi(yhi,phi,Rh). (3.32)

    Then let φhH10(Ω) be the solution of the following problem

    a(q,φh)=(Shi+1uhiShiuhi,q), qV.

    With the help of the standard duality argument, we infer that

    ||Shi+1uhiShiuhi||20=a(Shi+1uhiShiuhi,φhφhi)CJ(h0)||Shi+1uhiShiuhi||a||Shi+1uhiShiuhi||0, (3.33)

    where φhi is the standard finite element estimate of φh with respect to VThi. Then combining with (3.26)-(3.33), there holds that

    (phi+1phi,uhi+1uhi)||yhi+1yhi||20+CJ(h0)(η2,Thi(uhi,yhi,Rh)+η3,Thi(yhi,phi,Rh))||uhiuhi+1||0. (3.34)

    One easily sees that

    ||yhi+1yhi||0=||Shi+1uhi+1Shiuhi||0||Shi+1uhi+1Shi+1uhi||0+||Shi+1uhiShiuhi||0C||uhiuhi+1||0+CJ(h0)(η2,Thi(uhi,yhi,Rh)+η3,Thi(yhi,phi,Rh)). (3.35)

    Hence, (3.20) follows from (3.17), (3.24), and (3.34)-(3.35).

    In order to prove (3.21), we have to provide the following

    2a(yyhi+1,yhi+1yhi)=2(upuhi+1phi+1,yhi+1yhi)=2(u(pphi+1),yhiyhi+1)+2(phi+1(uuhi+1),yhiyhi+1)δ||uuhi+1||20+δ||pphi+1||20+δ1||yhi+1yhi||20. (3.36)

    Thus we conclude that (3.21) follows from (3.18), (3.25), and (3.35)-(3.36).

    Now we will give the proof of (3.22).

    ||phi+1phi||0=||Shi+1(Shi+1uhi+1yd)Shi(Shiuhi+1yd)||0||Shi+1(Shi+1uhi+1yd)Shi+1(Shi+1uhiyd)||0+||Shi+1(Shi+1uhiyd)Shi(Shiuhi+1yd)||0C||uhiuhi+1||0+CJ(h0)(η2,Thi(uhi,yhi,Rh)+η3,Thi(yhi,phi,Rh)). (3.37)

    It is similar to (3.36) yields

    2a(pphi+1,phi+1phi)=2((yup)(yhi+1uhi+1phi+1),phi+1phi)=2(u(pphi+1),phi+1phi)+2(phi+1(uuhi+1),phi+1phi)+(yyhi+1,phi+1phi)δ||uuhi+1||20+δ||pphi+1||20+δ1||phi+1phi||20+δ||yyhi+1||20. (3.38)

    Hence, (3.22) follows from (3.19), (3.25), and (3.37)-(3.38).

    For ThiT, we denote Uhiad, Vhi and the solution (yhi,uhi,phi) of (2.11)-(2.13) with respect to Thi. Then we define the following notations that

    e2hi=uuhi20+yyhi2a+pphi2a,E2hi=uhiuhi+120+yhiyhi+12a+phiphi+12a,˜η2Thi(ω)=η22,Thi(uhi,yhi,ω)+η23,Thi(yhi,phi,ω),

    We now proceed to prove the contraction of the Algorithm 2.1.

    Theorem 3.1. Let (y,u,p)H10(Ω)×Uad×H10(Ω) be the solution of (2.5)-(2.7) and (yh,uh,ph)Vh×Uhad×Vh be the solution of (2.10)-(2.12) generated by the adaptive finite element algorithm 1. There exist γ1>0, γ2>0 and α(0,1] depending only on the shape of regularity of initial Th0, b, Ω and the marking parameter θ(0,1] such that

    e2hi+1+γ1η21,Thi+1(phi+1,Thi+1)+γ2˜η2Thi+1(Thi+1)α(e2hi+γ1η21,Thi(phi,Thi)+γ2˜η2Thi(Thi)), (3.39)

    where h0 sufficiently small, and h0< 1.

    Proof. By Theorem 2.1, Lemma 3.3 and Lemma 3.3 yields

    e2hiCη2Thi(Thi), (3.40)
    ˜η2Thi+1(Thi+1)(1+δ){˜η2Thi(Thi)λ˜η2Thi(Mhi)}+C(2+δ1)E2hi, (3.41)
    η21,Thi+1(phi+1,Thi+1)(1+δ1){η21,Thi(phi,Thi)(121/2)η21,Thi(phi,Rhi)}+C(1+δ11)h20||phiphi+1||20, (3.42)
    (12δ)e2hi+1e2hiE2hi+Cδ1(η21,Thi(phi,Rhi)+J2(h0)˜η2Thi(Thi)), (3.43)

    where Rhi denote the set of elements which are refined from Thi to Thi+1. Combining with the upper bound of Theorem 2.1, then choosing parameters ˜γ1,˜γ2, we obtain

    (12δ)e2hi+1+˜γ1η21,Thi+1(phi+1,Thi+1)+˜γ2˜η2Thi+1(Thi+1)e2hi+˜γ2(1+δ){˜η2Thi(Thi)λ˜η2Thi(Mhi)}+˜γ2(2+δ1)E2hiE2hi(˜γ1(1+δ1)(121/2)Cδ1)˜η2Thi(Rhi)+˜γ1(1+δ1)η21,Thi(phi,Thi)+Cδ1J2(h0)˜η2Thi(Thi)+˜γ1(1+δ1)h20(η21,Thi+1(phi+1,Thi+1)+˜η2Thi+1(Thi+1)+η21,Thi(phi,Thi)+˜η2Thi(Thi)).

    We choose

    ˜γ1(1+δ1)(121/2)Cδ1>0,˜γ2C(2+δ1)=1. (3.44)

    Then we find that

    (12δ)e2hi+1+˜γ1(1C(1+δ1)h20)η21,Thi+1(phi+1,Thi+1)+(˜γ2C(1+δ1)h20)˜η2Thi+1(Thi+1)e2hi+˜γ1((1+δ1)+C(1+δ1)h20)η21,Thi(phi+1,Thi)c(η21,Thi(phi,Mhi)+η2Thi(Mhi))+(˜γ2(1+δ)+˜γ1C(1+δ11)h20+Cδ1J2(h0))˜η2Thi(Thi),

    where c=min{˜γ1(1+δ1)(121/2)Cδ1,˜γ2C(2+δ1)}. Then choosing β(0,1) and using the marking strategy of Algorithm 2.1 and Theorem 2.1, we derive that

    (12δ)e2hi+1+˜γ1(1C(1+δ1)h20)η21,Thi+1(phi+1,Thi+1)+(˜γ2C(1+δ1)h20)˜η2Thi+1(Thi+1)(1Cθβ)e2hi+(˜γ1((1+δ1)+C(1+δ1)h20)cθ(1β))η21,Thi(phi,Thi)+(˜γ2(1+δ)+˜γ1C(1+δ11)h20+Cδ1J2(h0)cθ(1β))˜η2Thi(Thi).

    One easily sees that

    e2hi+1+γ1η21,Thi+1(phi+1,Thi+1)+γ2˜η2Thi+1(Thi+1)α1e2hi+α2γ1η21,Thi(phi,Thi)+α3γ2˜η2Thi(Thi),

    where

    γ1=˜γ1(1C(1+δ1)h20)12δ, (3.45)
    γ2=˜γ2C(1+δ1)h2012δ, (3.46)
    α1=1Cθβ12δ, (3.47)
    α2=˜γ1((1+δ1)+C(1+δ1)h20)cθ(1β)˜γ1(1C(1+δ1)h20), (3.48)
    α3=˜γ2(1+δ)+˜γ1C(1+δ11)h20+Cδ1J2(h0)cθ(1β)˜γ2C(1+δ1)h20. (3.49)

    By choosing sufficiently small δ<1 and β such that α1(0,1). As long as (3.48)-(3.49) are slightly deformed, and combining with (3.44), when h01 and δ1,δ small enough, we can conclude that

    α2(0,1)andα3(0,1).

    Choosing α=max{α1,α2,α3}, and then (3.39) complete the rest of the proof.

    In this section, we consider the quasi-optimality for the adaptive finite element method. Firstly we introduce some interpretation of the notations. For Th,Th1,Th2T, with the assumption that #Th be the number of elements in Th, we also set Th1Th2 be the smallest common conforming refinement of Th1 and Th2 [12,26], such that

    #(Th1Th2)#Th1+#Th2#Th0. (4.1)

    According to [20], a function approximation class is defined by

    As:={(y,u,p,yd,f)H10(Ω)×L2(Ω)×H10(Ω)×L2(Ω)×L2(Ω):|(y,u,p,yd,f)|s<+},

    where

    |(y,u,p,yd,f)|s:=supN>0NsinfThTNinf(yh,uh,ph)Vh×Uhad×Vh{uuh20+yyh2a+pph2a+osc2Th(f,Th))+osc2Th(yhyd,Th)}12,

    and

    TN0:={ThT:#Th#Th0N0}.

    A localized upper bound will be given in the following lemmas, which play an important role in proving the qusi-optimality for an adaptive finite element method.

    Lemma 4.1. For Th,˜ThT, and Th˜Th, let Rh be the set of refined elements from Th to ˜Th. Let (yh,uh,ph) and (˜yh,˜uh,˜ph) be the solutions of (2.11)(2.13) with respect to Th and ˜Th, respectively. Then there exists a constant C, depending on the shape regularity of initial grids Th0 and b such that

    uh˜uh20+yh˜yh2a+ph˜ph2aCη2Th(Rh), (4.2)

    where

    η2Th(Rh)=η21,Th(ph,Rh)+η22,Th(uh,yh,Rh)+η23,Th(yh,ph,Rh).

    Proof. From (3.26)-(3.27) and (3.34)-(3.35), we obtain

    ||uh˜uh||20Cη2Th(Rh). (4.3)

    By using the results of Lemma 3.6 (see [12]), we derive that

    ||yh˜yh||a=||ShuhS˜h˜uh||a||ShuhS˜hS˜huhS˜h||a+||S˜huhS˜hS˜h˜uh||aCη2,Th(uh,yh,Rh)+||uh˜uh||0. (4.4)

    Then, it holds that

    ||ph˜ph||a=||Sh(Shuhyd)S˜h(S˜h˜uhyd)||a||Sh(Shuhyd)S˜h(Shuhyd)||a+||S˜h(Shuhyd)S˜h(S˜h˜uhyd)||aCη3,Th(yh,ph,Rh)+||yh˜yh||a. (4.5)

    And thus we conclude that (4.2) follows from (4.3)-(4.5).

    In the following lemma, we list the error indicators on the coarse grids that must satisfy a Dörfler property on the refinement one.

    Lemma 4.2. Assume that the marking parameter θ(0,θ), where

    θ=C2C(1+h40)+1.

    For Th,˜ThT and Th˜Th, let (yh,uh,ph) and (˜yh,˜uh,˜ph) be the solutions of (2.11)-(2.13) with respect to Th and ˜Th, respectively. If

    e2˜Th+osc2˜Th(˜Th)μ[e2Th+osc2Th(Th)], (4.6)

    is satisfied for μ:=12(1θθ). Then, the set Rh of elements which are refined from Th to ˜Th satisfies the Döfler property

    η2Th(Rh)θη2Th(Th),

    where

    e2Th=uuh20+yyh2a+pph2a,osc2Th(ω)=osc2Th(f,ω)+osc2Th(yhyd,ω),

    for ωTh and e2˜Th, osc2˜Th(˜Th) similarly to define.

    Proof. From (4.6) and Theorem 2.1, we derive

    (12μ)Cη2Th(Th)(12μ)(e2Th+osc2Th(Th)e2Th2e2˜Th+osc2Th(Th)2osc2˜Th(˜Th).

    Applications of the triangle inequality yields

    ||uuh||202||u˜uh||20+2||uh˜uh||20,||yyh||2a2||y˜yh||2a+2||yh˜yh||2a,||pph||2a2||p˜ph||2a+2||ph˜ph||2a.

    Thus, from (4.2), we obtain

    e2Th2e2˜Th2Cη2Th(Rh). (4.7)

    From (3.10), for TTh˜Th, it is easy to see that

    osc2Th(yhyd,Th˜Th)2osc2˜Th(˜yhyd,Th˜Th)2Ch40η2Th(Rh). (4.8)

    According to Remark 2.1 of [12], there holds for TRh

    oscTh(T)ηTh(T). (4.9)

    By adopting (4.7)-(4.9), there holds

    (12μ)Cη2Th(Th)(2C(1+h20)+1)η2Th(Rh),(12μ)θη2Th(Th)η2Th(Rh),θη2Th(Th)η2Th(Rh),

    where

    θ=C(2C(1+h20)+1)andθ=(12μ)θ.

    The following Lemma is essential to prove the quasi-optimality.

    Lemma 4.3. Let (y,u,p) and (Thi,Uhiad,Vhi,yhi,uhi,phi) be the solution of (2.6)-(2.8) and the sequence of grids, finite element spaces and discrete solutions produced by Algorithm 2.1, respectively. Assume that the marking parameter θ satisfies the condition in Lemma 4.2, then the following estimate is valid

    #MhiC(C12s1|(uh,y,p,yd,f)|1ssμ12s(e2hi+osc2Thi(Thi))12s), (4.10)

    if (u,y,p,yd,f)As.

    Proof. Let ϵ:=μC11(η2Thi+osc2Thi(Thi)), where C1 shall be produced in (4.13). Due to (uh,y,p,yd,f)As, then there exists a ThϵT and (yϵ,uϵ,pϵ)Uhϵad×Vhϵ×Vhϵ such that

    #Thϵ#Th0C|(u,y,p,yd,f)|1/ssϵ1/s, (4.11)
    ||uuϵ||20+||yyϵ||2a+||ppϵ||2a+osc2Thϵ(f,Thϵ)+osc2Thϵ(yϵyd,Thϵ)ϵ2. (4.12)

    Let (y,u,p) be the solution of (2.6)-(2.8) with respect Th=ThϵThi which is the smallest common refinement of Thϵ and Thi. Next, we first prove the following estimate

    e2Th+osc2ThC1(e2Thϵ+osc2Thϵ), (4.13)

    where

    e2Thϵ=||uuϵ||20+||yyϵ||2a+||ppϵ||2a,osc2Thϵ(Thϵ)=osc2Thϵ(f,Thϵ)+osc2Thϵ(yϵyd,Thϵ).

    Combining with the Young's inequality yield

    (uu,uuϵ)=(uuϵ,uuϵ)(uuϵ,uuϵ)(uuϵ,uuϵ)||uuϵ||20+||uuϵ||20. (4.14)

    Note that there holds the similar result to a(yy,yyϵ) and a(pp,ppϵ). Thus combining with (3.17)-(3.19) and (4.14), we find that

    6(||uuϵ||20+||yyϵ||2a+||ppϵ||2a)||uu||20+||uuϵ||20+||yy||2a+||yyϵ||2a+||pp||2a+||ppϵ||2a. (4.15)

    From (3.10) with Th=˜Th=Th and yh=y, ˜yh=yϵ, such that

    osc2Th(yyd,Th)2osc2Thϵ(yϵyd,Thϵ)Ch40||yyϵ||2a. (4.16)

    For TThϵ, let TT:={TTh:TT}. There holds

    TTT||ffT||20,T=TTT(Tf2(Tf)2|T|)=Tf2TTT(Tf)2|T|Tf2TTT(Tf)2|T|CTf2(Tf)2|T|=C||ffT||20,T.

    Hence,

    oscTh+2(f,Th)CoscThϵ+2(f,Thϵ). (4.17)

    And thus we conclude that (4.13) follows from (4.15)-(4.17). Then combining with (4.12), we find that

    e2Th+osc2Th(Th)C1(e2Thϵ+osc2Thϵ(Thϵ))C1ϵ2=μ(e2Thi+osc2Thi(Thi)) (4.18)

    Note that Rh, which is the set of refinement elements from Thi to Th, it satisfies the marking property of Lemma 4.2 that

    #Mhi#Rh#Th#Thi#ThϵTh0. (4.19)

    Thus we find that (4.10) follows from (4.11) and (4.19).

    The following theorem describes the quasi-optimality of an adaptive finite element method.

    Theorem 4.1. Let (y,u,p) and (Thi,Uhiad,Vhi,yhi,uhi,phi) be the solution of (2.6)-(2.8). Where the sequence of grids, finite element spaces, and discrete solutions produced by Algorithm 2.1, respectively. Assume that Th0 satisfies the condition (b) of Section 4 (see [3]). Let (u,y,p,yd,f)As, then there holds

    #Thi#Th0C|(u,y,p,yd,f)|1ss(e2hi+osc2Thi(Thi))12s, (4.20)

    provided h01.

    Proof. According to Lemma 2.3 of [12], we derive that

    #Thi#Th0i1j=0Mhj. (4.21)

    Combining (4.21) with Lemma 4.3 leads to

    #Thi#Th0i1j=0MhjCC0i1j=0(e2hi+osc2Thi(Thi))12s, (4.22)

    where

    C0=C12s1|(u,y,p,yd,f)|1/ssμ12s.

    An application of Theorem 2.1, such that

    e2hi+γ1η21,Thi(phi,Thi)+γ2˜η2Thi(Thi)e2hi+osc2Thi(Thi). (4.23)

    Thus we conclude that (4.20) follows from (4.22)-(4.23).

    In this section, we are going to use the following iterations to solve the bilinear optimal control problem numerically.

    Algorithm 5.1. It follows from linear optimal control problems, there exist similar algorithms in the reference [20], we define the following Algorithm 5.1 based on the bilinear optimal control problem:

    a(ykh,wh)+(uk1hykh,wh)=(f,wh), whVh, (5.1)
    a(qh,pkh)+(uk1hpkh,qh)=(ykhyd,qh), qhVh, (5.2)
    (αukh+ykhpkh,vhukh)0, vkhUhad. (5.3)

    Given an initial control u0hUhad, then substitute it into (5.1) yields y1h. And then substitute y1h into (5.2) yields p1h. At last, substitute p1h into (5.3) yields u1h. Repeat the above step, we can solve (ykh,pkh,ukh), for k=1,2,, it holds that

    ukh=1α(Phpkh+max(0,ˉpkh)),

    where Ph is the L2-projection from L2(Ω) to Uh and ˉpkh=Ωpkh|Ω|.

    For ease of analysis the solution of bilinear optimal control problem we constructing examples as follows.

    Example 1. The first example is a bilinear optimal control problem, we set state equation on Ω=(0,1)2, and α=0.5. We take the exact solution as

    p=sinπx1sinπx2,u0=1αmax{ˉp,0}p,yd=Δp+upy,f=Δy+uy,y=2π2p.

    In Example 1, the state and co-state are approximated by the piecewise linear elements, while piecewise constant elements are used to approximate the control. We compute Example 1 on an adaptive grid and a uniform grid respectively.

    In Figure 1, we plot the profiles of the exact state and the numerical state on adaptive refinement grid with θ=0.3 and 21 adaptive loops. We plot the profiles of the co-exact state variables and the co-numerical state variables on adaptive refinement grid with θ=0.3 and 21 adaptive loops in Figure 2 as well. Connecting with Figure 1 and Figure 2, we find that although the solution of Example 1 is smooth, larger gradients can be observed in certain areas. Accordingly, the adaptive finite element method may obtain a much smaller error compared to the uniform refinement.

    Figure 1.  The profiles of the exact state (left) and the numerical state (right) on adaptively refined grid with θ=0.3 and 21 adaptive loops for Example 1 generated by Algorithm 2.1.
    Figure 2.  The profiles of the exact co-state variables (left) and the numerical co-state variables (right) on adaptively refined grid with θ=0.3 and 21 adaptive loops for Example 1 generated by Algorithm 2.1.

    We show the comparisons of convergence history of the errors on adaptively grids with θ=0.3 and uniformly refined grids with θ=1 in Figure 3. We can observe η2 is approximately parallel to the line slop 1 which is the optimal convergence rate we expected by using linear finite elements. This situation confirms the theoretic results in Section 4. But in this example, the solutions of the optimal control are quite smooth. Here is little difference in the convergence history of the errors compared to uniform refinement.

    Figure 3.  The comparisons of convergence history of the estimates on uniformly refined grids with θ=1 (left) and adaptively refined grids with θ=0.3 (right) for Example 1.

    Example 2. In the second example, we choose α=0.3 with the exact solutions on Ω=(0,1)2

    u=1α(p+max(0,ˉp)),p={sinπx1sinπx2,if s(x1,x2)<0,0,if s(x1,x2)0,yd={100(x11)2+(x21)2,if s(x1,x2)<0,0,if s(x1,x2)0,

    where s(x1,x2)=(x10.2)2+(x20.6)20.04.

    Similarly, the state and co-state are approximated by the piecewise linear elements, while piecewise constant elements are used to approximate the control. We compute Example 2 on an adaptive grid and a uniform grid respectively.

    In Figure 4, we plot the profiles of the numerical state and the numerical co-state on 21 adaptive loops with θ=0.3 for Example 2, both the numerical state and the numerical co-state can be seen in the singularities around the peak. Further, the grids are concentrated at the peak. In Figure 5, we show the convergence history of the estimators. We can see η2 is approximately parallel to slop 1. From the figure, it is easy to see that the second-order convergence for the reduction of error estimators for the adaptively refined grids. In Example ref{exm2}, the optimal control is not smooth so that there is much difference in using either the uniform or adaptive grids to approximate the control.

    Figure 4.  The profiles of the numerical state (left) and the numerical co-state (right) on adaptively refined grids with θ=0.3 for Example 2.
    Figure 5.  The comparisons of convergence history of the estimates on uniformly refined grids with θ=1 (left) and adaptively refined grids with θ=0.3 (right) for Example 2.

    This work is supported by National Science Foundation of China (11201510), National Social Science Fund of China (19BGL190), China Postdoctoral Science Foundation (2017T100155, 2015M580197), Innovation Team Building at Institutions of Higher Education in Chongqing (CXTDX201601035), Chongqing Research Program of Basic Research and Frontier Technology (cstc2019jcyj-msxmX0280) and Scientific and Technological Research Program of Chongqing Municipal Education Commission (KJZD-K202001201), Youth Innovative Talents Project (Natural Science) of research on humanities and social sciences in Guangdong normal university (2017KQNCX265), Hunan Provincial Education Department of China (18C0196).

    The authors declare no conflict of interest.



    [1] M. Ainsworth, J. T. Oden, A posteriori error estimators in finite element analysis, Comput. Methods Appl. Mech. Engrg., 142 (1997), 1–88. doi: 10.1016/S0045-7825(96)01107-3
    [2] I. Babuška, W. C. Rheinboldt, Error estimates for adaptive finite computations, SIAM J. Numer. Anal., 15 (1978), 736–754. doi: 10.1137/0715049
    [3] P. Binev, W. Dahmen, R. Devore, Adaptive finite element approximation for distributed elliptic optimal control problems, SIAM J. Control Optim., 97 (2004), 219–268.
    [4] L. Zhang, Z. Zhou, Spectral galerkin approximation of optimal control problem governed by riesz fractional differential equation, Appl. numer. math., 143 (2019), 247–262. doi: 10.1016/j.apnum.2019.04.003
    [5] F. Wang, Z. Zhang, Z. Zhou, A spectral galerkin approximation of optimal control problem governed by fractional advection diffusion reaction equations, J. Comput. Appl. Math., 386 (2021), 113–129.
    [6] N. Du, H. Wang, W. B. Liu, A fast gradient projection method for a constrained fractional optimal control, J. Sci. Comput., 68 (2016), 1–20. doi: 10.1007/s10915-015-0125-1
    [7] P. G. Ciarlet, The Finite Element Method for Elliptic Problems, Amsterdam: North-Holland, 1978.
    [8] Z. Chen, J. Feng, An adaptive finite element algorithm with reliable and efficient error for linear parabolic problems, Math. Comput., 73 (2004), 1167–1193. doi: 10.1090/S0025-5718-04-01634-5
    [9] Y. Chen, Z. Lu, High Efficient and Accuracy Numerical Methods for Opyimal Control Problems, Science Press, Beijing, 2015.
    [10] Y. Chen, Z. Lu, Y. Huang, Superconvergence of triangular Raviart-Thomas mixed finite element methods for a bilinear constrained optimal control problem, Comput. Math. Appl., 66 (2013), 1498–1513. doi: 10.1016/j.camwa.2013.08.019
    [11] Y. Chen, Z. Lu, L. Liu, Numerical Methods for Partial Differential Equations, Science Press, Beijing, 2015.
    [12] J. M. Cascon, C. Kreuzer, R. H. Nochetto, K. G. Siebert, Qusi-optimal convergence rate for an adaptive finite element method, SIAM J. Numer. Anal., 46 (2008), 2524–2550. doi: 10.1137/07069047X
    [13] W. Dörfler, A convergent adaptive algorithm for Poisson equation, SIAM J. Numer. Anal., 33 (1996), 1106–1124. doi: 10.1137/0733054
    [14] A. Demlow, R. Stevenson, Convergence and quasi-optimality of an adaptive finite element method for controlling L2 errors, Numer. Math., 117 (2011), 185–218. doi: 10.1007/s00211-010-0349-9
    [15] A. Gaevskaya, R. H. W. Hoppe, Y. Iliash, M. Kieweg, Convergence anlysis of an adaptive finite element for distributed control problems with control constraints, Int. Serises Numer. Math., 155 (2007), 47–68. doi: 10.1007/978-3-7643-7721-2_3
    [16] L. Ge, W. Liu, D. Yang, Adaptive finite element approximation for a constrained optimal control problem via multi-meshes, J. Sci. Comput., 41 (2009), 238–255. doi: 10.1007/s10915-009-9296-y
    [17] L. Ge, W. Liu, D. Yang, L2 norm equivalent a posteriori error estimate for a constrained optimal control problem, Inter. J. Numer. Anal. Model., 6 (2009), 335–353.
    [18] W. Gong, N. Yan, Adaptive finite element method for elliptic optimal control problems: convergence and optimality, Numer. Math., 135 (2017), 1121–1170. doi: 10.1007/s00211-016-0827-9
    [19] L. He, A. Zhou, Comvergence and optimality of adaptive finite element methods for elliptic partial differential equations, Int. J. Numer. Anal. Model., 8 (2011), 1721–1743.
    [20] H. Leng, Y. Chen, Convergence and quasi-optimality of an adaptive finite element method for optimal control problems with integral control constraint, Adv. Comput. Math., 44 (2018), 1367–1394.
    [21] R. Li, W. Liu, H. Ma, T. Tang, Adaptive finite element methods with convergence rates, Numer. Math., 41 (2002), 1321–1349.
    [22] W. Liu, N. Yan, Adaptive Finite Element Methods for Optimal Control Governed by PDEs, Science Press, Beijing, 2008.
    [23] Z. Lu, S. Zhang, L-error estimates of rectangular mixed finite element methods for bilinear optimal control problem, Appl. Math. Comput., 300 (2017), 79–94.
    [24] P. Morin, R. H. Nochetto, K. G. Siebert, Data oscillation and convergence of adaptive FEM, SIAM J. Numer. Anal., 33 (1996), 1106–1124. doi: 10.1137/0733054
    [25] P. Morin, R. H. Nochetto, K. G. Siebert, Convergence of adaptive finite element methods, SIAM Reviews, 44 (2000), 466–488.
    [26] R. Stevenson, Optimality of a standard adaptive finite element method, Found Comput. Math., 7 (2007), 245–269. doi: 10.1007/s10208-005-0183-0
    [27] R. Verfurth, A Review of A Posteriori Error Estimation and Adaptive Mesh Refinement, Comput. Methods Appl. Mech. Engrg., Wiley-Teubner, London, 1996.
    [28] J. Xu, A. Zhou, Local and parallel finite element algorithms based on two-grid discretizations, Math. Comput., 69 (1996), 881–909.
  • Reader Comments
  • © 2021 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(2867) PDF downloads(93) Cited by(0)

Figures and Tables

Figures(5)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog