
This paper investigates the adaptive finite element method for an optimal control problem governed by a bilinear elliptic equation. We establish the finite element discrete scheme for the bilinear optimal control problem and use a dual argument, linearization method, bubble function, and new bubble function to obtain a posteriori error estimates. To prove the convergence and the quasi-optimality for adaptive finite element methods, we introduce the adaptive finite element algorithm, local perturbation, error reduction, discrete local upper bound, Dörfler property, dual argument method, and quasi orthogonality. A few numerical examples are given at the end of the paper to demonstrate our theoretical analysis.
Citation: Zuliang Lu, Xiankui Wu, Fei Huang, Fei Cai, Chunjuan Hou, Yin Yang. Convergence and quasi-optimality based on an adaptive finite element method for the bilinear optimal control problem[J]. AIMS Mathematics, 2021, 6(9): 9510-9535. doi: 10.3934/math.2021553
[1] | Zuliang Lu, Fei Cai, Ruixiang Xu, Lu Xing . Convergence and proposed optimality of adaptive finite element methods for nonlinear optimal control problems. AIMS Mathematics, 2022, 7(11): 19664-19695. doi: 10.3934/math.20221079 |
[2] | Zuliang Lu, Ruixiang Xu, Chunjuan Hou, Lu Xing . A priori error estimates of finite volume element method for bilinear parabolic optimal control problem. AIMS Mathematics, 2023, 8(8): 19374-19390. doi: 10.3934/math.2023988 |
[3] | Zuliang Lu, Fei Cai, Ruixiang Xu, Chunjuan Hou, Xiankui Wu, Yin Yang . A posteriori error estimates of hp spectral element method for parabolic optimal control problems. AIMS Mathematics, 2022, 7(4): 5220-5240. doi: 10.3934/math.2022291 |
[4] | Shengying Mu, Yanhui Zhou . An analysis of the isoparametric bilinear finite volume element method by applying the Simpson rule to quadrilateral meshes. AIMS Mathematics, 2023, 8(10): 22507-22537. doi: 10.3934/math.20231147 |
[5] | Zuliang Lu, Xiankui Wu, Fei Cai, Fei Huang, Shang Liu, Yin Yang . Error estimates in L2 and L∞ norms of finite volume method for the bilinear elliptic optimal control problem. AIMS Mathematics, 2021, 6(8): 8585-8599. doi: 10.3934/math.2021498 |
[6] | Changling Xu, Hongbo Chen . A two-grid P20-P1 mixed finite element scheme for semilinear elliptic optimal control problems. AIMS Mathematics, 2022, 7(4): 6153-6172. doi: 10.3934/math.2022342 |
[7] | Tiantian Zhang, Wenwen Xu, Xindong Li, Yan Wang . Multipoint flux mixed finite element method for parabolic optimal control problems. AIMS Mathematics, 2022, 7(9): 17461-17474. doi: 10.3934/math.2022962 |
[8] | Yuelong Tang . Error estimates of mixed finite elements combined with Crank-Nicolson scheme for parabolic control problems. AIMS Mathematics, 2023, 8(5): 12506-12519. doi: 10.3934/math.2023628 |
[9] | Jie Liu, Zhaojie Zhou . Finite element approximation of time fractional optimal control problem with integral state constraint. AIMS Mathematics, 2021, 6(1): 979-997. doi: 10.3934/math.2021059 |
[10] | Cagnur Corekli . The SIPG method of Dirichlet boundary optimal control problems with weakly imposed boundary conditions. AIMS Mathematics, 2022, 7(4): 6711-6742. doi: 10.3934/math.2022375 |
This paper investigates the adaptive finite element method for an optimal control problem governed by a bilinear elliptic equation. We establish the finite element discrete scheme for the bilinear optimal control problem and use a dual argument, linearization method, bubble function, and new bubble function to obtain a posteriori error estimates. To prove the convergence and the quasi-optimality for adaptive finite element methods, we introduce the adaptive finite element algorithm, local perturbation, error reduction, discrete local upper bound, Dörfler property, dual argument method, and quasi orthogonality. A few numerical examples are given at the end of the paper to demonstrate our theoretical analysis.
Bilinear optimal control problems are types of quintessential optimal control problems governed by a partial differential equation, which investigated in material mechanics, engineering mechanics, and engineering design, etc [9,11,22]. Effective numerical methods, such as in [1,12,21] for finite element methods, [4,5] for spectral Galerkin methods, and [6] for fast algorithms, are the key to the successful application of optimal control problems in the practical field. Accordingly, the study of the efficient numerical algorithm for the bilinear optimal control problem has far-reaching theoretical value and application prospects.
The finite element method is frequently applied to solve optimal control problems and uses the thought of mathematical approximation to model the actual physical system. It replaces complex problems with simpler ones by assuming that each element has a suitable, simple approximate solution and then derives a universal satisfaction condition for the solution domain [10,23]. Since the actual problem is replaced by a simpler one, this solution is not precise, but approximate. Given that the finite element method has high computational accuracy and can adapt to a variety of complex shapes, it has become an effective engineering analysis method.
Babuška first proposed the adaptive finite element method [2]. The adaptive finite element method is an accurate and efficient method of finite element discretization, which saves a lot of calculation time based on ensuring the specified accuracy. Appropriate grids can enormously reduce the errors resulting from the discretization of finite element approximation procedure in copying with an optimal control problem. As the case stands, the solutions for the optimal control problem of the nonlinear system are generally not available. In addition to the complexity and diversity of the nonlinear equations, it is practicable to solve the nonlinear equations with the thought of the adaptive finite element method.
Adaptive finite element methods had popularized for many years, but the theoretical analysis of the entire algorithm has just been actualized. The subject work in the early stage was made by Dörfler [13], in which he investigated an adaptive finite element for Poisson's equation by a reduction of energy errors under a mild assumption on the initial grid Th0. Later, Demlow, Morin, et-al. [14,24,25] proved the convergence of adaptive finite element methods without the mild assumption. Then another decisive issue of adaptive finite element methods has been investigated by Binev et al. [3]. In the last decades, much work has been done on the concept of total error and the sum of error energy plus oscillations [8,12,26]. Recently, Liu and Yan [22] found that adaptive finite element methods could be successfully applied in constrained optimal control problems. The later investigations could be found in [15,19]. Inspired by these works, Gong and Yan [18] investigated the convergence and quasi-optimality of an adaptive finite element for control constrained optimal control problems via applying variation control discretization. Later, Leng and Chen [20] studied the convergence of an adaptive finite element method for optimal control problems with integral control constraints.
To proceed, Section 2 next focuses on the bilinear optimal control problem with integral control constraints, which adopt piecewise constant discretization to deal with the control via. For the state and the co-state, by applying continuous piecewise linear discretization, respectively. Section 3 deduce a posteriori error estimate. For the convergence and the quasi-optimality, we prove them relying on quasi-orthogonality and discrete local upper bound. Section 4 based on the mild assumption to the initial grids, obtain the proof of convergence and quasi-optimality employing the solution operator of nonlinear elliptic equations. Section 5 then advances some numerical illustration.
Here are some notations that will be employed in this paper. Let Ω be a bounded Lipschitz domain on R2 and ∂Ω denote the boundary of Ω. We adopt the standard notation Wm,q(ω) for Sobolev space with norm ‖⋅‖m,q,ω and seminorm |⋅|m,q,ω. Where ω⊂Ω, and we will omit the sub description if ω=Ω. We denote Wm,2(Ω) by Hm(Ω) and set H10(Ω)={v∈H1(Ω):v=0 on ∂Ω}. For m=0 and q=2, we denote W0,2(ω)=L2(ω) and set ‖⋅‖0,2,ω=‖⋅‖0,ω. Let Th0 denote the initial partition of ˉΩ into disjoint triangles. By newest-vertex bisections for Th0, we obtain a class T of conforming partitions. For Th,˜Th∈T, we use Th⊂˜Th to indicate that ˜Th is a refinement of Th, where hT=|T|1/2. Denote the L2 inner product by (⋅,⋅), and let C be a constant independent of grids size.
In this paper, we focus our attention on the following bilinear convex optimal control problem:
minu∈Uad{12‖y−yd‖20+α2‖u‖20}, | (2.1) |
−Δy+uy=f,in Ω,y|∂Ω=0, | (2.2) |
where Ω and ΩU are bounded open sets in R2 with a Lipschitz boundary ∂Ω and ∂ΩU. Let f∈L2(Ω), and Uad be a closed convex set defined as follows
Uad={u∈L2(ΩU):∫ΩUu≥0}. |
We take the state space V=H10(Ω), the control space U=L2(ΩU), and H=L2(Ω) to determine the concept of finite element approximation for our discussion with the bilinear optimal control problems (2.1)-(2.2). We first give a weak formula for the state equation
a(y,v)=∫Ω∇y⋅∇vdx,∀ y,v∈V,(f1,f2)=∫Ωf1f2dx,∀ (f1,f2)∈H×H,(y,v)=∫ΩUyvdx,∀ (f1,f2)∈U×U. |
We set a norm ||v||a=√a(v,v), which is equivalent to ||v||1. Exist constants c and C that satisfies
a(v,v)≥c||v||2V,a(y,v)≤C||y||V||v||V,∀ y,v∈V. | (2.3) |
Then the bilinear optimal control problem (2.1)-(2.2) can be restated as
minu∈Uad{12‖y−yd‖20+α2‖u‖20}, | (2.4) |
a(y,v)+(uy,v)=(f,v),∀ v∈V. | (2.5) |
It is well known that the optimal control problem (2.4)-(2.5) has a solution (y,u) which was proved in the Theorem 2.2.4 of [22], and that if a pair (y,u) is the solution of (2.4)-(2.5), then there exist a costate p∈V such that the triplet (y,u,p) satisfies the following optimality conditions:
a(y,v)+(uy,v)=(f,v),∀v∈V, | (2.6) |
a(q,p)+(up,q)=(y−yd,q),∀q∈V, | (2.7) |
(αu−yp,v−u)U≥0,∀v∈Uad⊂U. | (2.8) |
For (2.6), since the coercivity of a(⋅,⋅), and we set a operator S:L2(Ω)→H10(Ω) such that Su=y. For (2.7), let S∗ be the adjoint operator of S, we assume that y−yd is given, such that S∗(y−yd)=p. Let Vh be the continuous piecewise linear finite element space for the partition Th∈T. We define Uh as the piecewise constant finite element space of Th. Set Uhad={vh∈Uh:∫Ωvh≥0}, then we derive the standard finite element discretization for the bilinear optimal control problem
minu∈Uad{12‖yh−yd‖20+α2‖uh‖20}, | (2.9) |
a(yh,v)+(uhyh,v)=(f,v),∀ v∈Vh. | (2.10) |
It is well known that the optimal control problem (2.9)-(2.10) has a solution (yh,uh) and that if a pair (yh,uh)∈Vh×Uhad is the solution of (2.9)-(2.10), then there is a co-state ph∈Vh such that the triplet (yh,uh,ph) satisfies the following optimality conditions
a(yh,v)+(uhyh,v)=(f,v),∀v∈Vh⊂V, | (2.11) |
a(q,ph)+(uhph,q)=(yh−yd,q),∀q∈Vh⊂V, | (2.12) |
(αuh−yhph,vh−uh)U≥0,∀v∈Uhad⊂U. | (2.13) |
Similarly, we set a operator Sh: Uhad→Vh such that Suh=yh. Let S∗h be the adjoint operator of Sh such that S∗h(yh−yd)=ph. Let us now introduce some error indicators which we will frequently use in this paper, namely the error η(⋅) and oscillation osc(⋅). For Th∈T, T∈Th, that are defined by:
η21,Th(ph,T)=h2T‖∇ph‖20,T,η22,Th(uh,yh,T)=h2T‖f−uhyh‖20,T+hT‖[∇yh]⋅n‖20,∂T∖∂Ω,η23,Th(yh,ph,T)=h2T‖yh−yd−uhph‖20,T+hT‖[∇ph]⋅n‖20,∂T∖∂Ω,osc2Th(f,T)=h2T‖f−fT‖20,T,osc2Th(yh−yd,T)=h2T‖(yh−yd)−(yh−yd)T‖20,T, |
where uh∈Uhad and yh,ph∈Vh, fT is L2-projection of f onto piecewise constant space on T and fT=∫Tf|T|. For ω⊂Th, such that
η21,Th(ph,ω)=∑T∈ωη21,Th(ph,T),osc2Th(f,ω)=∑T∈ωosc2Th(f,T), |
Similarly, η22,Th(uh,yh,ω), η23,Th(uh,yh,ω) and osc2Th(yh−yd,ω) can be denoted as well.
Before we discuss residual-based posteriori error estimates for the bilinear optimal control problem, we introduce the well-known result for the integral averaging operator [12].
Lemma 2.1. Let πah:L1(ΩU)→Uh⊂U be the integral averaging operator such that
(πahv)|T:=1|T|∫Tv,∀ T∈Th. |
Then for m=0,1 and 1≤q≤∞, here
||v−πahv||0,q,T≤ChmT|v|m,q,T,∀ v∈Wm,q(ΩU). | (2.14) |
We next discuss the residual-based posteriori error estimates for the bilinear optimal control problem. Moreover, we introduce the following auxiliary problems
a(yh,v)+(uhyh,v)=(f,v),∀ v∈V, | (2.15) |
a(q,ph)+(yhph,q)=(yh−yd,q),∀ v∈V. | (2.16) |
Here we state the theorem about a posteriori error estimates.
Theorem 2.1. Let (y,u,p)∈H10(Ω)×Uad×H10(Ω) be the solution of problems (2.6)-(2.8), and (yh,uh,ph)∈Vh×Uhad×Vh be the solution of problems (2.11)-(2.13). Then exist constants c and C holds
||u−uh||20+||y−yh||2a+||p−ph||2a≤c(η21,Th(ph,Th)+η22,Th(uh,yh,Th)+η23,Th(yh,ph,Th)), | (2.17) |
and
C(η21,Th(ph,Th)+η22,Th(uh,yh,Th)+η23,Th(yh,ph,Th))≤||u−uh||20+||y−yh||2a+||p−ph||2a+osc2Th(f,Th)+osc2Th(yh−yd,Th). | (2.18) |
Proof. By applying Lemma 2.1, we infer that
∑T∈Th||ph−πhph||20,T≤Cη21,Th(ph,T), |
where πh is the L2−projection operator onto piecewise constant space on Th. Therefore, it follows from Theorem 3.1 of [16], we derive that
||u−uh||20,T≤C∑T∈Th||ph−πhph||20,T+Cη22,Th(uh,yh,T)+Cη23,Th(yh,ph,T)≤C(η21,Th(ph,T)+η22,Th(uh,yh,T)+η23,Th(yh,ph,T)). | (2.19) |
Assume that ep=ph−ph, and epI=ˆπhep, where ˆπh is the average interpolation operator defined in Lemma 3.2 of [17], then we obtain
c||ph−ph||2a≤(∇ep,ph−ph)+(uh(ph−ph),ep)=(∇(ep−epI),∇(ph−ph))+(uh(ph−ph),ep−epI)+(epI,ph−ph)+(uh(ph−ph),epI)=∑T∈Th∫T(yh−yd−uhph)(ep−epI)−∑∂T∖∂Ω∫∂T([∇ph]⋅n)(ep−epI)−(yh−yh,ep)≤C(σ)∑T∈Thh2T∫T(yh−yd−uhph)2+C(σ)∑∂T∖∂ΩhT∫∂T([∇ph]⋅n)2+Cσ∑T∈Thh2T∫T|ep−epI|2+Cσ∑∂T∖∂ΩhT|ep−epI|2+C||yh−yh||0||ep||a≤C(σ)∑T∈Thh2T∫T(yh−yd−uhph)2+C(σ)∑∂T∖∂ΩhT∫∂T([∇ph]⋅n)2+C(σ)||yh−yh||20+Cσ||ep||2a, |
where ||v||0,4≤C||v||1 is defined in the embedding theorem of [7], and the property ||v||1≤C. Then, let σ=c2C, we obtain
||ph−ph||2a≤C∑T∈Thh2T∫T(yh−yd−uhph)2+C∑∂T∖∂ΩhT∫∂T([∇ph]⋅n)2+||yh−yh||20. | (2.20) |
Similarly, let ey=yh−yh, and eyI be the average interpolation of ey. It follows from (2.11) and (2.15) that
c||yh−yh||2a≤(∇(yh−yh),∇ey)+(uh(yh−yh),ey)=(∇(yh−yh),∇(ey−eyI))+(uh(yh−yh),ey−eyI)=∑T∈Th∫T(f−uhyh)(ey−eyI)−∑∂T∖∂Ω∫∂T([∇ph]⋅n)(ey−eyI)≤C(σ)∑T∈Thh2T∫T(f−uhyh)2+C(σ)∑∂T∖∂ΩhT∫∂T([∇yh]⋅n)2+Cσ||ey||2a. |
Thus, it holds that
||yh−yh||2a≤C∑T∈Thh2T∫T(f−uhyh)2+C∑∂T∖∂ΩhT∫∂T([∇yh]⋅n)2. | (2.21) |
Note that
||y−yh||a≤||yh−yh||a+||y−yh||a, | (2.22) |
||p−ph||a≤||ph−ph||+||p−ph||a, | (2.23) |
||p−ph||2a+||y−yh||2a≤C||u−uh||20. | (2.24) |
Hence, (2.17) follows from (2.19)-(2.21) and (2.22)-(2.24).
To derive the a posteriori lower error bounds for the optimal control problems governed by bilinear elliptic equations we use the standard bubble function technique (see [1,17,27]). Let bT be the standard third order polynomial bubble on T scaled with bT=λ1λ2λ3. We denote by {λ1,λ2,λ3} the barycentric coordinates of T. Due to yh is piecewise linear, we set
wT=((yh−yd)T−yhph)b2T, |
where c1=∫Tb2T|T| denotes a positive constant, here
||wT||20,T=∫Tc21((yh−yd)T−uhph)2b4T≤c1∫T((yh−yd)T−uhph)2. |
Then there holds λ1λ2λ3=0 on ∂T, and satisfies
wT|∂T∖∂Ω=c1((yh−yd)T−uhph)(λ1λ2λ3)2|∂T=0. |
Moreover, we obtain
∇wT|∂T∖∂Ω=2c1((yh−yd)T−uhph)(λ1λ2λ3)∇(λ1λ2λ3)|∂T=0, |
thus we can conclude that wT∈H20(T). Using the results from [1,27], we derive that
|wT|20,2,T≤Ch−4T∫T|wT(T)|2. |
We set ˆT be a reference element, and ˆX=FT(X)=X+bT be an affine map from T onto ˆT, we also set ˆw=w∘F−1T(ˆX). Thus we conclude that
|wT|20,T=∫ˆT|ˆwT(ˆX)|2. |
Where wT∈H20(T), we obtain that ˆwT∈H20(ˆT). Using the Poincare's inequality, we deduce the results below
∫ˆT|ˆwT(ˆX)|2≤C∑|a|=2∫ˆT|DaˆwT(ˆX)|2, |
such that
|wT|20,T≤Ch4T|wT|20,2,T. |
Thus we can obtain that
ch−2T||wT||20,T≤|wT|20,2,T≤Ch−2T||wT||20,T,∀ T∈Th. | (2.25) |
Then use of bubble function wT with (2.25) yields
∫Th2T((yh−yd)T−yhph)2=∫Th2T(yh−yd)T−uhph)wT≤∫T(yh−yd−uhph−(y−yd−up))wT+C(σ)∫T(yh−yd−(yh−yd)T)2+Cσh−2T||wT||20,T=−∫T∇(ph−p)∇wT+∫T(yh−y)wT+∫T(uhph−up)+C(σ)∫T(yh−yd−(yh−yd)T)2+Cσh−2T||wT||20,T≤C(σ)||ph−p||2a,T+C(σ)||yh−y||20,T+C(σ)||uh−u||20,T+C(σ)∫Th2T(yh−yd−(yh−yd)T)2+Cσh−2T||wT||20,T, |
We set uhph−up=uh(ph−p)+(uh−u)p, such that
∫Th2T(yh−yd−uh−ph)2≤C∫T((yh−yd)T−uhph)2+C∫Th2T(yh−yd−(yh−yd)T))2≤C||ph−p||2a,T+C||yh−y||20,T+C||uh−u||20,T+C∫Th2T(yh−yd−(yh−yd)T))2. | (2.26) |
Then by using the Schwarz's inequality, it follows from (2.25) that
∫∂ThT([∇ph]⋅n)2=∫∂T([∇ph]⋅n)w∂T=∫∂T([∇ph]⋅n−[∇p]⋅n)w∂T=∫∂T∇(ph−p)∇w∂T+∫∂T(y−yd−up)w∂T≤C(σ)||ph−p||2a,∂T∖∂Ω+∫∂T(y−yh)w∂T+∫∂T(yh−yd−uhph)w∂T+∫∂T(up−uhph)w∂T≤C(σ)||ph−p||2a,∂T∖∂Ω+C(σ)||y−yh||20,∂T∖∂Ω+C(σ)||u−uh||20,∂T∖∂Ω+C(σ)∫∂T(yh−yd−uhph)2+Cσ(||w∂T||2a,∂T∖∂Ω+h−2T||w∂T||20,∂T∖∂Ω), |
assume that up−uhph=(u−uh)p+uh(p−ph). Then there holds
∫∂ThT([∇ph]⋅n)2≤C||ph−p||2a,∂T∖∂Ω+C||y−yh||20,∂T∖∂Ω+C||u−uh||20,∂T∖∂Ω+C∫∂T(yh−yd−uhph)2. | (2.27) |
Applying (2.26)-(2.27) and the Poincare's inequality, we obtain
η23,Th(yh,ph,Th)=∑T∈Th∫Th2T(yh−yd−uhph)2+∑∂T∖∂Ω∫∂ThT([∇ph]⋅n)2≤C||p−ph||2a+C||y−yh||2a+C||u−uh||20+Cosc2Th(yh−yd,T). | (2.28) |
Similarly, we also derive that
η22,Th(uh,yh,Th)=∑T∈Th∫Th2T(f−uhph)2+∑∂T∖∂Ω∫∂ThT([∇yh]⋅n)2≤C||p−ph||2a+C||y−yh||2a+C||u−uh||20+Cosc2Th(f,T). | (2.29) |
Combining Lemma 3.6 of [17], we can derive the same result for bilinear elliptic equations
||ph−πhph||20,T≤C||u−uh||20,T+C||p−ph||20,T. |
From Lemma 2.1, we infer that
||ph−πhph||20,T≤Ch2T||∇ph||2a,T. |
Then using the inverse estimates to infer that
η21,Th(ph,Th)=∑T∈Thh2T||ph−πhph||20,T≤C||u−uh||20+C||p−ph||2a, | (2.30) |
and hence, (2.18) follows from (2.28)-(2.30).
Theorem 2.1 gives reliable and efficient posteriori error estimates. Now we introduce an adaptive finite element algorithm to explain what we mainly investigate in this paper.
Algorithm 2.1. Adaptive finite element algorithm for bilinear optimal control problems:
(0) Given an initial mesh Th0 and construct finite element space Uh0ad and Vh0. Select marking parameter 0<θ≤1 and set k:=0.
(1) Solve the discrete bilinear optimal control problem (2.11)-(2.13), then obtain an approximate solution (yhk,uhk,phk) with respect to Thk.
(2) Compute the local error estimator ηThk(T) for all T∈Thk.
(3) Select a minimal subset Mhk of Thk such that
η2Thk(Mhk)≥θη2Thk(Thk), |
where η2Thk(ω)=η21,Th(ph,ω)+η22,Th(uh,yh,ω)+η23,Th(yh,ph,ω) for all ω⊂Thk.
(4) Refine Mhk by bisecting b≥1 times in passing from Thk to Thk+1 and generally additional elements are refined in the process in order to ensure that Thk+1 is conforming.
(5) Solve the discrete bilinear optimal control problem (2.11)-(2.13), then obtain approximate solution (yhk+1,uhk+1,phk+1) with respect to Thk+1.
(6) Set k=k+1 and go to step (2).
In this section, we first introduce the relevant theorem of reference [24], where the authors construct a simple and efficient adaptive finite element methods to ensure a reduction rate of data oscillation, together with an error reduction based on a posteriori error estimators. In addition, we introduce the internal node properties and mark the data oscillations. To derive the residual-type posterior error estimates [21], we employ the lemmas as follows.
Lemma 3.1. For all v∈H1(Ω), Th∈T,T∈Th, there exists
||v||0,∂T∖∂Ω≤Ch−1/2T||v||0,T+Ch1/2T||v||1,T. |
Here we state the following local perturbation property.
Lemma 3.2. For Th∈T, T∈Th, let uh1,uh2∈Uhad, yh1,yh2,ph1,ph2∈Vh, we have
η1,Th(ph1,T)−η1,Th(ph2,T)≤ChT‖ph1−ph2‖a,T, | (3.1) |
η2,Th(uh1,yh1,T)−η2,Th(uh2,yh2,T)≤C(hT‖uh1−uh2‖0,T+hT‖ph1−ph2‖0,T+‖yh1−yh2‖a,ωT), | (3.2) |
η3,Th(yh1,ph1,T)−η3,Th(yh2,ph2,T)≤C(hT‖uh1−uh2‖0,T+hT‖yh1−yh2‖0,T+‖ph1−ph2‖a,ωT), | (3.3) |
oscTh(yh1−yd,T)−oscTh(yh2−yd,T)≤Ch2T‖yh1−yh2‖a,T. | (3.4) |
Proof. We only give the proof of (3.1) because of the proofs of (3.1), (3.3)-(3.4) are similar. By the definition of η2,Th(uh,yh,T), we deduce that
η2,Th(uh1,yh1,T)≤η2,Th(uh2,yh2,T)+h1/2T||[∇(yh1−yh2)]⋅n||0,∂T∖∂Ω+hT||uh1ph1−uh2ph2||0,T≤η2,Th(uh2,yh2,T)+h1/2T||[∇(yh1−yh2)]⋅n||0,∂T∖∂Ω+ChT||uh1−uh2||0,T+ChT||ph1−ph2||0,T. | (3.5) |
With the help of inverse estimates and Lemma 3.1, we infer that
||[∇(yh1−yh2)]⋅n||0,∂T∖∂Ω≤Ch−1/2T||yh1−yh2||a,ωT. | (3.6) |
Therefore (3.2) follows from (3.5)-(3.6).
By using the similar method with [24], we prove the following result.
Lemma 3.3. Let Th⊂~Th for Th,˜Th∈T.Mh⊂Th denotes the set of elements which are marked from Th to ˜Th. Then for uh∈Uhad, ˜uh∈U˜had, yh,ph∈Vh, ˜yh,˜ph∈V˜h and any δ,δ1∈(0,1], we have
η21,˜Th(˜ph,˜Th)−(1+δ1){η21,Th(ph,Th)−(1−2−1/2)η1,Th(ph,Rh)}≤C(1+δ−11)h20‖ph−˜ph‖2a, | (3.7) |
and
η22,˜Th(˜uh,˜yh,˜Th)−(1+δ){η22,T(uh,yh,Th)−λη22,Th(uh,yh,Mh)}≤C(1+δ−1)(h20‖uh−˜uh‖20+h20‖ph−˜ph‖20+‖yh−˜yh‖2a), | (3.8) |
and
η23,˜Th(˜yh,˜ph,˜Th)−(1+δ){η23,Th(yh,ph,Th)−λη23,Th(uh,yh,Mh)}≤C(1+δ−1)(h20‖uh−˜uh‖20+h20‖yh−˜yh‖20+‖ph−˜ph‖2a), | (3.9) |
and
osc2Th(yh−yd,Th∩˜Th)−2osc2˜Th(˜yh−yd,Th∩˜Th)≤2Ch40‖yh−˜yh‖2a, | (3.10) |
where λ=1−2−b2, h0=maxT∈Th0hT and Rh denotes the set of elements which are refined from Th to ˜Th.
Proof. We only give the proof of (3.7)-(3.8) and (3.10) owing to the proof of (3.9) can be prove with (3.8) similarly. From (3.1) of Lemma 3.2, then combining with the Young's inequality yield
η21,˜Th(˜ph,˜Th)≤Cη21,˜Th(ph,˜Th)+Ch2T||ph−˜ph||2a,T+Cδ1η21,˜Th(ph,˜Th)+Cδ−11h2T||p−˜ph||2a,T. | (3.11) |
Note for T∈Rh⊂Th that T will be bisected at least one time, it holds that
∑T′∈Tη1,˜Th(ph,T′)2≤2−1/2η1,Th(p,T)2. |
For T∈Th∖Rh, we obtain
η1,˜Th(p,T)2=η1,Th(p,T)2. |
Thus we can conclude that
η1,˜Th(ph,˜Th)2=η1,˜Th(ph,Rh)+η1,˜Th(ph,Th∖Rh)≤η1,Th(ph,Th)2−(1−2−1/2)η1,Th(ph,Rh). | (3.12) |
Hence, (3.7) follows from (3.11)-(3.12).
From Lemma 3.2 and the Young's inequality with parameter δ, we deduce that
η22,~Th(˜uh,˜yh,~Th)≤η22,~Th(uh,yh,~Th)+Ch2T(||˜uh−uh||20+||˜ph−ph||20)+Cδη22,~Th(uh,yh,~Th)+Cδ−1h2T(||˜uh−uh||20+||˜ph−ph||20)+||˜yh−yh||2a+Cδ−1||˜yh−yh||2a. | (3.13) |
Given a marked element T′∈Mh, let ˜TT′={T∈˜Th:T⊂T′}. For yh∈Vh⊂V˜h, we derive the jump [∇yh]=0 on the interior sides of ∪˜TT′. Let b be the number of bisections, it implies hT=|T|2≤(2−b|T′|)1/2≤2−b/2hT′, and then it holds that
∑Th∈˜TT′η22,˜Th(uh,yh,T)≤2−b/2η22,Th(uh,yh,T′). |
When T∈Th∖Mh, we check that
η22,˜Th(uh,yh,T)≤η22,Th(uh,yh,T). |
Thus we conclude that
η22,˜Th(uh,yh,˜Th)=η22,Th(uh,yh,Mh)+η22,˜Th(uh,yh,Th∖Mh)≤η22,Th(uh,yh,Th)−(1−2−b/2)η22,Th(uh,yh,Mh). | (3.14) |
And hence, (3.8) follows from (3.13)-(3.14).
It is obviously that for T∈Th∩˜Th, such that
oscTh(yh−yd,T)=osc˜Th(yh−yd,T). | (3.15) |
From (3.4) of Lemma 3.2 and the Young's inequality, such that
osc˜Th(˜yh−yd,T)≤2osc˜Th(yh−yd,T)+2h4T||˜yh−yh||2a,T. | (3.16) |
Brings (3.15) into (3.16), we conclude that (3.10) follows from summing over T∈Th∩˜Th for (3.16).
One difficulty in the proof of the convergence is lack of the orthogonality, thus we have to prove the quasi-orthogonality. We introduce the following fundamental relationships for Thi, where Thi+1∈T, Thi⊂Thi+1, such that
‖u−uhi+1‖20=‖u−uhi‖20−‖uhi−uhi+1‖20−2(u−uhi+1,uhi+1−uhi), | (3.17) |
‖y−yhi+1‖2a=‖y−yhi‖2a−‖yhi−yhi+1‖2a−2a(y−yhi+1,yhi+1−yhi), | (3.18) |
‖p−phi+1‖21=‖p−phi‖2a−‖phi−phi+1‖2a−2a(p−phi+1,phi+1−phi), | (3.19) |
where (y,u,p) are the solution of (2.6)-(2.8), (yhi,uhi,phi) and (yhi+1,uhi+1,phi+1) are the solution of (2.11)-(2.13) with respect to Thi and Thi+1, respectively.
Lemma 3.4. For Thi,Thi+1∈T and Thi⊂Thi+1, such that
(1−δ)‖u−uhi+1‖20−‖u−uhi‖20+‖uhi−uhi+1‖20≤Cδ−1(η21,Thi(phi,Rh)+J2(h0)(η22,Thi(uhi,yhi,Rh)+η23,Thi(yhi,phi,Rh))), | (3.20) |
and
(1−δ)‖y−yhi+1‖2a−‖y−yhi‖2a−‖yhi−yhi+1‖2a−δ‖u−uhi+1‖20−δ‖p−phi+1‖20≤Cδ−1(η21,Thi(phi,Rh)+J2(h0)(η22,Thi(uhi,yhi,Rh)+η23,Thi(yhi,phi,Rh))), | (3.21) |
and
(1−δ)‖p−phi+1‖2a−‖p−phi‖2a+‖phi−phi+1‖2a−δ‖u−uhi+1‖20−δ‖y−yhi+1‖20≤Cδ−1(η21,Thi(phi,Rh)+J2(h0)(η22,Thi(uhi,yhi,Rh)+η23,Thi(yhi,phi,Rh))), | (3.22) |
we follow the idea of [28] to introduce the quantity J2(h) that
J2(h)=supf∈L2(Ω),||f||0=1infvh∈Vh||Sf−vh||a. |
Proof. For Thi+1 and from (2.8), we obtain
(αuhi+1−yhi+1phi+1,vh−uhi+1)≤0,∀ vh∈Uhi+1ad. | (3.23) |
For Uhiad⊂Uhi+1ad and from (3.23), such that
α||uhi+1−uhi||20=(αuhi+1,uhi+1−uhi)−(αuhi,uhi+1−uhi)≤(yhi+1phi+1,uhi+1−uhi)−(αuhi,uhi+1−uhi)=(yhi+1phi+1−yhiphi,uhi+1−uhi)+(αuhi−yhiphi,uhi−uhi+1)=(yhi+1(phi+1−phi),uhi+1−uhi)+(phi(yhi+1−yhi),uhi+1−uhi)+(αuhi−yhiphi,uhi−uhi+1)≤C(phi+1−phi,uhi+1−uhi)+C(yhi+1−yhi,uhi+1−uhi)+(αuhi−yhiphi,uhi−uhi+1). | (3.24) |
Next, we divide (3.24) into three parts to prove. Firstly, Let πhi be the L2−projection onto Uhiad, then for T∈Thi∩Thi+1, we obtain
(πhiuhi+1−uhi+1)|T=0. |
Thus we infer that
(αuhi−yhiphi,uhi−uhi+1)≤(αuhi−yhiphi,πhiuhi+1−uhi+1)=(phi−πhiphi,πhi(uhi+1−uhi)−(uhi+1−uhi))≤Cη1,Thi(phi,Rh)||uhi−uhi+1||0, | (3.25) |
we denote Rh be the set of elements that are refined from Thi to Thi+1. It follows from the definition of Sh, there exists
(phi+1−phi,uhi+1−uhi)=(S∗hi+1(Shi+1uhi+1−yd)−S∗hi(Shiuhi−yd),uhi+1−uhi)=(S∗hi+1(Shi+1uhi+1−yd)−S∗hi+1(Shi+1uhi−yd),uhi+1−uhi)+(S∗hi+1(Shi+1uhi−yd)−S∗hi(Shiuhi−yd),uhi+1−uhi)=(Shi+1(uhi+1−uhi),Shi+1(uhi+1−uhi))+(S∗hi+1(Shi+1uhi−yd)−S∗hi(Shiuhi−yd),uhi+1−uhi)≤(S∗hi+1(Shi+1uhi−yd)−S∗hi(Shiuhi−yd),uhi+1−uhi)+||yhi+1−yhi||20. | (3.26) |
Let ϕ∈H10(Ω) be the solution of the following problem
a(ϕh,q)=(S∗hi+1(Shi+1uhi−yd)−S∗hi(Shiuhi−yd),q),∀ q∈V. |
Thus we obtain that
||S∗hi+1(Shi+1uhi−yd)−S∗hi(Shiuhi−yd)||20=a(ϕh−ϕhi,S∗hi+1(Shi+1uhi−yd)−S∗hi(Shiuhi−yd))+(Shi+1uhi−Shiuhi,ϕhi−ϕh)+(Shi+1uhi−Shiuhi,ϕh), | (3.27) |
we denote ϕhi the standard finite element estimate of ϕh with respect to Vhi. It follows from Proposition 2.1 of [18] that for Vhi⊂Vhi+1
a(ϕh−ϕhi,S∗hi+1(Shi+1uhi−yd)−S∗hi(Shiuhi−yd))≤a(ϕh−ϕhi,S∗hi+1(Shi+1uhi−yd)−S∗hi+1(Shiuhi−yd))+a(ϕh−ϕhi,S∗hi+1(Shiuhi−yd)−S∗hi(Shiuhi−yd))≤CJ(h0)||S∗hi+1(Shi+1uhi−yd)−S∗hi(Shiuhi−yd)||0(||Shi+1uhi−Shiuhi||a+||S∗hi+1(Shiuhi−yd)−S∗hi(Shiuhi−yd)||a), | (3.28) |
and
(Shi+1uhi−Shiuhi,ϕhi−ϕh)≤CJ(h0)||Shi+1uhi−Shiuhi||0||S∗hi+1(Shi+1uhi−yd)−S∗hi(Shiuhi−yd)||0, | (3.29) |
and
(Shi+1uhi−Shiuhi,ϕh)≤C||Shi+1uhi−Shiuhi||0||S∗hi+1(Shi+1uhi−yd)−S∗hi(Shiuhi−yd)||0. | (3.30) |
From Lemma 3.6 of [12], we obtain
||Shi+1uhi−Shiuhi||a≤Cη2,Thi(uhi,yhi,Rh), | (3.31) |
||S∗hi+1(Shiuhi−yd)−S∗hi(Shiuhi−yd)||a≤Cη3,Thi(yhi,phi,Rh). | (3.32) |
Then let φh∈H10(Ω) be the solution of the following problem
a(q,φh)=(Shi+1uhi−Shiuhi,q),∀ q∈V. |
With the help of the standard duality argument, we infer that
||Shi+1uhi−Shiuhi||20=a(Shi+1uhi−Shiuhi,φh−φhi)≤CJ(h0)||Shi+1uhi−Shiuhi||a||Shi+1uhi−Shiuhi||0, | (3.33) |
where φhi is the standard finite element estimate of φh with respect to VThi. Then combining with (3.26)-(3.33), there holds that
(phi+1−phi,uhi+1−uhi)≤||yhi+1−yhi||20+CJ(h0)(η2,Thi(uhi,yhi,Rh)+η3,Thi(yhi,phi,Rh))||uhi−uhi+1||0. | (3.34) |
One easily sees that
||yhi+1−yhi||0=||Shi+1uhi+1−Shiuhi||0≤||Shi+1uhi+1−Shi+1uhi||0+||Shi+1uhi−Shiuhi||0≤C||uhi−uhi+1||0+CJ(h0)(η2,Thi(uhi,yhi,Rh)+η3,Thi(yhi,phi,Rh)). | (3.35) |
Hence, (3.20) follows from (3.17), (3.24), and (3.34)-(3.35).
In order to prove (3.21), we have to provide the following
2a(y−yhi+1,yhi+1−yhi)=−2(up−uhi+1phi+1,yhi+1−yhi)=2(u(p−phi+1),yhi−yhi+1)+2(phi+1(u−uhi+1),yhi−yhi+1)≤δ||u−uhi+1||20+δ||p−phi+1||20+δ−1||yhi+1−yhi||20. | (3.36) |
Thus we conclude that (3.21) follows from (3.18), (3.25), and (3.35)-(3.36).
Now we will give the proof of (3.22).
||phi+1−phi||0=||S∗hi+1(Shi+1uhi+1−yd)−S∗hi(Shiuhi+1−yd)||0≤||S∗hi+1(Shi+1uhi+1−yd)−S∗hi+1(Shi+1uhi−yd)||0+||S∗hi+1(Shi+1uhi−yd)−S∗hi(Shiuhi+1−yd)||0≤C||uhi−uhi+1||0+CJ(h0)(η2,Thi(uhi,yhi,Rh)+η3,Thi(yhi,phi,Rh)). | (3.37) |
It is similar to (3.36) yields
2a(p−phi+1,phi+1−phi)=−2((y−up)−(yhi+1−uhi+1phi+1),phi+1−phi)=2(u(p−phi+1),phi+1−phi)+2(phi+1(u−uhi+1),phi+1−phi)+(y−yhi+1,phi+1−phi)≤δ||u−uhi+1||20+δ||p−phi+1||20+δ−1||phi+1−phi||20+δ||y−yhi+1||20. | (3.38) |
Hence, (3.22) follows from (3.19), (3.25), and (3.37)-(3.38).
For Thi∈T, we denote Uhiad, Vhi and the solution (yhi,uhi,phi) of (2.11)-(2.13) with respect to Thi. Then we define the following notations that
e2hi=‖u−uhi‖20+‖y−yhi‖2a+‖p−phi‖2a,E2hi=‖uhi−uhi+1‖20+‖yhi−yhi+1‖2a+‖phi−phi+1‖2a,˜η2Thi(ω)=η22,Thi(uhi,yhi,ω)+η23,Thi(yhi,phi,ω), |
We now proceed to prove the contraction of the Algorithm 2.1.
Theorem 3.1. Let (y,u,p)∈H10(Ω)×Uad×H10(Ω) be the solution of (2.5)-(2.7) and (yh,uh,ph)∈Vh×Uhad×Vh be the solution of (2.10)-(2.12) generated by the adaptive finite element algorithm 1. There exist γ1>0, γ2>0 and α∈(0,1] depending only on the shape of regularity of initial Th0, b, Ω and the marking parameter θ∈(0,1] such that
e2hi+1+γ1η21,Thi+1(phi+1,Thi+1)+γ2˜η2Thi+1(Thi+1)≤α(e2hi+γ1η21,Thi(phi,Thi)+γ2˜η2Thi(Thi)), | (3.39) |
where h0 sufficiently small, and h0< 1.
Proof. By Theorem 2.1, Lemma 3.3 and Lemma 3.3 yields
e2hi≤Cη2Thi(Thi), | (3.40) |
˜η2Thi+1(Thi+1)≤(1+δ){˜η2Thi(Thi)−λ˜η2Thi(Mhi)}+C(2+δ−1)E2hi, | (3.41) |
η21,Thi+1(phi+1,Thi+1)≤(1+δ1){η21,Thi(phi,Thi)−(1−2−1/2)η21,Thi(phi,Rhi)}+C(1+δ−11)h20||phi−phi+1||20, | (3.42) |
(1−2δ)e2hi+1≤e2hi−E2hi+Cδ−1(η21,Thi(phi,Rhi)+J2(h0)˜η2Thi(Thi)), | (3.43) |
where Rhi denote the set of elements which are refined from Thi to Thi+1. Combining with the upper bound of Theorem 2.1, then choosing parameters ˜γ1,˜γ2, we obtain
(1−2δ)e2hi+1+˜γ1η21,Thi+1(phi+1,Thi+1)+˜γ2˜η2Thi+1(Thi+1)≤e2hi+˜γ2(1+δ){˜η2Thi(Thi)−λ˜η2Thi(Mhi)}+˜γ2(2+δ−1)E2hi−E2hi−(˜γ1(1+δ1)(1−2−1/2)−Cδ−1)˜η2Thi(Rhi)+˜γ1(1+δ1)η21,Thi(phi,Thi)+Cδ−1J2(h0)˜η2Thi(Thi)+˜γ1(1+δ−1)h20(η21,Thi+1(phi+1,Thi+1)+˜η2Thi+1(Thi+1)+η21,Thi(phi,Thi)+˜η2Thi(Thi)). |
We choose
˜γ1(1+δ1)(1−2−1/2)−Cδ−1>0,˜γ2C(2+δ−1)=1. | (3.44) |
Then we find that
(1−2δ)e2hi+1+˜γ1(1−C(1+δ−1)h20)η21,Thi+1(phi+1,Thi+1)+(˜γ2−C(1+δ−1)h20)˜η2Thi+1(Thi+1)≤e2hi+˜γ1((1+δ1)+C(1+δ−1)h20)η21,Thi(phi+1,Thi)−c(η21,Thi(phi,Mhi)+η2Thi(Mhi))+(˜γ2(1+δ)+˜γ1C(1+δ−11)h20+Cδ−1J2(h0))˜η2Thi(Thi), |
where c=min{˜γ1(1+δ1)(1−2−1/2)−Cδ−1,˜γ2C(2+δ−1)}. Then choosing β∈(0,1) and using the marking strategy of Algorithm 2.1 and Theorem 2.1, we derive that
(1−2δ)e2hi+1+˜γ1(1−C(1+δ−1)h20)η21,Thi+1(phi+1,Thi+1)+(˜γ2−C(1+δ−1)h20)˜η2Thi+1(Thi+1)≤(1−Cθβ)e2hi+(˜γ1((1+δ1)+C(1+δ−1)h20)−cθ(1−β))η21,Thi(phi,Thi)+(˜γ2(1+δ)+˜γ1C(1+δ−11)h20+Cδ−1J2(h0)−cθ(1−β))˜η2Thi(Thi). |
One easily sees that
e2hi+1+γ1η21,Thi+1(phi+1,Thi+1)+γ2˜η2Thi+1(Thi+1)≤α1e2hi+α2γ1η21,Thi(phi,Thi)+α3γ2˜η2Thi(Thi), |
where
γ1=˜γ1(1−C(1+δ−1)h20)1−2δ, | (3.45) |
γ2=˜γ2−C(1+δ−1)h201−2δ, | (3.46) |
α1=1−Cθβ1−2δ, | (3.47) |
α2=˜γ1((1+δ1)+C(1+δ−1)h20)−cθ(1−β)˜γ1(1−C(1+δ−1)h20), | (3.48) |
α3=˜γ2(1+δ)+˜γ1C(1+δ−11)h20+Cδ−1J2(h0)−cθ(1−β)˜γ2−C(1+δ−1)h20. | (3.49) |
By choosing sufficiently small δ<1 and β such that α1∈(0,1). As long as (3.48)-(3.49) are slightly deformed, and combining with (3.44), when h0≪1 and δ1,δ small enough, we can conclude that
α2∈(0,1)andα3∈(0,1). |
Choosing α=max{α1,α2,α3}, and then (3.39) complete the rest of the proof.
In this section, we consider the quasi-optimality for the adaptive finite element method. Firstly we introduce some interpretation of the notations. For Th,Th1,Th2∈T, with the assumption that #Th be the number of elements in Th, we also set Th1⊕Th2 be the smallest common conforming refinement of Th1 and Th2 [12,26], such that
#(Th1⊕Th2)≤#Th1+#Th2−#Th0. | (4.1) |
According to [20], a function approximation class is defined by
As:={(y,u,p,yd,f)∈H10(Ω)×L2(Ω)×H10(Ω)×L2(Ω)×L2(Ω):|(y,u,p,yd,f)|s<+∞}, |
where
|(y,u,p,yd,f)|s:=supN>0NsinfTh∈TNinf(yh,uh,ph)∈Vh×Uhad×Vh{‖u−uh‖20+‖y−yh‖2a+‖p−ph‖2a+osc2Th(f,Th))+osc2Th(yh−yd,Th)}12, |
and
TN0:={Th∈T:#Th−#Th0≤N0}. |
A localized upper bound will be given in the following lemmas, which play an important role in proving the qusi-optimality for an adaptive finite element method.
Lemma 4.1. For Th,˜Th∈T, and Th⊂˜Th, let Rh be the set of refined elements from Th to ˜Th. Let (yh,uh,ph) and (˜yh,˜uh,˜ph) be the solutions of (2.11)−(2.13) with respect to Th and ˜Th, respectively. Then there exists a constant C, depending on the shape regularity of initial grids Th0 and b such that
‖uh−˜uh‖20+‖yh−˜yh‖2a+‖ph−˜ph‖2a≤Cη2Th(Rh), | (4.2) |
where
η2Th(Rh)=η21,Th(ph,Rh)+η22,Th(uh,yh,Rh)+η23,Th(yh,ph,Rh). |
Proof. From (3.26)-(3.27) and (3.34)-(3.35), we obtain
||uh−˜uh||20≤Cη2Th(Rh). | (4.3) |
By using the results of Lemma 3.6 (see [12]), we derive that
||yh−˜yh||a=||Shuh−S˜h˜uh||a≤||Shuh−S˜h−S˜huh−S˜h||a+||S˜huh−S˜h−S˜h˜uh||a≤Cη2,Th(uh,yh,Rh)+||uh−˜uh||0. | (4.4) |
Then, it holds that
||ph−˜ph||a=||S∗h(Shuh−yd)−S∗˜h(S˜h˜uh−yd)||a≤||S∗h(Shuh−yd)−S∗˜h(Shuh−yd)||a+||S∗˜h(Shuh−yd)−S∗˜h(S˜h˜uh−yd)||a≤Cη3,Th(yh,ph,Rh)+||yh−˜yh||a. | (4.5) |
And thus we conclude that (4.2) follows from (4.3)-(4.5).
In the following lemma, we list the error indicators on the coarse grids that must satisfy a Dörfler property on the refinement one.
Lemma 4.2. Assume that the marking parameter θ∈(0,θ∗), where
θ∗=C2C(1+h40)+1. |
For Th,˜Th∈T and Th⊂˜Th, let (yh,uh,ph) and (˜yh,˜uh,˜ph) be the solutions of (2.11)-(2.13) with respect to Th and ˜Th, respectively. If
e2˜Th+osc2˜Th(˜Th)≤μ[e2Th+osc2Th(Th)], | (4.6) |
is satisfied for μ:=12(1−θθ∗). Then, the set Rh of elements which are refined from Th to ˜Th satisfies the Döfler property
η2Th(Rh)≥θη2Th(Th), |
where
e2Th=‖u−uh‖20+‖y−yh‖2a+‖p−ph‖2a,osc2Th(ω)=osc2Th(f,ω)+osc2Th(yh−yd,ω), |
for ω⊂Th and e2˜Th, osc2˜Th(˜Th) similarly to define.
Proof. From (4.6) and Theorem 2.1, we derive
(1−2μ)Cη2Th(Th)≤(1−2μ)(e2Th+osc2Th(Th)≤e2Th−2e2˜Th+osc2Th(Th)−2osc2˜Th(˜Th). |
Applications of the triangle inequality yields
||u−uh||20≤2||u−˜uh||20+2||uh−˜uh||20,||y−yh||2a≤2||y−˜yh||2a+2||yh−˜yh||2a,||p−ph||2a≤2||p−˜ph||2a+2||ph−˜ph||2a. |
Thus, from (4.2), we obtain
e2Th−2e2˜Th≤2Cη2Th(Rh). | (4.7) |
From (3.10), for T∈Th∩˜Th, it is easy to see that
osc2Th(yh−yd,Th∩˜Th)−2osc2˜Th(˜yh−yd,Th∩˜Th)≤2Ch40η2Th(Rh). | (4.8) |
According to Remark 2.1 of [12], there holds for T∈Rh
oscTh(T)≤ηTh(T). | (4.9) |
By adopting (4.7)-(4.9), there holds
(1−2μ)Cη2Th(Th)≤(2C(1+h20)+1)η2Th(Rh),(1−2μ)θ∗η2Th(Th)≤η2Th(Rh),θη2Th(Th)≤η2Th(Rh), |
where
θ∗=C(2C(1+h20)+1)andθ=(1−2μ)θ∗. |
The following Lemma is essential to prove the quasi-optimality.
Lemma 4.3. Let (y,u,p) and (Thi,Uhiad,Vhi,yhi,uhi,phi) be the solution of (2.6)-(2.8) and the sequence of grids, finite element spaces and discrete solutions produced by Algorithm 2.1, respectively. Assume that the marking parameter θ satisfies the condition in Lemma 4.2, then the following estimate is valid
#Mhi≤C(C12s1|(uh,y,p,yd,f)|1ssμ−12s(e2hi+osc2Thi(Thi))−12s), | (4.10) |
if (u,y,p,yd,f)∈As.
Proof. Let ϵ:=μC−11(η2Thi+osc2Thi(Thi)), where C1 shall be produced in (4.13). Due to (uh,y,p,yd,f)∈As, then there exists a Thϵ∈T and (yϵ,uϵ,pϵ)∈Uhϵad×Vhϵ×Vhϵ such that
#Thϵ−#Th0≤C|(u,y,p,yd,f)|1/ssϵ−1/s, | (4.11) |
||u−uϵ||20+||y−yϵ||2a+||p−pϵ||2a+osc2Thϵ(f,Thϵ)+osc2Thϵ(yϵ−yd,Thϵ)≤ϵ2. | (4.12) |
Let (y∗,u∗,p∗) be the solution of (2.6)-(2.8) with respect Th∗=Thϵ⊕Thi which is the smallest common refinement of Thϵ and Thi. Next, we first prove the following estimate
e2Th∗+osc2Th∗≤C1(e2Thϵ+osc2Thϵ), | (4.13) |
where
e2Thϵ=||u−uϵ||20+||y−yϵ||2a+||p−pϵ||2a,osc2Thϵ(Thϵ)=osc2Thϵ(f,Thϵ)+osc2Thϵ(yϵ−yd,Thϵ). |
Combining with the Young's inequality yield
(u−u∗,u∗−uϵ)=(u−uϵ,u∗−uϵ)−(u∗−uϵ,u∗−uϵ)≤(u−uϵ,u∗−uϵ)≤||u−uϵ||20+||u∗−uϵ||20. | (4.14) |
Note that there holds the similar result to a(y−y∗,y∗−yϵ) and a(p−p∗,p∗−pϵ). Thus combining with (3.17)-(3.19) and (4.14), we find that
6(||u−uϵ||20+||y−yϵ||2a+||p−pϵ||2a)≥||u−u∗||20+||u∗−uϵ||20+||y−y∗||2a+||y∗−yϵ||2a+||p−p∗||2a+||p∗−pϵ||2a. | (4.15) |
From (3.10) with Th=˜Th=Th∗ and yh=y∗, ˜yh=yϵ, such that
osc2Th∗(y∗−yd,Th∗)−2osc2Thϵ(yϵ−yd,Thϵ)≤Ch40||y∗−yϵ||2a. | (4.16) |
For T′∈Thϵ, let TT′:={T∈Th∗:T∈T′}. There holds
∑T∈TT′||f−fT||20,T=∑T∈TT′(∫Tf2−(∫Tf)2|T|)=∫T′f2−∑T∈TT′(∫Tf)2|T|≤∫T′f2−∑T∈TT′(∫Tf)2|T′|≤C∫T′f2−(∫T′f)2|T′|=C||f−fT′||20,T′. |
Hence,
oscTh∗+2(f,Th∗)≤CoscThϵ+2(f,Thϵ). | (4.17) |
And thus we conclude that (4.13) follows from (4.15)-(4.17). Then combining with (4.12), we find that
e2Th∗+osc2Th∗(Th∗)≤C1(e2Thϵ+osc2Thϵ(Thϵ))≤C1ϵ2=μ(e2Thi+osc2Thi(Thi)) | (4.18) |
Note that Rh, which is the set of refinement elements from Thi to Th∗, it satisfies the marking property of Lemma 4.2 that
#Mhi≤#Rh≤#Th∗−#Thi≤#Thϵ−Th0. | (4.19) |
Thus we find that (4.10) follows from (4.11) and (4.19).
The following theorem describes the quasi-optimality of an adaptive finite element method.
Theorem 4.1. Let (y,u,p) and (Thi,Uhiad,Vhi,yhi,uhi,phi) be the solution of (2.6)-(2.8). Where the sequence of grids, finite element spaces, and discrete solutions produced by Algorithm 2.1, respectively. Assume that Th0 satisfies the condition (b) of Section 4 (see [3]). Let (u,y,p,yd,f)∈As, then there holds
#Thi−#Th0≤C|(u,y,p,yd,f)|1ss(e2hi+osc2Thi(Thi))−12s, | (4.20) |
provided h0≪1.
Proof. According to Lemma 2.3 of [12], we derive that
#Thi−#Th0≤i−1∑j=0Mhj. | (4.21) |
Combining (4.21) with Lemma 4.3 leads to
#Thi−#Th0≤i−1∑j=0MhjCC0i−1∑j=0(e2hi+osc2Thi(Thi))−12s, | (4.22) |
where
C0=C12s1|(u,y,p,yd,f)|1/ssμ−12s. |
An application of Theorem 2.1, such that
e2hi+γ1η21,Thi(phi,Thi)+γ2˜η2Thi(Thi)≈e2hi+osc2Thi(Thi). | (4.23) |
Thus we conclude that (4.20) follows from (4.22)-(4.23).
In this section, we are going to use the following iterations to solve the bilinear optimal control problem numerically.
Algorithm 5.1. It follows from linear optimal control problems, there exist similar algorithms in the reference [20], we define the following Algorithm 5.1 based on the bilinear optimal control problem:
a(ykh,wh)+(uk−1hykh,wh)=(f,wh),∀ wh∈Vh, | (5.1) |
a(qh,pkh)+(uk−1hpkh,qh)=(ykh−yd,qh),∀ qh∈Vh, | (5.2) |
(αukh+ykhpkh,vh−ukh)≥0,∀ vkh∈Uhad. | (5.3) |
Given an initial control u0h∈Uhad, then substitute it into (5.1) yields y1h. And then substitute y1h into (5.2) yields p1h. At last, substitute p1h into (5.3) yields u1h. Repeat the above step, we can solve (ykh,pkh,ukh), for k=1,2,⋯, it holds that
ukh=1α(−Phpkh+max(0,ˉpkh)), |
where Ph is the L2-projection from L2(Ω) to Uh and ˉpkh=∫Ωpkh|Ω|.
For ease of analysis the solution of bilinear optimal control problem we constructing examples as follows.
Example 1. The first example is a bilinear optimal control problem, we set state equation on Ω=(0,1)2, and α=0.5. We take the exact solution as
p=sinπx1sinπx2,u0=1αmax{ˉp,0}−p,yd=−Δp+up−y,f=−Δy+uy,y=2π2p. |
In Example 1, the state and co-state are approximated by the piecewise linear elements, while piecewise constant elements are used to approximate the control. We compute Example 1 on an adaptive grid and a uniform grid respectively.
In Figure 1, we plot the profiles of the exact state and the numerical state on adaptive refinement grid with θ=0.3 and 21 adaptive loops. We plot the profiles of the co-exact state variables and the co-numerical state variables on adaptive refinement grid with θ=0.3 and 21 adaptive loops in Figure 2 as well. Connecting with Figure 1 and Figure 2, we find that although the solution of Example 1 is smooth, larger gradients can be observed in certain areas. Accordingly, the adaptive finite element method may obtain a much smaller error compared to the uniform refinement.
We show the comparisons of convergence history of the errors on adaptively grids with θ=0.3 and uniformly refined grids with θ=1 in Figure 3. We can observe η2 is approximately parallel to the line slop −1 which is the optimal convergence rate we expected by using linear finite elements. This situation confirms the theoretic results in Section 4. But in this example, the solutions of the optimal control are quite smooth. Here is little difference in the convergence history of the errors compared to uniform refinement.
Example 2. In the second example, we choose α=0.3 with the exact solutions on Ω=(0,1)2
u=1α(−p+max(0,ˉp)),p={sinπx1sinπx2,if s(x1,x2)<0,0,if s(x1,x2)≥0,yd={100√(x1−1)2+(x2−1)2,if s(x1,x2)<0,0,if s(x1,x2)≥0, |
where s(x1,x2)=(x1−0.2)2+(x2−0.6)2−0.04.
Similarly, the state and co-state are approximated by the piecewise linear elements, while piecewise constant elements are used to approximate the control. We compute Example 2 on an adaptive grid and a uniform grid respectively.
In Figure 4, we plot the profiles of the numerical state and the numerical co-state on 21 adaptive loops with θ=0.3 for Example 2, both the numerical state and the numerical co-state can be seen in the singularities around the peak. Further, the grids are concentrated at the peak. In Figure 5, we show the convergence history of the estimators. We can see η2 is approximately parallel to slop −1. From the figure, it is easy to see that the second-order convergence for the reduction of error estimators for the adaptively refined grids. In Example ref{exm2}, the optimal control is not smooth so that there is much difference in using either the uniform or adaptive grids to approximate the control.
This work is supported by National Science Foundation of China (11201510), National Social Science Fund of China (19BGL190), China Postdoctoral Science Foundation (2017T100155, 2015M580197), Innovation Team Building at Institutions of Higher Education in Chongqing (CXTDX201601035), Chongqing Research Program of Basic Research and Frontier Technology (cstc2019jcyj-msxmX0280) and Scientific and Technological Research Program of Chongqing Municipal Education Commission (KJZD-K202001201), Youth Innovative Talents Project (Natural Science) of research on humanities and social sciences in Guangdong normal university (2017KQNCX265), Hunan Provincial Education Department of China (18C0196).
The authors declare no conflict of interest.
[1] |
M. Ainsworth, J. T. Oden, A posteriori error estimators in finite element analysis, Comput. Methods Appl. Mech. Engrg., 142 (1997), 1–88. doi: 10.1016/S0045-7825(96)01107-3
![]() |
[2] |
I. Babuška, W. C. Rheinboldt, Error estimates for adaptive finite computations, SIAM J. Numer. Anal., 15 (1978), 736–754. doi: 10.1137/0715049
![]() |
[3] | P. Binev, W. Dahmen, R. Devore, Adaptive finite element approximation for distributed elliptic optimal control problems, SIAM J. Control Optim., 97 (2004), 219–268. |
[4] |
L. Zhang, Z. Zhou, Spectral galerkin approximation of optimal control problem governed by riesz fractional differential equation, Appl. numer. math., 143 (2019), 247–262. doi: 10.1016/j.apnum.2019.04.003
![]() |
[5] | F. Wang, Z. Zhang, Z. Zhou, A spectral galerkin approximation of optimal control problem governed by fractional advection diffusion reaction equations, J. Comput. Appl. Math., 386 (2021), 113–129. |
[6] |
N. Du, H. Wang, W. B. Liu, A fast gradient projection method for a constrained fractional optimal control, J. Sci. Comput., 68 (2016), 1–20. doi: 10.1007/s10915-015-0125-1
![]() |
[7] | P. G. Ciarlet, The Finite Element Method for Elliptic Problems, Amsterdam: North-Holland, 1978. |
[8] |
Z. Chen, J. Feng, An adaptive finite element algorithm with reliable and efficient error for linear parabolic problems, Math. Comput., 73 (2004), 1167–1193. doi: 10.1090/S0025-5718-04-01634-5
![]() |
[9] | Y. Chen, Z. Lu, High Efficient and Accuracy Numerical Methods for Opyimal Control Problems, Science Press, Beijing, 2015. |
[10] |
Y. Chen, Z. Lu, Y. Huang, Superconvergence of triangular Raviart-Thomas mixed finite element methods for a bilinear constrained optimal control problem, Comput. Math. Appl., 66 (2013), 1498–1513. doi: 10.1016/j.camwa.2013.08.019
![]() |
[11] | Y. Chen, Z. Lu, L. Liu, Numerical Methods for Partial Differential Equations, Science Press, Beijing, 2015. |
[12] |
J. M. Cascon, C. Kreuzer, R. H. Nochetto, K. G. Siebert, Qusi-optimal convergence rate for an adaptive finite element method, SIAM J. Numer. Anal., 46 (2008), 2524–2550. doi: 10.1137/07069047X
![]() |
[13] |
W. Dörfler, A convergent adaptive algorithm for Poisson equation, SIAM J. Numer. Anal., 33 (1996), 1106–1124. doi: 10.1137/0733054
![]() |
[14] |
A. Demlow, R. Stevenson, Convergence and quasi-optimality of an adaptive finite element method for controlling L2 errors, Numer. Math., 117 (2011), 185–218. doi: 10.1007/s00211-010-0349-9
![]() |
[15] |
A. Gaevskaya, R. H. W. Hoppe, Y. Iliash, M. Kieweg, Convergence anlysis of an adaptive finite element for distributed control problems with control constraints, Int. Serises Numer. Math., 155 (2007), 47–68. doi: 10.1007/978-3-7643-7721-2_3
![]() |
[16] |
L. Ge, W. Liu, D. Yang, Adaptive finite element approximation for a constrained optimal control problem via multi-meshes, J. Sci. Comput., 41 (2009), 238–255. doi: 10.1007/s10915-009-9296-y
![]() |
[17] | L. Ge, W. Liu, D. Yang, L2 norm equivalent a posteriori error estimate for a constrained optimal control problem, Inter. J. Numer. Anal. Model., 6 (2009), 335–353. |
[18] |
W. Gong, N. Yan, Adaptive finite element method for elliptic optimal control problems: convergence and optimality, Numer. Math., 135 (2017), 1121–1170. doi: 10.1007/s00211-016-0827-9
![]() |
[19] | L. He, A. Zhou, Comvergence and optimality of adaptive finite element methods for elliptic partial differential equations, Int. J. Numer. Anal. Model., 8 (2011), 1721–1743. |
[20] | H. Leng, Y. Chen, Convergence and quasi-optimality of an adaptive finite element method for optimal control problems with integral control constraint, Adv. Comput. Math., 44 (2018), 1367–1394. |
[21] | R. Li, W. Liu, H. Ma, T. Tang, Adaptive finite element methods with convergence rates, Numer. Math., 41 (2002), 1321–1349. |
[22] | W. Liu, N. Yan, Adaptive Finite Element Methods for Optimal Control Governed by PDEs, Science Press, Beijing, 2008. |
[23] | Z. Lu, S. Zhang, L∞-error estimates of rectangular mixed finite element methods for bilinear optimal control problem, Appl. Math. Comput., 300 (2017), 79–94. |
[24] |
P. Morin, R. H. Nochetto, K. G. Siebert, Data oscillation and convergence of adaptive FEM, SIAM J. Numer. Anal., 33 (1996), 1106–1124. doi: 10.1137/0733054
![]() |
[25] | P. Morin, R. H. Nochetto, K. G. Siebert, Convergence of adaptive finite element methods, SIAM Reviews, 44 (2000), 466–488. |
[26] |
R. Stevenson, Optimality of a standard adaptive finite element method, Found Comput. Math., 7 (2007), 245–269. doi: 10.1007/s10208-005-0183-0
![]() |
[27] | R. Verfurth, A Review of A Posteriori Error Estimation and Adaptive Mesh Refinement, Comput. Methods Appl. Mech. Engrg., Wiley-Teubner, London, 1996. |
[28] | J. Xu, A. Zhou, Local and parallel finite element algorithms based on two-grid discretizations, Math. Comput., 69 (1996), 881–909. |