Processing math: 100%
Research article Special Issues

Construction optical solitons of generalized nonlinear Schrödinger equation with quintuple power-law nonlinearity using Exp-function, projective Riccati, and new generalized methods

  • Received: 24 December 2024 Revised: 26 January 2025 Accepted: 12 February 2025 Published: 21 February 2025
  • MSC : 35C05, 35C07, 35C08, 47J35

  • This work investigates the generalized nonlinear Schrödinger equation (NLSE), which imitates the wave transmission along optical fibers. This model incorporates a quintuple power-law of non-linearity and nonlinear chromatic dispersion. To demonstrate the significance and motivation for this work, a review of the prior research is presented in the literature. Three integration strategies are applied during the study process in order to produce a variety of novel solutions. These techniques include the modified exp-function approach, the general projective Riccati method (GPRM), and the new generalized method. The extracted solutions include bright solitons, singular solitons, dark solitons, and trigonometric solutions.

    Citation: Islam Samir, Hamdy M. Ahmed, Wafaa Rabie, W. Abbas, Ola Mostafa. Construction optical solitons of generalized nonlinear Schrödinger equation with quintuple power-law nonlinearity using Exp-function, projective Riccati, and new generalized methods[J]. AIMS Mathematics, 2025, 10(2): 3392-3407. doi: 10.3934/math.2025157

    Related Papers:

    [1] Zuliang Lu, Xiankui Wu, Fei Huang, Fei Cai, Chunjuan Hou, Yin Yang . Convergence and quasi-optimality based on an adaptive finite element method for the bilinear optimal control problem. AIMS Mathematics, 2021, 6(9): 9510-9535. doi: 10.3934/math.2021553
    [2] Zuliang Lu, Fei Cai, Ruixiang Xu, Chunjuan Hou, Xiankui Wu, Yin Yang . A posteriori error estimates of hp spectral element method for parabolic optimal control problems. AIMS Mathematics, 2022, 7(4): 5220-5240. doi: 10.3934/math.2022291
    [3] Xin Zhao, Xin Liu, Jian Li . Convergence analysis and error estimate of finite element method of a nonlinear fluid-structure interaction problem. AIMS Mathematics, 2020, 5(5): 5240-5260. doi: 10.3934/math.2020337
    [4] Zuliang Lu, Ruixiang Xu, Chunjuan Hou, Lu Xing . A priori error estimates of finite volume element method for bilinear parabolic optimal control problem. AIMS Mathematics, 2023, 8(8): 19374-19390. doi: 10.3934/math.2023988
    [5] Changling Xu, Hongbo Chen . A two-grid P20-P1 mixed finite element scheme for semilinear elliptic optimal control problems. AIMS Mathematics, 2022, 7(4): 6153-6172. doi: 10.3934/math.2022342
    [6] Tiantian Zhang, Wenwen Xu, Xindong Li, Yan Wang . Multipoint flux mixed finite element method for parabolic optimal control problems. AIMS Mathematics, 2022, 7(9): 17461-17474. doi: 10.3934/math.2022962
    [7] Yuelong Tang . Error estimates of mixed finite elements combined with Crank-Nicolson scheme for parabolic control problems. AIMS Mathematics, 2023, 8(5): 12506-12519. doi: 10.3934/math.2023628
    [8] Miao Xiao, Zhe Lin, Qian Jiang, Dingcheng Yang, Xiongfeng Deng . Neural network-based adaptive finite-time tracking control for multiple inputs uncertain nonlinear systems with positive odd integer powers and unknown multiple faults. AIMS Mathematics, 2025, 10(3): 4819-4841. doi: 10.3934/math.2025221
    [9] Jie Liu, Zhaojie Zhou . Finite element approximation of time fractional optimal control problem with integral state constraint. AIMS Mathematics, 2021, 6(1): 979-997. doi: 10.3934/math.2021059
    [10] Cagnur Corekli . The SIPG method of Dirichlet boundary optimal control problems with weakly imposed boundary conditions. AIMS Mathematics, 2022, 7(4): 6711-6742. doi: 10.3934/math.2022375
  • This work investigates the generalized nonlinear Schrödinger equation (NLSE), which imitates the wave transmission along optical fibers. This model incorporates a quintuple power-law of non-linearity and nonlinear chromatic dispersion. To demonstrate the significance and motivation for this work, a review of the prior research is presented in the literature. Three integration strategies are applied during the study process in order to produce a variety of novel solutions. These techniques include the modified exp-function approach, the general projective Riccati method (GPRM), and the new generalized method. The extracted solutions include bright solitons, singular solitons, dark solitons, and trigonometric solutions.



    With the development of computer technology, the finite element method follows closely in the early 1960s. The research on the reliability and validity of finite element analysis promoted the development of the finite element method [3,4,5,25,27]. Applying finite element methods, the emergence of errors has captured the attention of scholars. One of the main sources of errors is the error caused by the discretisation of the model, while researchers dissect various aspects of finite element analysis. In addition, the generation of finite element mesh is also a concern for scholars. In the early conventional FEA, scholars usually used experience, intuition or even guesses to generate meshes to simple judge whether the approximation results are reasonable or not. If it is not reasonable, the grid needs to be redesigned whenever necessary for the efficiency of the analysis and the reliability of the results. Therefore, the emergence of the adaptive finite element method is because the computer, after checking the current conditions, decides whether the solution is accurate enough to meet the determination needs according to the error information obtained during the adjustment process.

    To the best of our knowledge, the adaptive finite element method is a numerical method that can automatically adjust the algorithm to improve the process of solving [2]. An appropriate mesh can greatly reduce the errors arising from the discretisation of the finite element approximation process during the replication. According to the current situation, solutions of optimal control problems for nonlinear systems are usually not available. Besides the complexity and diversity of nonlinear equations, it is also very practical to use the adaptive finite element method for solving nonlinear equations.

    Adaptive finite element methods have been widely and successfully applied in various linear optimal control problems, for example, Eriksson and Johnson proposed that the adaptive finite element algorithm produces a series of successively refined meshes in which the final mesh satisfies a given error tolerance [11]. Gaevaskaya and Hope et al. proposed an adaptive finite element method for a class of distributed optimal control problems with control constraints is analysed by applying the reliability and discrete local efficiency of the posterior estimator and the quasi-orthogonality property as basic tools [13]. Braess and Carstensen et al. Investigate the residual jump contributions of a standard explicit residual-based a posteriori error estimator for an adaptive finite element method [1]. Hu and Xu et al. performed research on the convergence and optimality of the adaptive nonconforming linear element method for the stokes problem [17].

    However, a large number of literature show that the researches on adaptive finite element method for nonlinear optimal control problems have not reached its peak yet.

    Recently, for instance, Wriggers and Scherf propose an adaptive finite element technique for nonlinear contact problems [29]. Eriksson and Johnson consider adaptive finite element methods for parabolic problems to a class of nonlinear scalar problems, the authors obtain posteriori error estimates and design corresponding adaptive algorithms [12]. Nowadays, the problem of nonlinear optimal control problems, similar to big data, is the focus of scholars worldwide. Hence it is worthy of investigating the adaptive finite element method for such nonlinear problems.

    Many scholars have also studied the prior error estimates of the bilinear optimal control problem. For example, Yang, Demlow, and Dobrowolski et al. investigated the prior error estimates and superconvergence of optimal control problems for bilinear models and give the optimal L2-norm error estimates and the almost optimal L-norm estimates about the state and co-state variables [7,8,31]. Lu investigated a second-order parabolic bilinear optimal control problems and provided a priori error estimates for the finite element solutions of the state equations describing the system [24]. Shen and Yang et al. investigated a quadratic optimal control problem governed by a linear hyperbolic integro differential equation and its finite element approximation, a priori estimates have been carried out using the standard functional analysis techniques, and the existence and regularity of the solution are provided by using these estimates. At the same time, some scholars have analysed the posteriori error estimates of the finite element method for bilinear optimal control problems [15,26]. Lu, Chen, and Leng et al. discussed the discretisation of Raviart-Thomas mixed finite element for general bilinear optimal control problems, a posteriori error estimates are derived for both the coupled state and the control solutions [18,23]. Although bilinear optimal control problems are frequently met in applications, they are much more difficult to handle than linear or nonlinear cases. There is little work on nonlinear optimal control problems.

    In this paper, we focus on nonlinear optimal control problem with integral control constraints where we deal with the control via adopting piecewise constant discetization while applying continuous piecewise linear discretization for the state and the co-state, respectively. Then a posteriori error estimates is gained. For the convergence and the quasi-optimality, we prove them relying on quasi-orthogonality and discrete local upper bound. Based on the mild assumption to the initial grids, we obtain the proof of convergence and quasi-optimality by means of the solution operator of nonlinear elliptic equations. Finally, some numerical simulations are provided support for our theoretical analysis.

    Here are some notations will be used in this paper. Let Ω be a bounded Lipschitz domain in R2 and Ω denote the boundary of Ω. We use the standard notation Wm,q(ω) with norm m,q,ω and seminorm ||m,q,ω to express the standard Sobolev space for ωΩ. Moreover, we will omit the subscription if ω=Ω. For q=2, we denote Wm,2(Ω) by Hm(Ω) and m=m,2. Also for m=0 and q=2, we denote W0,2(ω)=L2(ω) and 0,2,ω=0,ω. Additionally, we observe that H10(Ω)={vH1(Ω):v=0 on Ω}. Let Th0 be the initial partition of ˉΩ into disjoint triangles. By newest-vertex bisections for Th0, we can obtain a class T of conforming partitions. For Th,˜ThT, we use Th˜Th to indicate that ˜Th is a refinement of Th and hT=|T|1/2, T is the partition diameter. In addition (,) denotes the L2 inner product. Beyond that, let C be a constant which independent of grids size, then we use AB to represent cABCA.

    The rest of the paper is organized as follows. In Section 2, we introduce the optimal control problem of our interest and obtain a posteriori error estimates. The relevant algorithms are introduced in Section 3. In Section 4, we use quasi-orthogonality and discrete local upper bounds to prove the convergence of the adaptive finite element method, as well as quasi-optimality in Section 5. We provide an adaptive finite element algorithm and some numerical simulations to verify our theoretical analysis in Section 6. Finally, we summarize the results of this paper and develop a plan for future work.

    In this paper we mainly enter into meaningful discussions with the following nonlinear optimal control problem governed by nonlinear elliptic equations:

    minuUad{12yyd20+α2u20},Δy+ϕ(y)=f+u,in Ω,y=0,on Ω,

    where y is the state variable, u is the control variable, f is a function of the control variable, α is a constant greater than zero, ydL2(Ω), Uad={v:vL2(Ω), Ωv dx0} is a closed convex subset of U=L2(Ω) and ϕ()W2,(R,R) for any R>0, ϕ(y)L2(Ω) for any yH1(Ω), ϕ0. Let V=H10(Ω), we give the weak formulation to deal with state equation, namely, find yV such that

    a(y,v)+(ϕ(y),v)=(f+u,v), vV,

    where

    a(y,v)=Ωyv dx.

    Then the nonlinear optimal control problem can be restated as follows

    minuUad{12yyd20+α2u20}, (2.1)
    a(y,v)+(ϕ(y),v)=(f+u,v), vV. (2.2)

    It is well known [14,20] that the nonlinear optimal control problem has at least one solution (y,u), and that if a pair (y,u) is the solution of the nonlinear optimal control problem, then there is a co-state pV such that the triplet (y,p,u) satisfies the following optimality conditions:

    a(y,v)+(ϕ(y),v)=(f+u,v), vV, (2.3)
    a(q,p)+(ϕ(y)p,q)=(yyd,q), qV, (2.4)
    (αu+p,vu)0, vUad. (2.5)

    Due to coercivity of a(,), we define a linear operator S:L2(Ω)H10(Ω) such that S(f+u)=y and let S be the adjoint of S such that S(yyd)=p where Vh is the continuous piecewise linear finite element space with respect to the partition ThT. For ThT, we define Uh as the piecewise constant finite element space with respect to Th. Let Uhad={vhUh:Ωvhdx0}. Then we derive the standard finite element discretization for the nonlinear optimal control problem as follows:

    minuhUhad{12yhyd20+α2uh20}, (2.6)
    a(yh,v)+(ϕ(yh),v)=(f+uh,v), vVh. (2.7)

    Similarly the nonlinear optimal control problem (2.6)–(2.7) has a solution (yh,uh), and that if a pair (yh,uh)Vh×Uh is the solution of (2.6)–(2.7), then there is a co-state phVh such that the triplet (yh,ph,uh) satisfies the following optimality conditions:

    a(yh,v)+(ϕ(yh),v)=(f+uh,v), vVh, (2.8)
    a(q,ph)+(ϕ(yh)ph,q)=(yhyd,q), qVh, (2.9)
    (αuh+ph,vhuh)0, vhUhad. (2.10)

    Here we define some error indicators we largely and frequently use in this paper where η() are error indicators and osc() represent the data oscillations. For ThT, TTh, we define

    η21,Th(ph,T)=h2Tph20,T,η22,Th(uh,yh,T)=h2Tf+uhϕ(yh)20,T+hT[yh]n20,TΩ,η23,Th(yh,ph,T)=h2Tyhydϕ(yh)ph20,T+hT[ph]n20,TΩ,osc2Th(f,T)=h2TffT20,T,osc2Th(yhyd,T)=h2T(yhyd)(yhyd)T20,T,

    where uhUhad, yh,phVh, and where fT is L2-projection of f onto piecewise constant space on T and fT=Tf|T|. For ωTh, we have

    η21,Th(ph,ω)=Tωη21,Th(ph,T),osc2Th(f,ω)=Tωosc2Th(f,T),

    by which η22,Th(uh,yh,ω), η23,Th(uh,yh,ω) and osc2Th(yhyd,ω) can be denoted similarly.

    Also, a reliable and effective a posteriori error estimates for the nonlinear optimal control problem (2.1)–(2.2) which is presented in next Theorem.

    Theorem 2.1. For ThT, let (y,p,u) be the exactsolution of problem (2.3), (2.4) and (2.5) and (yh,ph,uh) be the solution of problem (2.8), (2.9) and (2.10) with respect to Th. Then thereexist constants c and C such that

     uuh20+yyh21+pph21C(η21,Th(ph,Th)+η22,Th(uh,yh,Th)+η23,Th(yh,ph,Th)), (2.11)

    and

     c(η21,Th(ph,Th)+η22,Th(uh,yh,Th)+η23,Th(yh,ph,Th))uuh20+yyh21+pph21+osc2Th(f,Th)+osc2Th(yhyd,Th). (2.12)

    Proof. Step 1. We seek functions yh,phV satisfying the following auxiliary problems

    a(yh,v)+(ϕ(yh),v)=(f+uh,v), vV, (2.13)
    a(q,ph)+(ϕ(yh)ph,q)=(yhyd,q), qV. (2.14)

    It follows that

    uuh20C(αu,uuh)C(αuh,uuh)C(αuh,uuh)=C(αuh,uhu)+C(αuhαuh,uuh).

    With the help of the proof of Lemma 3.4 in [14] and the lemma 3.4 in [21], we have

    (αuh,uhu)=(αuh+ph,uhu)C(TThphπhph20,T+uuh20)C(σ)η21,Th(ph,Th)+Cσuuh20, (2.15)

    where η21,Th(ph,Th)=TThη21,Th(ph,T), and

    (αuhαuh,uuh)=(phph,uuh)C(σ)phph20+Cσuuh20, (2.16)

    where πh is the L2-projection operator onto piecewise constant space on Th, C(σ) is a universal constant, which depends on σ, σ is an arbitrary positive number, and C is a general universal constant, which can include C(σ). Obviously, from (2.15) and (2.16) there holds

    uuh20C(η21,Th(ph,Th)+phph20).

    Let ey=yhyh, and eyI=ˆπhey, where ˆπh is the average interpolation operator defined in Lemma 3.2 of [14], then we can obtain

    cey21((yhyh),ey)+(ϕ(yh)ϕ(yh),ey)=((yhyh),(eyeyI))+(ϕ(yh)ϕ(yh),eyeyI)=TThT(f+uhϕ(yh))(eyeyI)dxTThT([yh]n)(eyeyI)dxC(σ)TThh2TT(f+uhϕ(yh))2dx+C(σ)TΩhT([yh]n)2dx+Cσey21=C(σ)η22,Th(uh,yh,Th)+Cσey21,

    where σ is an arbitrary positive number. The definition of η2 will be given later on. Then let σ=c2C, we have

    yhyh21Cη22,Th(uh,yh,Th).

    Similarly, let ep=phph and epI be the average interpolation of ep, then we can get

    cep21(ep,(phph))+((ϕ(yh)(phph),ep)=(ep,(phph))+(ϕ(yh)phϕ(yh)ph,ep)+((ϕ(yh)ϕ(yh))ph,ep)=((epepI),(phph))+(ϕ(yh)phϕ(yh)ph,epepI)+(epI,(phph))+(ϕ(yh)phϕ(yh)ph,epI)+((ϕ(yh)ϕ(yh))ph,ep)=TThT(yhydϕ(yh)ph)(epepI)dxTΩT([ph]n)(epepI)(yhyh,epI)+((ϕ(yh)ϕ(yh))ph,ep)dxC(σ)TThh2TT(yhydϕ(yh)ph)2dx+C(σ)TΩhTT([ph]n)2dx+CσTThh2TT|ep|2dx+Cyhyh0ep0+Cϕ(yh)ϕ(yh)0ph0,4ep0,4C(σ)η23,Th(yh,ph,Th)+C(σ)yhyh20+C(σ)ph21yhyh20+Cσphph21)C(σ)η23,Th(yh,ph,Th)+Cyhyh20+Cσphph21,

    in which we apply the embedding theorem v0,4Cv1 (see [6]) and the property: ph1C, C(σ)yhyh20+C(σ)ph21yhyh20C(σ)yhyh20+C(σ)Cyhyh20Cyhyh20. The definition of η3 will be given later on. Absolutely, we can obtain

    phph21Cη23,Th(yh,ph,Th)+Cyhyh20.

    Hence we gain

    phph20Cphph21+Cyhyh21C(η22,Th(uh,yh,Th)+η23,Th(yh,ph,Th)).

    By the triangular inequality we obtain that

    yyh1yyh1+yhyh1C(uuh0+yhyh1),pph1pph1+phph1C(yyh1+phph1).

    In connection with what we discussed above, we have

    uuh20+pph21+yyh21C(η21,Th(ph,Th)+η22,Th(uh,yh,Th)+η23,Th(yh,ph,Th)).

    Step 2. Now we are in the position to get the lower bound. Consulting to the proof of Lemma 3.6 in [14] and Lemma 3.4 in [21], we conclude that

    h2Tph20TThphπhph20,TC(uuh20+pph21). (2.17)

    Proof. It is easily seen that

    TThphπhph20,T=TThphπhph0,Tphp+pπhp+πhpπhph0,TTThphπhph0,Tpπhp+13TThphπhph20,T+Cphp21.

    Since u+p=max(0,ˉp)=const, hence

    πh(u+p)=u+p,

    such that

    TThphπhph0,Tpπhp0,T=TThphπhph0,Tp+uπh(p+u)+πhpu0,T=TThphπhph0,Tπh(uuh)(uuh)0,T13TThphπhph20,T+Cuhu21.

    This completes the proof.

    According to [28], we empoly the standard bubble function technique to estimate error indicators η2,Th(uh,yh,Th) and η3,Th(yh,ph,Th). Similar to Chapter 7.2 in [22], there exists polynomials wTH10(T) and wTH10(TΩ) such that

    ThT([ph]n)2dx=T([ph]n)wTdx, (2.18)
    Th2T((yhyd)Tϕ(yh)ph)2dx=T((yhyd)Tϕ(yh)ph)wTdx, (2.19)

    and apparently

    wT21,TΩCThT([ph]n)2dx, (2.20)
    h2TwT20,TΩCThT([ph]n)2dx, (2.21)
    wT21,TChh2T((yhyd)Tϕ(yh)ph)2dx, (2.22)
    h2TwT20,TChh2T((yhyd)Tϕ(yh)ph)2dx. (2.23)

    By using the Schwarz inequality, it follows from (2.18), (2.20), and (2.21) that

    ThT([ph]n)2dx=T([ph]n)wTdx=T([ph]n[p]n)wTdx=T(php)wTdx+div(Δ(php))wT=T(php)wTdx+T(yyd)dxϕ(y)p)wTdx=T(php)wTdx+T(yhydϕ(yh)ph)wTdx+T((yyd)(yhyd))wTdx+T(ϕ(y)pϕ(yh)ph)wTdxC(σ)php21,TΩ+C(σ)yyh20,TΩ+Tϕ(y)(pph)wTdx+T(ϕ(y)ϕ(yh))phwTdx+C(σ)T(yhydϕ(yh)ph)2dx+Cσ(wT21,TΩ+h2TwT20,TΩ)C(σ)php21,TΩ+C(σ)yyh20,TΩ+C(σ)ϕ(y)20,TΩ(pph)20,TΩ+T˜ϕ(yh)(yyh)phwTdx+C(σ)T(yhydϕ(yh)ph)2dx+Cσ(wT21,TΩ+h2TwT20,TΩ)C(σ)php21,TΩ+C(σ)yyh20,TΩ+C(σ)˜ϕ(yh)20,TΩyyh20,TΩph20,TΩ+C(σ)T(yhydϕ(yh)ph)2dx+Cσ(wT21,TΩ+h2TwT20,TΩ)C(σ)php21,TΩ+C(σ)yyh20,TΩ+C(σ)T(yhydϕ(yh)ph)2dx+CσThT([ph]n)2dx,

    where σ is an arbitrary positive number and ϕ()W2,(Ω) has been used. Then let σ=12C and we have

    ThT([ph]n)2dxCphp21,TΩ+Cyyh20,TΩ+CT(yhydϕ(yh)ph)2dx. (2.24)

    Next, it follows from (2.19), (2.22), and (2.23) that

    Th2T((yhyd)Tϕ(yh)ph)2dx=T((yhyd)Tϕ(yh)ph)wTdx=T(yhydϕ(yh)ph)wTdx+T((yhyd)(yhyd)T)wTdxT(yhydϕ(yh)ph(yyd)+ϕ(y)p)wTdx+C(σ)Th2T((yhyd)(yhyd)T)2dx+Cσh2TwT20,T=T(php)wTdx+T((yhyd)(yyd))wTdx+T(ϕ(yh)phϕ(y)p)wTdx+C(σ)Th2T((yhyd)(yhyd)T)2dx+Cσh2TwT20,TC(σ)php21,T+C(σ)(yyd)(yhyd)20,T+Tϕ(yh)(php)wTdx+T(ϕ(yh)ϕ(y))pwTdx+C(σ)Th2T((yhyd)(yhyd)T)2dx+Cσ(wT21,T+h2TwT20,T)C(σ)php21,T+C(σ)yyh20,T+C(σ)Th2T((yhyd)(yhyd)T)2dx+C(σ)ϕ(yh)20,Tphp20+C(σ)˜ϕ(y)20,Tyhy20,Tp20,T+Cσ(wT21,T+h2TwT20,T)C(σ)php21,T+C(σ)yyh20,T+C(σ)Th2T((yhyd)(yhyd)T)2dx+CσTh2T((yhyd)Tϕ(yh)ph)2dx,

    where ϕ()W2,(Ω) has been used. Absolutely, we can deduce that

    Th2T((yhyd)Tϕ(yh)ph)2dxCphp21,T+Cyyh20,T+CTh2T((yhyd)(yhyd)T)2dx.

    Then we have

    Th2T(yhydϕ(yh)ph)2dxCTh2T((yhyd)Tϕ(yh)ph)2dx+CTh2T((yhyd)(yhyd)T)2dxCphp21,T+Cyyh20,T+CTh2T((yhyd)(yhyd)T)2dx. (2.25)

    In connection with (2.24) and (2.25) we are easy to gain

    η23,Th(yh,ph,Th)=TThTh2T(yhydϕ(yh)ph)2dx+TΩThT([ph]n)2dxCphp21+Cyyh20+CTThTh2T((yhyd)(yhyd)T)2dxCphp21+Cyyh21+Cosc2Th(yhyd,Th).

    It can also be deduced that

    η22,Th(uh,yh,Th)=TThTh2T(f+uhϕ(yh))2dx+TΩThT([yh]n)2dxCyhy21+Cuuh20+CTThTh2T(ffT)2dxCyhy21+Cuuh20+Cosc2Th(f,Th).

    Above-mentioned results tell the proof of Theorem 2.1 is accomplished.

    In this section, we introduce two related algorithms as follows:

    Algorithm 3.1. Adaptive finite element algorithm for nonlinear optimal controlproblems:

    (0) Given an initial grids Th0 and construct finiteelement space Uh0ad and Vh0. Select marking parameter 0<θ1 and set k:=0.

    (1) Solve the discrete nonlinear optimal control problem (2.8)–(2.10), then obtain approximate solution (uhk,yhk,phk) with respect to Thk.

    (2) Compute the local error estimator ηThk(T) for all TThk.

    (3) Select a minimal subset Mhk of Thk such that

    η2Thk(Mhk)θη2Thk(Thk),

    where η2Thk(ω)=η21,Th(ph,ω)+η22,Th(uh,yh,ω)+η23,Th(yh,ph,ω) for all ωThk.

    (4) Refine Mhk by bisecting b1 times inpassing from Thk to Thk+1 andgenerally additional elements are refined in the process in order toensure that Thk+1 is conforming.

    (5) Solve the discrete nonlinear optimal control problem (2.8)–(2.10), then obtain approximate solution (uhk+1,yhk+1,phk+1) with respect to Thk+1.

    (6) Set k=k+1 and go to step (2).

    Algorithm 3.2. Given an initial control u0hUhad, then seek (ykh,pkh,ukh) such that

    a(ykh,wh)+(ϕ(ykh),wh)=(f+uk1h,wh), whVh,a(qh,pk1h)+(ϕ(ykh)pk1h,qh)=(ykhyd,qh), qhVh,(αukh+pk1h,vhukh)0, vkhUhad,

    for k=1,2,, and apparently

    ukh=1α(Phpkh+max(0,ˉpkh)),

    where Ph is the L2-projection from L2(Ω) to Uh and ˉpkh=Ωpkh|Ω|.

    In this section we first consider the nonlinear elliptic equations as follows:

    {Δy+ϕ(y)=f,in Ω,y=0,on Ω, (4.1)

    where fL2(Ω). We introduce the quantity J(h) in view of the idea in [30] as follows:

    J(h):=supfL2(Ω), f0,Ω=1infvhvTSvh1,

    where S is the solution operator for nonlinear elliptic equations. Obviously, J(h)1 for h(0,h0) if h01. Hereinafter there holds the Lemma about the quantity referring to [10,30].

    Lemma 4.1. For each fL2(Ω), there exists a constant C such that

    SfShf1CJ(h)f0, (4.2)

    and

    SfShf0CJ(h)SfShf1, (4.3)

    where Sh is the discretesolution operator for nonlinear elliptic equations.

    Here, Lemma 4.1 is a preparation for Lemma 4.4.

    According to [19], the local perturbation property plays an important role for the proof of the convergence. It impels us to combine the sum of the error estimates.

    Lemma 4.2. For ThT, TTh, let uh1,uh2Uhad, yh1,yh2,ph1,ph2Vh, we have

    η1,Th(ph1,T)η1,Th(ph2,T)ChTph1ph21,T, (4.4)
    η2,Th(uh1,yh1,T)η2,Th(uh2,yh2,T)C(hTuh1uh20,T+yh1yh21,T), (4.5)
    η3,Th(yh1,ph1,T)η3,Th(yh2,ph2,T)C(hTyh1yh20,T+ph1ph21,T), (4.6)
    oscTh(yh1yd,T)oscTh(yh2yd,T)Ch2Tyh1yh21,T. (4.7)

    Proof. Step 1. According to [16,21] we have

    v0,TΩC(h1/2Tv0,T+h1/2Tv1,T). (4.8)

    By adopting the inverse estimates and (4.8) we obtain

    [(yh1yh2)]n0,TΩCh1/2Tyh1yh21,ωT, (4.9)
    [(ph1ph2)]n0,TΩCh1/2Tph1ph21,ωT, (4.10)

    where ωT denotes the patch of elements that share an edge with T. By the definition of η1,Th(ph,T) and (4.10) we can deduce that

    η1,Th(ph1,T)η1,Th(ph2,T)+ChT[(ph1ph2)]n0,TΩ.

    Then we have

    η1,Th(ph1,T)η1,Th(ph2,T)ChT[(ph1ph2)]n0,TΩChTph1ph21,T,

    where the proof of (4.4) is finished.

    Step 2. By the definition of η2,Th(uh,yh,T) and (4.9), we can deduce that

    η2,Th(uh1,yh1,T)η2,Th(uh2,yh2,T)hTuh1uh20,T+h1/2T[(yh1yh2)]n0,TΩ+hTϕ(yh1)ϕ(yh2)0,ThTuh1uh20,T+Cyh1yh21,ωT+ChT˜ϕ(yh1)0,Tyh1yh20,TC(hTuh1uh20,T+yh1yh21,T),

    where ϕ()W2,(Ω) has been used and the proof of (4.5) is finished.

    Step 3. By the definition of η3,Th(yh,ph,T) and (4.10) we can derive that

    η3,Th(yh1,ph1,T)η3,Th(yh2,ph2,T)hTyh1yh20,T+h1/2T[(ph1ph2)]n0,TΩ+hTϕ(yh1)ph1ϕ(yh2)ph20,ThTyh1yh20,T+Cph1ph21,ωT+hTϕ(yh1)0,Tph1ph20,T+hT˜ϕ(yh1)0,Tyh1yh20,Tph20,ThTyh1yh20,T+Cph1ph21,ωT+ChTph1ph20,TC(hTyh1yh20,T+ph1ph21,T),

    where ϕ()W2,(Ω) has been used and the proof of (4.6) is finished. \\Step 4. Similarly, we have

     oscTh(yh1yd,T)oscTh(yh2yd,T)h2Tyh1yh20,T.

    In brief, Lemma 4.2 is proved.

    The authors in [25] demonstrate an error reduction provided the current errors are larger than the desired errors, that is to say, the errors may not be reduced in the process of coarse grids refinement before introducing a node of the refined grids inside each marked element while D¨orfler proves a similar result assumption [9].

    Lemma 4.3. Let Th~Th for Th,~ThT.MhTh denotes the set of elements whichare marked from Th to ˜Th. Then for uhUhad, ˜uhU˜had, yh,phVh, ˜yh,˜phV˜h and any δ,δ1(0,1], we have

     η21,˜Th(˜ph,~Th)(1+δ1){η21,Th(ph,Th)(121/2)η1,Th(ph,Rh)}C(1+δ11)h20ph˜ph21, (4.11)

    and

     η22,˜Th(˜uh,˜yh,˜Th)(1+δ){η22,T(uh,yh,Th)λη22,Th(uh,yh,Mh)}C(1+δ1)(h20uh˜uh20+yh˜yh21), (4.12)

    and

     η23,˜Th(˜yh,˜ph,˜Th)(1+δ){η23,Th(yh,ph,Th)λη23,Th(uh,yh,Mh)}C(1+δ1)(h20yh˜yh20+ph˜ph21), (4.13)

    and

     osc2Th(yhyd,Th˜Th)2osc2˜Th(˜yhyd,Th˜Th)2Ch40yh˜yh21, (4.14)

    where λ=12b2, h0=maxTTh0hT and Rh denotes the set ofelements which are refined from Th to ~Th.

    Proof. Step 1. Applying the Young's inequality with parameter δ1 and (4.4) we get

    η21,~Th(˜ph,~Th)η21,~Th(ph,~Th)Ch20ph˜ph21+2η1,~Th(˜ph,~Th)η1,~Th(ph,~Th)C(1+δ11)h20ph˜ph21+δ1η21,~Th(ph,~Th). (4.15)

    Note that T will be bisected at least one time for the element TRhTh, then we have

    TTη1,~Th(ph,T)21/2η1,Th(ph,T).

    For TThRh, we gain

    η21,~Th(ph,T)=η21,Th(ph,T).

    In connection with the above estimates we demonstrate that

    η21,˜Th(˜ph,~Th)(1+δ1){η21,Th(ph,Th)(121/2)η1,Th(ph,Rh)}=η21,~Th(ph,Rh)+η21,~Th(ph,ThRh)(1+δ1){η21,Th(ph,Th)(121/2)η1,Th(ph,Rh)}21/2η21,~Th(ph,Rh)+η21,~Th(ph,ThRh)(1+δ1){η21,Th(ph,Th)(121/2)η1,Th(ph,Rh)}C(1+δ11)h20ph˜ph21,

    which illustrates (4.11) has been proved.

    Step 2. Employing the Young's inequality with parameter δ and (4.5) we obtain

    η23,~Th(˜yh,˜ph,~Th)η23,~Th(yh,ph,~Th)C(1+δ1)h2T(TThhT˜yhyh20,T+˜phph21)+δη23,~Th(yh,ph,~Th),

    where it is similar to (4.15). Then let ˜ThT={T~Th:TT} where TMh is a marked element. For arbitrary phVThV~Th, we find the jump [ph]=0 on the interior sides of ˜ThT. Suppose b is the number of bisections, we can deduce that

    hT=|T|1/2(2b|T|)1/22b2hT,

    due to refinement by bisection, then we obtain

    T~ThTη23,~Th(yh,ph,T)2b2η23,Th(yh,ph,T).

    It is easy to find that

    η23,~Th(yh,ph,T)η23,Th(yh,ph,T),

    for any TThMh. In connection with the above estimates we expound that

    η23,~Th(yh,ph,~Th)=η23,~Th(yh,ph,Mh)+η23,~Th(yh,ph,ThMh)2b2η23,Th(yh,ph,Mh)+η23,Th(yh,ph,ThMh)=η23,Th(yh,ph,Th)(12b2)η23,Th(yh,ph,Mh).

    As has been said, we can achieve (4.13) connecting with the above estimates to which the proof of (4.12) is similar.

    Step 3. For arbitrary TTh~Th via using (4.7) and the Young's inequality we obtain that

    oscTh(yhyd,T)=osc~Th(yhyd,T),osc2Th(yhyd,T)2osc2~Th(˜yhyd,T)2Ch4T˜yhyh21,T.

    Obviously, (4.14) can be got by summing the above inequality over TTh~Th. To sum up, Lemma 4.3 is proved.

    As to the proof of the convergence, one of the main obstacle is that there do not have the orthogonality while it is vital to prove the convergence. Thus getting back to the second place we transfer proof of the quasi-orthogonality. The latter is popularly adopted in the adaptive mixed and the nonconforming adaptive finite element methods [19]. Apparently it is true for the following basic relationships with Thk,Thk+1T and ThkThk+1,

    uuhk+120=uuhk20uhkuhk+1202(uuhk+1,uhk+1uhk), (4.16)
    yyhk+121=yyhk21yhkyhk+1212a(yyhk+1,yhk+1yhk), (4.17)
    pphk+121=pphk21phkphk+1212a(pphk+1,phk+1phk), (4.18)

    where (u,y,p) are the solution of (2.3)–(2.5), (uhk,yhk,phk) and (uhk+1,yhk+1,phk+1) are the solution of (2.8)–(2.10) with respect to Thk and Thk+1, respectively. Accordingly we have the quasi-orthogonality below.

    Lemma 4.4. For Thk,Thk+1T and ThkThk+1, we have

     (1δ)uuhk+120uuhk20+uhkuhk+120Cδ1(η21,Thk(phk,Rh)+J2(h0)(η22,Thk(uhk,yhk,Rh)+η23,Thk(yhk,phk,Rh))), (4.19)

    and

     (1δ)yyhk+121yyhk21yhkyhk+121δuuhk+120Cδ1(η21,Thk(phk,Rh)+J2(h0)(η22,Thk(uhk,yhk,Rh)+η23,Thk(yhk,phk,Rh))), (4.20)

    and

     (1δ)pphk+121pphk21+phkphk+121δyyhk+121Cδ1(η21,Thk(phk,Rh)+J2(h0)(η22,Thk(uhk,yhk,Rh)+η23,Thk(yhk,phk,Rh))). (4.21)

    Proof. Step 1. It follows from Lemma 4.3 in [19] that for UhkadUhk+1ad we have

    αuhk+1uhk20(phk+1phk,uhkuhk+1)+(αuhk+phk,uhkuhk+1)(Shk+1(Shk+1(f+uhk)yd)Shk(Shk(f+uhk)yd),uhkuhk+1)+Cη1,Thk(phk,Rh)uhkuhk+10, (4.22)

    where Rh is the set of elements which are refined from Thk to Thk+1.

    For the right-hand first item of (4.22) we let ζhH10(Ω) be the solution of the following problem based on Lemma 4.1

    a(ζh,q)=(Shk+1(Shk+1(f+uhk)yd)Shk(Shk(f+uhk)yd),q), qH10(Ω).

    Hence we can get

    Shk+1(Shk+1(f+uhk)yd)Shk(Shk(f+uhk)yd)20=a(ζh,Shk+1(Shk+1(f+uhk)yd)Shk(Shk(f+uhk)yd))=a(ζhζhk,Shk+1(Shk+1(f+uhk)yd)Shk(Shk(f+uhk)yd))+(Shk+1(f+uhk)Shk(f+uhk),ζhkζh)+(Shk+1(f+uhk)Shk(f+uhk),ζh).

    From the proof of Lemma 3.3 in [19], we infer that

    a(ζhζhk,Shk+1(Shk+1(f+uhk)yd)Shk(Shk(f+uhk)yd))J(h0)Shk+1(Shk+1(f+uhk)yd)Shk(Shk(f+uhk)yd)0(Shk+1(f+uhk)Shk(f+uhk)1+Shk+1(Shk(f+uhk)yd)Shk(Shk(f+uhk)yd)1),

    and

    (Shk+1(f+uhk)Shk(f+uhk),ζhkζh)CJ(h0)Shk+1(f+uhk)Shk(f+uhk)0Shk+1(Shk(f+uhk)yd)Shk(Shk(f+uhk)yd)0,

    and

    (Shk+1(f+uhk)Shk(f+uhk),ζ)Shk+1(f+uhk)Shk(f+uhk)0Shk+1(Shk(f+uhk)yd)Shk(Shk(f+uhk)yd)0.

    Similar to Lemma 4.6 in [4], we derive that

    Shk+1(f+uhk)Shk(f+uhk)1Cη2,Thk(uhk,yhk,Rh),Shk+1(Shk(f+uhk)yd)Shk(Shk(f+uhk)yd)1Cη3,Thk(yhk,phk,Rh).

    In connection with the above estimates we conclude that

     Shk+1(Shk(f+uhk)yd)Shk(Shk(f+uhk)yd)0C(J(h0)(η2,Thk(uhk,yhk,Rh)+η3,Thk(yhk,phk,Rh))+Shk+1(f+uhk)Shk(f+uhk)0).

    For the third term at the right end of the above inequality, we let φhH10(Ω) be the solution of the following problem

    a(q,φh)=(Shk+1(f+uhk)Shk(f+uhk),q), qH10(Ω).

    According to the standard duality theory, we can deduce that

    Shk+1(f+uhk)Shk(f+uhk)20=a(Shk+1(f+uhk)Shk(f+uhk),φhφhk)CJ(h0)Shk+1(f+uhk)Shk(f+uhk)1Shk+1(f+uhk)Shk(f+uhk)0,

    where φhk is the standard finite element estimate of φh with respect to VThk. So we have

    Shk+1(Shk(f+uhk)yd)Shk(Shk(f+uhk)yd)0C(J(h0)(η2,Thk(uk,yk,Rh)+η3,Thk(yhk,phk,Rh))).

    As mentioned above, we can get

    (phk+1phk,uhkuhk+1)C(J(h0)(η2,Thk(uhk,yhk,Rh)+η3,Thk(yhk,phk,Rh))uhkuhk+10).

    In combination with (4.22) and above inequality, we deduct that

    uhkuhk+10C(η1,Thk(phk,Rh)+J(h0)(η2,Thk(uhk,yhk,Rh)+η3,Thk(yhk,phk,Rh))). (4.23)

    It is easy to derive the desired result (4.19) with the help of (4.16) and (4.23).

    Step 2. Our task now is to prove (4.20) and so is (4.21). Obviously we have

    yhk+1yhk0=Shk+1(f+uhk+1)Shk(f+uhk)0Shk+1(f+uhk+1)Shk+1(f+uhk)0+Shk+1(f+uhk)Shhk(f+uhk)0C(uhkuhk+10+J(h0)η2,Thk(uhk,yhk,Rh)). (4.24)

    By using the Cauchy inequality, we obtain

    2a(yyhk+1,yhk+1yhk)=2(uuhk+1,yhk+1yhk)2(ϕ(y)ϕ(yhk+1),yhk+1yhk)δuuhk+120+1δyhk+1yhk20+(˜ϕ(y)(yyhk+1),yhk+1yhk)δuuhk+120+1δyhk+1yhk20+˜ϕ(y)0yyhk+10yhk+1yhk0δuuhk+120+1δyhk+1yhk20+C(δyyhk+120+1δyhk+1yhk20). (4.25)

    It is easy to derive the desired result (4.20) with the assistance of (4.17) and (4.23)–(4.25).

    For ThkT, we will denote Uhkad, Vhk and the solution (uhk,yhk,phk) of (2.8)–(2.10) with respect to Thk by Uhkad, Vhk, and (uhk,yhk,phk) and we define some notations before we prove the convergence of Algorithm 3.1 as follows:

    e2hk=uuhk20+yyhk21+pphk21,E2hk=uhkuhk+120+yhkyhk+121+phkphk+121,˜η2Thk(ω)=η22,Thk(uhk,yhk,ω)+η23,Thk(yhk,phk,ω),

    for ωΩ.

    Theorem 4.1. Let (Thk,Uhkad,Vhk,uhk,yhk,phk) be the sequence of grids, finite element spaces and discretesolutions produced by the Algorithm 3.1. Then there existconstants γ1>0, γ2>0 and α(0,1), onlydepending on the shape regularity of initial grids Th0, b, and the marking parameter θ(0,1], such that

    e2hk+1+γ1η21,Thk+1(phk+1,Thk+1)+γ2˜η2Thk+1(Thk+1)α(e2hk+γ1η21,Thk(phk,Thk)+γ2˜η2Thk(Thk)), (4.26)

    provided h01.

    Proof. We get the following results from Theorem 2.1, Lemma 4.3 and Lemma 4.4,

    e2hkCη2Thk(Thk), (4.27)
    ˜η2Thk+1(Thk+1)(1+δ){˜η2Thk(Thk)λ˜η2Thk(Mhk)}+C(1+1δ)E2hk, (4.28)
    η21,Thk+1(phk+1,Thk+1)(1+δ1){η21,Thk(phk,Thk)(121/2)η21,Thk(phk,Rhk)}+C(1+1δ1)h20phkphk+121, (4.29)
    (12δ)e2hk+1e2hkE2hk+C1δ(η21,Thk(phk,Rhk)+J(h0)˜η2Thk(Thk)), (4.30)

    where Rhk is the set of elements which are refined from Thk to Thk+1. Applying the upper bound in Theorem 2.1, (4.29) can be simplified into

    η21,Thk+1(phk+1,Thk+1)(1+δ1){η21,Thk(phk,Thk)(121/2)η21,Thk(phk,Rhk)}+C(1+1δ1)h20(η21,Thk+1(phk+1,Thk+1)+˜η2Thk+1(Thk+1)+η21,Thk(phk,Thk)+˜η2Thk(Thk)). (4.31)

    Multiplying (4.28) and (4.31) with ~γ2 and ~γ1, respectively, and adding the results to (4.30) yields

     (12δ)e2hk+1+~γ1η21,Thk+1(phk+1,Thk+1)+~γ2˜η2Thk+1(Thk+1)e2hk+~γ2(1+δ){˜η2Thk(Thk)λ˜η2Thk(Mhk)}+~γ2C(1+1δ)E2hkE2hk(~γ1(1+δ1)(121/2)C1δ)η21,Thk(phk,Rhk)+~γ1(1+δ1)η21,Thk(phk,Thk)+C1δJ2(h0)˜η2Thk(Thk)+~γ1C(1+1δ1)h20(η21,Thk+1(phk+1,Thk+1)+˜η2Thk+1(Thk+1)+η21,Thk(phk,Thk)+˜η2Thk(Thk)).

    If ~γ1 is such that

    ~γ1(1+δ1)(121/2)C1δ>0,

    and one chooses ~γ2 such that

    ~γ2C(1+1δ)=1, (4.32)

    then we have

    (12δ)e2hk+1+~γ1(1C(1+1δ1)h20)η21,Thk+1(phk+1,Thk+1)+(~γ2~γ1C(1+1δ1)h20)˜η2Thk+1(Thk+1)e2hk+~γ1((1+δ1)+C(1+1δ1)h20)η21,Thk(phk,Thk)+(~γ2(1+δ)+~γ1C(1+1δ1)h20+C1δJ2(h0))˜η2Thk(Thk)c(η21,Thk(phk,Mhk)+˜η2Thk(Mhk)),

    where

    c=min{~γ2(1+δ)λ,~γ1(1+δ1)(121/2)C1δ}.

    By using the marking strategy in Algorithm 3.1 and the upper bound in Theorem 2.1 to arrive at

    (12δ)e2hk+1+~γ1(1C(1+1δ1)h20)η21,Thk+1(phk+1,Thk+1)+(~γ2~γ1C(1+1δ1)h20)˜η2Thk+1(Thk+1)(1Cθβ)e2hk+(~γ1((1+δ1)+C(1+1δ1)h20)cθ(1β))η1,Thk(phk,Thk)+(~γ2(1+δ)+~γ1C(1+1δ1)h20+C1δJ2(h0)cθ(1β))˜ηThk(Thk),

    where β(0,1). Then we deduce that

     e2hk+1+γ1η21,Thk+1(phk+1,Thk+1)+γ2˜η2Thk+1(Thk+1)α1e2hk+α2γ1η21,Thk(phk,Thk)+α3γ2˜η2Thk(Thk),

    where

    α1=1Cθβ12δ,γ1=~γ1(1C(1+δ11)h20)12δ,γ2=~γ2~γ1C(1+δ11)h2012δ,α2=~γ1((1+δ1)+C(1+δ11)h20)cθ(1β)~γ1(1C(1+δ11)h20),α3=~γ2(1+δ)+~γ1C(1+δ11)h20+Cδ1J2(h0)cθ(1β)~γ2~γ1C(1+δ11)h20.

    As long as δ<1 and β is small enough, α1(0,1) can be guaranteed. To facilitate judgment, we transfer the following adjustments to the above formula:

    α2=(1C(1+δ11)h20)+δ1+2C(1+δ11)h20cθ(1β)~γ11C(1+δ11)h20,α3=~γ2(1+δ)~γ1C(1+δ11)h20+2~γ1C(1+δ11)h20+Cδ1J2(h0)cθ(1β)~γ2~γ1C(1+δ11)h20.

    It is absolutely clear that

    α2(0,1),

    if h01 and δ1 is sufficiently small. Then consider (4.32) to deduce that

    γ2=δ2C(1+δ),

    which can say

    α3(0,1),

    if h01 and δ is sufficiently small. Therefore if choose α=max{α1,α2,α3}, we can derive the expected results.

    Theorem 4.2. Let (Thk,Uhkad,Vhk,uhk,yhk,phk) be the sequence of grids, finite element spaces and discretesolutions produced by the Algorithm 3.1 and the conditionsof Theorem 4.1 keep. Then we have

    uuhk20+yyhk21+pphk210ask.

    Proof. It is obviously true combining Theorem 2.1 and Theorem 4.1.

    In this section we consider the quasi-optimality for the adaptive finite element method. Firstly we give the notations interpretation. For Th,Th1,Th2T, let #Th be the number of elements in Th, and Th1Th2 be the smallest common conforming refinement of Th1 and Th2 and satisfies [4,27]

    #(Th1Th2)#Th1+#Th2#Th0. (5.1)

    According to [19], we need defining a function approximation class

    As:={(u,y,p,yd,f)L2(Ω)×H10(Ω)×H10(Ω)×L2(Ω)×L2(Ω):|(u,y,p,yd,f)|s<+},

    where

    |(u,y,p,yd,f)|s:=supN>0NsinfThTNinf(uh,yh,ph)Uhad×Vh×Vh{uuh20+yyh21+pph21+osc2Th(f,Th))+osc2Th(yhyd,Th)}12,

    and

    TN:={ThT:#Th#Th0N0}.

    To illustrate the quasi-optimality of the adaptive finite element method, we need a local upper bound on the distance between nested solutions [4], since the error of this method can only be estimated by using the indicators of refined elements without a buffer layer.

    Lemma 5.1. For Th,˜ThT and Th˜Th, let Rh bethe set of refined elements from Th to ˜Th. Let (uh,yh,ph) and (˜u,˜y,˜p) be the solutions of (2.8)–(2.10) with respect to Th and ˜Th, respectively. Then there exists a constant C, depending on the shape regularity of initial grids Th0 and b such that

    uh˜u20+yh˜y21+ph˜p21Cη2Th(Rh), (5.2)

    where

    η2Th(Rh)=η21,Th(ph,Rh)+η22,Th(uh,yh,Rh)+η23,Th(yh,ph,Rh).

    Proof. From (4.23) of Lemma 4.4, we have

    uh˜u20η2Th(Rh). (5.3)

    By Lemma 4.6 in [4], we deduce that

    yh˜y1=Sh(f+uh)S˜h(f+˜u)1Sh(f+uh)S˜h(f+uh)1+S˜h(f+uh)S˜h(f+˜u)1C(η2,Th(uh,yh,Rh)+uh˜u0). (5.4)

    The corresponding result is attained by the similar method above

    ph˜p1=Sh(Sh(f+uh)yd)S˜h(S˜h(f+˜u)yd)1 C(η3,Th(yh,ph,Rh)+yh˜y1). (5.5)

    In connection with (5.3), (5.4) and (5.5) we can derive (5.2).

    Dörfler introduced a crucial marking and proved strict energy error reduction for the Laplacian provided the initial grids Th0 satisfying a mild assumption [9]. If the sum of errors satisfy suitable error reductions, the error indicators on the coarse grids must satisfy a Dörfler property on the refined one [19].

    Lemma 5.2. Assume that the marking parameter θ(0,θ), where

    θ=C2C(1+h40)+1.

    For Th,˜ThT and Th˜Th, let (uh,yh,ph) and (˜u,˜y,˜p) be thesolutions of (2.8)–(2.10) with respect to Th and ˜Th, respectively. If

    e2˜Th+osc2˜Th(˜Th)μ[e2Th+osc2Th(Th)], (5.6)

    is satisfied for μ:=12(1θθ). Then, the set Rh of elements which are refined from Th to ˜Th satisfies the Dörfler property

    η2Th(Rh)θη2Th(Th),

    where

    e2Th=uuh20+yyh21+pph21,osc2Th(ω)=osc2Th(f,ω)+osc2Th(yhyd,ω),

    for ωTh and e2˜Th, osc2˜Th(˜Th) similarly defined.

    Proof. By the lower bound in Theorem 2.1 and (5.6) to obtain that

    (12μ)Cη2Th(Th)(12μ)(e2Th+osc2Th(Th))e2Th2e2˜Th+osc2Th(Th)2osc2˜Th(˜Th).

    It is well-known that there exists the fundamental relationships:

    uuh202u˜u20+2uh˜u20,yyh212y˜y21+2yh˜y21,pph212p˜p21+2ph˜p21.

    Hence we can get the following result form Lemma 5.1

    e2Th2e2˜Th2Cη2Th(Rh). (5.7)

    For TTh˜Th, we can get the following result from (4.14) of Lemma 4.3

    osc2Th(yhyd,Th˜Th)2osc2˜Th(˜yyd,Th˜Th)2C(h40η2Th(Rh)). (5.8)

    According to Remark 2.1 in [4], we get the following result as the indicator ηTh(T) dominates oscillation oscTh(T)

    osc2Th(T)η2Th(T), (5.9)

    for all TRh. Then in connection with (5.7), (5.8) and (5.9) one obtains

    (12μ)Cη2Th(Th)(2C(1+h40)+1)η2Th(Rh),(12μ)θη2Th(Th)η2Th(Rh),θη2Th(Th)η2Th(Rh),

    where

    θ=C2C(1+h40)+1,andθ=(12μ)θ.

    Lemma 5.3. Let (u,y,p) and (Thk,Uhkad,Vhk,uhk,yhk,phk) be the solution of (2.3)–(2.5) and the sequence ofgrids, finite element spaces and discrete solutions produced by Algorithm 3.1, respectively. Assume that the markingparameter θ satisfies the condition in Lemma 5.2, then the following estimate is valid

    #MhkC(N12s|(u,y,p,yd,f)|1ssμ12s(e2hk+osc2Thk(Thk))12s), (5.10)

    if (u,y,p,yd,f)As.

    Proof. Let ϵ2:=μN1(e2Tk+osc2Tk(Tk)), where N shall be produced in the proof of (5.13) and μ is defined in Lemma 5.2. Because of (u,y,p,yd,f)As, there exists a ThϵT and a (uhϵ,yhϵ,phϵ)Uhϵad×Vhϵ×Vhϵ such that

    #Thϵ#Th0|(u,y,p,yd,f)|1/ssϵ1/s, (5.11)

    and

    uuhϵ20+yyhϵ21+pphϵ21+osc2Thϵ(f,Thϵ)+osc2Thϵ(yhϵyd,Thϵ)ϵ2. (5.12)

    Let (uh,yh,ph) be the solution of (2.8)–(2.10) with respect to Th, where Th=ThϵThk is the smallest common refinement of Thϵ and Thk. In the following we will prove the following inequality firstly

    e2h+osc2Th(Th)N(e2Thϵ+osc2Thϵ(Thϵ)), (5.13)

    where

    e2Thϵ:=uuhϵ20+yyhϵ21+pphϵ21,osc2Thϵ(Thϵ):=osc2Thϵ(f,Thϵ)+osc2Thϵ(yhϵyd,Thϵ).

    According to the principle of adding one item and subtracting one item, then we have

    uuhϵ20=uuh20+uhϵuh20+2(uuh,uhuhϵ),yyhϵ21=yyh21+yhϵyh21+2a(yyh,yhyhϵ),pphϵ21=pph21+phϵph21+2a(pph,phphϵ).

    With the help of the Young's inequality one obtains

    (uuh,uhuhϵ)=(uuhϵ,uhuhϵ)(uhuhϵ,uhuhϵ)(uuhϵ,uhuhϵ)uuhϵ20+14uhuhϵ20,

    and so are a(yyh,yhyhϵ) and a(pph,phphϵ). Hence in connection with what we get above we deduce that

     uuh20+uhuhϵ20+yyh2a+yhyhϵ21+pph21+phphϵ216(uuhϵ20+yyhϵ21+pphϵ21). (5.14)

    From Remark 2.1 in [4] and (4.14) in Lemma 4.3 with Th=˜Th=Th, y=yh and ˜y=yhϵ, we obtain that

    osc2Th(yhyd,Th)2osc2Thϵ(yhϵyd,Thϵ)osc2Th(yhyd,Th)2osc2Th(yhϵyd,Th)2Nh40yhyhϵ21. (5.15)

    For any TThϵ, let ThT:={TTh:TT}. From the proof of Lemma 4.3 in [19], we derive that

    TThTffT20,TNffT20,T.

    And then we can get

    osc2T(f,T)N(osc2Tϵ(f,Tϵ)). (5.16)

    Combining (5.14)–(5.16) to obtain (5.13) and using (5.12) and the definition of ϵ2, we have

    e2h+osc2Th(Th)N(e2Thϵ+osc2Thϵ(Thϵ))Nϵ2=μ(e2hk+osc2Thk(Thk)).

    It is true for the following result from Lemma 5.2

    #Mhk#Rh#Th#Thk#Thϵ#Th0. (5.17)

    Combining (5.11), (5.17) and the definition of ϵ2 to derive the desired result (5.10).

    The following consequence is the result of previous estimates where the truth is that the number of elements is a dwindle for the errors. Namely, if a given adaptive method is used to approximate the exact solution at a certain convergence rate, the iteratively constructed grids sequence will achieve this rate until a constant factor.

    Theorem 5.1. Let (u,y,p) and (Thk,Uhkad,Vhk,uhk,yhk,phk) be the solution of (2.3)–(2.5) and the sequence ofgrids, finite element spaces and discrete solutions produced by Algorithm 3.1, respectively. Assume that Th0 satisfies the condition (b) of Section 4 in [27]. Let (u,y,p,yd,f)As, then we have

    #Thk#Th0C|(u,y,p,yd,f)|1ss(e2hk+osc2Thk(Thk))12s, (5.18)

    provided h01.

    Proof. It follows from Theorem 2.1 and Lemma 5.3 that

    e2hk+γ1η21,Thk(phk,Thk)+γ2˜ηThk(Thk)e2hk+osc2Thk(Thk). (5.19)

    From Lemma 2.3 in [4], we have

    #Thk#Th0Ck1i=0Mhi. (5.20)

    With the assistance of Lemma 5.3 and (5.20) to gain

    #Thk#Th0Ck1i=0MhiC(Mk1i=0(e2hk+osc2Thk(Thk))12s), (5.21)

    where

    M=N12s|(u,y,p,yd,f)|1ssα12s.

    Then it follows from (5.19), (5.21) and Theorem 4.1 that

    #Thk#Th0C(M(e2hk+osc2Thk(Thk))12ski=1α12s)C|(u,y,p,yd,f)|1ss(e2hk+osc2Thk(Thk))12s,

    which tells the proof of Theorem 5.1.

    In this section, we firstly present an adaptive finite element and then give the adaptive iteration method where the purpose is to provide empirical analysis for our theory.

    Example 1. We consider the nonlinear optimal control problem governed by nonlinear elliptic equations subject to the state equation

    Δy+y3=f+u,Δp+3y2p=yyd,

    where we choose α=1 and Ω=[0,1]×[0,1] and apparently exact solution

    u=1α(max(0,ˉp)p),y=sin(πx1)+sin(πx2),p=y.

    By simple calculation we have Ωpdx=4π which satisfies uUad.

    We choose 15 adaptive loops for Example 1, then we plot the profiles of the exact state and the numerical state on adaptively refined grids with θ=0.5 in Figure 1. It is easy to see that the solution is smooth, but we can find larger gradients in some regions, hence comparing with uniform refinement, adaptive finite element method can provide smaller error. In Figure 2, we provide the triangle refined grids after 6 and 12 adaptive iterations of Algorithm 3.1.

    Figure 1.  The exact state (left) and the numerical state (right) for Example.
    Figure 2.  The adaptive grids after 6 steps (left) and 12 steps (right) for Example.

    In Figure 3, we plot the convergence history for the errors where the left is adaptive refinement (θ=0.5) and the right is uniform refinement (θ=1). We can find it very intuitively when we provide the optimal convergence rate slope 1 via adopting the linear finite elements, the error reduction can be observed.

    Figure 3.  The error estimate compares between adaptively (left) and uniformly (right) refined grids for Example 1.

    Example 2. We consider the same nonlinear optimal control problem as Example 1 with α=0.1, Ω=(1,1)×(0,1)(1,0)×(1,0] and apparently exact solution

    p={5×1010e1m,m<0,0,m0,y=p,m=(x10.2)2+(x20.6)20.04,

    where uUad can be guaranteed.

    Comparing with Example 1, we provide some plots concerning with 21 adaptive loops for Example 2 with θ=0.5. In Figure 4, it is more easy to say that the solution is smooth while the large gradients can be found in some regions illustrating that adaptive refinement can obtain smaller errors than the uniformly refinement where we offer the error estimate graphs to explain. In Figure 5, the left plot tells us the error estimates on adaptive refinement (θ=0.5) and the right shows the error estimates on uniform refinement (θ=1). With the slope 1 being the optimal convergence rate expected, we see the error reduction from Figure 5. Meanwhile we can also find that the convergence order of the total-error and the error estimate indicators are approaching to straight line slope 1 in which they are roughly parallel where it is showed that the posteriori error estimates we obtained in Section 2 are reliable.

    Figure 4.  The exact state (upper-middle) and the numerical state (left) and the adjoint state (right) variables for Example 2.
    Figure 5.  The error estimate compares between adaptively (left) and uniformly (right) refined grids for Example 2.

    In Figure 6, we show the adaptive grids after 9 and 19 adaptive iterations for Example 2 of 21 adaptive loops with θ=0.5. We can find that the grids are concentrated on the regions where the solutions have larger gradients. Only can we note that reduced orders are observed for the uniform refinement because of the singularity of the solutions.

    Figure 6.  The adaptive grids after 9 steps (left) and 19 steps (right) for Example 2.

    In this paper, we first study the adaptive finite element method for nonlinear optimal control problems and give the corresponding adaptive algorithm. To evaluate the adaptive finite element method, we obtain the a posteriori error estimates for the nonlinear elliptic equations with upper and lower bound convergence and optimality, which are also important indicators for evaluating the algorithms. Therefore, we prove that the sum of the posterior errors of the control, state and covariance variables are convergent, as shown in Theorem 4.2. Based on the local upper bound, we prove the quasi-optimality of the proposed adaptive algorithm, see Section 5. To verify our theoretical analysis, we finally provide some numerical simulations. In previous research papers, the finite element methods for linear optimal control problems were studied. Our innovation is to extend the method of linear optimal control problems to a series of nonlinear optimal control problems.

    There are a lot of problems which can not be tackled, such as the L2L2 posteriori error estimates for nonlinear elliptic equations as well as the convergence and quasi-optimality for nonlinear parabolic equations. Furthermore, we note that the analysis in this paper can be generalized to common nonlinear parabolic problems and boundary problems, and we will work on these problems.

    This work is supported by National Science Foundation of China (11201510), National Social Science Fund of China (19BGL190), Chongqing Research Program of Basic Research and Frontier Technology (cstc2019jcyj-msxmX0280), General project of Chongqing Natural Science Foundation (cstc2021jcyj-msxmX0949), Scientific and Technological Research Program of Chongqing Municipal Education Commission (KJZD-K20200120), Chongqing Key Laboratory of Water Environment Evolution and Pollution Control in Three Gorges Reservoir Area (WEPKL2018YB-04), Science and technology research project of Chongqing Education Commission (KJQN202101212), Research Center for Sustainable Development of Three Gorges Reservoir Area (2019sxxyjd07), Guangdong Basic and Applied Basic Research Foundation of Joint Fund Project (2021A1515111048), and Guangdong Province Characteristic Innovation Project (2021WTSCX120).

    The authors declare that they have no competing interests.



    [1] Y. Sun, Z. Hu, H. Triki, M. Mirzazadeh, W. Liu, A. Biswas, et al., Analytical study of three-soliton interactions with different phases in nonlinear optics, Nonlinear Dyn., 111 (2023), 18391–18400. https://doi.org/10.1007/s11071-023-08786-z doi: 10.1007/s11071-023-08786-z
    [2] Y. Zhong, K. Yu, Y. Sun, H. Triki, Q. Zhou, Stability of solitons in Bose–Einstein condensates with cubic–quintic–septic nonlinearity and non-PT-symmetric complex potentials, Eur. Phys. J. Plus, 139 (2024), 119. https://doi.org/10.1140/epjp/s13360-024-04930-9 doi: 10.1140/epjp/s13360-024-04930-9
    [3] Q. Li, W. Zou, Normalized ground states for Sobolev critical nonlinear Schrödinger equation in the L2-supercritical case, DCDS, 44 (2024), 205–227. https://doi.org/10.3934/dcds.2023101 doi: 10.3934/dcds.2023101
    [4] S. Malik, S. Kumar, Pure-cubic optical soliton perturbation with full nonlinearity by a new generalized approach, Optik, 258 (2022), 168865. https://doi.org/10.1016/j.ijleo.2022.168865 doi: 10.1016/j.ijleo.2022.168865
    [5] Q. Li, J. Nie, W. Wang, J. Zhou, Normalized solutions for Sobolev critical fractional Schrödinger equation, Adv. Nonlinear Anal., 13 (2024), 20240027. https://doi.org/10.1515/anona-2024-0027 doi: 10.1515/anona-2024-0027
    [6] W. Zhang, J. Zhang, V. Rădulescu, Semiclassical states for the pseudo-relativistic Schrödinger equation with competing potentials, Commun. Math. Sci., 23 (2025), 465–507.
    [7] S. Shen, Z. J. Yang, Z. G. Pang, Y. R. Ge, The complex-valued astigmatic cosine-Gaussian soliton solution of the nonlocal nonlinear Schrödinger equation and its transmission characteristics, Appl. Math. Lett., 125 (2022), 107755. https://doi.org/10.1016/j.aml.2021.107755 doi: 10.1016/j.aml.2021.107755
    [8] M. A. E. Abdelrahman, A. Alharbi, M. B. Almatrafi, Fundamental solutions for the generalised third-order nonlinear Schrödinger equation, Int. J. Appl. Comput. Math., 6 (2020), 160. https://doi.org/10.1007/s40819-020-00906-2 doi: 10.1007/s40819-020-00906-2
    [9] W. B. Rabie, H. M. Ahmed, Construction cubic-quartic solitons in optical metamaterials for the perturbed twin-core couplers with Kudryashov's sextic power law using extended F-expansion method, Chaos Soliton Fract., 160 (2022), 112289. https://doi.org/10.1016/j.chaos.2022.112289 doi: 10.1016/j.chaos.2022.112289
    [10] I. Onder, A. Secer, M. Ozisik, M. Bayram, Investigation of optical soliton solutions for the perturbed Gerdjikov-Ivanov equation with full-nonlinearity, Heliyon, 9 (2023), e13519. https://doi.org/10.1016/j.heliyon.2023.e13519 doi: 10.1016/j.heliyon.2023.e13519
    [11] I. Samir, H. M. Ahmed, S. Alkhatib, E. M. Mohamed, Construction of wave solutions for stochastic Radhakrishnan–Kundu–Lakshmanan equation using modified extended direct algebraic technique, Results Phys., 55 (2023), 107191. https://doi.org/10.1016/j.rinp.2023.107191 doi: 10.1016/j.rinp.2023.107191
    [12] R. Kumar, R. Kumar, A. Bansal, A. Biswas, Y. Yildirim, S. P. Moshokoa, et al., Optical solitons and group invariants for Chen-Lee-Liu equation with time-dependent chromatic dispersion and nonlinearity by Lie symmetry, Ukr. J. Phys. Opt., 2023.
    [13] M. M. Khatun, M. A. Akbar, New optical soliton solutions to the space-time fractional perturbed Chen-Lee-Liu equation, Results Phys., 46 (2023), 106306. https://doi.org/10.1016/j.rinp.2023.106306 doi: 10.1016/j.rinp.2023.106306
    [14] M. Sadaf, G. Akram, S. Arshed, K. Farooq, A study of fractional complex Ginzburg–Landau model with three kinds of fractional operators, Chaos Soliton. Fract., 166 (2023), 112976. https://doi.org/10.1016/j.chaos.2022.112976 doi: 10.1016/j.chaos.2022.112976
    [15] K. J. Wang, J. Si, Diverse optical solitons to the complex Ginzburg–Landau equation with Kerr law nonlinearity in the nonlinear optical fiber, Eur. Phys. J. Plus, 138 (2023), 187. https://doi.org/10.1140/epjp/s13360-023-03804-w doi: 10.1140/epjp/s13360-023-03804-w
    [16] W. Chen, J. Manafian, K. H. Mahmoud, A. S. Alsubaie, A. Aldurayhim, A. Alkader, Cutting-edge analytical and numerical approaches to the Gilson–Pickering equation with plenty of soliton solutions, Mathematics, 11 (2023), 3454. https://doi.org/10.3390/math11163454 doi: 10.3390/math11163454
    [17] A. Yokuş, H. Durur, K. A. Abro, D. Kaya, Role of Gilson–Pickering equation for the different types of soliton solutions: a nonlinear analysis, Eur. Phys. J. Plus, 135 (2020), 657. https://doi.org/10.1140/epjp/s13360-020-00646-8 doi: 10.1140/epjp/s13360-020-00646-8
    [18] N. A. Kudryashov, The Lakshmanan–Porsezian–Daniel model with arbitrary refractive index and its solution, Optik, 241 (2021), 167043. https://doi.org/10.1016/j.ijleo.2021.167043 doi: 10.1016/j.ijleo.2021.167043
    [19] G. Liang, H. Zhang, L. Fang, Q. Shou, W. Hu, Q. Guo, Influence of transverse cross-phases on propagations of optical beams in linear and nonlinear regimes, Laser Photonics Rev., 14 (2020), 2000141. https://doi.org/10.1002/lpor.202000141 doi: 10.1002/lpor.202000141
    [20] I. Samir, H. M. Ahmed, A. Darwish, H. H. Hussein, Dynamical behaviors of solitons for NLSE with Kudryashov's sextic power-law of nonlinear refractive index using improved modified extended tanh-function method, Ain Shams Eng. J., 15 (2024), 102267. https://doi.org/10.1016/j.asej.2023.102267 doi: 10.1016/j.asej.2023.102267
    [21] O. El-shamy, R. El-barkoki, H. M. Ahmed, W. Abbas, I. Samir, Exploration of new solitons in optical medium with higher-order dispersive and nonlinear effects via improved modified extended tanh function method, Alex. Eng. J., 68 (2023), 611–618. https://doi.org/10.1016/j.aej.2023.01.053 doi: 10.1016/j.aej.2023.01.053
    [22] I. Samir, A. Abd-Elmonem, H. M. Ahmed, General solitons for eighth-order dispersive nonlinear Schrödinger equation with ninth-power law nonlinearity using improved modified extended tanh method, Opt. Quant. Electron., 55 (2023), 470. https://doi.org/10.1007/s11082-023-04753-5 doi: 10.1007/s11082-023-04753-5
    [23] Y. Shang, The extended hyperbolic function method and exact solutions of the long–short wave resonance equations, Chaos Soliton. Fract., 36 (2008), 762–771. https://doi.org/10.1016/j.chaos.2006.07.007 doi: 10.1016/j.chaos.2006.07.007
    [24] L. Akinyemi, Two improved techniques for the perturbed nonlinear Biswas–Milovic equation and its optical solitons, Optik, 243 (2021), 167477. https://doi.org/10.1016/j.ijleo.2021.167477 doi: 10.1016/j.ijleo.2021.167477
    [25] N. Savaissou, B. Gambo, H. Rezazadeh, A. Bekir, S. Y. Doka, Exact optical solitons to the perturbed nonlinear Schrödinger equation with dual-power law of nonlinearity, Opt. Quant. Electron., 52 (2020), 318. https://doi.org/10.1007/s11082-020-02412-7 doi: 10.1007/s11082-020-02412-7
    [26] E. A. Az-Zo'bi, W. A. Alzoubi, L. Akinyemi, M. Şenol, B. S. Masaedeh, A variety of wave amplitudes for the conformable fractional (2+1)-dimensional Ito equation, Mod. Phys. Lett. B, 35 (2021), 2150254. https://doi.org/10.1142/S0217984921502547 doi: 10.1142/S0217984921502547
    [27] W. B. Rabie, H. M. Ahmed, I. Samir, M. Alnahhass, Optical solitons and stability analysis for NLSE with nonlocal nonlinearity, nonlinear chromatic dispersion and Kudryashov's generalized quintuple-power nonlinearity, Results Phys., 59 (2024), 107589. https://doi.org/10.1016/j.rinp.2024.107589 doi: 10.1016/j.rinp.2024.107589
    [28] A. Biswas, M. Ekici, A. Sonmezoglu, Stationary optical solitons with Kudryashov's quintuple power–law of refractive index having nonlinear chromatic dispersion, Phys. Lett. A, 426 (2022), 127885. https://doi.org/10.1016/j.physleta.2021.127885 doi: 10.1016/j.physleta.2021.127885
    [29] M. Ekici, Stationary optical solitons with complex Ginzburg–Landau equation having nonlinear chromatic dispersion and Kudryashov's refractive index structures, Phys. Lett. A, 440 (2022), 128146. https://doi.org/10.1016/j.physleta.2022.128146 doi: 10.1016/j.physleta.2022.128146
    [30] A. Sonmezoglu, Stationary optical solitons having Kudryashov's quintuple power law nonlinearity by extended G/G–expansion, Optik, 253 (2022), 168521. https://doi.org/10.1016/j.ijleo.2021.168521 doi: 10.1016/j.ijleo.2021.168521
    [31] M. G. Hafez, M. A. Akbar, An exponential expansion method and its application to the strain wave equation in microstructured solids, Ain Shams Eng. J., 6 (2015), 683–690. https://doi.org/10.1016/j.asej.2014.11.011 doi: 10.1016/j.asej.2014.11.011
    [32] G. Akram, M. Sadaf, S. Arshed, F. Sameen, Bright, dark, kink, singular and periodic soliton solutions of Lakshmanan–Porsezian–Daniel model by generalized projective Riccati equations method, Optik, 241 (2021), 167051. https://doi.org/10.1016/j.ijleo.2021.167051 doi: 10.1016/j.ijleo.2021.167051
  • Reader Comments
  • © 2025 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(505) PDF downloads(52) Cited by(1)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog