Algorithm 3 SMME-FEM-MC-ADMM. |
Begin
Let Compute the vectors for Generate the matrices end for while Increament Update end while End |
Efficient numerical methods for solving Poisson equation constraint optimal control problems with random coefficient are discussed in this paper. By applying the finite element method and the Monte Carlo approximation, the original optimal control problem is discretized and transformed into an optimization problem. Taking advantage of the separable structures, Algorithm 1 is proposed for solving the problem, where an alternating direction method of multiplier is used. Both computational and storage costs of this algorithm are very high. In order to reduce the computational cost, Algorithm 2 is proposed, where the multi-modes expansion is introduced and applied. Further, for reducing the storage cost, we propose Algorithm 3 based on Algorithm 2. The main idea is that the random term is shifted to the objective functional, which could be computed in advance. Therefore, we only need to solve a deterministic optimization problem, which could reduce all the costs significantly. Moreover, the convergence analyses of the proposed algorithms are established, and numerical simulations are carried out to test the performances of them.
Citation: Xiaowei Pang, Haiming Song, Xiaoshen Wang, Jiachuan Zhang. Efficient numerical methods for elliptic optimal control problems with random coefficient[J]. Electronic Research Archive, 2020, 28(2): 1001-1022. doi: 10.3934/era.2020053
[1] | Xiaowei Pang, Haiming Song, Xiaoshen Wang, Jiachuan Zhang . Efficient numerical methods for elliptic optimal control problems with random coefficient. Electronic Research Archive, 2020, 28(2): 1001-1022. doi: 10.3934/era.2020053 |
[2] | Jingshi Li, Jiachuan Zhang, Guoliang Ju, Juntao You . A multi-mode expansion method for boundary optimal control problems constrained by random Poisson equations. Electronic Research Archive, 2020, 28(2): 977-1000. doi: 10.3934/era.2020052 |
[3] | Guanrong Li, Yanping Chen, Yunqing Huang . A hybridized weak Galerkin finite element scheme for general second-order elliptic problems. Electronic Research Archive, 2020, 28(2): 821-836. doi: 10.3934/era.2020042 |
[4] | Ziqiang Wang, Chunyu Cen, Junying Cao . Topological optimization algorithm for mechanical-electrical coupling of periodic composite materials. Electronic Research Archive, 2023, 31(5): 2689-2707. doi: 10.3934/era.2023136 |
[5] |
Zuliang Lu, Fei Huang, Xiankui Wu, Lin Li, Shang Liu .
Convergence and quasi-optimality of |
[6] | Changling Xu, Huilai Li . Two-grid methods of finite element approximation for parabolic integro-differential optimal control problems. Electronic Research Archive, 2023, 31(8): 4818-4842. doi: 10.3934/era.2023247 |
[7] | Meixin Xiong, Liuhong Chen, Ju Ming, Jaemin Shin . Accelerating the Bayesian inference of inverse problems by using data-driven compressive sensing method based on proper orthogonal decomposition. Electronic Research Archive, 2021, 29(5): 3383-3403. doi: 10.3934/era.2021044 |
[8] | Mingtao Cui, Wennan Cui, Wang Li, Xiaobo Wang . A polygonal topology optimization method based on the alternating active-phase algorithm. Electronic Research Archive, 2024, 32(2): 1191-1226. doi: 10.3934/era.2024057 |
[9] | Kun Wang, Lin Mu . Numerical investigation of a new cut finite element method for Stokes interface equations. Electronic Research Archive, 2025, 33(4): 2503-2524. doi: 10.3934/era.2025111 |
[10] | Jye Ying Sia, Yong Kheng Goh, How Hui Liew, Yun Fah Chang . Constructing hidden differential equations using a data-driven approach with the alternating direction method of multipliers (ADMM). Electronic Research Archive, 2025, 33(2): 890-906. doi: 10.3934/era.2025040 |
Efficient numerical methods for solving Poisson equation constraint optimal control problems with random coefficient are discussed in this paper. By applying the finite element method and the Monte Carlo approximation, the original optimal control problem is discretized and transformed into an optimization problem. Taking advantage of the separable structures, Algorithm 1 is proposed for solving the problem, where an alternating direction method of multiplier is used. Both computational and storage costs of this algorithm are very high. In order to reduce the computational cost, Algorithm 2 is proposed, where the multi-modes expansion is introduced and applied. Further, for reducing the storage cost, we propose Algorithm 3 based on Algorithm 2. The main idea is that the random term is shifted to the objective functional, which could be computed in advance. Therefore, we only need to solve a deterministic optimization problem, which could reduce all the costs significantly. Moreover, the convergence analyses of the proposed algorithms are established, and numerical simulations are carried out to test the performances of them.
It is well known that many physical and engineering problems could be described by the optimal control problems (OCPs) with partial differential equation (PDE) or random/stochastic partial differential equation (RPDE/SPDE) constraints, such as the optimal heat source problem, the optimal design of the aircraft, and the weather forecasting problem etc. For the deterministic PDE optimal control problem, there exists plenty research results, and we refer to [15,21,26] for more details. However, only few works have been developed for the RPDE/SPDE optimal control problem.
Generally speaking, the numerical methods for RPDE/SPDE constraint optimal control problems are divided into two categories: sample methods and projection methods. For the former, Cao et al. proposed an improved Monte Carlo method using compression variance technique to deal with optimal control problems under random Burger constraints in 2003 [7]. Based on the gradient error, Kouri et al. combined the trust region method and the stochastic collocation method to propose an adaptive collocation method, which is superior to the traditional Newtonian conjugate gradient method [17]. For the latter, Xiu developed a polynomial chaos expansion for the target functional, which transforms the original SPDE constraint optimal control problem to a nonlinear problem with finite coefficients [25]. However, no theoretical guarantee was given there. In 2011, Gunzburger and his collaborators transformed the SPDE constraint optimal control problem into a KKT nonlinear system, and then proved the existence of the solution of the KKT system. By using Karhunen Loeve (KL) expansion or appropriate orthogonal decomposition, they obtained a series of deterministic optimization systems numerically [13]. In 2013, Kunoth and Schwab used generalized polynomial chaos expansion (gPC) to handle the RPDE/SPDE optimal control problems [18]. Recently, Taylor approximation and variance reduction method was proposed by Chen et al. for PDE-constrained optimal control under uncertainty [8].
In this paper, we mainly concentrate on the efficient numerical methods for the optimal control problem constrained by the Poisson equation with random coefficient. The major challenges for dealing with the randomness are: (Ⅰ) How to deal with the uncertainties appeared in the constraint and the objective functional, and formulate a discretized optimization problem. (Ⅱ) How to solve the resulted optimization problem, and propose an efficient algorithm. We will analyze the challenges above one by one, and present the corresponding solutions.
Monte Carlo (MC) method [5,7], the stochastic collocation (SC) method [1,2], the stochastic Galerkin (SG) method [22,23], the polynomial chaos (PC) expansion [16], and so on are some of the commonly used methods for dealing with the first issue. Based on the generality and parallelizability of MC method, we will apply it to deal with the random sample space. Further, using the finite element method (FEM) to approximate the physical space, we shall obtain a discretized optimization problem. This is the most natural idea, but it will face huge difficulties in computation and storage when the high accuracy solution is required. The KL expansion is a spectral representation of the covariance function, which consists of the eigenvalues and eigenfunctions. This method is widely used in RPDE/SPDE fields when the eigenvalues decay exponentially or non-Gaussian spatially-dependent random with large correlation length, otherwise it will run into the curse of dimensionality [3]. As an improvement, the multi-modes expansion (MME) [11] will be considered to approximate the state variable firstly, and then carry out MC and FEM subsequently. By exploiting the relationships among the terms of MME expansion, we can transform the discretized optimization problem into a quadratic programming problem with deterministic linear constraints. In this paper, we will follow this approach to resolve the first challenge.
The second challenge is closed related to the first one. If the first challenge can be handled very well, the latter one will be much less difficult. For solving the resulted optimization problem, there exist fruitful algorithms, such as the gradient descent method, the conjugate gradient method, the Newton's method, the regularization method, and the alternating direction method of multipliers (ADMM) etc., the interested readers are referred to [13,21] and references therein for the rich literature. Because of the separability of state and control of the resulted optimization problem, ADMM is applied in this paper, which has been used in the optimal control problem with deterministic PDE constraints, and displayed satisfactory results [14,26]. Moreover, to the best of our knowledge, there is no complete convergence analysis for elliptic RPDE-constrained optimization problems. The existing results merely present the difference between the exact solution and the discretized solution [13,19], while the error for solving the optimization problem is ignored. Here, based on the convergence analysis of ADMM in [6] and the techniques for dealing with the former challenge, the global error estimate in terms of the deviation between objective functionals will be established for our proposed algorithms later.
For the clarity of the structure, the efficient algorithm proposed in this paper is developed by three steps. Firstly, the MC, the FEM, and the ADMM are used sequentially to formulate a primitive algorithm, Algorithm 1, which needs to compute
The outline of this paper is as follows. In Section 2, the model problem, the well-posedness of the solution, and the first order optimal condition of the model problem are introduced. Algorithm 1, 2, 3, and their convergence analyses are presented in Section 3, 4, and 5, respectively. In Section 6, numerical simulations are carried out to test the performance of the proposed algorithms. Finally, some conclusion remarks are given in Section 7.
The model problem, the well-posedness of the studied model, and its corresponding first order optimal condition will be presented in this section.
For easy of description, most of the notations and definitions used in this paper are adopted from the reference [9]. We consider the model problem on a polygon
‖v‖Lp(Θ,X)=(∫Θ‖v(ξ,⋅)‖pXdP(ξ))1p<∞,1≤p<∞,‖v‖L∞(Θ,X)=esssupξ∈Θ‖v(ξ,⋅)‖X<∞, |
where
Using the notions above, the Poisson equation constraint optimal control problem with random coefficient and Dirichlet boundary condition could be written as
(OCP){miny∈Y,u∈UF(y(ξ,x),u(x))subject to:{−∇⋅(α(ξ,x)∇y(ξ,x))=u(x),inD,y(ξ,x)=g(x),on∂D, |
with
F(y(ξ,x),u(x)):=E[12∫D|y(ξ,x)−yd|2dx]+γ2∫D|u(x)−ud|2dx, | (1) |
where
The corresponding variational formulation of the Poisson equation with random coefficient is given by: Finding
E[a(y,v)]=E[(u,v)L2(D)],∀v∈L2(Θ,H10(D)), | (2) |
with
Assumption 1. ([3]) The random diffusion coefficient
0<α1≤essinfDα(ξ,x)≤‖α(ξ,⋅)‖L∞(D)≤α2<∞. |
Lemma 2.1. ([2,4,12]) If Assumption 1 holds, then for any given right-hand-side function
‖y(ξ,⋅)‖H1(D)≤1α1‖u‖H−1(D)≤1α1‖u‖L2(D), | (3) |
and the global
E[‖y‖2L2(D)]≤C(α)‖u‖2L2(D), | (4) |
where
For simplicity, the diffusion coefficient
α(ξ,x)=1+εσ(ξ,x), | (5) |
other cases could be considered similarly. Here,
P{ξ∈Θ;‖σ(ξ,x)‖L∞(D)≤1}=1, |
and
y=T(u), | (6) |
where
E[‖T(u)‖jH1(D)]=E[‖y(u)‖jH1(D)]≤C1‖u‖jL2(D), | (7) |
where
Therefore, if the state variable space is
(P){miny∈Y,u∈UF(y(ξ,x),u(x))subject toy(ξ,x)=T(u(x)), |
which can also be reformulated as
minu∈UJ(u)=minu∈UF(T(u),u)=minu∈U{E[12∫D|T(u)−yd|2dx]+γ2∫D|u−ud|2dx}. | (8) |
Here,
Finally, let
J′(u∗)=E[T∗(T(u∗)−yd)]+γ(u∗−ud)=0, | (9) |
where
Now, the optimal control problem (P), and its first order optimal condition have all been presented. In the remainder of the paper, we will propose three algorithms for solving the optimal control problem (P) incrementally, and compare them with each other.
The first algorithm studied in this paper is presented in this section. The motivation comes from the finite element method (FEM), the Monte Carlo approximation (MC), and the alternating directions method of multipliers (ADMM). We will carry out these techniques sequentially to solve the optimal control problem (P), and present an global error estimate in the form of the deviation between the objective functionals.
Let
Vh={vh∈H10(D)|vh|K∈P1(K),∀K∈Kh}, |
where
yh(ξ,x)=N∑i=1yi(ξ)ϕi(x)=Ry(ξ), | (10) |
which belongs to
Based on above notations, for a fixed sample
a(yh(ξ,⋅),vh)=(u,vh)L2(D),∀vh∈Vh. | (11) |
This means there exists an operator
Lemma 3.1. The pathwise FEM discretization (11) satisfies the following stability estimate: for any fixed sample
‖Th(u)‖H1(D)=‖yh(ξ,⋅)‖H1(D)≤c(α)‖u‖L2(D), | (12) |
where
The proof, which will be omitted here, follows from the coerciveness and continuity of the bilinear form
Now, we generalize the stability estimate of the deterministic problem to the random coefficient case. It is well known that the FEM approximation of (2) is: Finding
E[a(yh,vh)]=E[(u,vh)L2(D)],∀vh∈L2(Θ,Vh). | (13) |
By applying the Assumption 1 and the estimate (12), we can derive:
Lemma 3.2. For any control variable
E[‖Thu‖jH1(D)]=E[‖yh‖jH1(D)]≤C2‖u‖jL2(D), | (14) |
where
By using the estimates (4), (7), (14), the Theorem 4.3 in [10], and the triangle inequality, we can obtain the following lemma.
Lemma 3.3. ([11,19]) If the control variable
(E[‖(T−Th)u‖jL2(D)])1j≤C3h2‖u‖L2(D), | (15) |
where
Similar to (10), the control variable
uh=Ru∈Uh=Vh, | (16) |
where
(P1h){minyh∈Yh,uh∈UhF(yh(ξ,x),uh(x))subject toyh(ξ,x)=Th(uh(x)). |
In this subsection, we mainly apply the standard MC method to approximate the expectation
EM[yh(ξ,x)]=1MM∑j=1yh(ξj,x), | (17) |
which satisfies the error estimate (cf. [4])
(E[‖E[yh]−EM[yh]‖2L2(D)])12≤1√M(E[‖yh‖2L2(D)])12. | (18) |
Substituting the MC approximation (17) into (P
(PM1h){minyh∈Yh,uh∈UhFM,h(yh(ξ,x),uh(x))subject toyh(ξj,x)=Th(ξj,uh(x)),j=1,⋯,M, |
where
FM,h(yh,uh)=EM[12∫D∣yh−yd∣2dx]+γ2∫D∣uh−ud∣2dx. | (19) |
Similar to (P), the optimization problem (P
E[EM[T∗h(Th(u∗M,h)−yd)]]+γ(u∗M,h−Qhud)=0, | (20) |
where
Using the first order optimal conditions (9) and (20), we can get
Lemma 3.4. ([19]) Let
‖u∗−u∗M,h‖L2(D)≤K1√M+K2h2 | (21) |
with
K1=Cγ4√8(‖yd‖L2(D)+‖u∗‖L2(D)),K2=Cγ(‖yd‖L2(D)+‖u∗‖L2(D)+‖ud‖L2(D)), |
where the constant
Further, applying the definitions of the functionals
Theorem 3.5. Let
E[|F(y∗,u∗)−FM,h(y∗M,h,u∗M,h)|]≤ˉK1√M+ˉK2h2, | (22) |
where
ˉK1=√82(C1‖u∗‖L2(D)+‖yd‖L2(D))2+γ(‖u∗‖L2(D)+‖ud‖L2(D))K1+(K1C1C2‖u∗‖L2(D)+K1C2‖yd‖L2(D))+2√2(C2K1(C1‖u∗‖L2(D)+‖yd‖L2(D)))+2√2((C1C3‖u∗‖L2(D)+C3‖yd‖L2(D)+C1C2K2)‖u∗‖L2(D)+C2K2‖yd‖L2(D))+4√2(C32‖u∗‖4+C22K22+2C22K21),ˉK2=((C1C3‖u∗‖L2(D)+C3‖yd‖L2(D)+C1C2K2)‖u∗‖L2(D)+C2‖yd‖L2(D)K2)+γ(‖u∗‖L2(D)+‖ud‖L2(D))K2+4√2(C32‖u∗‖4L2(D)+C22K22). |
For the sake of structural consideration, we put the proof in Appendix.
In this subsection, we will introduce the ADMM to solve the optimization problem (P
(˜PM1h){miny(ξj),u∈RNFM,h(y(ξ),u)subject toA1(ξj)y(ξj)=B1u,j=1,⋯,M, |
where
A1(ξj)=(α(ξj,x)∇ϕm,∇ϕn)L2(D)∈RN×N,B1=(ϕm,ϕn)L2(D)∈RN×N. |
The augmented Lagrangian functional of the optimization problem
L1β(y(ξ),u,λ(ξ))=FM,h(y(ξ),u)−EM[(λ(ξ),A1(ξ)y(ξ)−B1u)]+EM[β2‖A1(ξ)y(ξ)−B1u‖2], |
where
miny(ξ),umaxλ(ξ)L1β(y(ξ),u,λ(ξ)). | (23) |
For the saddle-point problem (23), the ADMM is an efficient algorithm. We will apply ADMM to solve this problem, and the whole process can be described by
Notice that every iteration of the ADMM has the closed-form solution
yt+1(ξj)=(B1+βA1(ξj)TA1(ξj))−1(c+βA1(ξj)TB1ut+A1(ξj)Tλt(ξj)),ut+1=(γB1+βBT1B1)−1EM[γd+βBT1A1(ξ)yt+1(ξ)−BT1λt(ξ)],λt+1(ξj)=λt(ξj)−β(A1(ξj)yt+1(ξj)−B1ut+1), | (24) |
with
Now, we consider the convergence of the ADMM. It is easy to find that the stiffness matrixs
Algorithm 3 SMME-FEM-MC-ADMM. |
Begin
Let Compute the vectors for Generate the matrices end for while Increament Update end while End |
Lemma 3.6. For the problem (23), after
E[|FM,h(yt,ut)−FM,h(y∗M,u∗M)|]≤ˉK3t, | (25) |
where
ˉK3=12E[EM[‖u0−u∗Mλ0−λ∗‖˜Hβ+1β‖λ0‖]],˜Hβ=(βBT1B10012βI). |
Further, applying the Theorem 3.5, the identity
FM,h(y∗M,h,u∗M,h)−FM,h(Ryt,Rut)=FM,h(y∗M,u∗M)−FM,h(yt,ut), |
the Lemma 3.6, and the triangle inequality, we can get the global error estimate.
Theorem 3.7. A global error estimate for Algorithm 1 is as follows:
E[|F(y∗,u∗)−FM,h(Ryt,Rut)|]≤ˉK1√M+ˉK2h2+ˉK3t. | (26) |
From Algorithm 1, we can see that the stiffness matrix
To reduce the computational cost in Algorithm 1, we shall introduce the multi-modes expansion (MME), and propose a new algorithm. The main idea is to approximate the state variable by MME technique, and then transform the original constraint in Algorithm 1 and thus shift the randomness to the right hand side mass matrix. The advantage is that we can calculate the expectation of the right hand side random matrix in advance, then compute and store the inverse of the matrix only once, which could reduce the computational cost significantly.
The multi-modes expansion for the state variable
y=∞∑q=0εqyq, | (27) |
where
−Δy0=u,inD,−Δyq=∇⋅(σ(ξ,⋅)∇yq−1),inD,∀q≥1,yq=0,on∂D,∀q≥0. | (28) |
It is apparent that the relationship between
Lemma 4.1. The equations (28) has a unique solution
E[‖yq‖2H2(D)]≤c(α)q+1‖u‖2L2(D), | (29) |
where
For the purpose of numerical solution, we truncate the expression
yQ=Q−1∑q=0εqyq. | (30) |
Lemma 4.2. ([11]) If the perturbation magnitude
limQ→∞E‖y−yQ‖L2(D)=0. |
Lemma 4.1 shows that the problem (28) is well-posed, and Lemma 4.2 shows that the expression (30) is an approximation of the state variable
yQ=TQ(u). | (31) |
Feng et al. [11] showed that the truncation error of MME after
Lemma 4.3. Under the assumptions of Lemma 4.1 and Lemma 4.2, it holds that
(E[‖(T−TQ)u‖jL2(D)])1j=(E[‖y−yQ‖jL2(D)])1j≤C4εQ‖u‖L2(D), | (32) |
where
In this subsection, we will apply FEM to discretize
For any fixed
yq,h(ξ,x)=N∑i=1yq,i(ξ)ϕi(x)=Ryq(ξ),q≥0, | (33) |
where
yQh(ξ,x)=Q−1∑q=0εqyq,h(ξ,x)=RQ−1∑q=0εqyq(ξ)=RyQ(ξ). | (34) |
And the variational formulation of the constraint equation in the truncated MME form is given by: For each
(∇y0,h(x),∇vh(x))L2(D)=(u(x),vh(x))L2(D),(∇yq,h(ξ,x),∇vh(x))L2(D)=(σ(ξ,x)∇yq−1,h(ξ,x),∇vh(x))L2(D), | (35) |
for any
yQh=TQh(u). | (36) |
Lemma 4.4. (cf. [11]) Under the assumption of Lemma 4.2, for
E[‖TQhu‖jL2(D)]=E[‖yQh(u)‖jL2(D)]≤C5‖u‖jL2(D), | (37) |
and the error estimate
(E[‖(TQ−TQh)u‖jL2(D)])1j=(E[‖yQ−yQh‖jL2(D)])1j≤C6h2‖u‖L2(D), | (38) |
where
Further, applying the triangle inequality, the estimates (32) and (38), we have the following error estimate.
Lemma 4.5. Under the conditions of Lemma 4.1 and Lemma 4.4, we have the following estimate
(E[‖(T−TQh)u‖jL2(D)])1j≤C(εQ+h2)‖u‖L2(D), | (39) |
where
By arguments similar to the ones used to get the optimization problem (19), applying the expressions (16), (34) and (36), the original problem (P) could be approximated by
(PQ2h){minyQh(ξ)∈Yh,uh∈UhFM,h(yQh(ξ,x),uh)subject toyQh(ξj,x)=(TQh(uh))(ξ,xj)j=1,⋯,M, |
where
E[EM[TQ∗h(TQhuQ∗M,h−yd)]]+γ(uQ∗M,h−Qhud)=0. | (40) |
Using the first order optimal conditions (9) and (40), we have
Lemma 4.6. (cf. [19]) Under the conditions of Lemma 4.1-4.5, the following error estimate
‖u∗−uQ∗M,h‖L2(D)≤˜C1√M+˜C2εQ+˜C3h2, | (41) |
holds, where
˜C1=4√8γC5(C5‖u∗‖L2(D)+‖yd‖L2(D)),˜C2=CC4γ(‖yd‖L2(D)+(C1+C5)‖u∗‖L2(D)),˜C3=C‖u0‖L2(D)+1γCC5(‖yd‖L2(D)+(C1+C5)‖u∗‖L2(D)). |
Similar to Theorem 3.5, we can get the error estimate on the difference of the objective functionals, as follows
Theorem 4.7. Let
E[|F(y∗,u∗)−FM,h(yQ∗M,h,uQ∗M,h)|]≤ˉC1√M+ˉC2εQ+ˉC3h2, | (42) |
where
ˉC1=γ(‖u∗‖L2(D)+‖ud‖L2(D))˜C1+√82(C1‖u∗‖L2(D)+‖yd‖L2(D))2+(C1C5‖u∗‖L2(D)+C5‖yd‖L2(D))˜C1+2√2(C1‖u∗‖L2(D)+‖yd‖L2(D))C2˜C1,ˉC2=γ(‖u∗‖L2(D)+‖ud‖L2(D))˜C2+((C1C‖u∗‖L2(D)+C3‖yd‖L2(D))‖u∗‖L2(D)+2√2(C1‖u∗‖L2(D)+‖yd‖L2(D))C2˜C2+(C1C5‖u∗‖L2(D)+C5‖yd‖L2(D))˜C2),ˉC3=γ(‖u∗‖L2(D)+‖ud‖L2(D))˜C3+((C1C‖u∗‖L2(D)+C3‖yd‖L2(D))‖u∗‖L2(D)+(C1C5‖u∗‖L2(D)+C5‖yd‖L2(D))˜C3)+4√2˜C23. |
The proof is similar to Theorem 3.5, we omit it here.
In this subsection, the ADMM algorithm for solving the optimization problem (P
Substituting the control variable
A2y0=B1u,A2yq(ξ)=B0(ξ)yq−1(ξ),∀q≥1, | (43) |
with
A2=(∇ϕm,∇ϕn)L2(D),B1=(ϕm,ϕn)L2(D),andB0(ξ)=(σ(ξ)∇ϕm,∇ϕn)L2(D). |
Applying the definition of
A2yQ(ξ)=Q−1∑q=0εqA2yq(ξ). |
Further, combining above equality with the equations (43), the relation between state variable
A2yQ(ξ)=B2(ξ)u, |
where
B2(ξ)={B1,Q=1,(I+εB0(ξ)A−12)B1,Q=2,(I+(εB0(ξ))(I+εA−12B0(ξ))A−12)B1,Q=3. |
Therefore, the optimization problem (P
(˜PQ2h){minyQ(ξ),u∈RNFM,h(yQ(ξ),u)subject toA2yQ(ξj)=B2(ξj)u,j=1,⋯,M, |
where the functional
The optimization problem (
minyQ(ξ),umaxλ(ξ)L2β(yQ(ξ),u,λ(ξ)), | (44) |
with the augmented Lagrangian function
L2β(yQ(ξ),u,λ(ξ))=FM,h(yQ(ξ),u)−EM[(λ(ξ),A2yQ(ξ)−B2(ξ)u)]+EM[β2‖A2yQ(ξ)−B2(ξ)u‖2], |
where
Now, the ADMM algorithm for solving the saddle point problem (44) could be described by
The ADMM process has the following closed form solutions
yQ,t+1(ξj)=(B1+βAT2A2)−1(c+βAT2B2(ξj)ut+AT2λt(ξj)),ut+1=(γB1+βEM[B2(ξ)TB2(ξ)])−1EM[γd+B2(ξ)T(βA2yQ,t+1(ξ)−λt(ξ))],λt+1(ξj)=λt(ξj)−β(A2yQ,t+1(ξj)−B2(ξ)ut+1), | (45) |
Algorithm 3 SMME-FEM-MC-ADMM. |
Begin
Let Compute the vectors for Generate the matrices end for while Increament Update end while End |
with
E[|FM,h(yQ,∗M,h,uQ,∗M,h)−FM,h(RyQ,t,Rut)|]=E[|FM,h(yQ,∗M,uQ,∗M)−FM,h(yQ,t,ut)|]≤˜K31t. |
Furthermore, applying the estimate of Theorem 4.7 and triangle inequality, we obtain the following estimate:
Theorem 4.8. Let
E[|F(y∗,u∗)−FM,h(RyQ,t,Rut)|]≤ˉC1√M+ˉC2εQ+ˉC3h2+˜K31t, | (46) |
where
From the iterations (45), we can see that all the matrices are deterministic except the matrix
In this section, a new algorithm is proposed to overcome the disadvantages of the Algorithm 2, which can reduce the storage requirement significantly.
From the variational formulation (35), we know that
yQ(ξ)=Q−1∑q=0εqyq(ξ)=P(ξ)y0, | (47) |
where
P(ξ)={I,Q=1,I+εA−12B0(ξ),Q=2,I+εA−12B0(ξ)+ε2(A−12B0(ξ))2,Q=3. |
The relationship (47) implies that
(˜PQ3h){miny0,u∈RNFM,h(P(ξ)y0,u)subject toA3y0−B3u=0, |
where the functional
miny0,umaxλL3β(y0,u,λ), | (48) |
with the augmented Lagrangian function
L3β(y0,u,λ)=FM,h(P(ξ)y0,u)−(λ,A3y0−B3u)+β2‖A3y0−B3u‖2, |
where
Now, we present the ADMM algorithm to solve the saddle point problem (48).
Algorithm 3 SMME-FEM-MC-ADMM. |
Begin
Let Compute the vectors for Generate the matrices end for while Increament Update end while End |
By straightforward calculations, we can get
yt+10=(EM[P(ξ)TB3P(ξ)]+βAT3A3)−1(EM[P(ξ)Tc]+βAT3B3ut+AT3λt),ut+1=(γB3+βBT3B3)−1(γd+βBT3A3yt+10−BT3λt),λt+1=λt−β(A3yt+10−B3ut+1). | (49) |
It is not hard to see that Algorithm 3 has the same convergence property as Algorithm 2 (see Theorem 4.8). From the iterations (49), we can see that we don't need to store the information for all the samples. The only matrices needing to be remembered are
In this section, results of numerical experiments, which are carried out to test the performance of the proposed algorithms, will be reported. The convergence order of the FEM for all algorithms is verified firstly. And then, the ADMM convergence order in the form of the objective function is shown. Finally, to illustrate the advantage of Algorithm 3, we present the storage costs of the three algorithms. All the simulations are conducted using Matlab on a high-performance computer with
We consider the optimal control problem on the spatial domain
yd=sin(πx)sin(πy),ud=2π2sin(πx)sin(πy). |
It is not difficult to find that
To verify the convergence order of the FEM, we fix other parameters firstly. For simplicity, the sample number of the MC, the iteration steps of the ADMM, the magnitude of the perturbation defined in (5), and the truncated length of the MME are set to be
Now, we will test the performance of ADMM. For this purpose, other parameters in the proposed algorithms need to be fixed in advance. Without loss of generality, we set
Finally, we analyze the CPU memory cost of the proposed algorithms, which is one of the main concerns in this paper. For the fixed MC parameter
Algorithm 1 | Algorithm 2 | Algorithm 3 | |||||
| | | | | | ||
| 1.271E-1 | 1.009E-1 | 1.195E-1 | 1.221E-1 | 1.967E-3 | 1.007E-3 | 2.006E-3 |
| 1.655E0 | 1.277E0 | 1.676E0 | 1.685E0 | 8.136E-3 | 2.723E-3 | 5.859E-3 |
| 1.807E1 | 1.792E1 | 2.481E1 | 2.487E1 | 4.856E-2 | 8.097E-2 | 7.375E-2 |
| out of memory | 2.677E2 | out of memory | out of memory | 7.546E-1 | 1.161E0 | 1.626E0 |
In order to observe the storage changes of the three proposed algorithms with respect to the sample number
| Algorithm 1 | Algorithm 2 | Algorithm 3 | ||||
| | | | | | ||
| 6.143E-2 | 5.456E-2 | 8.195E-2 | 7.839E-2 | 4.514E-2 | 6.814E-2 | 5.881E-2 |
| 2.498E-1 | 2.511E-1 | 2.964E-1 | 2.976E-1 | 4.568E-2 | 6.785E-2 | 5.901E-2 |
| 1.927E0 | 1.929E0 | 2.511E0 | 2.513E0 | 4.603E-2 | 6.887E-2 | 6.074E-2 |
| 1.807E1 | 1.792E1 | 2.481E1 | 2.487E1 | 4.856E-2 | 6.931E-2 | 7.375E-2 |
In this paper, three algorithms are proposed for the elliptic equation constraint optimal control problem with random coefficient. The Algorithm 1 is based on the FEM discretization, the MC approximation, and the ADMM iteration, which has global convergence. However, the computational cost and the storage cost are both very high. As an improvement, the Algorithm 2 is proposed, which reduce the computational cost by using the MME technique, but memory cost remain high. To overcome the disadvantage of Algorithm 2, Algorithm 3 is proposed, which reduce the memory cost significantly by transforming the constraint with random term to a deterministic one. Numerical simulations are carried out to verify the theoretical analysis, which illustrate the efficiency of the Algorithm 3 further. The applications of the proposed algorithms to other random optimal control problems will be discussed in our future works.
The work of H. Song was supported by the NSF of China under the grant No.11701210, the science and technology department project of Jilin Province under the grant No. 20190103029JH and 20200201269JC. the education department project of Jilin Province under the grant No. JJKH20180113KJ, and the fundamental research funds for the Central Universities. The work of X. Pang was supported by the NSF of China under the grant No. 11871245, 11771179, and 11726102. The work of J. Zhang was supported by the NSF of China under the grant No. 11971221 and 11731006, the Shenzhen Sci-Tech Fund under the grant No. JCYJ20170818153840322 and JCYJ20190809150413261, and Guangdong Provincial Key Laboratory of Computational Science and Material Design under the grant No. 2019B030301001. The authors also wish to thank the High Performance Computing Center of Jilin University, the Computing Center of Jilin Province, and the Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education for essential computing support.
Proof. Using the definitions of the
E[|F(y∗,u∗)−FM,h(y∗M,h,u∗M,h)|]≤E[|E[12∫D|y∗−yd|2dx]−EM[12∫D∣y∗M,h−yd∣2dx]|]+|γ2∫D|u∗−ud|2dx−γ2∫D∣u∗M,h−ud∣2dx|=12(I+II). | (50) |
For the first term
I=E[|E[‖y∗−yd‖2L2(D)]−EM[‖y∗M,h−yd‖2L2(D)]|]≤E[|(E−EM)[‖Tu∗−yd‖2L2(D)]|]+E[|EM[‖Tu∗−yd‖2L2(D)−‖Thu∗M,h−yd‖2L2(D)]|]=I1+I2. |
By using the MC error estimate (18), the triangle inequality, the estimate (7) with
I1=E[|(E−EM)[‖Tu∗−yd‖2L2(D)]|]≤(E[((E−EM)[‖Tu∗−yd‖2L2(D)])2])12≤(E[(E−EM)[‖Tu∗−yd‖4L2(D)]])12≤1√M(E[‖Tu∗−yd‖4L2(D)])14⋅2≤1√M(E[(‖Tu∗‖L2(D)+‖yd‖L2(D))4])14⋅2≤1√M((E[8‖Tu∗‖4L2(D)])14+(E[8‖yd‖4L2(D)])14)2≤1√M√8(C1‖u∗‖L2(D)+‖yd‖L2(D))2. | (51) |
Applying the Cauchy-Schwarz inequality, the triangle inequality, the MC error estimate (18), and the inequality
(E[(m+n)4])14≤(E[8(m4+n4)])14≤(E[8m4])14+(E[8n4])14,∀m,n≥0, | (52) |
we can get
I2=E[|EM[−‖Thu∗M,h−Tu∗‖2L2(D)+2(Tu∗−Thu∗M,h,Tu∗−yd)]|]≤2E[EM[‖Tu∗−Thu∗M,h‖L2(D)‖Tu∗−yd‖L2(D)]]+E[EM‖Thu∗M,h−Tu∗‖2L2(D)]=I21+I22, |
and
I21≤2E[(EM−E)[‖Tu∗−Thu∗M,h‖L2(D)(‖Tu∗‖L2(D)+‖yd‖L2(D))]]+2E[‖Tu∗−Thu∗M,h‖L2(D)(‖Tu∗‖L2(D)+‖yd‖L2(D))]≤2√M(E[(‖Tu∗‖L2(D)+‖yd‖L2(D))2‖Tu∗−Thu∗M,h‖2L2(D)])12+2(E[‖Tu∗−Thu∗‖L2(D)‖Tu∗‖L2(D)]+E[‖Tu∗−Thu∗‖L2(D)‖yd‖L2(D)])+2(E[‖Th(u∗−u∗M,h)‖L2(D)‖Tu∗‖L2(D)]+E[‖Th(u∗−u∗M,h)‖L2(D)‖yd‖L2(D)])≤2√M(E[(‖Tu∗‖L2(D)+‖yd‖L2(D))4])14(E[‖Tu∗−Thu∗M,h‖4L2(D)])14+2(E[‖Tu∗‖2L2(D)])12(E[‖(T−Th)u∗‖2L2(D)])12+2(E[‖yd‖2L2(D)])12(E[‖(T−Th)u∗‖2L2(D)])12+2(E[‖Tu∗‖2L2(D)])12(E[‖Th(u∗−u∗M,h)‖2L2(D)])12+2(E[‖yd‖2L2(D)])12(E[‖Th(u∗−u∗M,h)‖2L2(D)])12. |
Taking
I21≤2√M((E[8‖Tu∗‖4L2(D)])14+(E[8‖yd‖4L2(D)])14)⋅((E[8‖(T−Th)u∗‖4L2(D)])14+(E[8‖Th(u∗−u∗M,h)‖4L2(D)])14)+2(C1C3h2‖u∗‖2L2(D)+‖yd‖L2(D)C3h2‖u∗‖L2(D))+2(C1C2‖u∗−u∗M,h‖L2(D)+C2‖yd‖L2(D)‖u∗−u∗M,h‖L2(D))=2√M(4√8(C1‖u∗‖L2(D)+‖yd‖L2(D)))⋅(4√8(C3h2‖u∗‖L2(D)+C2‖u∗−u∗M,h‖L2(D)))+2((C1C3‖u∗‖2L2(D)+C3‖yd‖L2(D)‖u∗‖L2(D))h2)+2((C1C2‖u∗‖L2(D)+C2‖yd‖L2(D))‖u∗−u∗M,h‖L2(D)). | (53) |
Similarly, we obtain
(54) |
When
(55) |
Next, we deal with the second term, applying the Cauchy-Schwarz inequality, triangle inequality, and the estimate (21), we obtain
Substituting
[1] |
A stochastic collocation method for elliptic partial differential equations with random input data. SIAM Rev (2010) 52: 317-355. ![]() |
[2] |
A stochastic collocation method for elliptic partial differential equations with random input data. SIAM J. Numer. Anal. (2007) 45: 1005-1034. ![]() |
[3] |
Solving elliptic boundary value problems with uncertain coefficients by the finite element method: The stochastic formulation. Comput. Methods. Appl. Mech. Engrg. (2005) 194: 1251-1294. ![]() |
[4] |
Multi-level Monte Carlo finite element method for elliptic PDEs with stochastic coefficients. Numer. Math. (2011) 119: 123-161. ![]() |
[5] |
R. E. Caflisch, Monte-Carlo and quasi-Monte Carlo methods, in Acta Numerica, Acta Numer., 7, Cambridge University Press, Cambridge, 1998, 1–49. doi: 10.1017/S0962492900002804
![]() |
[6] |
complexity analysis of the alternating direction method of multipliers. Sci. China Math. (2019) 62: 795-808. ![]() |
[7] |
An efficient Monte Carlo method for optimal control problems with uncertainty. Comput. Optim. Appl. (2003) 26: 219-230. ![]() |
[8] |
Taylor approximation and variance reduction for PDE-constrained optimal control under uncertainty. J. Comput. Phys. (2019) 385: 163-186. ![]() |
[9] | P. G. Ciarlet, The Finite Element Method for Elliptic Problems, Studies in Mathematics and its Applications, 4, North-Holland Publishing Co., Amsterdam-New York-Oxford, 1978. |
[10] |
An efficient numerical method for acoustic wave scattering in random media. SIAM/ASA J. Uncertain. Quantif. (2015) 3: 790-822. ![]() |
[11] |
A multimodes Monte Carlo finite element method for elliptic partial differential equations with random coefficients. Int. J. Uncertain. Quantif. (2016) 6: 429-443. ![]() |
[12] |
D. Gilbarg and N. S. Trudinger, Elliptic Partial Differential Equations of Second Order, Classics in Mathematics, Springer-Verlag, Berlin, 2001. doi: 10.1007/978-3-642-61798-0
![]() |
[13] |
Error estimates of stochastic optimal Neumann boundary control problems. SIAM J. Numer. Anal. (2011) 49: 1532-1552. ![]() |
[14] |
An alternating direction method of multipliers for the optimization problem constrained with a stationary Maxwell system. Commun. Comput. Phys. (2018) 24: 1435-1454. ![]() |
[15] |
M. Hinze, R. Pinnau, M. Ulbrich and S. Ulbrich, Optimization with PDE Constraints, Mathematical Modelling: Theory and Applications, 23, Springer, New York, 2009. doi: 10.1007/978-1-4020-8839-1
![]() |
[16] |
Finite element approximations of stochastic optimal control problems constrained by stochastic elliptic PDEs. J. Math. Anal. Appl. (2011) 384: 87-103. ![]() |
[17] |
D. P. Kouri, M. Heinkenschloss, D. Ridzal and B. G. van Bloemen Waanders, A trust-region algorithm with adaptive stochastic collocation for PDE optimization under uncertainty, SIAM J. Sci. Comput., 35 (2013), A1847–A1879. doi: 10.1137/120892362
![]() |
[18] |
Analytic regularity and GPC approximation for control problems constrained by linear parametric elliptic and parabolic PDEs. SIAM J. Control Optim. (2013) 51: 2442-2471. ![]() |
[19] |
An efficient alternating direction method of multipliers for optimal control problems constrained by random Helmholtz equations. Numer. Algorithms (2018) 78: 161-191. ![]() |
[20] |
On the Schrödinger equation and the eigenvalue problem. Comm. Math. Phys. (1983) 88: 309-318. ![]() |
[21] |
J. C. De los Reyes, Numerical PDE-Constrained Optimization, SpringerBriefs in Optimization, Springer, Cham, 2015. doi: 10.1007/978-3-319-13395-9
![]() |
[22] |
Stochastic Galerkin method for elliptic SPDEs: A white noise approach. Discrete Contin. Dyn. Syst. Ser. B (2006) 6: 941-955. ![]() |
[23] | Stochastic Galerkin method for constrained optimal control problem governed by an elliptic integro-differential PDE with stochastic coefficients. Int. J. Numer. Anal. Model. (2015) 12: 593-616. |
[24] |
Lower bounds for higher eigenvalues by finite difference methods. Pacific J. Math. (1958) 8: 339-368. ![]() |
[25] |
Fast numerical methods for robust optimal design. Eng. Optim. (2008) 40: 489-504. ![]() |
[26] |
An alternating direction method of multipliers for elliptic equation constrained optimization problem. Sci. China Math. (2017) 60: 361-378. ![]() |
1. | Sida Lin, Lixia Meng, Jinlong Yuan, Changzhi Wu, An Li, Chongyang Liu, Jun Xie, Sequential adaptive switching time optimization technique for maximum hands-off control problems, 2024, 32, 2688-1594, 2229, 10.3934/era.2024101 |
Algorithm 3 SMME-FEM-MC-ADMM. |
Begin
Let Compute the vectors for Generate the matrices end for while Increament Update end while End |
Algorithm 3 SMME-FEM-MC-ADMM. |
Begin
Let Compute the vectors for Generate the matrices end for while Increament Update end while End |
Algorithm 3 SMME-FEM-MC-ADMM. |
Begin
Let Compute the vectors for Generate the matrices end for while Increament Update end while End |
Algorithm 1 | Algorithm 2 | Algorithm 3 | |||||
| | | | | | ||
| 1.271E-1 | 1.009E-1 | 1.195E-1 | 1.221E-1 | 1.967E-3 | 1.007E-3 | 2.006E-3 |
| 1.655E0 | 1.277E0 | 1.676E0 | 1.685E0 | 8.136E-3 | 2.723E-3 | 5.859E-3 |
| 1.807E1 | 1.792E1 | 2.481E1 | 2.487E1 | 4.856E-2 | 8.097E-2 | 7.375E-2 |
| out of memory | 2.677E2 | out of memory | out of memory | 7.546E-1 | 1.161E0 | 1.626E0 |
| Algorithm 1 | Algorithm 2 | Algorithm 3 | ||||
| | | | | | ||
| 6.143E-2 | 5.456E-2 | 8.195E-2 | 7.839E-2 | 4.514E-2 | 6.814E-2 | 5.881E-2 |
| 2.498E-1 | 2.511E-1 | 2.964E-1 | 2.976E-1 | 4.568E-2 | 6.785E-2 | 5.901E-2 |
| 1.927E0 | 1.929E0 | 2.511E0 | 2.513E0 | 4.603E-2 | 6.887E-2 | 6.074E-2 |
| 1.807E1 | 1.792E1 | 2.481E1 | 2.487E1 | 4.856E-2 | 6.931E-2 | 7.375E-2 |
Algorithm 3 SMME-FEM-MC-ADMM. |
Begin
Let Compute the vectors for Generate the matrices end for while Increament Update end while End |
Algorithm 3 SMME-FEM-MC-ADMM. |
Begin
Let Compute the vectors for Generate the matrices end for while Increament Update end while End |
Algorithm 3 SMME-FEM-MC-ADMM. |
Begin
Let Compute the vectors for Generate the matrices end for while Increament Update end while End |
Algorithm 1 | Algorithm 2 | Algorithm 3 | |||||
| | | | | | ||
| 1.271E-1 | 1.009E-1 | 1.195E-1 | 1.221E-1 | 1.967E-3 | 1.007E-3 | 2.006E-3 |
| 1.655E0 | 1.277E0 | 1.676E0 | 1.685E0 | 8.136E-3 | 2.723E-3 | 5.859E-3 |
| 1.807E1 | 1.792E1 | 2.481E1 | 2.487E1 | 4.856E-2 | 8.097E-2 | 7.375E-2 |
| out of memory | 2.677E2 | out of memory | out of memory | 7.546E-1 | 1.161E0 | 1.626E0 |
| Algorithm 1 | Algorithm 2 | Algorithm 3 | ||||
| | | | | | ||
| 6.143E-2 | 5.456E-2 | 8.195E-2 | 7.839E-2 | 4.514E-2 | 6.814E-2 | 5.881E-2 |
| 2.498E-1 | 2.511E-1 | 2.964E-1 | 2.976E-1 | 4.568E-2 | 6.785E-2 | 5.901E-2 |
| 1.927E0 | 1.929E0 | 2.511E0 | 2.513E0 | 4.603E-2 | 6.887E-2 | 6.074E-2 |
| 1.807E1 | 1.792E1 | 2.481E1 | 2.487E1 | 4.856E-2 | 6.931E-2 | 7.375E-2 |