Linear-quadratic-Gaussian mean-field games driven by Poisson jumps with major and minor agents

Ruimin Xu; Kaiyue Dong; Jingyu Zhang; Ying Zhou; Ruimin Xu; Kaiyue Dong; Jingyu Zhang; Ying Zhou

doi:10.3934/math.2025503

AIMS Mathematics

2025, Volume 10, Issue 5: 11086-11110. doi: 10.3934/math.2025503

Previous Article Next Article

Research article Special Issues

Linear-quadratic-Gaussian mean-field games driven by Poisson jumps with major and minor agents

School of Mathematics and Statistics, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China

Received: 22 January 2025 Revised: 22 April 2025 Accepted: 27 April 2025 Published: 15 May 2025
MSC : 91A16

This paper studies mean-field linear-quadratic-Gaussian (LQG) games with a major agent and a large number of minor agents, where each agent's state process is driven by a Poisson random measure and independent Brownian motion. The major and minor agents were coupled via both their state dynamics as well as in their individual cost functionals. By the Nash certainty equivalence (NCE) methodology, two limiting control problems were constructed and the decentralized strategies were derived through the consistency condition. The $\epsilon$ -Nash equilibrium property of the obtained decentralized strategies was shown for a finite $N$ population system where $\epsilon = O(1/\sqrt{N})$ . A numerical example was presented to illustrate the consistency of the mean-field estimation and the impact of the population's collective behavior.

Keywords:

Citation: Ruimin Xu, Kaiyue Dong, Jingyu Zhang, Ying Zhou. Linear-quadratic-Gaussian mean-field games driven by Poisson jumps with major and minor agents[J]. AIMS Mathematics, 2025, 10(5): 11086-11110. doi: 10.3934/math.2025503

Related Papers:

[1]	Hamidou Tembine . Mean-field-type games. AIMS Mathematics, 2017, 2(4): 706-735. doi: 10.3934/Math.2017.4.706
[2]	Xiuxian Chen, Zhongyang Sun, Dan Zhu . Mean-variance investment and risk control strategies for a dynamic contagion process with diffusion. AIMS Mathematics, 2024, 9(11): 33062-33086. doi: 10.3934/math.20241580
[3]	Jiali Wu, Maoning Tang, Qingxin Meng . A stochastic linear-quadratic optimal control problem with jumps in an infinite horizon. AIMS Mathematics, 2023, 8(2): 4042-4078. doi: 10.3934/math.2023202
[4]	Wen Li, Deyi Li, Yuqiang Feng, Du Zou . Existence and stability of fuzzy Pareto-Nash equilibria for fuzzy constrained multi-objective games with fuzzy payoffs. AIMS Mathematics, 2023, 8(7): 15907-15931. doi: 10.3934/math.2023812
[5]	Chenwei Liu, Shuwen Xiang, Yanlong Yang . Existence and essential stability of Nash equilibria for biform games with Shapley allocation functions. AIMS Mathematics, 2022, 7(5): 7706-7719. doi: 10.3934/math.2022432
[6]	Chengqing Pan, Haishu Lu . On the existence of solutions for systems of generalized vector quasi-variational equilibrium problems in abstract convex spaces with applications. AIMS Mathematics, 2024, 9(11): 29942-29973. doi: 10.3934/math.20241447
[7]	Weiwei Shen, Yan Zhang . Strong convergence of the Euler-Maruyama method for the stochastic volatility jump-diffusion model and financial applications. AIMS Mathematics, 2025, 10(5): 12032-12054. doi: 10.3934/math.2025545
[8]	Jaicer López-Rivero, Hugo Cruz-Suárez, Carlos Camilo-Garay . Nash equilibria in risk-sensitive Markov stopping games under communication conditions. AIMS Mathematics, 2024, 9(9): 23997-24017. doi: 10.3934/math.20241167
[9]	Ramkumar Kasinathan, Ravikumar Kasinathan, Dumitru Baleanu, Anguraj Annamalai . Well posedness of second-order impulsive fractional neutral stochastic differential equations. AIMS Mathematics, 2021, 6(9): 9222-9235. doi: 10.3934/math.2021536
[10]	Huimin Li, Shuwen Xiang, Yanlong Yang, Chenwei Liu . Differential evolution particle swarm optimization algorithm based on good point set for computing Nash equilibrium of finite noncooperative game. AIMS Mathematics, 2021, 6(2): 1309-1323. doi: 10.3934/math.2021081

Abstract

1. Introduction

Mean-field games of a large population system have attracted consistent and intense attention in recent years (see, e.g., ^{[1,2,3,4,5,6,7,8,9,10]}) due to their wide applicability in many fields such as finance, economics, engineering, biological science, and social science. The agents in mean-field games are individually insignificant, while their aggregated behavior has a substantial effect on each agent. This collective influence can be captured by the mean-field couplings in their individual dynamics and/or individual cost functionals. For mean-field games, it is unrealistic for a given agent to collect detailed state information of all agents due to the highly complex interactions among its peers. To tackle the dimensionality difficulty caused by the highly complex interactions among the agents in mean-field games, Huang, Caines, and Malhamé ^[11], Huang ^[12], and Nourian and Caines ^[13] developed a powerful approach—the Nash certainty equivalence (NCE) methodology. The key idea of this methodology is to establish a consistency relationship between the individual strategies and the mass effect (i.e., the asymptotic limit of state-average) as the population size goes to infinity. Based on this effective analytical tool, one can construct a set of decentralized strategies for each agent in the mean-field game, and verify the asymptotic Nash equilibrium property (namely, $\epsilon$ -Nash equilibrium) of the decentralized strategies where the individual optimality loss level $\epsilon$ depends on the population size $N$ . A closely related method for solving mean-field games was independently developed by Lasry and Lions ^[14,15,16]. For a comprehensive survey of the theory of the mean-field game and its applications, one is referred to ^{[11,12,14,16,17,18,19,20,21]} and the references therein.

The consideration of major and minor a agent game problems under a large population framework has been well studied in ^{[3,12,13,21,22]}. Huang ^[12] investigated a kind of stochastic dynamic linear-quadratic-Gaussian mean-field games model involving a major agent interacting with a large number of minor agents. The major agent has a significant influence in affecting minor agents, while the minor agents individually have negligible impact on others, but their collective behavior will impose a significant impact on all agents through mean-field coupling terms in the individual dynamics and costs. Applications of this type of mean-field game appear in many socio-economic problems such as economic and social opinion models with an influential leader (e.g., ^[23]), such as the charging control of plug-in electric vehicles ^[24]. Xu and Wu ^[21] studied large-population dynamic games involving a LQG system with an exponential cost functional, and the parameter in the cost functional can describe an investor's risk attitude. Moreover, in the game, there is a major agent and a population of $N$ minor agents where $N$ is very large. Wang and Xu ^[22] investigated a time-inconsistent linear-quadratic game involving a major agent as well as numerous minor agents.

Motivated by the absence of relevant theory and some practical applications, this paper studies mean-field LQG games with random jumps involving a major agent and plenty of minor agents. Specifically, we consider mean-field games with agents of the following mixed types: (ⅰ) a major agent and (ⅱ) a large population of $N$ minor agents where $N$ is very large. The dynamic of each agent follows a linear stochastic differential equation driven by both Brownian motions and Poisson random measures. Moreover, the present study considers the mean-field LQG mixed games in which the diffusion term depends on the major agent's and the minor agent's states as well as the individual control strategy. Stochastic processes with random jumps can be used to model fluctuations in the financial market, both for option pricing purposes and risk management (see ^{[20,25,26,27]}). As for mean-field LQG games with random jumps, Benazzoli, Campi, and Di Persio ^[1] studied a symmetric $n$ -player nonzero-sum stochastic differential game with jump-diffusion dynamics and mean-field type interaction among the players, and they constructed an approximate Nash equilibrium for the $n$ -player game with $n$ sufficiently large. Xu and Shi ^[20] investigated LQG games of a stochastic large population system with jump diffusion processes. It is worth noting that in existing research on mean-field games of a stochastic large population system driven by jump-diffusion processes, all agents are comparably small and may be regarded as peers.

To obtain an asymptotic Nash equilibrium property (i.e., $\epsilon$ -Nash equilibrium) for the original mean-field game, we apply the NCE approach to establish a certain consistency relationship between all minor agents and the mass effect. First, we construct two auxiliary stochastic control problems driven by stochastic differential equations driven by Poisson jumps (SDEPs) which depict the state of the major agent and a generic minor agent, and obtain the corresponding optimal control in feedback form. Next, to devise the decentralized strategies of individual agents, we formulate a kind of fully coupled forward-backward stochastic differential equation driven by Poisson jumps which is called a consistency condition (CC) system. Then, a set of decentralized strategies are constructed by using the solution of the CC system, which are demonstrated to be the $\epsilon$ -Nash equilibrium.

The main contributions of this paper can be summarized as follows:

● A new class of LQG mean-field games involving major and minor agents is investigated. The dynamics of each agent follows a linear stochastic differential equation driven by both Brownian motions and Poisson random measures, in which the diffusion terms of the major and minor agents depend on their states and control strategy.

● The average state of all minor agents $x^{(N)}(\cdot)$ appears in the drift term and diffusion term of the state equations for both the major agent and all the minor agents, as well as in their cost functionals.

● The consistency condition system called the NCE equation is represented through a fully coupled two-point boundary value problem, and based on this equation, we design a set of decentralized feedback control strategies for the $N+1$ agents by use of two limiting control systems.

● By the approximation relationship between the closed-loop mean-field game system and the limiting systems, the set of NCE-based decentralized control strategies is shown to be an $\epsilon$ -Nash equilibrium for a finite $N+1$ population system where $\epsilon = O(1/\sqrt{N})$ .

This paper is organized as follows. In Section 2, we formulate the LQG mean-field games driven by Poisson random jumps involving a major agent and many minor agents. Section 3 introduces two auxiliary optimization problems for the major agent and each minor agent, respectively, and the consistency condition system is derived. Section 4 aims to present the $\epsilon$ -Nash equilibrium property of the decentralized control strategies. A numerical example is given in Section 5. Finally, Section 6 concludes the paper.

2. Formulation of the problem

2.1. Notations

Throughout this paper, we denote by $\mathbb{R}^{n}$ the $n$ -dimensional Euclidean space. For a given Euclidean space, we denote by $|\cdot|$ (respectively, $\langle \cdot, \cdot\rangle$ ) the standard Euclidean norm (respectively, inner product). The transpose of a matrix (or vector) $\mathbb{X}$ is denoted by $\mathbb{X}^T$ . Let $(\Omega, \mathcal{F}, \{\mathcal{F}_{t}\}_{0\leq t\leq T}, \mathbb{P})$ be a complete filtered probability measure space for fixed time $T > 0$ , and let the number $N$ represent the population size of minor agents. Denote by $\mathcal{N}$ the index set $\{1, 2, \cdots, N\}$ . Let $\mathcal{F}_t$ be the filtration generated by the following mutually independent processes:

(ⅰ) $(N+1)$ independent one-dimensional standard Brownian motions $\{W_{i}(t), \ \ i = 0, 1, \cdots, N\}_{0\leq t\leq T}$ ;

(ⅱ) $(N+1)$ independent Poisson random measures $\{\widetilde{G}_i, i = 0, 1, \cdots, N\}$ on $E_i\times \mathbb{R}^{+}$ , where $E_i\subset \mathbb{R}$ is a nonempty open set equipped with its Borel field $\mathcal{B}(E_i)$ , with compensator $\widehat{G} _i(dedt) = \pi_i(de)dt$ , such that $G_i (S\times [0, t]) = (\widetilde{G}_i-\widehat{G}_i)(S\times [0, t])_{t\geq 0}$ is a martingale for all $S \in \mathcal{B}(E_i)$ . $\pi_i$ is a $\sigma$ -finite measure on $(E_i, \mathcal{B}(E_i))$ and is called the characteristic measure. Moreover, $\forall S\in \mathcal{B}(E_i)$ , $\mathbb{C}_0: = \sup\limits_{0\leq i \leq N}\pi_i(S) < +\infty$ is a positive constant independent of the number $N$ .

We also set

$\begin{align*} &\mathcal{F}_t^{0}: = \sigma\big\{W_0(s), 0\leq s\leq t\}\bigvee \sigma\{G_0(S_0\times[0, s]), 0\leq s\leq t, \forall S_0\in \mathcal{B}(E_0)\big\}, \\ &\mathcal{F}_t^{i}: = \sigma\big\{W_i(s), 0\leq s\leq t\}\bigvee \sigma\{G_i(S_i\times[0, s]), 0\leq s\leq t, \forall S_i\in \mathcal{B}(E_i)\big\}, \\ &\mathcal{F}_t^{0, i}: = \sigma\big\{W_0, W_i(s), 0\leq s\leq t\}\bigvee \sigma\{G_0(S_0\times[0, s]), G_i(S_i\times[0, s]), 0\leq s\leq t, \\ &\ \ \ \ \ \ \ \ \ \ \ \ \ \forall S_0\in \mathcal{B}(E_0), S_i\in \mathcal{B}(E_i)\big\}, \end{align*}$

where $\bigvee_{\alpha}\mathcal{F}_{\alpha}: = \sigma\left(\bigcup_{\alpha}\mathcal{F}_{\alpha}\right)$ . Here, $\{\mathcal{F}_t^{0}\}_{0\leq t\leq T}$ represents the information of the major agent, whereas for the given $i\in \mathcal{N}$ , $\{\mathcal{F}_t^{i}\}_{0\leq t\leq T}$ stands the individual information of the $i$ th minor agent.

Denote by $\mathcal{S}^n$ the set of symmetric $n\times n$ matrices with real elements. If $M\in \mathcal{S}^n$ is positive (semi) definite, we write $M > (\geq) \ 0$ . We also introduce the following spaces:

$\begin{align*} &L_{\mathcal{G}}^{2}(\mathbb{R}^n): = \Big\{ \zeta: \Omega\rightarrow \mathbb{R}^n| \zeta\ \text{is}\ \mathcal{G} \text{-measurable and}\ \mathbb{E}\Big[|\zeta|^{2}\Big] < +\infty\Big\};\\ &S_{\mathcal{G}}^{2}\left([0, T];\mathbb{R}^n\right): = \Big\{\phi(\cdot): [0, T]\times\Omega \rightarrow \mathbb{R}^n| \phi(\cdot)\ \text{is}\ \mathcal{G}_t \text{-adapted and}\ \mathbb{E}\Big[\sup\limits_{0\leq t\leq T}|\phi(t)|^{2}\Big] < +\infty\Big\};\\ &L_{\mathcal{G}}^{2}\left([0, T];\mathbb{R}^n\right): = \Big\{\phi(\cdot): [0, T]\times\Omega \rightarrow \mathbb{R}^n| \phi(\cdot)\ \text{is a}\ \mathcal{G}_t \text{-progressively measurable process}\\ & \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \text{and}\ \mathbb{E}\Big[\int_{0}^{T}|\phi(t)|^{2}dt\Big] < +\infty\Big\}. \end{align*}$

2.2. Major-minor mean-field game problems

Let us consider an LQG mean-field game involving a major agent $\mathcal{A}_0$ and a population of $N$ minor agents $\{\mathcal{A}_i, i = 1, 2, \cdots, N\}$ . For the major agent $\mathcal{A}_0$ , $\mathcal{U}^{c, 0}_{ad}: = \{u(\cdot)|u(\cdot)\in L_{\mathcal{F}}^{2}([0, T]; \mathbb{R}^k)\}$ denotes the centralized admissible control set, and $\mathcal{U}^{0}_{ad}: = \{u(\cdot)|u(\cdot)\in L_{\mathcal{F}^{0}}^{2}([0, T]; \mathbb{R}^k)\}$ represents the corresponding decentralized admissible control set. For each $i\in \mathcal{N}$ , we define the centralized admissible control set for the minor agent $\mathcal{A}_i$ as $\mathcal{U}^{c, i}_{ad}: = \{u_i(\cdot)|u_i(\cdot)\in L_{\mathcal{F}}^{2}([0, T]; \mathbb{R}^k)\}$ , while the corresponding decentralized admissible control set is $\mathcal{U}^{i}_{ad}: = \{u_i(\cdot)|u_i(\cdot)\in L_{\mathcal{F}^{0, i}}^{2}([0, T]; \mathbb{R}^k)\}$ . Note that we have $\mathcal{U}^{i}_{ad}\subset \mathcal{U}^{c, i}_{ad}$ for $i = 0, 1, \cdots, N$ .

The dynamics of the major agent $\mathcal{A}_0$ is given as follows:

$\begin{eqnarray} \left\{ \begin{array}{ll} dx_{0}(t) = [A_{0}x_{0}(t)+B_{0}u_{0}(t)+b_{0}x^{(N)}(t)+f_{0}(t)]dt+[C_{0}x_{0}(t)+D_{0}u_{0}(t)\\ \ \ \ \ \ \ \ \ \ \ \ \ +\ l_{0}x^{(N)}(t)+\sigma_{0}(t)]dW_{0}(t)+F_{0}\int_{E_{0}}G_{0}(dedt), \\ x_{0}(0) = a_{0}\in \mathbb{R}^n, \end{array} \right. \end{eqnarray}$

(1)

and the state of the minor agent $\mathcal{A}_i$ is described by

$\begin{eqnarray} \left\{ \begin{array}{ll} dx_{i}(t) = [Ax_{i}(t)+Bu_{i}(t)+b_{1}x^{(N)}(t)+f(t)]dt+[Cx_{i}(t)+Du_{i}(t)\\ \ \ \ \ \ \ \ \ \ \ \ \ +\ b_{2}x^{(N)}(t)+Hx_{0}(t)+\sigma(t)] dW_{i}(t)+F\int_{E_{i}} G_{i}(dedt), \\ x_{i}(0) = a_{i}\in \mathbb{R}^n, \ \ \ \ \ i = 1, \cdots, N, \end{array} \right. \end{eqnarray}$

(2)

where $x^{(N)}(t) = \frac{1}{N}\sum_{j = 1}^{N}x_{j}(t)$ represents the average state of all minor agents. Here, $A_{0}\in\mathbb{R}^{n\times n}, B_{0}\in\mathbb{R}^{n\times k}, C_{0}\in\mathbb{R}^{n\times n}, D_{0}\in\mathbb{R}^{n\times k}$ , $b_{0}\in\mathbb{R}^{n\times n}, l_{0}\in\mathbb{R}^{n\times n}, F_{0}\in\mathbb{R}^{n}, A\in\mathbb{R}^{n\times n}$ , $B\in\mathbb{R}^{n\times k}, C\in\mathbb{R}^{n\times n}, D\in\mathbb{R}^{n\times k}$ , $b_{1}\in\mathbb{R}^{n\times n}, b_{2}\in\mathbb{R}^{n\times n}, H\in\mathbb{R}^{n\times n},$ and $F\in\mathbb{R}^{n}$ are given constants, and $f_{0}(\cdot)\in\mathbb{R}^{n}, \sigma_{0}(\cdot)\in\mathbb{R}^{n}, f(\cdot)\in\mathbb{R}^{n},$ and $\sigma(\cdot)\in\mathbb{R}^{n}$ are given deterministic functions. For given admissible control $u_0$ and $u_i$ , it follows that the systems (1) and (2) admit a unique solution $x_0(\cdot), x_i(\cdot) \in S_{\mathcal{F}}^{2}([0, T]; \mathbb{R}^n)$ .

Let $u = (u_{0}, u_{1}, \ldots, u_{i}, \ldots, u_{N})$ be the set of control strategies for all $N+1$ agents, and $u_{-i} = (u_{0}, u_{1}, \ldots, u_{i-1}, u_{i+1}, \ldots, u_{N})$ for $i = 0, 1\cdots, N$ . The cost functional for the major agent $\mathcal{A}_{0}$ is

$\begin{align} J_{0}(u_{0}, u_{-0}) = &\ \frac{1}{2}\mathbb{E}\Big\{\int^{T}_{0}\Big[\langle Q_{0}(x_{0}(t)-\beta_{0} x^{(N)}(t)), (x_{0}(t)-\beta_{0} x^{(N)}(t))\rangle+\langle R_{0}u_{0}(t), u_{0}(t)\rangle\Big]dt\\ &\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ +\langle M_{0}x_{0}(T), x_{0}(T)\rangle\Big\}. \end{align}$

(3)

The cost functional for minor agent $\mathcal{A}_{i}, 1\leq i\leq N$ , is

$\begin{align} J_{i}(u_{i}, u_{-i}) = &\ \frac{1}{2}\mathbb{E}\Big\{\int^{T}_{0}\Big[\langle Q(x_{i}(t)-\beta_{1}x^{(N)}(t)-\beta_{2}x_{0}(t)), (x_{i}(t)-\beta_{1}x^{(N)}(t)-\beta_{2}x_{0}(t))\rangle\\ &\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ +\langle Ru_{i}(t), u_{i}(t)\rangle\Big]dt+\langle Mx_{i}(T), x_{i}(T)\rangle\Big\}. \end{align}$

(4)

The coefficients of cost functionals satisfy that $Q_{0}, Q\in \mathcal{S}^n, Q_{0}\geq 0, Q\geq 0, \beta_{0}, \beta_{1}, \beta_{2}\in\mathbb{R}^n, R_{0} > 0, R > 0, R_{0}, R\in \mathcal{S}^k$ and $M_{0}\geq 0, M\geq 0, M_{0}, M\in \mathcal{S}^n$ .

Parallel to (2), the cost functional (4) contains the term $\beta_{2}x_{0}(t)$ to capture the strong influence of the major agent. Note that the state dynamics (1) and (2), and the cost functionals (3) and (4), indicate that the major agent $\mathcal{A}_0$ has a significant influence on minor agents, while each minor agent $\mathcal{A}_i, i\in\mathcal{N}$ , has a negligible impact on other agents in a large $N$ population system.

Now, we propose the following LQG mean-field games.

Problem (LP): Find an admissible strategy $\bar{u} = (\bar{u}_{0}, \bar{u}_{1}, \ldots, \bar{u}_{i}, \ldots, \bar{u}_{N})$ where $\bar{u}_{i}(\cdot)\in\mathcal{U}_{ad}^{c, i}, i = 0, 1, \cdots, N$ , such that

$\begin{align*} J_{i}(\bar{u}_{i}, \bar{u}_{-i}) = \inf\limits_{u_{i}(\cdot)\in \mathcal{U}_{ad}^{c, i}} J_{i}(u_{i}, u_{-i}), \quad i = 0, 1, \cdots, N. \end{align*}$

We call $\bar{u}$ a Nash equilibrium strategy for Problem (LP).

Remark 2.1. It should be noted that this paper only addresses the existence of Nash equilibrium strategies and does not involve whether the Nash equilibrium is unique. The study of the uniqueness of Nash equilibrium strategies is also an active research topic. The variational inequality approach proposed in He and Wang ^[28] provides a feasible methodology for studying the uniqueness of Nash equilibrium strategies.

3. Closed-loop behavior of the agents

In this section, we first construct two auxiliary stochastic optimal control problems, which are called limiting systems, for the major and a generic minor agent in Sections 3.1 and 3.2, respectively. Then we present the approximations between the limiting systems and the corresponding mean-field system in Section 3.3.

3.1. Optimal control of the major agent

For any $v_0(\cdot)\in\mathcal{U}_{ad}^{0}$ , the state $y_{0}(\cdot)$ of agent $\mathcal{A}_{0}$ satisfies the following stochastic differential equation:

$\begin{eqnarray} \left\{ \begin{array}{ll} dy_{0}(t) = [A_{0}y_{0}(t)+B_{0}v_{0}(t)+b_{0}x^{(0)}(t)+f_{0}(t)]dt+[C_{0}y_{0}(t)+D_{0}v_{0}(t)\\ \ \ \ \ \ \ \ \ \ \ \ \ +\ l_{0}x^{(0)}(t)+\sigma_{0}(t)] dW_{0}(t)+F_{0}\int_{E_{0}} G_{0}(dedt), \\ \; y_{0}(0) = a_{0}, \end{array} \right. \end{eqnarray}$

(5)

where function $x^{(0)}(\cdot)$ will be given later.

The corresponding cost functional is given by

$\begin{align*} \widetilde{J}_{0}(v_{0}) = &\ \frac{1}{2}\mathbb{E}\bigg\{\int^{T}_{0}\Big[\langle Q_{0}(y_{0}(t)-\beta_{0} x^{(0)}(t)), (y_{0}(t)-\beta_{0} x^{(0)}(t))\rangle+\langle R_{0}v_{0}(t), v_{0}(t)\rangle\Big]dt\nonumber\\ &\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ +\langle M_{0}y_{0}(T), y_{0}(T)\rangle\bigg\}. \end{align*}$

Problem (LM1): The objective is to find $\bar{v}_{0}(\cdot)\in\mathcal{U}_{ad}^{0}$ such that

$\begin{align*} \widetilde{J}_{0}(\bar{v}_{0}) = &\inf\limits_{v_{0}\in \mathcal{U}_{ad}^{0}}\widetilde{J}_{0}(v_{0}). \end{align*}$

Let $P_{0}(\cdot)$ be the solution of the following Riccati equation:

$\begin{eqnarray*} \label{Sec-3 Riccati equation 1} \left\{ \begin{array}{ll} -\dot{P}_{0}(t) = P_{0}(t)A_{0}+A_{0}^{\top}P_{0}(t)+C_{0}^{\top}P_{0}(t)C_{0}+Q_{0}-(B_{0}^{\top}P_{0}(t)+D_{0}^{\top}P_{0}(t)C_{0})^{\top}\\ \ \ \ \ \ \ \ \ \ \ \ \ \times\ (R_{0}+D_{0}^{\top}P_{0}(t)D_{0})^{-1}(B_{0}^{\top}P_{0}(t)+D_{0}^{\top}P_{0}(t)C_{0}), \\ R_{0}+D_{0}^{\top}P_{0}(t)D_{0}\geq0, \\ P_{0}(T) = M_{0}. \end{array} \right. \end{eqnarray*}$

Let $\eta_{0}(\cdot)$ denote the solution of

$\begin{align*} \left\{ \begin{array}{ll} \dot{\eta}_{0}(t) = -\Big\{[A_{0}-B_{0}(R_{0}+D_{0}^{\top}P_{0}(t)D_{0})^{-1}\times (B_{0}^{\top}P_{0}(t)+D_{0}^{\top}P_{0}(t)C_{0})]^{\top}\eta_{0}(t)\\ \ \ \ \ \ \ \ \ \ \ +\ [C_{0}-D_{0}(R_{0}+D_{0}^{\top}P_{0}(t)D_{0})^{-1}\times(B_{0}^{\top}P_{0}(t)+D_{0}^{\top}P_{0}(t)C_{0})]^{\top}\\ \ \ \ \ \ \ \ \ \ \ \times\ P_{0}(t)(l_{0}x^{(0)}(t)+\sigma_{0}(t))+[P_{0}(t)(b_{0}x^{(0)}(t)+f_{0}(t))-\beta_{0}Q_{0}x^{(0)}(t)]\Big\}, \\ \eta_{0}(T) = 0. \end{array} \right. \end{align*}$

The following result presents the optimal control of Problem (LM1).

Theorem 3.1. Suppose that

$\begin{eqnarray*} \left\{ \begin{array}{ll} \Lambda_{0}(t): = -(R_{0}+D_{0}^{\top}P_{0}(t)D_{0})^{-1}\times(B_{0}^{\top}P_{0}(t)+D_{0}^{\top}P_{0}(t)C_{0}), \\ \Theta_{0}(t): = -(R_{0}+D_{0}^{\top}P_{0}(t)D_{0})^{-1}\times[B_{0}^{\top}\eta_{0}(t)+D_{0}^{\top}P_{0}(t)(l_{0}x^{(0)}(t)+\sigma_{0}(t))]. \end{array} \right. \end{eqnarray*}$

Then the optimal control strategy of Problem (LM1) is

$\begin{align*} \bar{v}_{0}(t) = \Lambda_{0}(t)\bar{y}_{0}(t)+\Theta_{0}(t), \end{align*}$

where $\bar{y}_{0}(\cdot)$ satisfies

$\begin{eqnarray} \left\{ \begin{array}{ll} d\bar{y}_{0}(t) = [(A_{0}+B_{0}\Lambda_{0}(t))\bar{y}_{0}(t)+B_{0}\Theta_{0}(t)+b_{0}x^{(0)}(t)+f_{0}(t)]dt\\ \ \ \ \ \ \ \ \ \ \ \ +\ [(C_{0}+D_{0}\Lambda_{0}(t))\bar{y}_{0}(t)+D_{0}\Theta_{0}(t)+l_{0}x^{(0)}(t)+\sigma_{0}(t)]dW_{0}(t)\\ \ \ \ \ \ \ \ \ \ \ \ +\ F_{0}\int_{E_{0}}G_{0}(dedt), \\ \bar{y}_{0}(0) = a_{0}. \end{array} \right. \end{eqnarray}$

(6)

Proof. Let $\hat{b}(t): = b_{0}x^{(0)}(t)+f_{0}(t), \hat{\sigma}(t): = l_{0}x^{(0)}(t)+\sigma_{0}(t)$ . Then the state equation (5) can be written as

$\begin{align*} \left\{ \begin{array}{ll} dy_{0}(t) = \Big[A_{0}y_{0}(t)+B_{0}u_{0}(t)+\hat{b}(t)\Big]dt+\Big[C_{0}y_{0}(t)+D_{0}u_{0}(t)+\hat{\sigma}(t)\Big]dW_{0}(t)\\ \ \ \ \ \ \ \ \ \ \ \ \ +\ F_{0}\int_{E_{0}} G_{0}(dedt), \\ \; y_{0}(0) = a_{0}. \end{array} \right. \end{align*}$

For simplicity, we denote $\hat{{R}}_0(t): = R_{0}+D_{0}^{\top}P_{0}(t)D_{0}, \$ $\hat{{B}}_0(t): = B_{0}^{\top}P_{0}(t)+D_{0}^{\top}P_{0}(t)C_{0}$ . Applying Itô's formula to $\Big(\frac{1}{2}y_{0}^{\top}(t)P_{0}(t)y_{0}(t)+y_{0}^{\top}(t)\eta_{0}(t)\Big)$ , we obtain

$\begin{align*} &\mathbb{E}\Big\{\frac{1}{2}y_{0}^{\top}(T)P_{0}(T)y_{0}(T)-\frac{1}{2}y_{0}^{\top}(0)P_{0}(0)y_{0}(0)+y_{0}^{\top}(T)\eta_{0}(T)-y_{0}^{\top}(0)\eta_{0}(0)\Big\}\\ = &\ \mathbb{E}\Big\{\frac{1}{2}M_{0}y_{0}^{2}(T)-\frac{1}{2}y_{0}^{\top}(0)P_{0}(0)y_{0}(0)-y_{0}^{\top}(0)\eta_{0}(0)\Big\}\\ = &\ \mathbb{E}\int_{0}^{T}\Big[-\frac{1}{2}Q_{0}y_{0}^{2}-\frac{1}{2}y_{0}^{2}\hat{{B}}_0^{2}\hat{{R}}_0^{-1}+P_{0}y_{0}v_{0}^{\top}B_{0}^{\top} +P_{0}C_{0}y_{0}v_{0}^{\top}D_{0}^{\top}+\eta_{0}v_{0}^{\top}B_{0}^{\top}+\eta_{0}\hat{b}^{\top}\Big]dt\\ &+\mathbb{E}\int_{0}^{T}(\frac{1}{2}P_{0}D_{0}^{2}v_{0}^{2}+P_{0}D_{0}v_{0}\hat{\sigma}+\frac{1}{2}P_{0}\hat{\sigma}^{2})dt +\mathbb{E}\int_{0}^{T}\Big[B_{0}\hat{{R}}_0^{-1}(B_{0}^{\top} P_{0}+D_{0}^{\top}P_{0}C_{0})\Big]^{\top}\eta_{0}y_{0}^{\top}dt\\ &+\mathbb{E}\int_{0}^{T}\Big\{\Big[D_{0}\hat{{R}}_0^{-1}\hat{{B}}_0\Big]^{\top}P_{0}\hat{\sigma}y_{0}^{\top} +\beta_{0}Q_{0}x^{(0)}(t)y_{0}^{\top}\Big\}dt+\frac{1}{2}P_{0}F_{0}^{2}\int_{E_{0}}\int_{0}^{T}\pi_{0}(dedt). \end{align*}$

Combing the above equation with the definition of $\widetilde{J}_{0}(v_{0})$ , it follows that

$\begin{align*} \widetilde{J}_{0}(v_{0}) = \ &\mathbb{E}\Bigg\{\int^{T}_{0}(\frac{1}{2}Q_{0}(y_{0}-\beta_{0}x^{(0)}(t))^{2}+\frac{1}{2}R_{0}v_{0}^{2})dt+\frac{1}{2}M_{0}y_{0}^{2}(T)\Bigg\}\\ = &\ \mathbb{E}\Bigg\{\int^{T}_{0}\Bigg[-\beta_{0}Q_{0}x^{(0)}(t)y_{0}+\frac{1}{2}Q_{0}(\beta_{0}x^{(0)}(t))^{2}+\frac{1}{2}R_{0}v_{0}^{2}+\frac{1}{2}P_{0}D_{0}^{2}v_{0}^{2} +y_{0}v_{0}^{\top}P_{0}B_{0}^{\top}\\ &\ \ \ +y_{0}v_{0}^{\top}P_{0}C_{0}D_{0}^{\top}+\beta_{0}Q_{0}x^{(0)}(t)y_{0}^{\top}+\frac{1}{2}y_{0}^{2}\hat{{B}}_0^{2}\hat{{R}}_0^{-1} +P_{0}D_{0}v_{0}\sigma_{0}+\eta_{0}v_{0}^{\top}B_{0}^{\top}\\ &\ \ \ +\Big[B_{0}\hat{{R}}_0^{-1}\hat{{B}}_0\Big]^{\top}\eta_{0}y_{0}^{\top} +\Big[D_{0}\hat{{R}}_0^{-1}\hat{{B}}_0\Big]^{\top}P_{0}\hat{\sigma}y_{0}^{\top}+\frac{1}{2}P_{0}\hat{\sigma}^{2}+\eta_{0}\hat{b}^{\top}\Big]dt\Big\}\\ &\ \ \ +\frac{1}{2}P_{0}F_{0}^{2}\int_{E_{0}}\int_{0}^{T}\pi_{0}(dedt)+\frac{1}{2}a_{i0}^{2}P(0)+a_{i0}\eta(0)\\ = &\ \mathbb{E}\Bigg\{\int^{T}_{0}\bigg[\frac{1}{2}\hat{{R}}_0^{-1}\Big\{\Big[\hat{{R}}_0v_{0}+\hat{{B}}_0y_{0}\Big]^{2} +2(B_{0}^{\top}\eta_{0}+D_{0}^{\top}P_{0}\hat{\sigma})\big(\hat{{R}}_0v_{0}+\hat{{B}}_0y_{0}\big)\Big\}\\ &\ \ \ +\frac{1}{2}P_{0}\hat{\sigma}^{2}+\eta_{0}\hat{b}^{\top}\bigg]dt\Bigg\}+\frac{1}{2}P_{0}F_{0}^{2}\int_{E_{0}}\int_{0}^{T}\pi_{0}(dedt)+\frac{1}{2}a_{i0}^{2}P(0)+a_{i0}\eta(0)\\ = &\ \mathbb{E}\Big\{\int^{T}_{0}\Big[\frac{1}{2}\hat{{R}}_0^{-1}\|\hat{{R}}_0v_{0}+\hat{{B}}_0y_{0}+(B_{0}^{\top}\eta_{0} +D_{0}^{\top}P_{0}\hat{\sigma})\|^{2}-\frac{1}{2}\hat{{R}}_0^{-1}(B_{0}^{\top}\eta_{0}+D_{0}^{\top}P_{0}\hat{\sigma})^{2}\\ & \ \ \ +\frac{1}{2}P_{0}\hat{\sigma}^{2}+\eta_{0}\hat{b}^{\top}\Big]dt\Big\}+\frac{1}{2}P_{0}F_{0}^{2}\int_{E_{0}}\int_{0}^{T}\pi_{0}(dedt)+\frac{1}{2}a_{i0}^{2}P(0)+a_{i0}\eta(0). \end{align*}$

Hence we obtain the optimal control

$\begin{align*} \bar{v}_{0}(t) = &-\hat{{R}}^{-1}_0(t)\hat{{B}}^{-1}_0(t)\bar{y}_{0}(t)-\hat{{R}}^{-1}_0(t)(B_{0}^{\top}\eta_{0}(t)+D_{0}^{\top}P_{0}(t)\hat{\sigma}(t))\\ = &\ \Lambda_{0}(t)\bar{y}_{0}(t)+\Theta_{0}(t). \end{align*}$

The proof is therefore complete. □

3.2. Optimal control of the minor agent

For any $i\in\mathcal{N}$ , the limiting state of minor agent $\mathcal{A}_{i}$ is

$\begin{eqnarray*} \left\{ \begin{array}{ll} dy_{i}(t) = [Ay_{i}(t)+Bv_{i}(t)+b_{1}x^{(0)}(t)+f(t)]dt+[Cy_{i}(t)+Dv_{i}(t)+b_{2}x^{(0)}(t)\\ \ \ \ \ \ \ \ \ \ \ \ \ +\ Hy_{0}(t)+\sigma(t)] dW_{i}(t)+F\int_{E_{i}} G_{i}(dedt), \\ y_{i}(0) = a_{i}. \end{array} \right. \end{eqnarray*}$

The limiting cost functional is given by

$\begin{align*} \widetilde{J}_{i}(v_{i}) = &\ \frac{1}{2}\mathbb{E}\bigg\{\int^{T}_{0}\Big[\langle Q(y_{i}(t)-\beta_{1}x^{(0)}(t)-\beta_{2}y_{0}(t)), (y_{i}(t)-\beta_{1}x^{(0)}(t)-\beta_{2}y_{0}(t))\rangle\nonumber\\ &\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ +\langle Rv_{i}(t), v_{i}(t)\rangle\Big]dt+\langle My_{i}(T), y_{i}(T)\rangle\bigg\}. \end{align*}$

Problem (LM2): Find a control strategy $\bar{v}_{i}(\cdot)\in\mathcal{U}_{ad}^{i}, 1\leq i\leq N$ , such that

$\begin{align*} \widetilde{J}_{i}(\bar{v}_{i}) = &\inf\limits_{v_{i}\in \mathcal{U}_{ad}^{i}}\widetilde{J}_{i}(v_{i}). \end{align*}$

Let $P_{1}(\cdot)$ be the solution of the following Riccati equation:

$\begin{eqnarray*} \label{Sec-3 Riccati equation 2} \left\{ \begin{array}{ll} -\dot{P}_{1}(t) = P_{1}(t)A+A^{\top}P_{1}(t)+C^{\top}P_{1}(t)C+Q-(B^{\top}P_{1}(t)+D^{\top}P_{1}(t)C)^{\top}\\ \ \ \ \ \ \ \ \ \ \ \ \ \times\ (R+D^{\top}P_{1}(t)D)^{-1}(B^{\top}P_{1}(t)+D^{\top}P_{1}(t)C), \\ R+D^{\top}P_{1}(t)D\geq0, \\ P_{1}(T) = M. \end{array} \right. \end{eqnarray*}$

$\eta_{1}(\cdot)$ satisfies

$\begin{align*} \left\{ \begin{array}{ll} \dot{\eta}_{1}(t) = -\Big\{[A-B(R+D^{\top}P_{1}(t)D)^{-1}\times (B^{\top}P_{1}(t)+D^{\top}P_{1}(t)C)]^{\top}\eta_{1}(t)\\ \ \ \ \ \ \ \ \ \ \ +\ [C-D(R+D^{\top}P_{1}(t)D)^{-1}\times(B^{\top}P_{1}(t)+D^{\top}P_{1}(t)C)]^{\top}\times P_{1}(t)(b_{2}x^{(0)}(t)\\ \ \ \ \ \ \ \ \ \ \ +\ Hy_{0}(t)+\sigma(t))+[P_{1}(t)(b_{1}x^{(0)}(t)+f(t))-\beta_{1}Qx^{(0)}(t)-\beta_{2}Qy_{0}(t)]\Big\}, \\ \eta_{1}(T) = 0. \end{array} \right. \end{align*}$

Denote

$\begin{eqnarray*} \left\{ \begin{array}{ll} \Lambda_{1}(t): = -(R+D^{\top}P_{1}(t)D)^{-1}\times(B^{\top}P_{1}(t)+D^{\top}P_{1}(t)C), \\ \Theta_{1}(t): = -(R+D^{\top}P_{1}(t)D)^{-1}\times[B^{\top}\eta_{1}(t)+D^{\top}P_{1}(t)(b_{2}x^{(0)}(t)+Hy_{0}(t)+\sigma(t))], \\ \bar{\Theta}_{1}(t): = -(R+D^{\top}P_{1}(t)D)^{-1}\times[B^{\top}\eta_{1}(t)+D^{\top}P_{1}(t)(b_{2}x^{(0)}(t)+H\bar{y}_{0}(t)+\sigma(t))]. \end{array} \right. \end{eqnarray*}$

Using a similar proof as in Theorem 3.1, we have the following result.

Theorem 3.2. The optimal control strategy of Problem (LM2) is

$\begin{align*} \bar{v}_{i}(t) = \Lambda_{1}(t)\bar{y}_{i}(t)+\bar{\Theta}_{1}(t), \end{align*}$

where $\bar{y}_{i}(\cdot)$ satisfies

$\begin{eqnarray} \left\{ \begin{array}{ll} d\bar{y}_{i}(t) = [(A+B\Lambda_{1}(t))\bar{y}_{i}(t)+B\bar{\Theta}_{1}(t)+b_{1}x^{(0)}(t)+f(t)]dt+[(C+D\Lambda_{1}(t))\bar{y}_{i}(t)\\ \ \ \ \ \ \ \ \ \ \ \ +\ D\bar{\Theta}_{1}(t)+b_{2}x^{(0)}(t)+H\bar{y}_{0}(t)+\sigma(t)]dW_{i}(t)+F\int_{E_{i}}G_{i}(dedt), \\ \bar{y}_{i}(0) = a_{i}. \end{array} \right. \end{eqnarray}$

(7)

3.3. Approximation for the closed-loop system

In this subsection, we design a closed-loop mean-field system, and show the approximations between the limiting system and the corresponding closed-loop system.

Based on the feedback formulation of the optimal control for major agent $\mathcal{A}_{0}$ and minor agents $\mathcal{A}_{i}, 1\leq i\leq N$ , we obtain

$\begin{eqnarray} \left\{ \begin{array}{ll} d\bar{x}_{0}(t) = [(A_{0}+B_{0}\Lambda_{0}(t))\bar{x}_{0}(t)+B_{0}\Theta_{0}(t)+b_{0}\bar{x}^{(N)}(t)+f_{0}(t)]dt+[(C_{0}+D_{0}\Lambda_{0}(t))\bar{x}_{0}(t)\\ \ \ \ \ \ \ \ \ \ \ \ +\ D_{0}\Theta_{0}(t)+l_{0}\bar{x}^{(N)}(t)+\sigma_{0}(t)] dW_{0}(t)+F_{0}\int_{E_{0}} G_{0}(dedt), \\ \bar{x}_{0}(0) = a_{0}, \end{array} \right. \end{eqnarray}$

(8)

and

$\begin{eqnarray} \left\{ \begin{array}{ll} d\bar{x}_{i}(t) = [(A+B\Lambda_{1}(t))\bar{x}_{i}(t)+B\bar{\Theta}_{1}(t)+b_{1}\bar{x}^{(N)}(t)+f(t)]dt+[(C+D\Lambda_{1}(t))\bar{x}_{i}(t)\\ \ \ \ \ \ \ \ \ \ \ \ +\ D\bar{\Theta}_{1}(t)+b_{2}\bar{x}^{(N)}(t)+H\bar{x}_{0}(t)+\sigma(t)] dW_{i}(t)+F\int_{E_{i}} G_{i}(dedt), \\ \bar{x}_{i}(0) = a_{i}. \end{array} \right. \end{eqnarray}$

(9)

By $\bar{x}^{(N)}(t) = \frac{1}{N}\sum\limits_{k = 1}^{N}\bar{x}_i(t)$ , the function $x^{(0)}(t)$ fulfills

$\begin{eqnarray} \left\{ \begin{aligned} &d{x}^{(0)}(t) = [(A+B\Lambda_{1}(t)+b_{1}){x}^{(0)}(t)+B\bar{\Theta}_{1}(t)+f(t)]dt, \\ &\; {x}^{(0)}(0) = \frac{1}{N}\sum\limits_{j = 1}^{N}a_{j}. \end{aligned} \right. \end{eqnarray}$

(10)

Now, we introduce the following NCE equation:

$\begin{eqnarray*} \label{Sec-3 NCE equation 1} \left\{ \begin{array}{ll} d\bar{y}_{0}(t) = \Big[(A_{0}+B_{0}\Lambda_{0}(t))\bar{y}_{0}(t)+B_{0}\Theta_{0}(t)+b_{0}x^{(0)}(t)+f_{0}(t)\Big]dt\\ \ \ \ \ \ \ \ \ \ \ \ \ \ +\ \Big[(C_{0}+D_{0}\Lambda_{0}(t))\bar{y}_{0}(t)+D_{0}\Theta_{0}(t)+l_{0}x^{(0)}(t)+\sigma_{0}(t)\Big]dW_{0}(t)+F_{0}\int_{E_{0}}G_{0}(dedt), \\ \dot{x}^{(0)}(t) = \big(A+B\Lambda_{1}(t)+b_{1}\big)x^{(0)}(t)-B(R+D^{\top}P_{1}(t)D)^{-1}\\ \ \ \ \ \ \ \ \ \ \ \ \ \times\ [B^{\top}\eta_{1}(t)+D^{\top}P_{1}(t)(b_{2}x^{(0)}(t)+H\bar{y}_{0}(t)+\sigma(t))]+f(t), \\ -\dot{\eta}_{1}(t) = [A+B\Lambda_{1}(t)]^{\top}\eta_{1}(t)+[C+D\Lambda_{1}(t)]^{\top}P_{1}(t)[b_{2}x^{(0)}(t)+H\bar{y}_{0}(t)+\sigma(t)]\\ \ \ \ \ \ \ \ \ \ \ \ +\ P_{1}(t)(b_{1}x^{(0)}(t)+f(t))-\beta_{1}Qx^{(0)}(t)-\beta_{2}Q\bar{y}_{0}(t), \\ -\dot{\eta}_{0}(t) = [A_{0}+B_{0}\Lambda_{0}(t)]^{\top}\eta_{0}(t)+[C_{0}+D_{0}\Lambda_{0}(t)]^{\top}P_{0}(t)l_{0}x^{(0)}(t)\sigma_{0}(t)\\ \ \ \ \ \ \ \ \ \ \ \ +\ P_{0}(t)(b_{0}x^{(0)}(t)+f_{0}(t))-\beta_{0}Q_{0}x^{(0)}(t), \\ \bar{y}_{0}(0) = a_{0}, \ \ \eta_{0}(T) = \eta_{1}(T) = 0, \ \ x^{(0)}(0) = \frac{1}{N}\sum\limits_{j = 1}^{N}a_{j}, \end{array} \right. \end{eqnarray*}$

which can be written as

$\begin{eqnarray} \left\{ \begin{array}{ll} d\bar{y}_{0}(t) = \Big[\hat{{A}}_0(t)\bar{y}_{0}(t)+\mathbb{G}_0(t)x^{(0)}(t) -B_{0}\hat{{R}}_0^{-1}(t)B_{0}^{\top}\eta_{0}(t)+\hat{\mathbb{G}}_0(t)\Big]dt\\ \ \ \ \ \ \ \ \ \ \ +\ \Big[{C}_0(t)\bar{y}_{0}(t) +\mathbb{H}_0(t)x^{(0)}(t)-D_{0}\hat{{R}}_0^{-1}(t)B_{0}^{\top}\eta_{0}(t)+\hat{\mathbb{H}}_0(t)\Big]dW_{0}(t)+F_{0}\int_{E_{0}}G_{0}(dedt), \\ \dot{x}^{(0)}(t) = \mathbb{G}_1(t)x^{(0)}(t)-B\hat{{R}}^{-1}(t)\big[B^{\top}\eta_{1}(t)+D^{\top}P_{1}(t)H\bar{y}_{0}(t)+D^{\top}P_{1}(t)\sigma(t)\big]+f(t), \\ -\dot{\eta}_{1}(t) = \hat{{A}}^{\top}(t)\eta_{1}(t)+\mathbb{L}_1(t)x^{(0)}(t)+\mathbb{H}_1(t)\bar{y}_{0}(t)+\mathbb{K}_1(t), \\ -\dot{\eta}_{0}(t) = \hat{{A}}_0^{\top}\eta_{0}(t)+\mathbb{L}_0(t)x^{(0)}(t)+P_{0}(t)f_{0}(t)\\ \bar{y}_{0}(0) = a_{0}, \ \ \eta_{0}(T) = \eta_{1}(T) = 0, \ \ x^{(0)}(0) = \frac{1}{N}\sum\nolimits_{j = 1}^{N}a_{j}, \end{array} \right. \end{eqnarray}$

(11)

where

$\begin{eqnarray*} &&\hat{{A}}_0(t): = A_{0}+B_{0}\Lambda_{0}(t), \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \mathbb{G}_0(t): = -B_{0}\hat{{R}}_0^{-1}(t)D_{0}^{\top}P_{0}(t)l_{0}+b_{0}, \\ &&\hat{{R}}_0(t): = R_{0}+D_{0}^{\top}P_{0}(t)D_{0}, \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \hat{\mathbb{G}}_0(t): = f_{0}(t)-B_{0}\hat{{R}}_0^{-1}(t)D_{0}^{\top}P_{0}(t)\sigma_{0}(t), \\ &&{C}_0(t): = C_{0}+D_{0}\Lambda_{0}(t), \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \mathbb{H}_0(t): = -D_{0}\hat{{R}}_0^{-1}(t)D_{0}^{\top}P_{0}(t)l_{0}+l_{0}, \\ &&\hat{\mathbb{H}}_0(t): = \sigma_{0}(t)-D_{0}\hat{{R}}_0^{-1}(t)D_{0}^{\top}P_{0}(t)\sigma_{0}(t), \ \ \ \mathbb{G}_1(t): = \hat{{A}}(t)+b_{1}-B\hat{{R}}^{-1}(t)D^{\top}P_{1}(t)b_{2}, \\ &&\hat{{A}}(t): = A+B\Lambda_{1}(t), \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \mathbb{L}_1(t): = [C+D\Lambda_{1}(t)]^{\top}P_{1}(t)b_{2}+P_{1}(t)b_{1}-\beta_{1}Q, \\ &&\hat{{R}}(t): = R+D^{\top}P_{1}(t)D, \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \mathbb{H}_1(t): = [C+D\Lambda_{1}(t)]^{\top}P_{1}(t)H-\beta_{2}Q, \\ &&\mathbb{K}_1(t): = [C+D\Lambda_{1}(t)]^{\top}P_{1}(t)\sigma(t)+P_{1}(t)f(t), \\ &&\mathbb{L}_0(t): = [C_{0}+D_{0}\Lambda_{0}(t)]^{\top}P_{0}(t)l_{0}\sigma_{0}(t)+P_{0}(t)b_{0}-\beta_{0}Q_{0}. \end{eqnarray*}$

The above NCE equation is a kind of coupled two-point boundary value problem, whose well-posedness can be found in Theorem 4.2 of Hu et al. ^[3] under some monotonicity assumptions. We will not repeat them here for simplicity.

Next, we establish the approximation relationship between the closed-loop mean-field game system and the limiting system.

Proposition 3.3. The following estimates hold:

$\begin{align*} &(i)\ \sup\limits_{0\leq t \leq T} \mathbb{E}\Big|\bar{x}^{(N)}(t)-x^{(0)}(t)\Big|^{2} = O(\frac{1}{N}), \\ &(ii)\ \sup\limits_{0\leq t \leq T} \mathbb{E}\Big||\bar{x}^{(N)}(t)|^{2}-|x^{(0)}(t)|^{2}\Big| = O(\frac{1}{\sqrt{N}}), \\ &(iii)\ \sup\limits_{0\leq t \leq T} \mathbb{E}\Big|\bar{x}_{0}(t)-\bar{y}_{0}(t)\Big|^{2} = O(\frac{1}{N}), \\ &(iv)\ \sup\limits_{0\leq t \leq T} \mathbb{E}\Big||\bar{x}_{0}(t)|^{2}-|\bar{y}_{0}(t)|^{2}\Big| = O(\frac{1}{\sqrt{N}}), \\ &(v)\ \sup\limits_{0\leq t \leq T} \mathbb{E}\Big|\bar{x}_{i}(t)-\bar{y}_{i}(t)\Big|^{2} = O(\frac{1}{N}), \ \ \ 1\leq i\leq N, \\ &(vi)\ \sup\limits_{0\leq t \leq T} \mathbb{E}\Big||\bar{x}_{i}(t)|^{2}-|\bar{y}_{i}(t)|^{2}\Big| = O(\frac{1}{\sqrt{N}}), \ \ \ 1\leq i\leq N. \end{align*}$

Proof. Let $\bar{z}(t): = \bar{x}^{(N)}(t)-x^{(0)}(t)$ , $\bar{z}_{0}(t): = \bar{x}_{0}(t)-\bar{y}_{0}(t)$ , $\bar{z}_{i}(t): = \bar{x}_{i}(t)-\bar{y}_{i}(t)\ \ (1\leq i\leq N).$ Combining (9) with (10), we derive

$\begin{eqnarray*} \left\{ \begin{aligned} &d\bar{z}(t) = [(A+B\Lambda_{1}(t)+b_{1})\bar{z}(t)]dt+\frac{1}{N}\sum\limits_{j = 1}^{N}[(C+D\Lambda_{1}(t))\bar{x}_{j}(t)+D\bar{\Theta}_{1}(t)\\ &\; \; \; \; \; \; \; \; \; \; +b_{2}\bar{x}^{(N)}(t)+H\bar{x}_{0}(t)+\sigma(t)]dW_{j}(t)+\frac{1}{N}\sum\limits_{j = 1}^{N}F\int_{E_{j}}G_{j}(dedt), \\ &\bar{z}(0) = 0. \end{aligned} \right. \end{eqnarray*}$

Define $\chi(t): = b_{2}\bar{x}^{(N)}(t)+H\bar{x}_{0}(t)+\sigma(t)$ . Applying Itô's formula to $\bar{z}^{2}(t)$ , we obtain

$\begin{align*} \mathbb{E}[\bar{z}^{2}(t)] = &\ 2\int^{t}_{0}(A+B\Lambda_{1}(s)+b_{1})\mathbb{E}[\bar{z}^{2}(s)]ds+\frac{1}{N^{2}}\sum\limits_{j = 1}^{N}\mathbb{E}\int^{t}_{0}\Big\{[(C+D\Lambda_{1}(s))\bar{x}_{j}(s)\\ &+D\bar{\Theta}_{1}(s)+\chi(s)]\Big\}^{2}ds+\frac{F^{2}}{N^{2}}\sum\limits_{j = 1}^{N}\mathbb{E}\int_{E_{j}}\int^{t}_{0}\pi_{j}(deds), \\ \leq&\ 2\sup\limits_{0\leq t \leq T}(A+B\Lambda_{1}(t)+b_{1})\times \int^{t}_{0}\mathbb{E}[\bar{z}^{2}(s)]ds+\frac{T}{N}\max\limits_{0\leq j \leq N}\mathbb{E}[(C+D\Lambda_{1}(t))\bar{x}_{j}(t)\\ &+D\bar{\Theta}_{1}(t)+\chi(t)]^{2}+\frac{F^{2}}{N}\max\limits_{0\leq j \leq N}\mathbb{E}\int_{E_{j}}\int^{t}_{0}\pi_{j}(deds). \end{align*}$

According to Gronwall's inequality, it follow that

$\begin{align} \sup\limits_{0\leq t \leq T} \mathbb{E}\Big|\bar{x}^{(N)}(t)-x^{(0)}(t)\Big|^{2} = O\left(\frac{1}{N}\right). \end{align}$

(12)

For (ii), according to Hölder's inequality, we have

$\begin{align*} \mathbb{E}\Big||\bar{x}^{(N)}(t)|^{2}-|x^{(0)}(t)|^{2}\Big| = &\ \mathbb{E}\Big||\bar{x}^{(N)}(t)-x^{(0)}(t)|^{2}+2x^{(0)}(t)(\bar{x}^{(N)}(t)-x^{(0)}(t))\Big|\\ \leq&\ \mathbb{E}\Big[|\bar{x}^{(N)}(t)-x^{(0)}(t)|^{2}\Big]+2|x^{(0)}(t)|\Big(\mathbb{E}[|\bar{x}^{(N)}(t)-x^{(0)}(t)|^{2}]\Big)^{\frac{1}{2}}. \end{align*}$

By (12) and the boundedness of $|x^{(0)}(t)|$ , one has

$\begin{align*} \sup\limits_{0\leq t \leq T} \mathbb{E}\Big||\bar{x}^{(N)}(t)|^{2}-|x^{(0)}(t)|^{2}\Big| = O\left(\frac{1}{\sqrt{N}}\right). \end{align*}$

We now prove (iii). According to (6) and (8), it follows that

$\begin{eqnarray*} \left\{ \begin{aligned} &d\bar{z}_{0}(t) = [(A_{0}+B_{0}\Lambda_{0}(t))\bar{z}_{0}(t)+b_{0}\bar{z}(t)]dt+[(C_{0}+D_{0}\Lambda_{0}(t))\bar{z}_{0}(t)+l_{0}\bar{z}(t)]dW_{0}(t), \\ &\bar{z}_{0}(0) = 0. \end{aligned} \right. \end{eqnarray*}$

Applying Itô's formula to $\bar{z}_{0}^{2}(t)$ , we obtain

$\begin{align*} \mathbb{E}[\bar{z}_{0}^{2}(t)] = &\ 2\int_{0}^{t}\mathbb{E}\Big[(A_{0}+B_{0}\Lambda_{0}(s))\bar{z}_{0}^{2}(s)+b_{0}\bar{z}(s)\bar{z}_{0}(s)\Big]ds\\ &+\int_{0}^{t}\mathbb{E}\Big[(C_{0}+D_{0}\Lambda_{0}(s))\bar{z}_{0}(s)+l_{0}\bar{z}(s)\Big]^{2}ds\\ \leq&\ 2\int_{0}^{t}\Big[(A_{0}+B_{0}\Lambda_{0}(s))+(C_{0}+D_{0}\Lambda_{0}(s))^{2}+b_{0}^{2}\Big]\mathbb{E}\bar{z}_{0}^{2}(s)ds+\int_{0}^{t}\Big(\frac{1}{2}+2l_{0}^{2}\Big)\mathbb{E}\bar{z}^{2}(s)ds. \end{align*}$

By (12) and Gronwall's inequality, we have

$\begin{align} \sup\limits_{0\leq t \leq T} \mathbb{E}\Big|\bar{x}_{0}(t)-\bar{y}_{0}(t)\Big|^{2} = O(\frac{1}{N}). \end{align}$

(13)

Note that

$\begin{align*} \mathbb{E}\Big||\bar{x}_{0}(t)|^{2}-|\bar{y}_{0}(t)|^{2}\Big| = &\ \mathbb{E}\Big||\bar{x}_{0}(t)-\bar{y}_{0}(t)|^{2}+2\bar{y}_{0}(t)(\bar{x}_{0}(t)-\bar{y}_{0}(t))\Big|\\ \leq&\ \mathbb{E}\Big[|\bar{x}_{0}(t)-\bar{y}_{0}(t)|^{2}\Big]+2\Big(\mathbb{E}[|\bar{y}_{0}(t)|^{2}]\Big)^{\frac{1}{2}}\Big(\mathbb{E}[|\bar{x}_{0}(t)-\bar{y}_{0}(t)|^{2}]\Big)^{\frac{1}{2}}. \end{align*}$

According to (13) and the boundedness of $|\bar{y}_{0}(t)|$ , we obtain

$\begin{align*} \sup\limits_{0\leq t \leq T} \mathbb{E}\Big||\bar{x}_{0}(t)|^{2}-|\bar{y}_{0}(t)|^{2}\Big| = O(\frac{1}{\sqrt{N}}). \end{align*}$

Next, we prove (v). Combining (7) with (9), we have

$\begin{eqnarray*} \left\{ \begin{aligned} &d\bar{z}_{i}(t) = [(A+B\Lambda_{1}(t))\bar{z}_{i}(t)+b_{1}\bar{z}(t)]dt+[(C+D\Lambda_{1}(t))\bar{z}_{i}(t)+b_{2}\bar{z}(t)+H\bar{z}_{0}(t)]dW_{i}(t), \\ &\bar{z}_{i}(0) = 0. \end{aligned} \right. \end{eqnarray*}$

Applying Itô's formula to $\bar{z}_{i}^{2}(t)$ , we obtain

$\begin{align*} \mathbb{E}[\bar{z}_{i}^{2}(t)] = &\ 2\int_{0}^{t}\mathbb{E}\Big[(A+B\Lambda_{1}(s))\bar{z}_{i}^{2}(s)+b_{1}\bar{z}(s)\bar{z}_{i}(s)\Big]ds\\ &+\int_{0}^{t}\mathbb{E}\Big[(C+D\Lambda_{1}(s))\bar{z}_{i}(s)+b_{2}\bar{z}(s)+H\bar{z}_{0}(t)\Big]^{2}ds\\ \leq&\int_{0}^{t}\Big[2(A+B\Lambda_{1}(s))+b_1^2+3(C+D\Lambda_{1}(s))^{2}\Big]\mathbb{E}\bar{z}_{i}^{2}(s)ds\\ &+\int_{0}^{t}(1+3b_{2}^{2})\mathbb{E}\bar{z}^{2}(s)ds+3H^{2}\int_{0}^{t}\mathbb{E}\bar{z}_{0}^{2}(s)ds. \end{align*}$

By Gronwall's inequality, and estimates (12) and (13), we obtain

$\begin{align} \sup\limits_{0\leq t \leq T} \mathbb{E}\Big|\bar{x}_{i}(t)-\bar{y}_{i}(t)\Big|^{2} = O(\frac{1}{N}). \end{align}$

(14)

Finally, we prove (ⅵ). Since

$\begin{align*} \mathbb{E}\Big||\bar{x}_{i}(t)|^{2}-|\bar{y}_{i}(t)|^{2}\Big| \leq&\ \mathbb{E}\Big[|\bar{x}_{i}(t)-\bar{y}_{i}(t)|^{2}\Big]+2\mathbb{E}\Big[|\bar{y}_{i}(t)||\bar{x}_{i}(t)-\bar{y}_{i}(t)|\Big]\\ \leq&\ \mathbb{E}\Big[|\bar{x}_{i}(t)-\bar{y}_{i}(t)|^{2}\Big]+2\Big(\mathbb{E}[|\bar{y}_{i}(t)|^{2}]\Big)^{\frac{1}{2}}\Big(\mathbb{E}[|\bar{x}_{i}(t)-\bar{y}_{i}(t)|^{2}]\Big)^{\frac{1}{2}}. \end{align*}$

According to (14) and the boundedness of $|\bar{y}_{i}(t)|$ , we get

$\begin{align*} \sup\limits_{0\leq t \leq T} \mathbb{E}\Big||\bar{x}_{i}(t)|^{2}-|\bar{y}_{i}(t)|^{2}\Big| = O(\frac{1}{\sqrt{N}}). \end{align*}$

The proof is then complete. □

Define the control strategy for the major agent as

$\begin{align} \bar{u}_{0}(t) = \Lambda_{0}(t)\bar{x}_{0}(t)+\Theta_{0}(t), \end{align}$

(15)

and the control strategy for minor agents as

$\begin{align} \bar{u}_{i}(t) = \Lambda_{1}(t)\bar{x}_{i}(t)+\bar{\Theta}_{1}(t). \end{align}$

(16)

Based on the approximation relationship between the closed-loop mean-field systems and the limiting system, the following approximation relationship between cost functionals can be derived.

Proposition 3.4. For any $i = 0, 1, \cdots, N$ , we have

$\begin{align*} \Big|J_{i}(\bar{u}_{i}, \bar{u}_{-i})-\widetilde{J_{i}}(\bar{v}_{i})\Big| = O(\frac{1}{\sqrt{N}}). \end{align*}$

Proof. Based on the definitions of the cost functionals, we obtain

$\begin{align*} &\Big|J_{i}(\bar{u}_{i}, \bar{u}_{-i})-\widetilde{J}_{i}(\bar{v}_{i})\Big|\\ = &\ \Bigg|\frac{1}{2}\mathbb{E}\int^{T}_{0}\Big\{\Big[Q(\bar{x}_{i}(t)-\beta_{1}\bar{x}^{(N)}(t)-\beta_{2}\bar{x}_{0}(t))^{2}-Q(\bar{y}_{i}(t)-\beta_{1}x^{(0)}(t)-\beta_{2}\bar{y}_{0}(t))^{2}\Big]\\ &+\Big[R\bar{u}_{i}^{2}(t)-R\bar{v}_{i}^{2}(t)\Big]\Big\}dt+\frac{1}{2}\mathbb{E}\Big[M\bar{x}_{i}^{2}(T)-M\bar{y}_{i}^{2}(T)\Big]\Bigg|\\ = &\ \Bigg|\frac{1}{2}\mathbb{E}\int^{T}_{0}\Big\{Q\Big[(\bar{x}_{i}(t)-\beta_{1}\bar{x}^{(N)}(t)-\beta_{2}\bar{x}_{0}(t))+(\bar{y}_{i}(t)-\beta_{1}x^{(0)}(t)-\beta_{2}\bar{y}_{0}(t))\Big]\\ &\times \Big[(\bar{x}_{i}(t)-\beta_{1}\bar{x}^{(N)}(t)-\beta_{2}\bar{x}_{0}(t))-(\bar{y}_{i}(t)-\beta_{1}x^{(0)}(t)-\beta_{2}\bar{y}_{0}(t))\Big]\\ &+R\Big[(\Lambda_{1}(t)\bar{x}_{i}(t)+\bar{\Theta}_{1}(t))^{2}-(\Lambda_{1}(t)\bar{y}_{i}(t)+\bar{\Theta}_{1}(t))^{2}\Big]\Big\}dt+\frac{1}{2}\mathbb{E}\Big[M\bar{x}_{i}^{2}(T)-M\bar{y}_{i}^{2}(T)\Big]\Bigg|\\ = &\ \Big|\frac{1}{2}\mathbb{E}\int^{T}_{0}\Bigg\{Q\Big[\Big(2\bar{x}_{i}(t)-2\beta_{1}\bar{x}^{(N)}(t)-2\beta_{2}\bar{x}_{0}(t)\Big)-L(t)\Big]\times L(t)\\ &+R\Big[(\Lambda_{1}(t))^{2}\Big(\bar{x}_{i}^{2}(t)-\bar{y}_{i}^{2}(t)\Big)+2\Lambda_{1}(t)\bar{\Theta}_{1}(t)(\bar{x}_{i}(t)-\bar{y}_{i}(t))\Big]\Bigg\}dt+\frac{1}{2}\mathbb{E}\Big[M\bar{x}_{i}^{2}(T)-M\bar{y}_{i}^{2}(T)\Big]\\ \leq&\ \frac{1}{2}\int^{T}_{0}\Bigg\{Q\mathbb{E}\Big[\big|(2\bar{x}_{i}(t)-2\beta_{1}\bar{x}^{(N)}(t)-2\beta_{2}\bar{x}_{0}(t))L(t)\big|\Big]+Q\mathbb{E}\Big[\big|L^{2}(t)\big|\Big]\\ &+R(\Lambda_{1}(t))^{2}\mathbb{E}\Big[\big|\bar{x}_{i}^{2}(t)-\bar{y}_{i}^{2}(t)\big|\Big]+2R\Lambda_{1}(t)\bar{\Theta}_{1}(t) \mathbb{E}\Big[\big|\bar{x}_{i}(t)-\bar{y}_{i}(t)\big|\Big]\Bigg\}dt\\ &+\frac{1}{2}M\mathbb{E}\Big[\big|\bar{x}_{i}^{2}(T)-\bar{y}_{i}^{2}(T)\big|\Big]\\ \leq&\ \frac{1}{2}QT\sup\limits_{0\leq t \leq T}\mathbb{E}\Big[\big|(2\bar{x}_{i}(t)-2\beta_{1}\bar{x}^{(N)}(t)-2\beta_{2}\bar{x}_{0}(t))L(t)\big|\Big]+\frac{1}{2}QT\sup\limits_{0\leq t \leq T}\mathbb{E}\Big[\big|L^{2}(t)\big|\Big]\\ &+{\frac{1}{2}}RT(\Lambda_{1}(t))^{2}\sup\limits_{0\leq t \leq T}\mathbb{E}\Big[\big|\bar{x}_{i}^{2}(t)-\bar{y}_{i}^{2}(t)\big|\Big]+RT\Lambda_{1}(t)\bar{\Theta}_{1}(t)\sup\limits_{0\leq t \leq T}\mathbb{E}\Big[\big|\bar{x}_{i}(t)-\bar{y}_{i}(t)\big|\Big]\\ &+\frac{1}{2}M\sup\limits_{0\leq t \leq T}\mathbb{E}\Big[\big|\bar{x}_{i}^{2}(T)-\bar{y}_{i}^{2}(T)\big|\Big], \end{align*}$

where $L(t): = \Big[(\bar{x}_{i}(t)-\bar{y}_{i}(t))-\beta_{1}(\bar{x}^{(N)}(t)-x^{(0)}(t))-\beta_{2}(\bar{x}_{0}(t)-\bar{y}_{0}(t))\Big]$ . Obviously, according to Proposition 3.3, we have $\mathbb{E}\Big[\Big|L(t)\Big|^{2}\Big] = O(\frac{1}{N})$ . Therefore, it follows that $\Big|J_{i}(\bar{u}_{i}, \bar{u}_{-i})-\widetilde{J_{i}}(\bar{v}_{i})\Big| = O(\frac{1}{\sqrt{N}})$ . The proof is then complete. □

4. $\varepsilon$ -Nash equilibrium for Problem (LP)

This section will verify the asymptotic Nash equilibrium property of the decentralized control strategies $\bar{u} = (\bar{u}_{0}, \bar{u}_{1}, \cdots, \bar{u}_{N})$ specified by (15) and (16).

4.1. Major agent's perturbation

Let the major agent $\tilde{\mathcal{A}}_{0}$ take an alternative control strategy $u_{0}$ , and let the minor agent $\tilde{\mathcal{A}}_{i}$ take the control law (16). Then the state system with the major agent's perturbation is

$\begin{eqnarray} \left\{ \begin{aligned} &d\tilde{x}_{0}(t) = [A_{0}\tilde{x}_{0}(t)+B_{0}u_{0}(t)+b_{0}\tilde{x}^{(N)}(t)+f_{0}(t)]dt+[C_{0}\tilde{x}_{0}(t)+D_{0}u_{0}(t)\\ &\; \; \; \; \; \; \; \; \; \; +l_{0}\tilde{x}^{(N)}(t)+\sigma_{0}(t)]dW_{0}(t)+F_{0}\int_{E_{0}}G_{0}(dedt), \\ &d\tilde{x}_{i}(t) = [(A+B\Lambda_{1}(t))\tilde{x}_{i}(t)+B\bar{\Theta}_{1}(t)+b_{1}\tilde{x}^{(N)}(t)+f(t)]dt+[(C+D\Lambda_{1}(t))\tilde{x}_{i}(t)\\ &\; \; \; \; \; \; \; \; \; \; +D\bar{\Theta}_{1}(t)+b_{2}\tilde{x}^{(N)}(t)+H\tilde{x}_{0}(t)+\sigma(t)]dW_{i}(t)+F\int_{E_{i}}G_{i}(dedt), \\ &\tilde{x}_{0}(0) = a_{0}, \tilde{x}_{i}(0) = a_{i}, \ \ i = 1, \cdots, N, \end{aligned} \right. \ \ \ \ \ \ \end{eqnarray}$

(17)

where $\tilde{x}^{(N)}(t) = \frac{1}{N}\sum\limits_{k = 1}^{N}\tilde{x}_{k}(t)$ . The cost functional for major agent $\tilde{\mathcal{A}}_{0}$ is

$\begin{align*} J_{0}(u_{0}, u_{-0}) = &\ \frac{1}{2}\mathbb{E}\Bigg\{\int^{T}_{0}\Big[\langle Q_{0}(\tilde{x}_{0}(t)-\beta_{0}\tilde{x}^{(N)}(t)), (\tilde{x}_{0}(t)-\beta_{0}\tilde{x}^{(N)}(t))\rangle\\ &\ \ \ \ \ \ \ \ \ \ \ \ \ +\langle R_{0}u_{0}(t), u_{0}(t)\rangle\Big]dt+\langle M_{0}\tilde{x}_{0}(T), \tilde{x}_{0}(T)\rangle\Bigg\}. \end{align*}$

The corresponding limiting state equation with the major agent's perturbation control is

$\begin{eqnarray*} \left\{ \begin{array}{ll} d\tilde{y}_{0}(t) = [A_{0}\tilde{y}_{0}(t)+B_{0}u_{0}(t)+b_{0}x^{(0)}(t)+f_{0}(t)]dt+[C_{0}\tilde{y}_{0}(t)+D_{0}u_{0}(t)\\ \ \ \ \ \ \ \ \ \ \ \ \ \ +\ l_{0}x^{(0)}(t)+\sigma_{0}(t)]dW_{0}(t)+F_{0}\int_{E_{0}} G_{0}(dedt)\\ d\tilde{y}_{i}(t) = [(A+B\Lambda_{1}(t))\tilde{y}_{i}(t)+B\bar{\Theta}_{1}(t)+b_{1}x^{(0)}(t)+f(t)]dt+[(C+D\Lambda_{1}(t))\tilde{y}_{i}(t)\\ \ \ \ \ \ \ \ \ \ \ \ \ \ +\ D\bar{\Theta}_{1}(t)+b_{2}x^{(0)}(t)+H\tilde{y}_{0}(t)+\sigma(t)] dW_{i}(t)+F\int_{E_{i}}G_{i}(dedt)\\ \tilde{y}_{0}(0) = a_{0}, \ \ \ \tilde{y}_{i}(0) = a_{i}, \ \ i = 1, \cdots, N. \end{array} \right. \end{eqnarray*}$

The cost functional is

$\begin{align*} \widetilde{J}_{0}(u_{0}) = &\ \frac{1}{2}\mathbb{E}\Bigg\{\int^{T}_{0}\Big[\langle Q_{0}(\tilde{y}_{0}(t)-\beta_{0} x^{(0)}(t)), (\tilde{y}_{0}(t)-\beta_{0}x^{(0)}(t))\rangle\nonumber\\ &\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ +\langle R_{0}u_{0}(t), u_{0}(t)\rangle\Big]dt+\langle M_{0}\tilde{y}_{0}(T), \tilde{y}_{0}(T)\rangle\Bigg\}. \end{align*}$

The following result presents an approximation relationship between two perturbation systems.

Proposition 4.1. We have the following conclusion:

$\begin{align*} &(i)\ \sup\limits_{0\leq t \leq T} \mathbb{E}\Big|\tilde{x}^{(N)}(t)-x^{(0)}(t)\Big|^{2} = O(\frac{1}{N}), \\ &(ii)\ \sup\limits_{0\leq t \leq T} \mathbb{E}\Big||\tilde{x}^{(N)}(t)|^{2}-|x^{(0)}(t)|^{2}\Big| = O(\frac{1}{\sqrt{N}}), \\ &(iii)\ \sup\limits_{0\leq t \leq T} \mathbb{E}\Big|\tilde{x}_{0}(t)-\tilde{y}_{0}(t)\Big|^{2} = O(\frac{1}{N}), \\ &(iv)\ \sup\limits_{0\leq t \leq T} \mathbb{E}\Big||\tilde{x}_{0}(t)|^{2}-|\tilde{y}_{0}(t)|^{2}\Big| = O(\frac{1}{\sqrt{N}}). \end{align*}$

Proof. We only need to prove the first approximation relationship, and a other three approximation relationships can be obtained by a similar proof as in Proposition 3.3.

Define $\Phi(t): = \tilde{x}^{(N)}(t)-x^{(0)}(t)$ . Combining (10) with (17), we have

$\begin{align*} \left\{ \begin{array}{ll} d\Phi(t) = \Big[(A+B\Lambda_{1}(t)+b_{1})\Phi(t)\Big]dt+\frac{1}{N}\sum\limits_{k = 1}^{N}\Big[(C+D\Lambda_{1}(t))\tilde{x}_{k}(t)+D\bar{\Theta}_{1}(t)+b_{2}\tilde{x}^{(N)}(t)\\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ +\ H\tilde{x}_{0}(t)+\sigma(t)\Big]dW_{k}(t)+\frac{1}{N}\sum\limits_{k = 1}^{N}F\int_{E_{k}}G_{k}(dedt), \\ \Phi(0) = 0. \end{array} \right. \end{align*}$

Define $L_{k}(t): = [(C+D\Lambda_{1}(t))\tilde{x}_{k}(t)+D\bar{\Theta}_{1}(t)+b_{2}\tilde{x}^{(N)}(t)+H\tilde{x}_{0}(t)+\sigma(t)]$ . Therefore

$\begin{align*} &\mathbb{E}\int_{0}^{t}|L_{k}(s)|^{2}ds\\ = &\ \mathbb{E}\int_{0}^{t}\big[(C+D\Lambda_{1}(s))\tilde{x}_{k}(s)+D\tilde{\Theta}_{1}(s)+b_{2}(\tilde{x}^{(N)}(s)-x^{(0)}(s))+b_{2}x^{(0)}(s)+H\tilde{x}_{0}(s)+\sigma(s)\big]^{2}ds\\ \leq&\ \mathbb{C}\mathbb{E}\int_{0}^{t}[|\tilde{x}_{k}(s)|^{2}+1+|\tilde{x}^{(N)}(s)-x^{(0)}(s)|^{2}+|x^{(0)}(s)|^{2}+|\tilde{x}_{0}(s)|^{2}+|\sigma(s)|^{2}]ds\\ \leq&\ \mathbb{C}\mathbb{E}\int_{0}^{t}|(\tilde{x}^{(N)}(s)-x^{(0)}(s))|^{2}ds+\mathbb{C}_{1}, \end{align*}$

where

$\begin{align*} &\mathbb{C}: = \max\left\{\sup\limits_{t\in [0, T]}|C+D\Lambda_{1}(t)|, \sup\limits_{t\in [0, T]}|D\tilde{\Theta}_{1}(t)|, |b_2|, |H|, 1\right\}, \\ &\mathbb{C}_{1}: = \mathbb{C}\mathbb{E}\int_{0}^{T}[|\tilde{x}_{k}(s)|^{2}+1+|x^{(0)}(s)|^{2}+|\tilde{x}_{0}(s)|^{2}+|\sigma(s)|^{2}]ds \end{align*}$

are constants independent of $N$ .

Furthermore,

$\begin{align*} \mathbb{E}\Phi^{2}(t) = &\ 2\mathbb{E}\Big\{\int^{t}_{0}\Big[(A+B\Lambda_{1}(s)+b_{1})\Phi(s)\Big]ds\Big\}^{2}+\frac{2}{N^{2}}\mathbb{E}\Big\{\int_{0}^{t}\sum\limits_{j = 1}^{N}L_{j}^{2}(s)ds\Big\}\\ &+2\mathbb{E}\Big\{\int_{0}^{t}\frac{1}{N}\sum\limits_{k = 1}^{N}F\int_{E_{k}}G_{k}(deds)\Big\}^{2}\\ \leq&\ 2\mathbb{E}\int^{T}_{0}\Big[T|(A+B\Lambda_{1}(s)+b_{1})\Phi(s)|^{2}+\frac{1}{N}\max\limits_{1\leq k \leq N}|L_{k}(s)|^{2}\Big]ds\\ &+\frac{2}{N^{2}}E\int_{0}^{t}\sum\limits_{k = 1}^{N}\int_{E_{k}}|FG_{k}|^{2}n(de)ds. \end{align*}$

By Grownwall's inequality, we have

$\begin{align*} \sup\limits_{0\leq t \leq T}\mathbb{E}\Big|\tilde{x}^{(N)}(t)-x^{(0)}(t)\Big|^{2} = O(\frac{1}{N}). \end{align*}$

Then, the proof is complete. □

Similarly to the proof of Proposition 3.4, we can obtain the following result.

Proposition 4.2. For any $u_{0}(\cdot)\in\mathcal{U}_{ad}^{c, 0}$ , we have

$\begin{align*} \Big|J_{0}(u_{0}, \bar{u}_{-0})-\widetilde{J_{0}}(u_{0})\Big| = O(\frac{1}{\sqrt{N}}). \end{align*}$

4.2. Minor agent's perturbation

Now, let us consider the following case: a given minor agent $\tilde{\mathcal{A}}_{i}$ takes an alternative control strategy $u_{i}(\cdot)\in\mathcal{U}_{ad}^{c, i}$ , the major agent uses the optimal control strategy $\bar{u}_{0}(\cdot)$ defined by (15), while other minor agents $\tilde{\mathcal{A}}_{j}$ take the control strategy $\bar{u}_{j}(\cdot), j\neq i, \ 1\leq j\leq N$ , defined by (16). Then the dynamics of the agents with the given minor agent's perturbation can be written in the form

$\begin{eqnarray} \left\{ \begin{aligned} &d\hat{x}_{0}(t) = [(A_{0}+B_{0}\Lambda_{0}(t))\hat{x}_{0}(t)+B_{0}\Theta_{0}(t)+b_{0}\hat{x}^{(N)}(t)+f_{0}(t)]dt+[(C_{0}+D_{0}\Lambda_{0}(t))\hat{x}_{0}(t)\\ &\; \; \; \; \; \; \; \; \; \; \ \ \ +D_{0}\Theta_{0}(t)+l_{0}\hat{x}^{(N)}(t)+\sigma_{0}(t)]dW_{0}(t)+F_{0}\int_{E_{0}}G_{0}(dedt), \\ &d\hat{x}_{i}(t) = [A\hat{x}_{i}(t)+Bu_{i}(t)+b_{1}\hat{x}^{(N)}(t)+f(t)]dt+[C\hat{x}_{i}(t)+Du_{i}(t)+b_{2}\hat{x}^{(N)}(t)\\ &\; \; \; \; \; \; \; \; \; \; \ \ \ +H\hat{x}_{0}(t)+\sigma(t)]dW_{i}(t)+F\int_{E_{i}}G_{i}(dedt), \\ &d\hat{x}_{j}(t) = [(A+B\Lambda_{1}(t))\hat{x}_{j}(t)+B\bar{\Theta}_{1}(t)+b_{1}\hat{x}^{(N)}(t)+f(t)]dt+[(C+D\Lambda_{1}(t))\hat{x}_{j}(t)\\ &\; \; \; \; \; \; \; \; \; \; \ \ \ +D\bar{\Theta}_{1}(t)+b_{2}\hat{x}^{(N)}(t)+H\hat{x}_{0}(t)+\sigma(t)]dW_{j}(t)+F\int_{E_{j}}G_{j}(dedt), \\ &\tilde{x}_{0}(0) = a_{0}, \ \tilde{x}_{i}(0) = a_{i}, \ \tilde{x}_{j}(0) = a_{j}, j = 1, 2, \cdots, N, \ j\neq i, \end{aligned} \right. \end{eqnarray}$

(18)

where $\hat{x}^{(N)}(t) = \frac{1}{N}\sum\limits_{k = 1}^{N}\hat{x}_{k}(t)$ . The cost functional is

$\begin{align*} J_{i}(u_{i}, u_{-i}) = &\ \frac{1}{2}\mathbb{E}\Big\{\int^{T}_{0}\Big[\langle Q(\hat{x}_{i}(t)-\beta_{1}\hat{x}^{(N)}(t)-\beta_{2}\hat{x}_{0}(t)), (\hat{x}_{i}(t)-\beta_{1}\hat{x}^{(N)}(t)-\beta_{2}\hat{x}_{0}(t))\rangle\nonumber\\ &\; \; \; \; \; \; \; \; \; \; \ \ \ \ \ +\langle Ru_{i}(t), u_{i}(t)\rangle\Big]dt+\langle M\hat{x}_{i}(T), \hat{x}_{i}(T)\rangle\Big\}. \end{align*}$

The corresponding limiting system with the minor agent's perturbation strategy is

$\begin{eqnarray*} \left\{ \begin{aligned} &d\hat{y}_{0}(t) = [(A_{0}+B_{0}\Lambda_{0}(t))\hat{y}_{0}(t)+B_{0}\Theta_{0}(t)+b_{0}x^{(0)}(t)+f_{0}(t)]dt+[(C_{0}+D_{0}\Lambda_{0}(t))\hat{y}_{0}(t)\\ &\; \; \; \; \; \; \; \; \; \; \ \ \ +D_{0}\Theta_{0}(t)+l_{0}x^{(0)}(t)+\sigma_{0}(t)]dW_{0}(t)+F_{0}\int_{E_{0}}G_{0}(dedt), \\ &d\hat{y}_{i}(t) = [A\hat{y}_{i}(t)+Bu_{i}(t)+b_{1}x^{(0)}(t)+f(t)]dt+[C\hat{y}_{i}(t)+Du_{i}(t)+b_{2}x^{(0)}(t)\\ &\; \; \; \; \; \; \; \; \; \; \ \ \ +H\hat{y}_{0}(t)+\sigma(t)]dW_{i}(t)+F\int_{E_{i}}G_{i}(dedt), \\ &d\hat{y}_{j}(t) = [(A+B\Lambda_{1}(t))\hat{y}_{j}(t)+B\bar{\Theta}_{1}(t)+b_{1}x^{(0)}(t)+f(t)]dt+[(C+D\Lambda_{1}(t))\hat{y}_{j}(t)\\ &\; \; \; \; \; \; \; \; \; \; \ \ \ +D\bar{\Theta}_{1}(t)+b_{2}x^{(0)}(t)+H\hat{y}_{0}(t)+\sigma(t)]dW_{j}(t)+F\int_{E_{j}}G_{j}(dedt), \\ &\tilde{y}_{0}(0) = a_{0}, \ \tilde{y}_{i}(0) = a_{i}, \ \tilde{y}_{j}(0) = a_{j}, \ j = 1, 2, \cdots, N, \ j\neq i. \end{aligned} \right. \end{eqnarray*}$

The cost functional is

$\begin{align*} \widetilde{J}_{i}(u_{i}) = &\ \frac{1}{2}\mathbb{E}\Big\{\int^{T}_{0}\Big[\langle Q(\hat{y}_{i}(t)-\beta_{1} x^{(0)}(t)-\beta_{2}\hat{y}_{0}(t)), (\hat{y}_{i}(t)-\beta_{1} x^{(0)}(t)-\beta_{2}\hat{y}_{0}(t))\rangle\nonumber\\ &\; \; \; \; \; \; \; \; \; \; \ \ \ \ \ +\langle Ru_{i}(t), u_{i}(t)\rangle\Big]dt+\langle M\hat{y}_{i}(T), \hat{y}_{i}(T)\rangle\Big\}. \end{align*}$

Now, we are in a position to state the following approximation results.

Proposition 4.3. For the fixed $i$ , we have

$\begin{align*} &(i)\ \sup\limits_{0\leq t \leq T} \mathbb{E}\Big|\hat{x}^{(N)}(t)-x^{(0)}(t)\Big|^{2} = O(\frac{1}{N}), \\ &(ii)\ \sup\limits_{0\leq t \leq T} \mathbb{E}\Big||\hat{x}^{(N)}(t)|^{2}-|x^{(0)}(t)|^{2}\Big| = O(\frac{1}{\sqrt{N}}), \\ &(iii)\ \sup\limits_{0\leq t \leq T} \mathbb{E}\Big|\hat{x}_{i}(t)-\hat{y}_{i}(t)\Big|^{2} = O(\frac{1}{N}), \\ &(iv)\ \sup\limits_{0\leq t \leq T} \mathbb{E}\Big||\hat{x}_{i}(t)|^{2}-|\hat{y}_{i}(t)|^{2}\Big| = O(\frac{1}{\sqrt{N}}). \end{align*}$

Proof. We prove only the first approximation relationship, and the other three approximation relationships can be obtained by a similar proof as in Proposition 3.3.

Define $\tilde{z}(t): = \hat{x}^{(N)}(t)-x^{(0)}(t)$ . According to (10) and (18), we get

$\begin{eqnarray*} \left\{ \begin{aligned} &d\tilde{z}(t) = [(A+B\Lambda_{1}(t)+b_{1})\tilde{z}(t)]dt+\mathbb{S}(t)dt+d \quad \mathbb{L}(t)+\frac{1}{N}\sum\limits_{k = 1}^{N}F\int_{E_{k}}G_{k}(dedt), \\ &\; \tilde{z}(0) = 0, \end{aligned} \right. \end{eqnarray*}$

where

$\begin{align*} \mathbb{S}(t) = &\ \frac{B}{N}[u_{i}(t)-\Lambda_{1}(t)\hat{x}_{i}(t)-\bar{\Theta}_{1}(t)], \\ \mathbb{L}(t) = &\ \frac{1}{N}\sum\limits_{k = 1, k\neq i}^{N}\int_{0}^{t}[(C+D\Lambda_{1}(r))\hat{x}_{k}(r)+D\bar{\Theta}_{1}(r)+b_{2}\hat{x}^{(N)}(r)+H\hat{x}_{0}(r)+\sigma(r)]dW_{k}(r)\nonumber\\ &+\frac{1}{N}\int_{0}^{t}[C\hat{x}_{i}(r)+Du_{i}(r)+b_{2}\hat{x}^{(N)}(r)+H\hat{x}_{0}(r)+\sigma(r)]dW_{i}(r). \end{align*}$

Since

$\begin{align*} \int_{0}^{t}\mathbb{E}|\mathbb{S}(r)|^{2}dr\leq\frac{3B^{2}}{N^{2}}\Big(\int_{0}^{t}\mathbb{E}[u_{i}^{2}(r)]dr+\int_{0}^{t}\mathbb{E}[(\Lambda_{1}(r))^{2}\hat{x}_{i}^{2}(r)]dr+\int_{0}^{t}\mathbb{E}[(\bar{\Theta}_{1}(r))^{2}]dr\Big), \end{align*}$

we get

$\begin{align} \int_{0}^{t}\mathbb{E}|\mathbb{S}(r)|^{2}dr = O(\frac{1}{N^{2}}). \end{align}$

(19)

Note that

$\begin{align*} V(t): = &\ \mathbb{E}\int_{0}^{t}(d \quad \mathbb{L}(r))^{2}\\ = &\ \frac{1}{N^{2}}\sum\limits_{k = 1, k\neq i}^{N}\int_{0}^{t}\mathbb{E}\Big|(C+D\Lambda_{1}(r))\hat{x}_{k}(r)+D\bar{\Theta}_{1}(r)+b_{2}\hat{x}^{(N)}(r)+H\hat{x}_{0}(r)+\sigma(r)\Big|^{2}dr\\ &+\frac{1}{N^{2}}\int_{0}^{t}\mathbb{E}\Big|C\hat{x}_{i}(r)+Du_{i}(r)+b_{2}\hat{x}^{(N)}(r)+H\hat{x}_{0}(r)+\sigma(r)\Big|^{2}dr\\ \leq&\ \frac{T}{N}\sup\limits_{0\leq t \leq T}\max\limits_{0\leq t \leq T}\mathbb{E}\Big|(C+D\Lambda_{1}(t))\hat{x}_{k}(t)+D\bar{\Theta}_{1}(t)+b_{2}\hat{x}^{(N)}(t)+H\hat{x}_{0}(t)+\sigma(t)\Big|^{2}\\ &+\frac{T}{N^{2}}\sup\limits_{0\leq t \leq T}\mathbb{E}\Big|C\hat{x}_{i}(t)+Du_{i}(t)+b_{2}\hat{x}^{(N)}(t)+H\hat{x}_{0}(t)+\sigma(t)\Big|^{2}. \end{align*}$

Thus

$\begin{align} V(t) = O(\frac{1}{N}). \end{align}$

(20)

Applying Itô's formula to $\tilde{z}^{2}(t)$ , we obtain

$\begin{align*} \mathbb{E}[\tilde{z}^{2}(t)] = &\ 2\int_{0}^{t}(A+B\Lambda_{1}(r)+b_{1})\mathbb{E}[\tilde{z}^{2}(r)]dr+2\int_{0}^{t}\mathbb{E}[\tilde{z}(r)\mathbb{S}(r)]dr+V(t)\\ &+\frac{F^{2}}{N^{2}}\sum\limits_{i = 1}^{N}\mathbb{E}\int_{E_{i}}\int_{0}^{t}\pi_{i}(dedr)\\ \leq&\sup\limits_{0\leq t \leq T}(|2A+2B\Lambda_{1}(t)+2b_{1}|+1)\int_{0}^{t}\mathbb{E}[\tilde{z}^{2}(r)]dr+\int_{0}^{t}\mathbb{E}[\mathbb{S}^{2}(r)]dr+V(t)\\ &+\frac{F^{2}}{N}\max\limits_{0\leq t \leq T}\mathbb{E}\int_{E_{i}}\int_{0}^{t}\pi_{i}(dedr). \end{align*}$

Combining (19) and (20) with Gronwall's inequality, we get

$\begin{align*} \sup\limits_{0\leq t \leq T}\mathbb{E}\Big|\hat{x}^{(N)}(t)-x^{(0)}(t)\Big|^{2} = O(\frac{1}{N}). \end{align*}$

This completes the proof. □

By using similar arguments as in Proposition 3.4, we can obtain the following conclusion.

Proposition 4.4. For any $u_{i}(\cdot)\in\mathcal{U}_{ad}^{c, i}, 1\leq i\leq N$ , one has

$\begin{align} \Big|J_{i}(u_{i}, \bar{u}_{-i})-\widetilde{J_{i}}(u_{i})\Big| = O(\frac{1}{\sqrt{N}}). \end{align}$

(21)

4.3. $\epsilon$ -Nash equilibrium

In this subsection, we will verify the $\epsilon$ -Nash equilibrium property of the decentralized control strategies (15) and (16).

Before presenting the main result, we give the definition of $\epsilon$ -Nash equilibrium in the following manner.

Definition 4.5. A set of control strategies $\bar{u} = (\bar{u}_0, \bar{u}_1, \cdots, \bar{u}_N)$ where $\bar{u}_i(\cdot)\in \mathcal{U}^{c, i}_{a, d}, \ i = 0, 1\cdots, N$ , is called an $\epsilon$ -Nash equilibrium with respect to costs $J_i, i = 0, 1\cdots, N$ , if there exists an $\epsilon\geq0$ , such that for any $i = 0, 1\cdots, N$ , we have

$\begin{align} J_{i}(\bar{u}_{i}, \bar{u}_{-i})\leq J_{i}(u_{i}, \bar{u}_{-i})+\epsilon, \end{align}$

(22)

when any alternative strategy $u_i(\cdot)\in \mathcal{U}^{c, i}_{a, d}$ is applied by agent $\mathcal{A}_i$ .

Based on the above results, we obtain the following main result.

Theorem 4.6. Suppose that $\bar{x}_{i}(\cdot), i = 0, 1, \cdots, N$ , is the solution to the equation systems (8) and (9). Then the set of control strategy profiles $\bar{u} = (\bar{u}_0, \bar{u}_1, \cdots, \bar{u}_N)$ defined by (15) and (16) is an $\epsilon$ -Nash equilibrium of $\text{ Problem (LP)}$ , where $\epsilon = O(\frac{1}{\sqrt{N}})\rightarrow 0$ as $N\rightarrow +\infty$ .

Proof. Combining Propositions 3.4 and 4.2 with Proposition 4.4, we obtain

$\begin{align*} J_{i}(\bar{u}_{i}, \bar{u}_{-i}) = &\ \widetilde{J_{i}}(\bar{v}_{i})+O(\frac{1}{\sqrt{N}})\\ \leq&\ \widetilde{J_{i}}(u_{i})+O(\frac{1}{\sqrt{N}})\\ \leq&\ J_{i}(u_{i}, \bar{u}_{-i})+O(\frac{1}{\sqrt{N}}), \ \ \ i = 0, 1, \cdots, N. \end{align*}$

Therefore, the conclusion holds with $\epsilon = O(\frac{1}{\sqrt{N}})$ . □

5. Numerical examples

This section demonstrates the consistency of mean-field estimation as well as the influence of the population's collective behavior $\bar{x}^{(N)}(\cdot)$ on the state trajectories of the agents through a numerical example.

Consider a mean-field game system with one major agent and $N = 500$ minor agents. For any $u_j\in\mathcal{U}^{c, j}_{ad}, j = 0, 1, \cdots, N$ , the dynamics of the major agent and minor agents are given by

$\begin{eqnarray} \left\{ \begin{array}{ll} dx_{0}(t) = \big(\frac{1}{2}x_{0}(t)+u_{0}(t)+x^{(N)}(t)\big)dt+\big(x_{0}(t)+u_{0}(t)+x^{(N)}(t)\big)dW_{0}(t)\\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ +\ 2\int_{E_{0}}G_{0}(dedt), \\ dx_{i}(t) = \big(3x_{i}(t)+5u_{i}(t)+x^{(N)}(t)\big)dt+\big(2x_{i}(t)+u_{i}(t)+x^{(N)}(t)+x_{0}(t)\big)dW_{i}(t)\\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ +\int_{E_{i}}G_{i}(dedt), \\ x_{0}(0) = 5, \ x_{i}(0) = a_{i}, \ i = 1, \cdots, N, \end{array} \right. \end{eqnarray}$

(23)

where $t\in[0, T]$ with $T = 1$ . Let the initial states of the agents $\{a_{i}, i = 1, \cdots, N\}$ be independent and identically distributed random variables with the normal distribution $N(-5, 1)$ .

The cost functional of the major agent $\mathcal{A}_{0}$ is

$\begin{align} J_{0}(u_{0}, u_{-0}) = \frac{1}{2}\mathbb{E}\Big\{\int_{0}^{T}\Big[3(x_{0}(t)-x^{(500)}(t))^{2}+u_{0}^{2}(t)\Big]dt+3x_{0}^{2}(1)\Big\}, \end{align}$

(24)

and the cost functional of the minor agent $\mathcal{A}_{i}, i = 1, \cdots, 500$ , is

$\begin{align} J_{i}(u_{i}, u_{-i}) = \frac{1}{2}\mathbb{E}\Big\{\int_{0}^{T}\Big[2(x_{i}(t)-x^{(500)}(t)-x_{0}(t))^{2}+u_{i}^{2}(t)\Big]dt+x_{i}^{2}(1)\Big\}. \end{align}$

(25)

It is easy to check that $\{P_{0}(t)\equiv 3, \forall t\in[0, 1]\}$ is a unique solution of the following Riccati equation:

$\begin{eqnarray*} \left\{ \begin{array}{ll} \dot{P}_{0}(t)+2P_{0}(t)-4P_{0}^{2}(t)(1+P_{0}(t))^{-1}+3 = 0, \\ P_{0}(1) = 3. \end{array} \right. \end{eqnarray*}$

Suppose that $P_{1}(\cdot)$ fulfills

$\begin{eqnarray*} \left\{ \begin{array}{ll} \dot{P}_{1}(t)+10P_{1}(t)-49P_{1}^{2}(t)(1+P_{1}(t))^{-1}+2 = 0, \\ P_{1}(1) = 1. \end{array} \right. \end{eqnarray*}$

Then the NCE Eq (11) turns out to be

$\begin{eqnarray} \left\{ \begin{array}{ll} d\bar{y}_{0}(t) = \Big[-\bar{y}_{0}(t)+\frac{1}{4}x^{(0)}(t) \Big]dt+\Big[-\frac{1}{2}\bar{y}_{0}(t) +\frac{1}{4}x^{(0)}(t)\Big]dW_{0}(t)+2\int_{E_{0}}G_{0}(dedt), \\ \dot{x}^{(0)}(t) = \Big(4+40\mathbb{P}(t)\Big)x^{(0)}(t)-25(1+P_{1}(t))^{-1}\eta_{1}(t)+5\mathbb{P}(t)\bar{y}_{0}(t), \\ -\dot{\eta}_{1}(t) = \Big(3+35\mathbb{P}(t)\Big)\eta_{1}(t)+P_1(t)x^{(0)}(t)+\Big(2P_1(t)+7\mathbb{P}(t)P_1(t)-2\Big)\Big(\bar{y}_{0}(t)+x^{(0)}(t)\Big), \\ \bar{y}_{0}(0) = a_{0}, \ \ \eta_{1}(T) = 0, \ \ x^{(0)}(0) = \frac{1}{N}\sum\limits_{j = 1}^{N}a_{j}, \ \ \eta_{0}(t)\equiv0, \ \ \ t\in[0, T], \end{array} \right. \end{eqnarray}$

(26)

where $\mathbb{P}(t) = -(1+P_{1}(t))^{-1}P_1(t)$ .

According to Theorem 4.6, the set of control strategies $\bar{u} = (\bar{u}_0, \bar{u}_1, \cdots, \bar{u}_N)$ defined by

$\begin{align*} \bar{u}_{0}(t) = &-\frac{3}{2}\bar{x}_{0}(t)-\frac{3}{4}x^{(0)}(t), \\ \bar{u}_{i}(t) = \ &\mathbb{P}(t)\Big(7\bar{x}_{i}(t)+x^{(0)}(t)+\bar{y}_{0}(t)\Big)-5(1+P_{1}(t))^{-1}\eta_1(t), \ i = 1, 2, \cdots, N, \end{align*}$

is an $\epsilon$ -Nash equilibrium of the mean-field systems (24) and (25), where $\bar{x}_{0}(\cdot)$ and $\bar{x}_{i}(\cdot)$ satisfy

$\begin{eqnarray} \left\{ \begin{array}{ll} d\bar{x}_{0}(t) = \big(-\bar{x}_{0}(t)-\frac{3}{4}x^{(0)}(t)+\bar{x}^{(500)}(t)\big)dt+\big(-\frac{1}{2}\bar{x}_{0}(t)-\frac{3}{4}x^{(0)}(t) +\bar{x}^{(500)}(t)\big)dW_{0}(t)\\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ +\ 2\int_{E_{0}}G_{0}(dedt), \\ d\bar{x}_{i}(t) = \bigg[\big(3+35\mathbb{P}(t)\big)\bar{x}_{i}(t)+5\mathbb{P}(t)x^{(0)}(t)+5\mathbb{P}(t)\bar{y}_0(t)+\bar{x}^{(N)}(t)\\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ -\ 25(1+P_{1}(t))^{-1}\eta_1(t)\bigg]dt+\bigg[\big(2+7\mathbb{P}(t)\big)\bar{x}_{i}(t)+\mathbb{P}(t)x^{(0)}(t)+\mathbb{P}(t)\bar{y}_0(t)\\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ +\ \bar{x}^{(N)}(t)+\bar{x}_{0}(t)-5(1+P_{1}(t))^{-1}\eta_1(t)\bigg]dW_{i}(t)+\int_{E_{i}}G_{i}(dedt), \\ \bar{x}_{0}(0) = a_{0}, \ \bar{x}_{i}(0) = a_{i}, \ i = 1, \cdots, N, \end{array} \right. \end{eqnarray}$

(27)

where $\bar{x}^{(500)}(t) = \frac{1}{500}\sum\limits_{j = 1}^{500}\bar{x}_{j}(t)$ .

In this article, Merton's jump model (see Merton ^[29], as well as Platen and Bruti-Liberati [, pg. 37] is applied to describe the jump-diffusion process. Assume that $\int_{E_{0}} G_{0}(dedt) = \mathbb{Q}_0(\mu_{0}, \sigma_{0})d\Pi_0(\lambda_0)$ . $\mathbb{Q}_0(\mu_{0}, \sigma_{0})$ is the jump size with a normally distributed mean $\mu_{0}\sim N(2, 1)$ and a standard deviation $\sigma_{0} = 0.1$ . The Poisson process $\Pi_0(\lambda_0)$ has a jump intensity of $\lambda_0 = 2$ . For agent $\mathcal{A}_i, i = 1, \cdots, 500$ , let $\int_{E_{i}} G_{i}(dedt) = \mathbb{Q}_i(\mu_{1}, \sigma_{1})d\Pi_i(\lambda)$ . $\mathbb{Q}_i(\mu_{1}, \sigma_{1})$ is the jump size with a normally distributed mean $\mu_{1}\sim N(1, 1)$ and a standard deviation $\sigma_{1} = 0.05$ . The Poisson process $\Pi_i(\lambda)$ has a jump intensity of $\lambda = 5$ .

shows the consistency of mean-field estimation, and the interactive influence between mean-field term $\bar{x}^{(500)}(\cdot)$ , and the major state $\bar{x}_0(\cdot)$ . When the number of minor agents $N = 500$ , as shown in , the curves of $\bar{x}^{(500)}(\cdot)$ and $x^{(0)}(\cdot)$ coincide well, which illustrates the consistency of the mean-field estimation indicated by Proposition 3.3.

Figure 1. Consistency of mean-field estimation for

$a_{i}\sim N(-5, 1), i = 1, \cdots, 500$ ,

$x_{0}(0) = 5$ .

DownLoad: Full-Size Img PowerPoint

illustrates the state trajectories of the major agent and all the minor agents. As shown in , for each fixed $i$ , the trajectory $\bar{x}_i(\cdot)$ of $\mathcal{A}_i$ , in addition to being influenced by its own initial values and parameters, is also affected by the major agent and the collective behavior of all the minor agents.

Figure 2. Curves of

$\bar{x}_i, i = 0, 1, 2, \cdots, 500,$ for

$a_{i}\sim N(-5, 1), i = 1, \cdots, 500$ ,

$x_{0}(0) = 5$ .

DownLoad: Full-Size Img PowerPoint

To illustrate how the key parameters in the control strategies of Eqs (15) and (16) influence the system's dynamic behavior, we set another set of initial values for $N+1$ agents with $x_{0}(0) = -5$ and the independent and identically distributed random variables $\{a_{i}\sim N(5, 1), \ i = 1, \cdots, 500\}$ . and are shown to elaborate the consistency of mean-field estimation and the curves of $\bar{x}_i, i = 0, 1, 2, \cdots, 500$ .

Figure 3. Consistency of mean-field estimation for

$a_{i}\sim N(5, 1), i = 1, \cdots, 500$ ,

$x_{0}(0) = -5$ .

DownLoad: Full-Size Img PowerPoint

Figure 4. Consistency of mean-field estimation for

$a_{i}\sim N(5, 1), i = 1, \cdots, 500$ ,

$x_{0}(0) = -5$ .

DownLoad: Full-Size Img PowerPoint

6. Conclusions

Motivated by the lack of theory and some practical applications, this paper in concerned with linear-quadratic-Gaussian mean-field games involving mixed agents of a stochastic large population system with random jumps. There are two mixed types of agents: (ⅰ) a major agent and (ⅱ) a population of $N$ minor agents where $N$ is very large. The coupling of the major and minor agents exists in both their state dynamics and their individual cost functions. To deal with the dimensionality difficulty and obtain decentralized strategies, the NCE methodology is applied to yield a set of decentralized strategies which is verified to be the $\epsilon$ -Nash equilibrium. We provide numerical examples to illustrate both the consistency of the mean-field estimation and the impact of the population's collective behavior. In the future, an interesting research direction is to extend the modeling and analysis to the social optima case, which may involve more applications in practice and generate more challenges in theory. Another potential direction is to study the uniqueness of the equilibrium strategy, which may be more valuable and challenging.

Author contributions

Conceptualization and methodology, R. X.; writing-original draft, review and editing, K. D., J. Z. and Y. Z.; supervision, R. X. All authors have read and agreed to the published version of the manuscript.

Use of Generative-AI tools declaration

The authors declare that they have not used Artificial Intelligence (AI) tools in the creation of this article.

Acknowledgments

This research is partially supported by the Natural Science Foundation of Shandong Province of China (Grant no. ZR2020MA031, ZR2021MA049), Qilu University of Technology (Shandong Academy of Sciences) Major Innovation Project of Science, Education and Industry Integration Pilot Project (2024ZDZX11), National Natural Science Foundation of China (11971266), and the Colleges and Universities Twenty Terms Foundation of Jinan City (2021GXRC100).

Conflict of interest

All authors declare no conflicts of interest.

References

[1]	C. Benazzoli, L. Campi, L. Di Persio, $\epsilon$ -Nash equilibrium in stochastic differential games with mean-field interaction and controlled jumps, Stat. Probabil. Lett., 154 (2019), 108522. https://doi.org/10.1016/j.spl.2019.05.021 doi: 10.1016/j.spl.2019.05.021
[2]	R. Carmona, F. Delarue, Probabilistic analysis of mean-field games, SIAM J. Control Optim., 51 (2013), 2705–2734. https://doi.org/10.1137/120883499 doi: 10.1137/120883499
[3]	Y. Hu, J. Huang, T. Nie, Linear-quadratic-gaussian mixed mean-field games with heterogeneous input constraints, SIAM J. Control Optim., 56 (2018), 2835–2877. https://doi.org/10.1137/17M1151420 doi: 10.1137/17M1151420
[4]	M. Huang, P. E. Caines, R. P. Malhame, Social optima in mean field LQG control: centralized and decentralized strategies, IEEE T. Automat. Contr., 57 (2012), 1736–1751. https://doi.org/10.1109/TAC.2012.2183439 doi: 10.1109/TAC.2012.2183439
[5]	J. Huang, Z. Qiu, S. Wang, Z. Wu, A unified relation analysis of linear-quadratic mean-field game, team, and control, IEEE T. Automat. Contr., 69 (2024), 3325–3332. https://doi.org/10.1109/TAC.2023.3323576 doi: 10.1109/TAC.2023.3323576
[6]	B. Wang, J. Zhang, Social optima in mean field linear-quadratic-gaussian models with Markov jump parameters, SIAM J. Control Optim., 55 (2017), 429–456. https://doi.org/10.1137/15M104178X doi: 10.1137/15M104178X
[7]	H. Wang, R. Xu, Time-inconsistent LQ games for large-population systems and applications, J. Optim. Theory Appl., 197 (2023), 1249–1268. https://doi.org/10.1007/s10957-023-02223-2 doi: 10.1007/s10957-023-02223-2
[8]	R. Xu, F. Zhang, $\epsilon$ -Nash mean-field games for general linear-quadratic systems with applications, Automatica, 114 (2020), 108835. https://doi.org/10.1016/j.automatica.2020.108835 doi: 10.1016/j.automatica.2020.108835
[9]	H. Yuan, Q. Zhu, The well-posedness and stabilities of mean-field stochastic differential equations driven by g-Brownian motion, SIAM J. Control Optim., 63 (2025), 596–624. https://doi.org/10.1137/23M1593681 doi: 10.1137/23M1593681
[10]	H. Yuan, Q. Zhu, The stabilities of delay stochastic McKean-Vlasov equations in the g-framework, Sci. China Inf. Sci., 68 (2025), 112203. https://doi.org/10.1007/s11432-024-4075-2 doi: 10.1007/s11432-024-4075-2
[11]	M. Huang, P. E. Caines, R. P. Malhame, Large-population cost-coupled LQG problems with nonuniform agents: individual-mass behavior and decentralized $\epsilon$ -Nash equilibria, IEEE T. Automat. Contr., 52 (2007), 1560–1571. https://doi.org/10.1109/TAC.2007.904450 doi: 10.1109/TAC.2007.904450
[12]	M. Huang, Large-population LQG games involving a major player: the Nash certainty equivalence principle, SIAM J. Control Optim., 48 (2010), 3318–3353. https://doi.org/10.1137/080735370 doi: 10.1137/080735370
[13]	M. Nourian, P. E. Caines, $\epsilon$ -Nash mean field game theory for nonlinear stochastic dynamical systems with major and minor agents, SIAM J. Control Optim., 51 (2013), 3302–3331. https://doi.org/10.1137/120889496 doi: 10.1137/120889496
[14]	J. M. Lasry, P. L. Lions, Jeux à Champ Moyen. Ⅰ–Le cas stationnaire, C. R. Math., 343 (2006), 619–625. https://doi.org/10.1016/j.crma.2006.09.019 doi: 10.1016/j.crma.2006.09.019
[15]	J. M. Lasry, P. L. Lions, Jeux à Champ Moyen. Ⅱ–Horizon fini et contrôle optimal, C. R. Math., 343 (2006), 679–684. https://doi.org/10.1016/j.crma.2006.09.018 doi: 10.1016/j.crma.2006.09.018
[16]	J. M. Lasry, P. L. Lions, Mean field games, Jpn. J. Math., 2 (2007), 229–260. https://doi.org/10.1007/s11537-007-0657-8 doi: 10.1007/s11537-007-0657-8
[17]	A. Bensoussan, J. Frehse, P. Yam, Mean field games and mean field type control theory, New York: Springer, 2013. https://doi.org/10.1007/978-1-4614-8508-7
[18]	J. Huang, S. Wang, Z. Wu, Backward mean-field linear-quadratic-gaussian (LQG) games: full and partial information, IEEE T. Automat. Contr., 61 (2016), 3784–3796. https://doi.org/10.1109/TAC.2016.2519501 doi: 10.1109/TAC.2016.2519501
[19]	T. Nie, S. Wang, Z. Wu, Linear-quadratic delayed mean-field social optimization, Appl. Math. Optim., 89 (2024), 4. https://doi.org/10.1007/s00245-023-10067-5 doi: 10.1007/s00245-023-10067-5
[20]	R. Xu, J. Shi, $\epsilon$ -Nash mean-field games for linear-quadratic systems with random jumps and applications, Int. J. Control, 94 (2021), 1415–1425. https://doi.org/10.1080/00207179.2019.1651940 doi: 10.1080/00207179.2019.1651940
[21]	R. Xu, T. Wu, Risk-sensitive large-population linear-quadratic-gaussian games with major and minor agents, Asian J. Control, 25 (2023), 4391–4403. https://doi.org/10.1002/asjc.3106 doi: 10.1002/asjc.3106
[22]	H. Wang, R. Xu, Time-inconsistent large-population linear-quadratic games with major and minor agents, Int. J. Control, 2025. https://doi.org/10.1080/00207179.2025.2491823 doi: 10.1080/00207179.2025.2491823
[23]	B. Düring, P. Markowich, J. F. Pietschmann, M. T. Wolfram, Boltzmann and Fokker-Planck equations modelling opinion formation in the presence of strong leaders, P. Roy. Soc. A Math. Phy., 465 (2009), 3687–3708. https://doi.org/10.1098/rspa.2009.0239 doi: 10.1098/rspa.2009.0239
[24]	Z. Ma, D. S. Callaway, I. A. Hiskens, Decentralized charging control of large populations of plug-in electric vehicles, IEEE T. Control Syst. Technol., 21 (2011), 67–78. https://doi.org/10.1109/TCST.2011.2174059 doi: 10.1109/TCST.2011.2174059
[25]	J. Shi, Z. Wu, Maximum principle for forward-backward stochastic control system with random jumps and applications to finance, J. Syst. Sci. Complex., 23 (2010), 219–231. https://doi.org/10.1007/s11424-010-7224-8 doi: 10.1007/s11424-010-7224-8
[26]	J. Shi, Z. Wu, A risk-sensitive stochastic maximum principle for optimal control of jump diffusions and its applications, Acta Math. Sci., 31 (2011), 419–433. https://doi.org/10.1016/S0252-9602(11)60242-7 doi: 10.1016/S0252-9602(11)60242-7
[27]	R. Cont, P. Tankov, Financial modelling with jump processes, Chapman and Hall/CRC, 2003.
[28]	W. He, Y. Wang, Distributed optimal variational GNE seeking in merely monotone games, IEEE/CAA J. Automatic., 11 (2024), 1621–1630. https://doi.org/10.1109/JAS.2024.124284 doi: 10.1109/JAS.2024.124284
[29]	R. Merton, Option pricing when underlying stock returns are discontinuous, J. Financ. Econ., 3 (1976), 125–144. https://doi.org/10.1016/0304-405X(76)90022-2 doi: 10.1016/0304-405X(76)90022-2
[30]	E. Platen, N. Bruti-Liberati, Numerical solution of stochastic differential equations with jumps in finance, Berlin: Springer-Verlag, 2010. https://doi.org/10.1007/978-3-642-13694-8

Reader Comments

Your name:*

Email:*
© 2025 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.4

Metrics

Article views(137) PDF downloads(27) Cited by(0)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

AIMS Mathematics

Linear-quadratic-Gaussian mean-field games driven by Poisson jumps with major and minor agents

Related Papers:

Abstract

1. Introduction

2. Formulation of the problem

2.1. Notations

2.2. Major-minor mean-field game problems

3. Closed-loop behavior of the agents

3.1. Optimal control of the major agent

3.2. Optimal control of the minor agent

3.3. Approximation for the closed-loop system

4. ε \varepsilon -Nash equilibrium for Problem (LP)

4.1. Major agent's perturbation

4.2. Minor agent's perturbation

4.3. ϵ \epsilon -Nash equilibrium

5. Numerical examples

6. Conclusions

Author contributions

Use of Generative-AI tools declaration

Acknowledgments

Conflict of interest

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

Abstract

1. Introduction

2. Formulation of the problem

2.1. Notations

2.2. Major-minor mean-field game problems

3. Closed-loop behavior of the agents

3.1. Optimal control of the major agent

3.2. Optimal control of the minor agent

3.3. Approximation for the closed-loop system

4. ε \varepsilon -Nash equilibrium for Problem (LP)

4.1. Major agent's perturbation

4.2. Minor agent's perturbation

4.3. ϵ \epsilon -Nash equilibrium

5. Numerical examples

6. Conclusions

Author contributions

Use of Generative-AI tools declaration

Acknowledgments

Conflict of interest

References

4. $\varepsilon$ -Nash equilibrium for Problem (LP)

4.3. $\epsilon$ -Nash equilibrium

4. $\varepsilon$ -Nash equilibrium for Problem (LP)

4.3. $\epsilon$ -Nash equilibrium