An efficient, lightweight MobileNetV2-based fine-tuned model for COVID-19 detection using chest X-ray images

Shubashini Velu; Shubashini Velu

doi:10.3934/mbe.2023368

Mathematical Biosciences and Engineering

2023, Volume 20, Issue 5: 8400-8427. doi: 10.3934/mbe.2023368

Previous Article Next Article

Research article

An efficient, lightweight MobileNetV2-based fine-tuned model for COVID-19 detection using chest X-ray images

Shubashini Velu ^,

Department of Management Information System, College of Business, Prince Mohammad Bin Fahd University, 617, Al Jawharah, Khobar, Dhahran, Saudi Arabia

Received: 25 September 2022 Revised: 01 January 2023 Accepted: 02 January 2023 Published: 02 March 2023

In recent years, deep learning's identification of cancer, lung disease and heart disease, among others, has contributed to its rising popularity. Deep learning has also contributed to the examination of COVID-19, which is a subject that is currently the focus of considerable scientific debate. COVID-19 detection based on chest X-ray (CXR) images primarily depends on convolutional neural network transfer learning techniques. Moreover, the majority of these methods are evaluated by using CXR data from a single source, which makes them prohibitively expensive. On a variety of datasets, current methods for COVID-19 detection may not perform as well. Moreover, most current approaches focus on COVID-19 detection. This study introduces a rapid and lightweight MobileNetV2-based model for accurate recognition of COVID-19 based on CXR images; this is done by using machine vision algorithms that focused largely on robust and potent feature-learning capabilities. The proposed model is assessed by using a dataset obtained from various sources. In addition to COVID-19, the dataset includes bacterial and viral pneumonia. This model is capable of identifying COVID-19, as well as other lung disorders, including bacterial and viral pneumonia, among others. Experiments with each model were thoroughly analyzed. According to the findings of this investigation, MobileNetv2, with its 92% and 93% training validity and 88% precision, was the most applicable and reliable model for this diagnosis. As a result, one may infer that this study has practical value in terms of giving a reliable reference to the radiologist and theoretical significance in terms of establishing strategies for developing robust features with great presentation ability.

Keywords:

Citation: Shubashini Velu. An efficient, lightweight MobileNetV2-based fine-tuned model for COVID-19 detection using chest X-ray images[J]. Mathematical Biosciences and Engineering, 2023, 20(5): 8400-8427. doi: 10.3934/mbe.2023368

Related Papers:

[1]	Wanshun Zhao, Kelin Li, Yanchao Shi . Exponential synchronization of neural networks with mixed delays under impulsive control. Electronic Research Archive, 2024, 32(9): 5287-5305. doi: 10.3934/era.2024244
[2]	Wei Ji . Optimal control problems with time inconsistency. Electronic Research Archive, 2023, 31(1): 492-508. doi: 10.3934/era.2023024
[3]	Zhaoyan Meng, Shuting Lyu, Mengqing Zhang, Xining Li, Qimin Zhang . Sufficient and necessary conditions of near-optimal controls for a stochastic listeriosis model with spatial diffusion. Electronic Research Archive, 2024, 32(5): 3059-3091. doi: 10.3934/era.2024140
[4]	Yang Song, Beiyan Yang, Jimin Wang . Stability analysis and security control of nonlinear singular semi-Markov jump systems. Electronic Research Archive, 2025, 33(1): 1-25. doi: 10.3934/era.2025001
[5]	Daliang Zhao, Yansheng Liu . Controllability of nonlinear fractional evolution systems in Banach spaces: A survey. Electronic Research Archive, 2021, 29(5): 3551-3580. doi: 10.3934/era.2021083
[6]	Changling Xu, Huilai Li . Two-grid methods of finite element approximation for parabolic integro-differential optimal control problems. Electronic Research Archive, 2023, 31(8): 4818-4842. doi: 10.3934/era.2023247
[7]	N. U. Ahmed, Saroj Biswas . Optimal strategy for removal of greenhouse gas in the atmosphere to avert global climate crisis. Electronic Research Archive, 2023, 31(12): 7452-7472. doi: 10.3934/era.2023376
[8]	Chengbo Yi, Jiayi Cai, Rui Guo . Synchronization of a class of nonlinear multiple neural networks with delays via a dynamic event-triggered impulsive control strategy. Electronic Research Archive, 2024, 32(7): 4581-4603. doi: 10.3934/era.2024208
[9]	Haijun Wang, Gege Kang, Ruifang Zhang . On optimality conditions and duality for multiobjective fractional optimization problem with vanishing constraints. Electronic Research Archive, 2024, 32(8): 5109-5126. doi: 10.3934/era.2024235
[10]	Xinzheng Xu, Yanyan Ding, Zhenhu Lv, Zhongnian Li, Renke Sun . Optimized pointwise convolution operation by Ghost blocks. Electronic Research Archive, 2023, 31(6): 3187-3199. doi: 10.3934/era.2023161

Abstract

1. Introduction

Impulsive differential equations are employed for the analysis of real-world phenomena that are characterized by the state of instantaneous system changes. It has been found to play a vital role in many areas, such as sampled-data control, communication networks, industrial robots, biology, and so on. Due to the widespread existence of impulsive perturbation, extensive research has been dedicated to the exploration of the stability of impulsive differential systems (see ^[1,2,3,4]). Meanwhile, some scholars have focused on studying the optimal impulsive control problem, which has resulted in numerous interesting findings (see ^[5,6,7,8,9]).

Finding the necessary optimality conditions is one of the central tasks in optimal control theory. Pontryagin and his co-authors have made milestone contributions (see ^[10]). As pointed out in ^[11]: "The mathematical significance of the maximum principle lies in that maximizing the Hamiltonian is much easier than the original control problem that is infinite-dimensional". Essentially, the Hamiltonian maximization occurs pointwise. Since Pontryagin's maximum principle was discovered, the first-order and second-order necessary conditions for optimal control problems in both finite- and infinite-dimensional spaces have been extensively researched (see ^[11,12]).

However, it is not always possible to find the optimal control by pointwise maximizing the Hamiltonian, and this kind of optimal control problem is often referred to as the singular control problems. For singular control problems, the first task is to discover new necessary conditions to distinguish optimal singular controls from other singular controls; one way to do this is to look for second-order conditions. It is common to seek a second-order necessary condition that requires a quadratic functional to be non-negative. Ideally, we would prefer the second-order necessary conditions to have pointwise characteristics that are similar to Pontryagin's maximum principle, which means that the pointwise mode is preferable for maximizing a particular function. The pointwise necessary optimality conditions are reviewed in ^[13]; the Jacobson conditions and the Goh conditions are generally considered as two types of pointwise second-order necessary optimality conditions for the optimal singular control problem. In addition, references ^[14,15,16] are very comprehensive references on the singular control problem. Regarding the Goh conditions, the original contribution can be seen in ^[17]. In short, the Goh conditions are obtained by applying Goh's transformation. Goh's transformation approaches have been designed to transform the original singular problem into a new nonsingular one; then, for the nonsingular optimal control problem, the classical Legendre-Clebsch condition may be applied. In this process, Goh's transformation may be used several times; therefore, the Goh conditions are also called the generalized Legendre-Clebsch conditions; recent research results are very abundant, for example, references ^{[18,19,20,21]} and related references. Conversely, limited attention has been given to investigating the Jacobson-type conditions.

There is an interesting story about Jacobson-type necessary conditions. Until Jacobson discovered the Jacobson-type necessary conditions, it was thought that there was no Riccati-type matrix differential equation for singular control problems. In fact, Jacobson introduced a linear matrix Riccati differential equation that is similar to the nonlinear matrix Riccati differential equation in the standard LQ problem. Thus, a "new" necessary condition for singular control was obtained in ^[22]. After this, Jacobson found that this new condition is different from the generalized Legendre-Clebsch condition, and Jacobson has demonstrated that these two necessary conditions are generally insufficient for optimality in ^[23]. Therefore, the matrix Riccati differential equation can not only solve nonsingular problems, they can also solve important optimal singular control problems; singular control and nonsingular control can even be considered under a unified framework (see Theorem 4.2 in ^[15,24]).

Recent studies ^[25,26] have established the Jacobson-type pointwise second-order necessary optimality conditions for deterministic and stochastic optimal singular control problems, respectively. Particularly, the relaxation methods for control problems have been used to solve the singular control problem of a lack of a linear structure in control problems; also, the pointwise second-order necessary conditions have been obtained in ^[25]. When discussing the second-order necessary conditions, only the allowable set of singular control is considered, rather than the original allowable control set, which can greatly reduce the computational expense of singular control problems. References ^[25,26] have also been applied to derive the pointwise second-order necessary optimality conditions for singular control problems with constraints in finite- or infinite-dimensional spaces. For further details, please refer to ^{[13,27,28,29,30,31]}. Theorem 4.3 in ^[25] describes the Jacobson-type second-order necessary optimality condition of singular control problems governed by pulseless controlled systems according to Pontryagin. For the definition of singular control in the classical sense and in the Pontryagin sense, see Definitions 1 and 2 in ^[14], as well as (4.4) and (4.5) in ^[25].

Regarding the importance of the impulsive system, the second-order necessary conditions for the optimal control problem governed by impulsive differential systems have been studied in ^[32], which focuses on impulsive systems with multi-point nonlocal and integral boundary conditions; the second-order necessary conditions of the integral form has been obtained by introducing the impulsive matrix function directly. When discussing the second-order necessary conditions for singular control, the perturbation control takes its value in full space, and not in the singular control region (see (3.13)); therefore, the conclusion is not reached by the pointwise Jacobson-type necessary conditions. In addition, references ^[33,34,35] consider second-order necessary conditions whereby the measure is impulsive control; the system can experience an infinite number of jumps in a finite amount of time, but this is different from what we are going to consider.

Inspired by the above discussions, we aimed to study the following optimal control problem, as governed by impulsive differential systems, which differs from the previous works.

Problem P: Let $T > 0$ and $\Lambda = \{\; t_i\mid\; 0 < t_1 < t_2 < \cdots < t_k < T\; \} \subset(0, T)$ be given; $U\subseteq\mathbb{R}^{m}$ is a nonempty bounded convex open set.

$\begin{eqnarray} \min J(u(\cdot)) = \int_{0}^{T}l(t,y(t),u(t))dt+G(y(T)), \end{eqnarray}$

(1.1)

subject to

$\begin{eqnarray} \left\{ \begin{array}{ll} \dot{y}(t) = f(t,y(t),u(t)),&t\in[0,T]\setminus \Lambda,\\ y(t_i+)-y(t_i) = J_{i}(y(t_{i})),&t_i\in \Lambda,\\ y(0) = y_0, \end{array}\right. \end{eqnarray}$

(1.2)

where $u(\cdot)\in\mathcal{U}_{ad} = \left\{\; u(\cdot)\; \mid\; u(\cdot)\; \; \text{measurable}, u(t)\in U \right\}$ , and $l$ , $G$ , $f$ , and $J_{i} \; (i = 1, 2, \cdots, k)$ are the given maps.

Here, we generalize the control model of ^[15,25] to an impulsive controlled system. For Problem P, the difficulty lies in the introduction of the appropriate impulsive adjoint matrix differential equation, which we overcome by borrowing the method presented in ^[25]. The main conclusions include Theorems 3.1 and 4.6, where the former is a generalization of Theorem 4.2 in ^[15] to impulsive controlled systems; the latter is a similar conclusion of Theorem 4.3 in ^[25], but the difference is that Theorem 4.6 is a conclusion of the impulsive controlled systems, and in the classical sense, while Theorem 4.3 in ^[25] is a conclusion for the pulseless controlled systems, and in the Pontryagin sense.

The main novelties and contributions of this paper can be summarized as follows: (i) generalization of pulseless controlled systems to impulsive controlled systems, which is helpful to increase the applicability of the model; (ii) through the use of $C([0, T])$ as a dense subspace in $L^{1}([0, T])$ , as per the results of functional analysis, the condition of conclusion is weakened; (iii) pointwise Jacobson-type necessary conditions have been obtained, which facilitates the calculations for and distinguishes the optimal singular control from other singular controls in the classical sense (see Definition 2.2).

The outline of this paper is as follows. Some preliminaries are proposed in Section 2. In Section 3, the integral-form second-order necessary conditions are derived. In Section 4, the pointwise Jacobson-type necessary conditions are given. An example is considered to elucidate the proposed main results in Section 5, and Section 6 concludes this paper.

2. Preliminaries

In this section, we will present some preliminaries, which includes basic assumptions, the definition of the singular control in the classical sense, the solvability of impulsive systems, and a lemma that has been obtained via functional analysis.

Let $B^{\top}$ denote the transposition of a matrix $B$ . Define $C^{1}\left([0, T]\setminus\Lambda, \mathbb{R}^{n}\right) \; = \big\{y : [0, T]\longrightarrow\mathbb{R}^{n}\; |\; y$ is continuous differential at $t \in[0, T]\setminus\Lambda \big\}$ and $PC_{l}\left([0, T], \mathbb{R}^{n}\right)\left(PC_{r}\left([0, T], \; \mathbb{R}^{n}\right)\right) \; = \big\{y : [0, T]\longrightarrow\mathbb{R}^{n}\; |\; y$ is continuous at $t \in[0, T]\setminus\Lambda, \; y$ is left (right) continuous and exists right (left) limit at $t\in\Lambda\big\}$ , obviously endowed with the norm $\|y\|_{PC} = \sup \big\{\|y(t+)\|, \|y(t-)\|\; |\; \; t\in[0, T]\big\}$ , $PC_{l}\left([0, T], \mathbb{R}^{n}\right)$ ; also, $PC_{r}\left([0, T], \mathbb{R}^{n}\right)$ denotes Banach spaces.

Let us assume the following:

(A1) $U\subseteq \mathbb{R}^{m}$ is a nonempty bounded convex open set.

(A2) The functions denoted by $F = \left(\begin{array}{cc}f\\l\end{array}\right): [0, T ]\times \mathbb{R}^{n}\times U\longrightarrow\mathbb{R}^{n+1}$ are measurable in $t$ and twice continuously differentiable in $(y, u)$ ; for any $\rho > 0$ , there exists a constant $L(\rho) > 0$ such that, for all $y, \hat{y}\in \mathbb{R}^{n}$ and $u, \hat{u}\in U$ with $\|y\|, \|\hat{y}\|, \|u\|, \|\hat{u}\|\leq \rho$ , and for all $t\in[0, T]$ such that

$\begin{eqnarray*} \label{2.1} \left\{\begin{array}{ll} \|F(t,y,u)-F(t,\hat{y},\hat{u})\|\leq L(\rho)\left(\|y-\hat{y}\|+\|u-\hat{u}\|\right),\\ \|F_{y}(t,y,u)-F_{y}(t,\hat{y},\hat{u})\|\leq L(\rho)\left(\|y-\hat{y}\|+\|u-\hat{u}\|\right),\\ \|F_{u}(t,y,u)-F_{u}(t,\hat{y},\hat{u})\|\leq L(\rho)\left(\|y-\hat{y}\|+\|u-\hat{u}\|\right), \end{array}\right. \end{eqnarray*}$

there is a constant $h > 0$ such that

$\begin{eqnarray} \|F(t,y,u)\|\leq h\left(1+\|y\|\right),\mbox{ for all }(t,u)\in [0,T]\times U, \end{eqnarray}$

(2.1)

where $F_{y}(t, y, u)^{\top}$ (or $F_{u}(t, y, u)^{\top}$ ) denotes the Jacobi matrix of $F$ in $y$ (or $u$ ).

(A3) the functions denoted by $\tilde{J}_{i} = \left(\begin{array}{cc}J_{i}\\G\end{array}\right): \mathbb{R}^{n}\longrightarrow\mathbb{R}^{n+1}$ ( $i = 1, 2, \cdots, k$ ) are twice continuously differentiable in $x$ , and, for any $\rho > 0$ , there exists a constant $L(\rho) > 0$ such that, for all $y, \hat{y}\in \mathbb{R}^{n}$ with $\|y\|, \|\hat{y}\|\leq \rho$ , we have

$\begin{eqnarray*} \label{2.3} \left\{ \begin{array}{ll} \|\tilde{J}_{i}(y)-\tilde{J}_{i}(\hat{y})\|\leq L(\rho)\|y-\hat{y}\|,\\ \|\tilde{J}_{ix}(y)-\tilde{J}_{iy}(\hat{y})\|\leq L(\rho)\|y-\hat{y}\|, \end{array}\right. \end{eqnarray*}$

where $\tilde{J}_{iy}(y)^{\top}$ denotes the Jacobi matrix of $\tilde{J}_{i}$ in $y$ .

Denote $H(t) = l\left(t, y(t), u(t)\right) + \langle f(t, y(t), u(t)), \varphi(t) \rangle$ . To simplify the notation, $[t]$ is used to replace $(t, \bar{y}(t), \overline{\varphi}(t), \bar{u}(t))$ when evaluating the dynamics $f$ and the Hamiltonian $H$ ; for example, $f[t] = f(t, \bar{y}(t), \bar{u}(t))$ and $H[t] = l\left(t, \bar{y}(t), \bar{u}(t)\right) + \langle f(t, \bar{y}(t), \bar{u}(t)), \overline{\varphi}(t) \rangle$ .

Remark 2.1. $H(t) = l\left(t, y(t), u(t)\right) + \langle f(t, y(t), u(t)), \varphi(t) \rangle$ is often referred to as the Hamiltonian function, or, simply, the Hamiltonian. The Hamiltonian function is represented in many reports as $\widetilde{H}(t) = -l\left(t, y(t), u(t)\right) + \langle f(t, y(t), u(t)), \varphi(t) \rangle$ ; the difference between the two is that Pontryagin's maximum principle maximizes the Hamiltonian $\widetilde{H}$ or minimizes the Hamiltonian $H$ . Accordingly, it also leads to a difference between positivity and negativity of the optimal inequality in the maximum principle. In this article, we use $H$ to denote the Hamiltonian; in other words, we need to minimize the Hamiltonian $H$ .

Now, we will prove an elementary theorem that will be useful in the following sections.

Theorem 2.2. Let (A1)–(A3) hold; for any fixed $u\in \mathcal{U}_{ad}$ , the control system (1.2) has a unique solution $y^{u}\in PC_{l}\left([0, T], \mathbb{R}^{n}\right)$ given by

$\begin{eqnarray} y^{u}(t) = y_{0}+\int_{0}^{t}f\left(s,y^{u}(s),u(s)\right)ds+\sum\limits_{0 < t_i < t}J_{i}\left(y^{u}(t_i)\right), \end{eqnarray}$

(2.2)

and there exists a constant $M = M\left(h, T, y_0, J_{1}(0), J_{2}(0), \cdots, J_{k}(0)\right)$ such that

$\begin{eqnarray} \left\|y^{u}(t)\right\|\leq M\;\mathit{\mbox{for all}}\; (t,u)\in [0,T]\times\mathcal{U}_{ad}. \end{eqnarray}$

(2.3)

Proof. By the qualitative theory of differential equations, it follows from (A1)–(A3) that the system of equations

$\begin{eqnarray*} \left\{ \begin{array}{ll} \dot{y}(t) = f(t,y(t),u(t)),&t\in[0,t_1],\\ y(0) = y_0, \end{array}\right. \end{eqnarray*}$

has a unique solution $y^{u}\in C\left([0, t_1], \mathbb{R}^{n}\right)$ given by

$y^{u}(t) = y_{0}+\int_{0}^{t}f\left(s,y^{u}(s),u(s)\right)ds,\quad t\in [0,t_1],$

and

$\left\|y^{u}(t)\right\|\leq e^{ht}\left(ht_1+\|y_0\|\right)\mbox{ for all } (t,u)\in [0,t_1]\times\mathcal{U}_{ad}.$

Let

$y_1 = y^{u}(t_1)+J_1\left(y^{u}(t_1)\right),$

then, one also can infer that the system of equations

$\begin{eqnarray*} \left\{ \begin{array}{ll} \dot{y}(t) = f(t,y(t),u(t)),&t\in(t_1,t_2],\\ y(t_1+) = y_1, \end{array}\right. \end{eqnarray*}$

has a unique solution $y^{u}\in C\left((t_1, t_2], \mathbb{R}^{n}\right)$ given by

$y^{u}(t) = y_{0}+\int_{0}^{t}f\left(s,y^{u}(s),u(s)\right)ds+J_1\left(y^{u}(t_1)\right), t\in (t_1,t_2],$

and

$\left\|y^{u}(t)\right\|\leq e^{h(t-t_1)}\left(h(t_2-t_1)+\|y_1\|\right)\mbox{ for all } (t,u)\in (t_1,t_2]\times\mathcal{U}_{ad}.$

Using a step by step method, together with

$\left\|J_i\left(y^{u}(t_i)\right)\right\|\leq L\left(\left\|y^{u}(t_i)\right\|\right)\left\|y^{u}(t_i)\right\|+\|J_i(0)\|,\; i = 1,2,\cdots,k,$

it is not difficult to claim that there exists a constant $M$ such that (2.2) and (2.3) hold. Therefore, we have completed the proof of Theorem 2.2.

To establish the main conclusion of Theorem 4.6, we now introduce the definition of singular control in the classical sense (see Definition 2 in ^[14], as well as (4.4) in ^[25]).

Definition 2.3. We refer to the elements in the following equation as singular controls in the classical sense:

$\begin{eqnarray*} \label{2.6} \overline{\mathcal{U}}_{ad} = \left\{v(\cdot)\in \mathcal{U}_{ad}\; |\; H_{u} (t, \bar{y}(t), v(t), \overline{\varphi}(t)) = 0, H_{uu} (t, y(t), v(t), \varphi(t))\equiv 0 \right\}. \end{eqnarray*}$

Remark 2.4. If the Hamiltonian H is linear in control $u$ , then $\overline{\mathcal{U}}_{ad}$ is a singular control set in the classical sense.

Now, we shall introduce a fundamental lemma, as derived from functional analysis, that will be utilized to establish the necessary condition for Problem P.

Lemma 2.5. Let $h(t)$ be an $n$ -dimensional piecewise continuous vector value function on $[0, T]$ , and suppose that

$\int_{0}^{T}\langle h(t),a(t)\rangle dt = 0,$

for all $n$ -dimensional piecewise continuous vector value functions $a(t)$ on $[0, T]$ ; then, $h(t) = 0$ at all continuous moments of $h(t)$ on $[0, T]$ .

Proof. Suppose that $h(t)$ at some continuous moments $\bar{t}$ when $h(\bar{t})\neq 0$ because $h(t)$ is an $n$ -dimensional piecewise continuous vector value function; therefore, there exists an interval $I_{\bar{t}}$ at $\bar{t}$ such that

$h(t)\neq 0, t\in I_{\bar{t}}.$

In this case, given the following:

$\begin{eqnarray*} a(t) = \left\{ \begin{array}{ll} h(t),&t\in I_{\bar{t}},\\ 0,&t\in [0,T]\; \backslash \; I_{\bar{t}}, \end{array}\right. \end{eqnarray*}$

then

$\int_{0}^{T}h^{\top}(t)h(t)dt = \int_{I_{\bar{t}}}h^{\top}(t)h(t)dt = \int_{I_{\bar{t}}}\|h(t)\|^{2}dt > 0.$

This is a contradiction. Therefore, we have finished the proof of Lemma 2.5.

3. Integral-form second-order necessary condition

The purpose of this section is to prove Theorem 3.1, which establishes the integral form of the second-order necessary conditions for Problem P. The basic idea is that the condition of being second-order variational and non-negative is a necessary condition for an optimal control problem. We will borrow the method adopted in ^[25] and introduce a linear impulsive adjoint matrix to prove it.

Theorem 3.1. Let (A1)–(A3) hold and $\bar{u}$ represent the optimal control of $J$ over $\mathcal{U}_{ad}$ ; it is necessary that there exist functions $(\overline{y}, \overline{\varphi}, \overline{W}, \overline{\Phi})\in PC_{l}\left([0, T], \mathbb{R}^{n}\right) \times PC_{r}\left([0, T], \mathbb{R}^{n}\right) \times PC_{r}\left([0, T], \mathbb{R}^{n\times n}\right) \times PC_{l}\left([0, T], \mathbb{R}^{n\times n}\right)$ such that the following equations and inequality hold:

$\begin{eqnarray} \left\{ \begin{array}{ll} \dot{\bar{y}}(t) = f(t,\bar{y}(t),\bar{u}(t)),&t\in[0,T]\setminus \Lambda,\\ \bar{y}(t_i+)-\bar{y}(t_i) = J_{i}(\bar{y}(t_{i})),&t_i\in \Lambda,\\ \bar{y}(0) = y_0; \end{array}\right. \end{eqnarray}$

(3.1)

$\begin{eqnarray} \left\{\begin{array}{ll} \dot{\overline{\varphi}}(t) = -f_{y}(t,\bar{y}(t),\bar{u}(t))\overline{\varphi}(t) -l_{y}(t,\bar{y}(t),\bar{u}(t)),&t\in[0,T]\setminus \Lambda,\\ \overline{\varphi}(t_i-) = \overline{\varphi}(t_i)+J_{iy}(\bar{y}(t_{i}))\overline{\varphi}(t_{i}),&t_i\in \Lambda,\\ \overline{\varphi}(T) = G_{y}(\bar{y}(T)); \end{array}\right. \end{eqnarray}$

(3.2)

$\begin{eqnarray} \left\{\begin{array}{ll} \dot{\overline{W}}(t) = -f_{y}(t,\bar{y}(t),\bar{u}(t))\overline{W}(t)-\overline{W}(t)f_{y}(t,\bar{y}(t),\bar{u}(t))^{\top}\\ \qquad\qquad- \overline{\varphi}(t)^{\top}f_{yy}(t,\bar{y}(t),\bar{u}(t))- l_{yy}(t,\bar{y}(t),\bar{u}(t)),&t\in[0,T]\setminus \Lambda,\\ \overline{W}(t_i-) = \overline{W}(t_i)+J_{iy}(\bar{y}(t_{i}))\overline{W}(t_i)+\overline{W}(t_i+)J_{iy}(\bar{y}(t_{i}))^{\top}\\ \qquad\qquad +J_{iy}(\bar{y}(t_{i}))\overline{W}(t_i)J_{iy}(\bar{y}(t_{i}))^{\top}+\overline{\varphi}(t_i)^{\top}J^{}_{iyy}(\bar{y}(t_{i})),&t_i\in \Lambda,\\ \overline{W}(T) = G_{yy}(\bar{y}(T)); \end{array}\right.\quad \end{eqnarray}$

(3.3)

$\begin{eqnarray} \left\{\begin{array}{ll} \dot{\overline{\Phi}}(t) = f_{y}(t,\bar{y}(t),\bar{u}(t))^{\top}\overline{\Phi}(t),&t\in[0,T]\setminus \Lambda,\\ \overline{\Phi}(t_i+) = \overline{\Phi}(t_i)+J_{iy}(\bar{y}(t_{i}))^{\top}\overline{\Phi}(t_i),&t_i\in \Lambda,\\ \overline{\Phi}(0) = I; \end{array}\right. \end{eqnarray}$

(3.4)

$\begin{eqnarray*} \label{3.5} &&\frac{1}{2}\int_{0}^{T}\langle H_{uu}[t][u(t)-\bar{u}(t)],u(t)-\bar{u}(t)\rangle dt\nonumber\\ &&+\int_{0}^{T}\int_{0}^{t}\langle \left[H_{uy}[t]+\overline{W}(t)f_{u}[t]^{\top}\right][u(t)-\bar{u}(t)], \\ &&\qquad \overline{\Phi}(t)\overline{\Phi}(s)^{-1}f_{u}[s]^{T}[u(s)-\bar{u}(s)]ds\rangle dt \geq0\;\mathit{\mbox{for all}}\;u(\cdot)\in \mathcal{U}_{ad} \nonumber. \end{eqnarray*}$

Proof. Now, let $(\bar{y}(\cdot), \bar{u}(\cdot))$ be the given optimal pair and $\varepsilon\in(0, 1]$ . For an arbitrary but fixed $u(\cdot)\in \mathcal{U}_{ad}$ , let $u^{\varepsilon}(\cdot) = \bar{u}(\cdot)+\varepsilon (u(\cdot)-\bar{u}(\cdot))$ . It follows from the assumption (A1) that $u^{\varepsilon}(\cdot)\in \mathcal{U}_{ad}$ ; according to Theorem 2.2, $u^{\varepsilon}$ determines the unique allowed state $y^{\varepsilon}(\cdot)$ ; then, we can get from (A2), (A3) and Theorem 2.2 (see (2.2) and (2.3)) that

$\begin{eqnarray*} &&\left\|y^{\varepsilon}(t)-\bar{y}(t)\right\|\\ &\leq&\int_{0}^{t}\left\|f\left(s,y^{\varepsilon}(s),u^{\varepsilon}(s)\right)-f\left(s,\bar{y}(s),\bar{u}(s)\right)\right\|ds+ \sum\limits_{0 < t_i < t}\left\|J_i\left(y^{\varepsilon}(t_i)\right)-J_i\left(\bar{y}(t_i)\right)\right\|\\ &\leq&L(M)\int_{0}^{t}\left(\left\|y^{\varepsilon}(t)-\bar{y}(t)\right\|+\varepsilon\left\|u(t)-\bar{u}(t)\right\|\right)ds+L(M) \sum\limits_{0 < t_i < t}\left\|y^{\varepsilon}(t_i)-\bar{y}(t_i)\right\|. \end{eqnarray*}$

Using the impulse integral inequality (see ^[1]), we have

$\begin{eqnarray} \lim\limits_{\varepsilon\rightarrow0}\left\|y^{\varepsilon}-\bar{y}\right\|_{PC} = 0. \end{eqnarray}$

(3.5)

Let

$\begin{eqnarray*} \label{3.7} Y(t) = \lim\limits_{\varepsilon\rightarrow0}Y^{\varepsilon}(t) = \lim\limits_{\varepsilon\rightarrow0} \frac{y^{\varepsilon}(t)-y(t)}{\varepsilon}. \end{eqnarray*}$

In the same way as for (3.5), it is easy to show that

$\begin{eqnarray} \lim\limits_{\varepsilon\rightarrow0}\left\|Y^{\varepsilon}-Y\right\|_{PC} = 0, \end{eqnarray}$

(3.6)

and $Y$ solves the following system of variational equations:

$\begin{eqnarray} \left\{ \begin{array}{ll} \dot{Y}(t) = f_{y}[t]^{\top}Y(t)+f_{u}[t]^{\top}(u(t)-\bar{u}(t)),&t\in [0,T]\setminus\Lambda,\\ Y(t_i+) = Y(t_i)+J_{iy}\left(\bar{y}(t_{i})\right)^{\top}Y(t_{i}), &t_i\in\Lambda,\\ Y(0) = 0. \end{array} \right. \end{eqnarray}$

(3.7)

To obtain the first-order necessary condition for Problem P, the following proposition will be used.

Proposition 3.2. Let (A2) and (A3) hold and $\overline{\varphi}\in PC_{r}\left([0, T], \mathbb{R}^{n}\right)$ be the solution of the impulsive adjoint equation given by (3.2). Then

$\begin{eqnarray} &&\int_{0}^{T}\langle f_{u}\left(t,\bar{y}(t),\bar{u}(t)\right)\overline{\varphi}(t),u(t)-\bar{u}(t)\rangle dt\\ & = &\int_{0}^{T} \langle l_{y}\left(s,\bar{y}(s),\bar{u}(s)\right),Y(s) \rangle ds+\langle G_{y}\left(\bar{y}(T)\right),Y(T)\rangle. \end{eqnarray}$

(3.8)

Proof. Since $C([0, T])$ is a dense subspace in $L^{1}([0, T])$ , there exist function sequences $\left\{f^{\alpha}_{y}\right\}$ , $\left\{l_{y}^{\alpha}\right\}\subseteq C([0, T]$ such that

$\begin{eqnarray} f^{\alpha}_{y}(\cdot)\longrightarrow f_{y}(\cdot,\bar{y}(\cdot),\bar{u}(\cdot))\mbox{ and }l^{\alpha}_{y}(\cdot)\longrightarrow l_{y}(\cdot,\bar{y}(\cdot),\bar{u}(\cdot))\mbox{ in } L^{1}([0,T])\mbox{ as }\alpha\rightarrow \infty. \end{eqnarray}$

(3.9)

Moreover, it follows from (A3) that the system of linear impulsive differential equations given by

$\begin{eqnarray} \left\{\begin{array}{ll} \dot{\varphi}_{\alpha}(t) = -f^{\alpha}_{y}(t)\varphi_{\alpha}(t) -l_{y}^{\alpha}(t),&t\in[0,T]\setminus \Lambda,\\ \varphi_{\alpha}(t_i-) = \varphi_{\alpha}(t_i)+J_{iy}(\bar{y}(t_{i}))\varphi_{\alpha}(t_{i}),&t_i\in \Lambda,\\ \varphi_{\alpha}(T) = G_{y}(\bar{y}(T)), \end{array}\right. \end{eqnarray}$

(3.10)

has a unique solution $\varphi_{\alpha}\in PC_{r}\left([0, T], \mathbb{R}^{n}\right)\bigcap C^{1}\left([0, T]\setminus\Lambda, \mathbb{R}^{n}\right)$ , and that there is a constant $\beta > 0$ such that

$\|\varphi_{\alpha}\|_{PC}\leq\beta\mbox{ for all } \alpha.$

Hence, we have

$\begin{eqnarray*} &&\left\|\varphi_{\alpha}(t)-\overline{\varphi}(t)\right\|\\ &\leq& \int_{t}^{T}\left\|l_{y}^{\alpha}(s)-l_{y}(s,\bar{y}(s),\bar{u}(s))\right\|ds+\int_{t}^{T}\left\|f^{\alpha}_{y}(s)-f_{y}(s,\bar{y}(s),\bar{u}(s))\right\|\left\|\varphi_{\alpha}(s)\right\|ds\\ &&+\int_{t}^{T}\left\|f_{y}(s,\bar{y}(s),\bar{u}(s))\right\|\left\|\varphi_{\alpha}(s)-\overline{\varphi}(s)\right\|ds+\sum _{t < t_i < T}\left\|J_{iy}(\bar{y}(t_{i}))\right\|\left\|\varphi_{\alpha}(t_{i})-\overline{\varphi}(t_{i})\right\|. \end{eqnarray*}$

Therefore, using the same method as for (3.5), it is not difficult to show that

$\begin{eqnarray} \lim\limits_{\alpha\rightarrow \infty}\left\|\varphi_{\alpha}-\overline{\varphi}\right\|_{PC} = 0. \end{eqnarray}$

(3.11)

Consequently, we can infer from (3.7) and (3.10) that

$\begin{eqnarray*} &&\int_{0}^{T}\langle f_{u}\left(t,\bar{y}(t),\bar{u}(t)\right) \varphi_{\alpha}(t), u(t)-\bar{u}(t)\rangle dt\nonumber\\ & = &\int_{0}^{T}\langle \varphi_{\alpha}(t), f_{u}\left(t,\bar{y}(t),\bar{u}(t)\right)^{\top}(u(t)-\bar{u}(t))\rangle dt\nonumber\\ & = &\int_{0}^{T}\langle \varphi_{\alpha}(t), \dot{Y}(t)-f^{\top}_{y}\left(t,\bar{y}(t),\bar{u}(t)\right)Y(t)\rangle dt\nonumber\\ & = &\langle \varphi_{\alpha}(T),Y(T)\rangle-\sum\limits_{i = 1}^{k}\left[\langle \varphi_{\alpha}(t_i),Y(t_i+)\rangle-\langle \varphi_{\alpha}(t_i-),Y(t_i)\rangle\right]\\ &&-\int_{0}^{T}\langle \dot{\varphi}_{\alpha}(t)+f_{y}^{\alpha}(t)\varphi_{\alpha}(t),Y(t)\rangle dt\nonumber\\ & = &\langle G_{y}(\bar{y}(T)),Y(T)\rangle-\int_{0}^{T}\langle \dot{\varphi}_{\alpha}(t)+f_{y}^{\alpha}(t)\varphi_{\alpha}(t),Y(t)\rangle dt\nonumber\\ & = &\langle G_{y}(\bar{y}(T)),Y(T)\rangle+\int_{0}^{T}\langle l_{y}^{\alpha}(t),Y(t)\rangle dt.\nonumber\\ \end{eqnarray*}$

Let $\alpha\longrightarrow\infty$ in the above expression; using (3.9) and (3.11), we have (3.8). Therefore, we have finished the proof of Proposition 3.2.

Based on the above proposition, we now continue to prove Theorem 3.1.

By the optimality of $\bar{u}$ , one can ascertain from the assumptions (A2) and (A3), (3.2), (3.6), (3.7), and Proposition 3.2 (see (3.8)) that

$\begin{eqnarray*} 0&\leq&\lim\limits_{\varepsilon\rightarrow0}\frac{J\left(u^{\varepsilon}\right)-J\left(\bar{u}\right)}{\varepsilon}\\ & = &\lim\limits_{\varepsilon\rightarrow0}\int_{0}^{T}\langle\int_{0}^{1}l_{y}\left(t,\bar{y}(t)+\tau\left(y^{\varepsilon}(t)-\bar{y}(t)\right),\bar{u}(t)\right)d\tau,Y^{\varepsilon}(t)\rangle dt\\ &&+\lim\limits_{\varepsilon\rightarrow0}\int_{0}^{T}\langle\int_{0}^{1}l_{u}\left(t,y^{\varepsilon}(t),\bar{u}(t)+\tau \varepsilon\left(u(t)-\bar{u}(t)\right)\right)d\tau,u(t)-\bar{u}(t)\rangle dt\\ &&+\lim\limits_{\varepsilon\rightarrow0}\langle\int_{0}^{1} G_{y}\left(\bar{y}(T)+\tau\left(y^{\varepsilon}(T)-\bar{y}(T)\right)\right)d\tau,Y^{\varepsilon}(T)\rangle\nonumber\\ & = &\int_{0}^{T} \langle l_{y}\left(t,\bar{y}(t),\bar{u}(t)\right),Y(t) \rangle dt+\int_{0}^{T}\langle l_{u}\left(t,\bar{y}(t),\bar{u}(t)\right),u(t)-\bar{u}(t)\rangle dt\\ &&+ \langle G_{y}\left(\bar{y}(T)\right),Y(T)\rangle\\ & = &\int_{0}^{T}\langle l_{u}\left(t,\bar{y}(t),\bar{u}(t)\right)+f_{u}\left(t,\bar{y}(t),\bar{u}(t)\right)\overline{\varphi}(t),u(t)-\bar{u}(t)\rangle dt, \end{eqnarray*}$

which leads to the following optimal inequality:

$\begin{eqnarray*} \int_{0}^{T}\langle l_{u}(t,\bar{y}(t),\bar{u}(t))+f_{u}(t,\bar{y}(t) ,\bar{u}(t))\overline{\varphi}(t),u(t)-\bar{u}(t)\rangle dt\geq0. \end{eqnarray*}$

Because of the arbitrariness of $u\in \mathcal{U}_{ad}$ , we get

$\begin{eqnarray} \int_{0}^{T}\langle l_{u}(t,\bar{y}(t),\bar{u}(t))+f_{u}(t,\bar{y}(t) ,\bar{u}(t))\overline{\varphi}(t),u(t)-\bar{u}(t)\rangle dt = 0. \end{eqnarray}$

(3.12)

Moreover, combining this with (A1) and Proposition 3.2, it follows that, for all continuous time points $t\in [0, T]$ of $\bar{u}(t)$ , we have

$\begin{eqnarray*} \label{3.15} H_{u}[t] = l_{u}(t,\bar{y}(t),\bar{u}(t))+\langle f_{u}(t,\bar{y}(t) ,\bar{u}(t)), \overline{\varphi}(t) \rangle = 0. \end{eqnarray*}$

We define

$\begin{eqnarray} \overline{U}(t) = \left\{v\in U\mid l_{u}(t,\bar{y}(t),v)+\langle f_{u}(t,\bar{y}(t) ,v), \overline{\varphi}(t) \rangle = 0 \right\}. \end{eqnarray}$

(3.13)

We refer to this as the singular control region in the classical sense, which will be used later.

Let

$Z^{\varepsilon}(\cdot) \equiv\left(Z_{1}^{\varepsilon}(\cdot) \quad Z_{2}^{\varepsilon}(\cdot) \quad \cdots \quad Z_{n}^{\varepsilon}(\cdot)\right)^{\top} = \frac{Y^{\varepsilon}(t)-Y(t)}{\varepsilon},$

and

$\begin{eqnarray*} \label{3.16} Z_{k}(t) = \lim\limits_{\varepsilon\rightarrow0}Z_{k}^{\varepsilon}(t). \end{eqnarray*}$

In the same way as for (3.5), one can also claim from (A2) and (A3) that

$\begin{eqnarray} \lim\limits_{\varepsilon\rightarrow0}\left\|Z^{\varepsilon}-Z\right\|_{PC([0,T])} = 0, \end{eqnarray}$

(3.14)

and $Z(\cdot)$ denotes the solution of the following system of equations:

$\begin{eqnarray} \left\{ \begin{array}{ll} \dot{Z}(t) = f_{y}(t,\bar{y}(t),\bar{u}(t))^{\top}Z(t)\\ \qquad+\frac{1}{2} \langle f_{uu}(t,\bar{y}(t),\bar{u}(t))[u(t)-\bar{u}(t)],[u(t)-\bar{u}(t)]\rangle\\ \qquad+ \langle f_{uy}(t,\bar{y}(t),\bar{u}(t))[u(t)-\bar{u}(t)], Y(t) \rangle\\ \qquad+\frac{1}{2} \langle f_{yy}(t,\bar{y}(t),\bar{u}(t))Y(t),Y(t)\rangle,&t\in (0,T]\setminus\Lambda, \\ Z(t_i+) = Z(t_i)+J^{\top}_{iy}(\bar{y}(t_{i}))Z(t_i)\\ \quad \qquad+\frac{1}{2}\langle J_{iyy}(\bar{y}(t_{i}))Y(t_{i}),Y(t_{i}) \rangle,&t_i\in \Lambda,\\ Z(0) = 0. \end{array}\right. \end{eqnarray}$

(3.15)

where

$\begin{eqnarray*} &&\frac{1}{2} \langle f_{uu}(t,\bar{y}(t),\bar{u}(t))[u(t)-\bar{u}(t)],[u(t)-\bar{u}(t)]\rangle\\ & = & \frac{1}{2} \left\{ \begin{array}{ll} \left.\begin{gathered} \langle f^1_{uu}(t,\bar{y}(t),\bar{u}(t))[u(t)-\bar{u}(t)],[u(t)-\bar{u}(t)]\rangle \\ \langle f^2_{uu}(t,\bar{y}(t),\bar{u}(t))[u(t)-\bar{u}(t)],[u(t)-\bar{u}(t)]\rangle \\ \vdots\\ \langle f^n_{uu}(t,\bar{y}(t),\bar{u}(t))[u(t)-\bar{u}(t)],[u(t)-\bar{u}(t)]\rangle \\ \end{gathered} \right\}, \end{array} \right. \end{eqnarray*}$

$\begin{eqnarray*} &&\frac{1}{2} \langle f_{yy}(t,\bar{y}(t),\bar{u}(t))Y(t),Y(t)\rangle\\ & = & \frac{1}{2} \left\{ \begin{array}{ll} \left.\begin{gathered} \langle f^1_{yy}(t,\bar{y}(t),\bar{u}(t))Y(t),Y(t)\rangle \\ \langle f^2_{yy}(t,\bar{y}(t),\bar{u}(t))Y(t),Y(t)\rangle \\ \vdots\\ \langle f^n_{yy}(t,\bar{y}(t),\bar{u}(t))Y(t),Y(t)\rangle \\ \end{gathered} \right\}, \end{array} \right. \end{eqnarray*}$

and

$\begin{eqnarray*} && \langle f_{uy}(t,\bar{y}(t),\bar{u}(t))[u(t)-\bar{u}(t)], Y(t) \rangle\\ & = & \left\{ \begin{array}{ll} \left.\begin{gathered} \langle f^1_{uy}(t,\bar{y}(t),\bar{u}(t))[u(t)-\bar{u}(t)]),Y(t)\rangle \\ \langle f^2_{uy}(t,\bar{y}(t),\bar{u}(t))[u(t)-\bar{u}(t)],Y(t)\rangle \\ \vdots\\ \langle f^n_{uy}(t,\bar{y}(t),\bar{u}(t))[u(t)-\bar{u}(t)],Y(t)\rangle \\ \end{gathered} \right\}. \end{array} \right. \end{eqnarray*}$

Meanwhile, one can infer from (3.7) that $X: = YY^{\top}$ is the solution to the following system of equations:

$\begin{eqnarray} \left\{\begin{array}{ll} \dot{X}(t) = f_{y}(t,\bar{y}(t),\bar{u}(t))^\top X(t)\\ \qquad+ X(t)f_{y}(t,\bar{y}(t),\bar{u}(t))\\ \qquad+f_{u}(t,\bar{y}(t),\bar{u}(t))^\top(u(t)-\bar{u}(t)) Y^{\top}(t)\\ \qquad+Y(t)\left(f_{u}(t,\bar{y}(t),\bar{u}(t))^\top(u(t)-\bar{u}(t))\right)^{\top},&t\in [0,T]\setminus\Lambda, \\ X(t_i+) = X(t_i)+J^{\top}_{iy}(\bar{y}(t_{i}))X(t_{i})+X(t_{i})J_{iy}(\bar{y}(t_{i}))\\ \qquad+J^{\top}_{iy}(\bar{y}(t_{i}))X(t_{i})J_{iy}(\bar{y}(t_{i})),&t_i\in \Lambda,\\ X(0) = 0. \end{array}\right. \end{eqnarray}$

(3.16)

The subsequent proposition plays a crucial role in obtaining the integral form of the second-order necessary conditions.

Proposition 3.3. Let (A2) and (A3) hold and $W\in PC_{r}\left([0, T], \mathbb{R}^{n\times n}\right)$ , $\overline{\Phi}\in PC_{l}\left([0, T], \mathbb{R}^{n\times n}\right)$ be the solution of (3.3) and (3.4). Then,

$\begin{eqnarray} &&\int_{0}^{T} \langle \overline{W}(t)f_{u}(t,\bar{y}(t),\bar{u}(t))^{\top}(u(t)-\bar{u}(t)),Y(t)\rangle dt\\ & = &\frac{1}{2}\int_{0}^{T}\langle \left[l_{yy}\left(t,\bar{y}(t),\bar{u}(t)\right)+\overline{\varphi}(t)^{\top}f_{yy}(t,\bar{y}(t),\bar{u}(t))\right] Y(t),Y(t)\rangle dt\\ &&+\frac{1}{2}\langle G_{yy}\left(\bar{y}(T)\right) Y(T),Y(T)\rangle+\frac{1}{2}\sum\limits_{i = 1}^{k}\langle \overline{\varphi}(t_i)^{\top}J_{iyy}(\bar{y}(t_{i}))Y(t_{i})),Y(t_{i}))\rangle, \end{eqnarray}$

(3.17)

and

$\begin{eqnarray} &&\int_{0}^{T}\langle (l_{uy}(t,\bar{y}(t),\bar{u}(t))+\overline{\varphi}(t)^{\top}f_{uy}(t,\bar{y}(t),\bar{u}(t))+\overline{W}(t)f_{u}(t,\bar{y}(t),\bar{u}(t))^{T})[u(t)-\bar{u}(t)], Y(t)\rangle dt\\ & = &\int_{0}^{T}dt\int_{0}^{t}\langle (l_{uy}(t,\bar{y}(t),\bar{u}(t))+\overline{\varphi}(t)^{\top}f_{uy}(t,\bar{y}(t),\bar{u}(t))+\overline{W}(t)f_{u}(t,\bar{y}(t),\bar{u}(t))^{T})[u(t)-\bar{u}(t)], .\\ &&\qquad \qquad \qquad \qquad \qquad \qquad \overline{\Phi}(t)\overline{\Phi}(s)^{-1}f_{u}(s,\bar{y}(s),\bar{u}(s))^{T}[u(s)-\bar{u}(s)]\rangle ds. \end{eqnarray}$

(3.18)

Proof. Since $C([0, T])$ is a dense subspace in $L^{1}([0, T])$ , there exist function sequences $\left\{f^{\alpha}_{y}\right\}$ , $\left\{f^{\alpha}_{yy}\right\}$ , $\left\{f^{\alpha}_{uy}\right\}$ , $\left\{f^{\alpha}_{u}\right\}$ , $\left\{l^{\alpha}_{y}\right\}$ , $\left\{l_{yy}^{\alpha}\right\}$ , $\left\{l_{uy}^{\alpha}\right\}\subseteq C([0, T]$ such that

$\begin{eqnarray} \left\{\begin{array}{ll} f^{\alpha}_{y}(\cdot)\longrightarrow f_{y}(\cdot,\bar{y}(\cdot),\bar{u}(\cdot)),&l^{\alpha}_{y}(\cdot)\longrightarrow l_{y}(\cdot,\bar{y}(\cdot),\bar{u}(\cdot)),\\ f^{\alpha}_{yy}(\cdot)\longrightarrow f_{yy}(\cdot,\bar{y}(\cdot),\bar{u}(\cdot)),&l^{\alpha}_{yy}(\cdot)\longrightarrow l_{yy}(\cdot,\bar{y}(\cdot),\bar{u}(\cdot)),\\ f^{\alpha}_{uy}(\cdot)\longrightarrow f_{uy}(\cdot,\bar{y}(\cdot),\bar{u}(\cdot)),&l^{\alpha}_{uy}(\cdot)\longrightarrow l_{uy}(\cdot,\bar{y}(\cdot),\bar{u}(\cdot)),\\ f^{\alpha}_{u}\longrightarrow f_{u},& \end{array}\right.\mbox{ in } L^{1}([0,T])\mbox{ as }\alpha\rightarrow \infty. \end{eqnarray}$

(3.19)

Consequently, by (A3) and (3.19), one can infer that the systems of linear impulsive matrix differential equations given by

$\begin{eqnarray} \left\{\begin{array}{ll} \dot{W}_{\alpha}(t) = -f^{\alpha}_{y}(t)W_{\alpha}(t)-W_{\alpha}(t)f_{y}^{\alpha}(t)^{\top}- \overline{\varphi}(t)^{\top}f^{\alpha}_{yy}(t)- l^{\alpha}_{yy}(t),&t\in[0,T]\setminus \Lambda,\\ W_{\alpha}(t_i-) = W_{\alpha}(t_i+)+J_{iy}(\bar{y}(t_{i}))W_{\alpha}(t_i+)+W_{\alpha}(t_i+)J_{iy}(\bar{y}(t_{i}))^{\top}\\ \qquad\qquad +J_{iy}(\bar{y}(t_{i}))W_{\alpha}(t_i+)J_{iy}(\bar{y}(t_{i}))^{\top}+\overline{\varphi}(t_i)^{\top}J_{iyy}(\bar{y}(t_{i})),&t_i\in \Lambda,\\ W_{\alpha}(T) = G_{yy}(\bar{y}(T)), \end{array}\right. \end{eqnarray}$

(3.20)

and

$\begin{eqnarray} \left\{\begin{array}{ll} \dot{\Phi}_{\alpha}(t) = f^{\alpha}_{y}(t)^{\top}\Phi_{\alpha}(t),&t\in[0,T]\setminus \Lambda,\\ \Phi_{\alpha}(t_i+) = \Phi_{\alpha}(t_i)+J_{iy}(\bar{y}(t_{i}))^{\top} \Phi_{\alpha}(t_i),&t_i\in \Lambda,\\ \Phi_{\alpha}(0) = I, \end{array}\right. \end{eqnarray}$

(3.21)

each have a unique solution $W_{\alpha}\in PC_{r}\left([0, T], \mathbb{R}^{n\times n}\right)\bigcap C^{1}\left([0, T]\setminus\Lambda, \mathbb{R}^{n\times n}\right)$ and $\Phi_{\alpha}\in PC_{l}\big([0, T], \; \mathbb{R}^{n\times n}\big)\bigcap$ $C^{1}\left([0, T]\setminus\Lambda, \mathbb{R}^{n\times n}\right)$ , respectively. Not only that, there exists a constant $\gamma > 0$ such that

$\|W_{\alpha}\|_{PC}\leq\gamma \; \text{and}\; \|\Phi_{\alpha}\|_{PC}\leq\gamma \mbox{ for all } \alpha.$

Moreover, we have

$\begin{eqnarray*} &&\left\|W_{\alpha}(t)-\overline{W}(t)\right\|\\ &\leq& \int_{t}^{T}\left\|l_{yy}^{\alpha}(s)-l_{yy}(s,\bar{y}(s),\bar{u}(s))\right\|ds+\int_{t}^{T}\|\overline{\varphi}(s)^{\top}\|\left\|f_{yy}^{\alpha}(s)-f_{yy}(s,\bar{y}(s),\bar{u}(s))\right\|ds\\ &&+2\gamma\int_{t}^{T}\left\|f^{\alpha}_{y}(s)-f_{y}(s,\bar{y}(s),\bar{u}(s))\right\|ds+2\int_{t}^{T}\left\|W_{\alpha}(s)-\overline{W}(s)\right\|\left\|f^{\top}_{y}(s,\bar{y}(s),\bar{u}(s))\right\|ds\\ &&+2\sum _{t < t_i < T}\left(\left\|J_{iy}(\bar{y}(t_{i}))\right\|+\left\|J_{iy}(\bar{y}(t_{i}))\right\|^{2}\right)\left\|W_{\alpha}(t_{i})-\overline{W}(t_{i})\right\|, \end{eqnarray*}$

and

$\begin{eqnarray*} &&\left\|\Phi_{\alpha}(t)-\overline{\Phi}(t)\right\|\\ &\leq& \gamma\int_{0}^{t}\left\|f^{\alpha}_{y}(s)-f_{y}(s,\bar{y}(s),\bar{u}(s))\right\|ds+\int_{0}^{t}\left\|f_{y}(s,\bar{y}(s),\bar{u}(s))\right\|\left\|\Phi_{\alpha}(s)-\overline{\Phi}(s)\right\|ds\\ &+&\sum _{0 < t_i < t}\left\|J_{iy}(\bar{y}(t_{i}))\right\|\left\|\Phi_{\alpha}(t_{i})-\overline{\Phi}(t_{i})\right\|.\\ \end{eqnarray*}$

In the same way as for (3.5), we obtain

$\begin{eqnarray} \lim\limits_{\alpha\rightarrow \infty}\left\|W_{\alpha}-\overline{W}\right\|_{PC} = 0 \; \text{and}\; \lim\limits_{\alpha\rightarrow \infty}\left\|\Phi_{\alpha}-\overline{\Phi}\right\|_{PC} = 0. \end{eqnarray}$

(3.22)

In addition, it is obvious from (3.20) that $W^{\top}_{\alpha}$ is also a solution of (3.20). This means that

$\begin{eqnarray} W^{\top}_{\alpha}(t) = W_{\alpha}(t)\mbox{ for all } t\in [0,T]. \end{eqnarray}$

(3.23)

Since $tr(AB) = tr(BA)$ for all $k \times j$ matrix $A$ and $j \times k$ matrix $B$ , we can get from (3.2), (3.7), (3.16), (3.19), (3.20), (3.22), and (3.23) that

$\begin{eqnarray*} && 2\int_{0}^{T} \langle \overline{W}(t)f_{u}(t,\bar{y}(t),\bar{u}(t))^{\top}(u(t)-\bar{u}(t)),Y(t)\rangle dt\\ & = &2\lim\limits_{\alpha\rightarrow \infty}\int_{0}^{T} \langle W_{\alpha}(t)\left(\dot{Y}(t)-f_{y}(t,\bar{y}(t),\bar{u}(t))^{\top}Y(t)\right),Y(t)\rangle dt\\ & = &\lim\limits_{\alpha\rightarrow \infty}tr\int_{0}^{T} \bigg[ W_{\alpha}(t)\left(\dot{Y}(t)-f_{y}(t,\bar{y}(t),\bar{u}(t))^{\top}Y(t)\right)Y^{\top}(t)\\ &&+W_{\alpha}(t)Y(t)\left(\dot{Y}(t)^{\top}-Y(t)^{\top}f_{y}(t,\bar{y}(t),\bar{u}(t))\right) \bigg]dt\\ & = &\lim\limits_{\alpha\rightarrow \infty}tr\int_{0}^{T} W_{\alpha}(t)\left[\dot{X}(t)-f_{y}(t,\bar{y}(t),\bar{u}(t))^{\top}X(t)-X(t)f_{y}(t,\bar{y}(t),\bar{u}(t))\right]dt\\ & = &\lim\limits_{\alpha\rightarrow \infty}\bigg\{-\int_{0}^{T} \langle \left[\dot{W}_{\alpha}(t)+W_{\alpha}(t)f_{y}(t,\bar{y}(t),\bar{u}(t))^{\top}+f_{y}(t,\bar{y}(t),\bar{u}(t))W_{\alpha}(t)\right]Y(t),Y(t)\rangle dt\\ &&+\langle W_{\alpha}(T)Y(T),Y(T)\rangle+\sum\limits_{i = 1}^{k}\left[ \langle W_{\alpha}(t_i-)Y(t_i),Y(t_i)\rangle-\langle W_{\alpha}(t_i)Y(t_i+),Y(t_i+)\rangle\right]\bigg\}\\ & = &\lim\limits_{\alpha\rightarrow \infty}\bigg\{-\int_{0}^{T}\langle \left[\dot{W}_{\alpha}(t)+W_{\alpha}(t)f^{\alpha}_{y}(t)^{\top}+f^{\alpha}_{y}(t)W_{\alpha}(t)\right]Y(t),Y(t)\rangle dt\\ &&+\langle W_{\alpha}(T)Y(T),Y(T)\rangle+\sum\limits_{i = 1}^{k}\bigg[\langle (W_{\alpha}(t_i-)-W_{\alpha}(t_i))Y(t_i),Y(t_i)\rangle\\ &&-\langle\left( J_{iy}\left(\bar{y}(t_{i})\right)W_{\alpha}(t_i)+W_{\alpha}(t_i)J_{iy}\left(\bar{y}(t_{i})\right)^{\top}\right)Y(t_i),Y(t_{i})\rangle\\ &&-\langle J_{iy}\left(\bar{y}(t_{i})\right)W_{\alpha}(t_i)J_{iy}\left(\bar{y}(t_{i})\right)^{\top}Y(t_{i}),Y(t_i)\rangle\bigg]\bigg\}\\ & = &\lim\limits_{\alpha\rightarrow \infty}\bigg\{\int_{0}^{T}\langle \left[ l^{\alpha}_{yy}(t)+\overline{\varphi}(t)^{\top} f^{\alpha}_{yy}(t) \right]Y(t),Y(t) \rangle dt+\langle W_{\alpha}(T)Y(T),Y(T)\rangle\\ &&+\sum\limits_{i = 1}^{k}\bigg[\langle (W_{\alpha}(t_i-)-W_{\alpha}(t_i))Y(t_i),Y(t_i)\rangle\\ &&-\langle\left( J_{iy}\left(\bar{y}(t_{i})\right)W_{\alpha}(t_i)+W_{\alpha}(t_i)J_{iy}\left(\bar{y}(t_{i})\right)^{\top}\right)Y(t_i),Y(t_{i})\rangle\\ &&-\langle J_{iy}\left(\bar{y}(t_{i})\right)W_{\alpha}(t_i)J_{iy}\left(\bar{y}(t_{i})\right)^{\top}Y(t_{i}),Y(t_i)\rangle\bigg]\bigg\}\\ & = &\int_{0}^{T} \langle \left[l_{yy}(t,\bar{y}(t),\bar{u}(t))+\overline{\varphi}(t)^{\top}f_{yy}(t,\bar{y}(t),\bar{u}(t))\right]Y(t),Y(t)\rangle dt\\ &&+\langle G_{yy}(\bar{y}(T))Y(T),Y(T)\rangle+\sum\limits_{i = 1}^{k}\langle \overline{\varphi}(t_i)^{\top}J_{iyy}(\bar{y}(t_{i}))Y(t_{i}),Y(t_{i}) \rangle, \end{eqnarray*}$

i.e., (3.17) holds.

Now, let us prove (3.18). By (3.4), (3.7), (3.21), and (3.22), we have

$\begin{eqnarray*} Y(t)& = &\overline{\Phi}(t)\int_{0}^{t}\overline{\Phi}(s)^{-1}f_{u}[s]^{\top}(u(s)-\bar{u}(s))ds\\ & = &\lim\limits_{\alpha\rightarrow 0}\Phi_{\alpha}(t)\int_{0}^{t}\Phi_{\alpha}(s)^{-1}f_{u}[s]^{\top}(u(s)-\bar{u}(s))ds,\\ \end{eqnarray*}$

which means that (3.18) holds. Therefore, we have finished the proof of Proposition 3.3.

Based on the above propositions, we now continue to prove Theorem 3.1.

Since $\bar{u}$ represents optimal control of J over $\mathcal{U}_{ad}$ , together with Proposition 3.2 (see (3.8)) and (3.12), we have

$\begin{eqnarray} &&\int_{0}^{T} \langle l_{y}\left(t,\bar{y}(t),\bar{u}(t)\right),Y(t) \rangle ds+\int_{0}^{T}\langle l_{u}\left(t,\bar{y}(t),\bar{u}(t)\right),u(t)-\bar{u}(t)\rangle dt+ \langle G_{y}\left(\bar{y}(T)\right),Y(T)\rangle\\ & = &\int_{0}^{T}\langle l_{u}\left(t,\bar{y}(t),\bar{u}(t)\right)+f_{u}\left(t,\bar{y}(t),\bar{u}(t)\right)\varphi(t),u(t)-\bar{u}(t)\rangle dt\\ & = &0\mbox{ for all }u\in \mathcal{U}_{ad}. \end{eqnarray}$

(3.24)

Taken together with (A2), (A3), and (3.24), one can get

$\begin{eqnarray*} &&\frac{J\left(u^{\varepsilon}(\cdot)\right)-J\left(\bar{u}(\cdot)\right)}{\varepsilon}\\ & = &\int_{0}^{T}\langle\int_{0}^{1}\left(l_{u}\left(t,y^{\varepsilon}(t),\bar{u}(t)+\tau \varepsilon\left(u(t)-\bar{u}(t)\right)\right)-l_{u}\left(t,\bar{y}(t),\bar{u}(t)\right)\right)d\tau,u(t)-\bar{u}(t)\rangle dt\\ &&+\int_{0}^{T}\langle\int_{0}^{1}\left(l_{y}\left(t,\bar{y}(t)+\tau\left(y^{\varepsilon}(t)-\bar{y}(t)\right),\bar{u}(t)\right)-l_{y}\left(t,\bar{y}(t),\bar{u}(t)\right)\right)d\tau,Y^{\varepsilon}(t)\rangle dt\\ &&+\int_{0}^{T} \langle l_{y}\left(t,\bar{y}(t),\bar{u}(t)\right),Y^{\varepsilon}(t)-Y(t) \rangle dt+\langle G_{y}\left(\bar{y}(T)\right),Y^{\varepsilon}(T)-Y(T)\rangle\\ &&+\langle\int_{0}^{1} \left(G_{y}\left(\bar{y}(T)+\tau\left(y^{\varepsilon}(T)-\bar{y}(T)\right)\right)-G_{y}\left(\bar{y}(T)\right)\right)d\tau,Y^{\varepsilon}(T)\rangle\\ & = &\varepsilon\int_{0}^{T}\langle\int_{0}^{1}\tau\int_{0}^{1}l_{uu}\left(t,y^{\varepsilon}(t),\bar{u}(t)+\nu\tau \varepsilon\left(u(t)-\bar{u}(t)\right)\right)d\nu d\tau[u(t)-\bar{u}(t)],u(t)-\bar{u}(t)\rangle dt\\ &&+\varepsilon\int_{0}^{T}\langle\int_{0}^{1}l_{uy}\left(t,\bar{y}(t)+\tau\left(y^{\varepsilon}(t)-\bar{y}(t)\right),\bar{u}(t)\right) (u(t)-\bar{u}(t))d\tau ,Y^{\varepsilon}(t)\rangle dt\\ &&+\varepsilon\int_{0}^{T}\langle\int_{0}^{1}\tau\int_{0}^{1}l_{yy}\left(t,\bar{y}(t)+\nu\tau\left(y^{\varepsilon}(t)-\bar{y}(t)\right),\bar{u}(t)\right)d\nu d\tau Y^{\varepsilon}(t),Y^{\varepsilon}(t)\rangle dt\\ &&+\varepsilon\int_{0}^{T} \langle l_{y}\left(t,\bar{y}(t),\bar{u}(t)\right),Z^{\varepsilon}(t)\rangle dt+\varepsilon\langle G_{y}\left(\bar{y}(T)\right),Z^{\varepsilon}(T)\rangle\\ &&+\varepsilon\langle\int_{0}^{1} \tau\int_{0}^{1}G_{yy}\left(\bar{y}(T)+\nu\tau\left(y^{\varepsilon}(T)-\bar{y}(T)\right)\right)d\nu d\tau Y^{\varepsilon}(T),Y^{\varepsilon}(T)\rangle. \end{eqnarray*}$

Then, combining this with (3.6) and (3.14), the above expression, (A2), and (A3) leads to the following:

$\begin{eqnarray*} \label{3.28} &&\frac{1}{2}\int_{0}^{T}\langle l_{uu}\left(t,\bar{y}(t),\bar{u}(t)\right)[u(t)-\bar{u}(t)],u(t)-\bar{u}(t)\rangle dt\nonumber\\ &&+\int_{0}^{T}\langle l_{uy}\left(t,\bar{y}(t),\bar{u}(t)\right) (u(t)-\bar{u}(t)),Y(t)\rangle ds\nonumber\\ &&+\frac{1}{2}\int_{0}^{T}\langle l_{yy}\left(t,\bar{y}(t),\bar{u}(t)\right) Y(t),Y(t)\rangle dt+\frac{1}{2}\langle G_{yy}\left(\bar{y}(T)\right) Y(T),Y(T)\rangle\\ &&+\int_{0}^{T} \langle l_{y}(t,\bar{y}(t),\bar{u}(t)) ,Z(t)\rangle ds+\langle G_{y}\left(\bar{y}(T)\right),Z(T)\rangle\geq0\mbox{ for all }u\in \mathcal{U}_{ad}.\nonumber \end{eqnarray*}$

By (3.2), we have

$\begin{eqnarray} &&\frac{1}{2}\int_{0}^{T}\langle l_{uu}\left(t,\bar{y}(t),\bar{u}(t)\right)[u(t)-\bar{u}(t)],u(t)-\bar{u}(t)\rangle dt\\ &&+\int_{0}^{T}\langle l_{uy}\left(t,\bar{y}(t),\bar{u}(t)\right) (u(t)-\bar{u}(t)),Y(t)\rangle ds\\ &&+\frac{1}{2}\int_{0}^{T}\langle l_{yy}\left(t,\bar{y}(t),\bar{u}(t)\right) Y(t),Y(t)\rangle dt+\frac{1}{2}\langle G_{yy}\left(\bar{y}(T)\right) Y(T),Y(T)\rangle\\ &&-\int_{0}^{T} \langle \dot{\overline{\varphi}}(t)+f_{y}(t,\bar{y}(t),\bar{u}(t))\overline{\varphi}(t) ,Z(t)\rangle ds+\langle G_{y}\left(\bar{y}(T)\right),Z(T)\rangle\geq0\mbox{ for all }u\in \mathcal{U}_{ad}. \end{eqnarray}$

(3.25)

Since $C([0, T])$ is a dense subspace in $L^{1}([0, T])$ , there exist function sequences $\left\{u^{\alpha}\right\}, \left\{f_{uu}^{\alpha}\right\}\subseteq C([0, T])$ such that

$\begin{eqnarray} \lim\limits_{\alpha\rightarrow \infty}\left\|u^{\alpha}-[u-\bar{u}]\right\|_{L^{1}} = 0\mbox{ and }\lim\limits_{\alpha\rightarrow \infty}\left\|f_{uu}^{\alpha}(\cdot)-f_{uu}(\cdot,\bar{y}(\cdot),\bar{u}(\cdot))\right\|_{L^{1}} = 0. \end{eqnarray}$

(3.26)

It follows immediately from (3.15), (3.19), and (3.26) that the system of equations given by

$\begin{eqnarray} \left\{ \begin{array}{ll} \dot{Z}^{\alpha}(t) = f^{\alpha}_{y}(t)^\top Z^{\alpha}(t)+\frac{1}{2}u^{\alpha}(t)^\top f^{\alpha}_{uu}(t)u^{\alpha}(t)\\ \qquad\qquad + u^{\alpha}(t)^\top f^{\alpha}_{yu}(t)Y(t) +\frac{1}{2}Y(t)^\top f^{\alpha}_{yy}(t)Y(t),\qquad \qquad t\in [0,T]\setminus\Lambda, \\ Z^{\alpha}(t_i+) = Z^{\alpha}(t_i)+\frac{1}{2}Y(t_{i}))^{\top}J_{iyy}(\bar{y}(t_{i}))Y(t_{i}))+J_{iy}(\bar{y}(t_{i}))^{\top}Z^{\alpha}(t_i),\qquad t_i\in \Lambda,\\ Z^{\alpha}(0) = 0, \end{array}\right. \end{eqnarray}$

(3.27)

has a unique solution $Z^{\alpha}\in PC_{l}\left([0, T], \mathbb{R}^{n}\right)\bigcap C^{1}\left([0, T]\setminus\Lambda, \mathbb{R}^{n}\right)$ and

$\begin{eqnarray} \lim\limits_{\alpha\rightarrow \infty}\left\|Z^{\alpha}-Z\right\|_{PC} = 0, \end{eqnarray}$

(3.28)

where $Z(\cdot)$ is the solution to (3.15).

Moreover, we can infer from (3.19) and (3.26)–(3.28) that

$\begin{eqnarray} &&-\int_{0}^{T} \langle \dot{\overline{\varphi}}(t)+f_{y}(t,\bar{y}(t),\bar{u}(t))\overline{\varphi}(t) ,Z(t)\rangle dt+\langle G_{y}\left(\bar{y}(T)\right),Z(T)\rangle\\ & = &-\lim\limits_{\alpha\rightarrow \infty}\int_{0}^{T} \langle \dot{\overline{\varphi}}(t)+f^{\alpha}_{y}(t)\overline{\varphi}(t) ,Z^{\alpha}(t)\rangle dt+\langle G_{y}\left(\bar{y}(T)\right),Z(T)\rangle\\ & = &\lim\limits_{\alpha\rightarrow \infty}\left\{\int_{0}^{T} \langle \overline{\varphi}(t), \dot{Z}^{\alpha}(t)-f^{\alpha\top}_{y}(t)Z^{\alpha}(t) \rangle dt-\sum\limits_{i = 1}^{k}\left[\langle \overline{\varphi}(t_i-),Z^{\alpha}(t_i) \rangle-\langle\overline{\varphi}(t_i),Z^{\alpha}(t_i+)\rangle\right]\right\}\\ & = &\frac{1}{2} \lim\limits_{\alpha\rightarrow \infty}\int_{0}^{T} \langle \overline{\varphi}(t),u^{\alpha}(t)^\top f^{\alpha}_{uu}(t)u^{\alpha}(t)+ 2u^{\alpha}(t)^\top f^{\alpha}_{yu}(t)Y(t) +Y(t)^\top f^{\alpha}_{yy}(t)Y(t)\rangle dt\\ &&+\frac{1}{2}\sum\limits_{i = 1}^{k}\langle \overline{\varphi}(t_i), Y(t_{i}))^{\top}J_{iyy}(\bar{y}(t_{i}))Y(t_{i}))\rangle\\ & = &\frac{1}{2} \int_{0}^{T} \langle \overline{\varphi}(t),[u(t)-\bar{u}(t)]^{\top}f_{uu}(t,\bar{y}(t),\bar{u}(t))[u(t)-\bar{u}(t)]\rangle dt\\ &&+\int_{0}^{T} \langle \overline{\varphi}(t), (u(t)-\bar{u}(t))^{\top}(t)f_{uy}(t,\bar{y}(t),\bar{u}(t))Y(t)\rangle dt\\ &&+\frac{1}{2} \int_{0}^{T} \langle \overline{\varphi}(t),Y(t)^{\top}f_{yy}(t,\bar{y}(t),\bar{u}(t))Y(t)\rangle dt\\ &&+\frac{1}{2}\sum\limits_{i = 1}^{k}\langle \overline{\varphi}(t_i),Y(t_{i})^{\top}J_{iyy}(\bar{y}(t_{i}))Y(t_{i})))\rangle. \end{eqnarray}$

(3.29)

Taken together with (3.29), we deduce from (3.25) that

$\begin{eqnarray} &&\frac{1}{2}\int_{0}^{T}\langle \left[l_{uu}\left(t,\bar{y}(t),\bar{u}(t)\right)+\overline{\varphi}(t)^{\top}f_{uu}(t,\bar{y}(t),\bar{u}(t))\right][u(t)-\bar{u}(t)],u(t)-\bar{u}(t)\rangle ds\\ &&+\int_{0}^{T}\langle \left[l_{uy}\left(t,\bar{y}(t),\bar{u}(t)\right) +\overline{\varphi}(t)^{\top}f_{uy}(t,\bar{y}(t),\bar{u}(t))\right][u(t)-\bar{u}(t)],Y(t)\rangle dt\\ &&+\frac{1}{2}\int_{0}^{T}\langle \left[l_{yy}\left(t,\bar{y}(t),\bar{u}(t)\right)+\overline{\varphi}(t)^{\top}f_{yy}(t,\bar{y}(t),\bar{u}(t))\right] Y(t),Y(t)\rangle dt\\ &&+\frac{1}{2}\langle G_{yy}\left(\bar{y}(T)\right) Y(T),Y(T)\rangle+\frac{1}{2}\sum\limits_{i = 1}^{k}\langle \overline{\varphi}(t_i)^{\top}J_{iyy}(\bar{y}(t_{i}))Y(t_{i})),Y(t_{i}))\rangle \geq0\mbox{ for all }u\in \mathcal{U}_{ad}. \end{eqnarray}$

(3.30)

The following inequality follows from (3.30) and (3.17):

$\begin{eqnarray} &&\frac{1}{2}\int_{0}^{T}\langle [l_{uu}(t,\bar{y}(t),\bar{u}(t))+\overline{\varphi}(t)^{\top}f_{uu}(t,\bar{y}(t),\bar{u}(t))][u(t)-\bar{u}(t)],u(t)-\bar{u}(t)\rangle dt\\ &&+\int_{0}^{T}\langle [l_{uy}(t,\bar{y}(t),\bar{u}(t)) +\overline{\varphi}(t)^{\top}f_{uy}(t,\bar{y}(t),\bar{u}(t))+\overline{W}(t)f_{u}(t,\bar{y}(t),\bar{u}(t))^{\top}][u(t)-\bar{u}(t)],. \\ &&\qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad Y(t)\rangle dt \geq0\mbox{ for all }u\in \mathcal{U}_{ad}. \end{eqnarray}$

(3.31)

Then, by (3.18) and (3.31), we can show that

$\begin{eqnarray} &&\frac{1}{2}\int_{0}^{T}\langle [l_{uu}(t,\bar{y}(t),\bar{u}(t))+\overline{\varphi}(t)^{\top}f_{uu}(t,\bar{y}(t),\bar{u}(t))][u(t)-\bar{u}(t)],u(t)-\bar{u}(t)\rangle dt\\ &&+\int_{0}^{T}\int_{0}^{t}\langle [l_{uy}(t,\bar{y}(t),\bar{u}(t)) +\overline{\varphi}(t)^{\top}f_{uy}(t,\bar{y}(t),\bar{u}(t))+\overline{W}(t)f_{u}(t,\bar{y}(t),\bar{u}(t))^{\top}][u(t)-\bar{u}(t)],. \\ &&\qquad\qquad\qquad\overline{\Phi}(t)\overline{\Phi}(s)^{-1}f_{u}(s,\bar{y}(s),\bar{u}(s))^{T}[u(s)-\bar{u}(s)]ds\rangle dt \geq0\mbox{ for all }u\in \mathcal{U}_{ad}, \end{eqnarray}$

(3.32)

Thus, we have finished the proof of Theorem 3.1.

Remark 3.4. Theorem 3.1 does not establish whether the optimal control of Problem P is singular or nonsingular; it is a unified conclusion, similar to the equations in (4.5.2) of Theorem 4.2 in ^[15]. Therefore, Theorem 3.1 is a generalization of Theorem 4.2 in ^[15] to impulsive controlled systems. Based on Theorem 3.1, singular control and nonsingular control in the classical sense can be considered under a unified framework for Problem P.

4. Pointwise Jacobson type necessary conditions

In this section, on the basis of Theorem 3.1 in the previous section, we first obtain the Legendre-Clebsch condition; then, we give a corollary for the integral form of the second-order necessary optimality conditions for optimal singular control; finally, we give the pointwise Jacobson type necessary conditions and the pointwise Legendre-Clebsch condition.

Corollary 4.1. Let (A1)–(A3) hold and $\bar{u}$ denote the optimal control of $J$ over $\mathcal{U}_{ad}$ ; it is necessary that there exist a pair of functions $(\overline{y}, \overline{\varphi})\in PC_{l}\left([0, T], \mathbb{R}^{n}\right) \times PC_{r}\left([0, T], \mathbb{R}^{n}\right)$ such that (3.1), (3.2), and

$\begin{eqnarray} H_{uu}[t]\geq0,\;\mathit{\text{for all the continuous time of}}\; \bar{u}(t), t\in [0,T], \end{eqnarray}$

(4.1)

hold.

Proof. To prove Corollary 4.1, let $\bar{u}(\cdot)\in \mathcal{U}_{ad}$ ; take the special control variational problem as follows:

$\begin{eqnarray} u(t)-\bar{u}(t) = \left\{\begin{array}{ll} 0,\; & t\in[t_{0},\bar{t}),\\ h,\; & t\in[\bar{t}, \bar{t}+\varepsilon ),\\ 0,\; &t\in[\bar{t}+\varepsilon , T],\\ \end{array}\right.\qquad \end{eqnarray}$

(4.2)

where $h\in \mathbb{R}^{r}$ is a constant vector, $\varepsilon$ is a sufficiently small positive number, $\bar{t}$ is any continuous time of $\bar{u}(t)$ . For $u(t)-\bar{u}(t)$ , $\overline{\Phi}(t)$ satisfies (3.4); then, the solution $Y(t)$ of the variational problem (3.7) is given by

$\begin{eqnarray*} Y(t) = \left\{\begin{array}{ll} 0,\; & t\in[t_{0},\bar{t}),\\ \int_{\bar{t}}^{t}\overline{\Phi}(t)\overline{\Phi}(s)^{-1}f_{u}[s]^{\top}hds,\; & t\in[\bar{t}, \bar{t}+\varepsilon ),\\ \int_{\bar{t}}^{\bar{t}+\varepsilon}\overline{\Phi}(t)\overline{\Phi}(s)^{-1}f_{u}[s]^{\top}hds,\; & t\in[\bar{t}+\varepsilon , T].\\ \end{array}\right.\qquad \end{eqnarray*}$

Then

$\begin{eqnarray*} \|Y(t)\| = \left\{\begin{array}{ll} 0,\; & t\in[t_{0},\bar{t}),\\ O(\varepsilon),\; & t\in[\bar{t}, T],\\ \end{array}\right.\qquad \end{eqnarray*}$

where $\lim\limits_{\varepsilon\rightarrow 0}\frac{O(\varepsilon)}{\varepsilon} = C\; (\text{nonzero constant})$ . Utilizing the continuity of the mean value theorem for integrals, we have

$\begin{eqnarray} \int_{0}^{T}Y(t)^{\top}H_{yy}[t]Y(t)dt& = &\int_{\bar{t}}^{\bar{t}+\varepsilon}Y(t)^{\top}H_{yy}[t]Y(t)dt = O(\varepsilon^{2}) = o(\varepsilon),\\ \int_{0}^{T}Y(t)^{\top}H_{uy}[t][u(t)-\bar{u}(t)]dt & = &\int_{\bar{t}}^{\bar{t}+\varepsilon}Y(t)^{\top}H_{uy}[t]hdt\\ & = &\varepsilon Y(\bar{t})^{\top}H_{uy}[\bar{t}]h+ o(\varepsilon),\\ \int_{0}^{T} [u(t)-\bar{u}(t)]^{\top}H_{uu}[t][u(t)-\bar{u}(t)]dt& = &\int_{\bar{t}}^{\bar{t}+\varepsilon}h^{\top}H_{uu}[t]hdt\\ & = &\varepsilon h^{\top}H_{uu}[\bar{t}]h+ o(\varepsilon). \end{eqnarray}$

(4.3)

Substituting (4.3) into (3.32), we have

$\begin{eqnarray*} \varepsilon h^{\top}H_{uu}[t]h+ o(\varepsilon )\geq 0.\nonumber \end{eqnarray*}$

Observe that $h^{\top}H_{uu}[t]h$ is independent of $\varepsilon$ and $\varepsilon$ can be arbitrarily small; we have

$\begin{eqnarray*} \label{4.4} h^{\top}H_{uu}[t]h\geq 0. \end{eqnarray*}$

Since $h\in \mathbb{R}^{r}$ is an arbitrary vector and $t$ denotes arbitrary continuous time, $(4.1)$ holds.

Remark 4.2. Condition (4.1) represents the Legendre-Clebsch condition for the optimal control problem Problem P. At the same time, it also shows the rationality of the conventional hypothesis $H_{uu}[t]\geq 0, t\in [t_{0}, T]$ .

Remark 4.3. For the LQ problem, where $R[t] = H_{uu}[t]$ , when $R[t] > 0, \forall t\in [t_{0}, T]$ , the problem is called a nonsingular problem; when $R[t] = 0, \forall t\in [t_{0}, T]$ , the problem is called the totally singular case; when $R[t]\geq0, \forall t\in [t_{0}, T]$ , the problem is called the partially singular case (see Definitions (4.4)–(4.6) in ^[15]). Whether or not the optimal control problem is singular, and regardless of the kind of singularity, this classification standard is often adopted.

The following is a corollary of Theorem 3.1 in the case in which $H_{uu}[t]\equiv0$ , that is, Problem P is a totally singular problem according to the definitions in ^[15].

Corollary 4.4. Let (A1)–(A3) hold and $\bar{u}(\cdot)\in \overline{\mathcal{U}}_{ad}$ denotes the optimal singular control of $J$ over $\mathcal{U}_{ad}$ ; it is necessary that there exist functions $(\overline{y}, \overline{\varphi}, \overline{W}, \overline{\Phi})\in PC_{l}\left([0, T], \mathbb{R}^{n}\right) \times PC_{r}\left([0, T], \mathbb{R}^{n}\right) \times PC_{r}\left([0, T], \mathbb{R}^{n\times n}\right) \times PC_{l}\left([0, T], \mathbb{R}^{n\times n}\right)$ that satisfy (3.1)–(3.4) and

$\begin{eqnarray*} \label{4.5} &&\int_{0}^{T}dt\int_{0}^{t}\langle \left(\overline{W}(t)f_{u}[t]^{\top}+H_{uy}[t]\right) [u(t)-\bar{u}(t)],\overline{\Phi}(t)\overline{\Phi}(s)^{-1}f_{u}[s]^{\top}[u(s)-\bar{u}(s)]\rangle ds\nonumber\\ &&\geq 0,\quad \forall u\in \overline{\mathcal{U}}_{ad}. \end{eqnarray*}$

Remark 4.5. Corollary 4.4 is a similar conclusion of Theorem 4.3 in ^[25] for Problem P.

The following is the pointwise Jacobson-type second-order necessary optimality condition for Problem P. The conclusion is similar to that of Theorem 4.3 in ^[25]. Note that the set $\overline{U}(t)$ (see (3.13)) of values for $v$ differs from the set $\mathbb{R}^{m}$ of values that is described in ^[32]; thus, it essentially confirms the pointwise characteristic. The author of ^[25] has proven similar conclusions under the weaker condition, suggesting that the control region $U$ is a Polish space. In fact, under our basic assumption (A1), we can use the method on page 93 in ^[15] to prove it.

Theorem 4.6. Let (A1)–(A3) hold and $\bar{u}(\cdot)\in \overline{\mathcal{U}}_{ad}$ denote the optimal singular control of $J$ over $\mathcal{U}_{ad}$ ; it is necessary that there exist functions $(\overline{y}, \overline{\varphi}, \overline{W}, \overline{\Phi})\in PC_{l}\left([0, T], \mathbb{R}^{n}\right) \times PC_{r}\left([0, T], \mathbb{R}^{n}\right) \times PC_{r}\left([0, T], \mathbb{R}^{n\times n}\right) \times PC_{l}\left([0, T], \mathbb{R}^{n\times n}\right)$ that satisfy (3.1)–(3.4), and, for all continuous time of $\bar{u}(t), t\in [0, T]$ , we have

$\begin{eqnarray} \langle \left(\overline{W}(t)f_{u}[t]^{\top}+H_{uy}[t]\right)[v-\bar{u}(t)], f_{u}[t]^{\top}[v-\bar{u}(t)]\rangle \geq 0,\quad \forall v\in \overline{U}(t). \end{eqnarray}$

(4.4)

Proof. {Note that Definition 2.3 implies that $H_{uu}[t]\equiv0 \; \text{for all} \; t\in [0, T]$ , apply the same control perturbation as for (4.2);} we have

$\begin{eqnarray} &&\int_{0}^{T}\langle \left(\overline{W}(t)f_{u}[t]^{\top}+H_{yu}[t]\right) h,Y(t)\rangle dt \\ & = &\int_{\bar{t}}^{\bar{t}+\varepsilon}\langle \left(\overline{W}(t)f_{u}[t]^{\top}+H_{yu}[t]\right) h,Y(t)\rangle dt, \end{eqnarray}$

(4.5)

and the dominant term in the expansion of (4.5) for sufficiently small $\varepsilon$ is given by

$(\varepsilon )^{2}\langle \left(\overline{W}(t)f_{u}[t]^{\top}+H_{yu}[t]\right)h,f_{u}[t]^{\top}h\rangle \big|_{\bar{t}}.$

Since $\bar{t}$ can be chosen as any continuous time of $\bar{u}(t), t\in [0, T]$ , let $h = v-\bar{u}(t), \forall v\in \overline{U}(t)$ ; thus, (4.4) holds and we have finished the proof of Theorem 4.6.

Using the same idea as in Theorem 4.6, we can also obtain the pointwise Legendre-Clebsch necessary optimality condition corresponding to Corollary 4.1.

Corollary 4.7. Let (A1)–(A3) hold and $\bar{u}$ denote the optimal control of $J$ over $\mathcal{U}_{ad}$ ; it is necessary that there exist a pair of functions $(\overline{y}, \overline{\varphi})\in PC_{l}\left([0, T], \mathbb{R}^{n}\right) \times PC_{r}\left([0, T], \mathbb{R}^{n}\right)$ such that (3.1), (3.2), and, for all continuous time of $\bar{u}(t), t\in [0, T]$

$\begin{eqnarray} (v-\bar{u}(t))^{\top}H_{uu}[t](v-\bar{u}(t))\geq0, \forall v\in \overline{U}(t), \end{eqnarray}$

(4.6)

hold.

Remark 4.8. Comparing Corollaries 4.7 and 4.1, it can be found that if the pointwise condition is satisfied, $H_{uu}\geq 0$ is not required, as only (4.6) needs to be satisfied.

5. Example

In this section, we will give an example to illustrate the effectiveness of Theorem 4.6.

Let

$\begin{eqnarray*} \label{1.1} \min J(u(\cdot)) = y_{2}(1), \end{eqnarray*}$

subject to

$\begin{eqnarray*} \left\{ \begin{array}{ll} \dot{y}_{1}(t) = u,&t\in[0,1]\setminus 0.5,\\ \dot{y}_{2}(t) = -y_{1}^{2},&t\in[0,1]\setminus 0.5,\\ y_{1}(0.5+) = y_{1}(0.5)+y_{1}(0.5),\\ y_{2}(0.5+) = y_{2}(0.5),\\ y_{1}(0) = 0,\\ y_{2}(0) = 0. \end{array}\right. \end{eqnarray*}$

Obviously, the Hamiltonian $H(t, y, u, \varphi) = \varphi_{1}u-\varphi_{2}y_{1}^{2}$ satisfies the conditions for linear control, as denoted by $u$ and $u\in U = \mathbb{R}$ . According to Remark (2.4), the problem is singular, i.e., $\overline{U}(t) = \mathbb{R}$ . It is not difficult to assert that $\bar{u}\equiv 0$ denotes singular control; this is because, by $\bar{u}\equiv 0$ , we can get that $\bar{y}_{1}\equiv 0, \bar{y}_{2}\equiv 0$ , and $\overline{\varphi}_{1} = 0, t\in [0, 1]$ ; consequently, $H(t, y, u, \varphi) = \overline{\varphi}_{1}\bar{u}-\overline{\varphi}_{2}\bar{y}_{1}^{2}\equiv 0$ , $H_{u}\equiv 0$ , and $H_{uu}\equiv 0$ ; by Definition 2.3, $\bar{u}\equiv 0, t\in [0, 1]$ denotes singular control. The question is whether it is optimal singular control. Now, let us use Theorem 4.6 to determine that it must not be optimal singular control.

By (3.2), we have

$\begin{eqnarray*} \left\{ \begin{array}{ll} \dot{\overline{\varphi}}_{1}(t) = 2\overline{\varphi}_{2}\bar{y}_{1},&t\in[0,1]\setminus 0.5,\\ \dot{\overline{\varphi}}_{2}(t) = 0,&t\in[0,1]\setminus 0.5,\\ \overline{\varphi}_{1}(0.5-) = \overline{\varphi}_{1}(0.5),\\ \overline{\varphi}_{2}(0.5-) = \overline{\varphi}_{1}(0.5)+\overline{\varphi}_{2}(0.5),\\ \overline{\varphi}_{1}(1) = 0,\\ \overline{\varphi}_{2}(1) = 1.\\ \end{array}\right. \end{eqnarray*}$

Using (3.3), we have

$\begin{eqnarray*} \dot{\overline{W}}(t)& = &\left[{\begin{array}{cc} \dot{\overline{w}}_{11}(t) & \dot{\overline{w}}_{12}(t)\\ \dot{\overline{w}}_{21}(t) & \dot{\overline{w}}_{22}(t)\\ \end{array}}\right] = -\left[{\begin{array}{cc} 0 & -2\bar{y}_{1}\\ 0 & 0\\ \end{array}}\right] \left[{\begin{array}{cc} \overline{w}_{11}(t) & \overline{w}_{12}(t)\\ \overline{w}_{21}(t) & \overline{w}_{22}(t)\\ \end{array}}\right]\\ && -\left[{\begin{array}{cc} \overline{w}_{11}(t) & \overline{w}_{12}(t)\\ \overline{w}_{21}(t) & \overline{w}_{22}(t)\\ \end{array}}\right] \left[{\begin{array}{cc} 0 & 0\\ -2\bar{y}_{1} & 0\\ \end{array}}\right] +\left[{\begin{array}{cc} \overline{\varphi}_{2}(t) & 0\\ 0 & 0\\ \end{array}}\right], \end{eqnarray*}$

and

$\begin{eqnarray*} \left[{\begin{array}{cc} \overline{w}_{11}(1) & \overline{w}_{12}(1)\\ \overline{w}_{21}(1) & \overline{w}_{22}(1)\\ \end{array}}\right] = \left[{\begin{array}{cc} 0 & 0\\ 0 & 0\\ \end{array}}\right], \end{eqnarray*}$

and

$\begin{eqnarray*} \overline{W}(0.5-)& = &\left[{\begin{array}{cc} \overline{w}_{11}(0.5-) & \overline{w}_{12}(0.5-)\\ \overline{w}_{21}(0.5-) & \overline{w}_{22}(0.5-)\\ \end{array}}\right]\\ & = &\left[{\begin{array}{cc} 0 & 0\\ 1 & 0\\ \end{array}}\right] \left[{\begin{array}{cc} \overline{w}_{11}(0.5) & \overline{w}_{12}(0.5)\\ \overline{w}_{21}(0.5) & \overline{w}_{22}(0.5)\\ \end{array}}\right]\\ &+& \left[{\begin{array}{cc} \overline{w}_{11}(0.5) & \overline{w}_{12}(0.5)\\ \overline{w}_{21}(0.5) & \overline{w}_{22}(0.5)\\ \end{array}}\right] \left[{\begin{array}{cc} 0 & 1\\ 0 & 0\\ \end{array}}\right]\\ &+&\left[{\begin{array}{cc} 0 & 0\\ 1 & 0\\ \end{array}}\right] \left[{\begin{array}{cc} \overline{w}_{11}(0.5) & \overline{w}_{12}(0.5)\\ \overline{w}_{21}(0.5) & \overline{w}_{22}(0.5)\\ \end{array}}\right] \left[{\begin{array}{cc} 0 & 1\\ 0 & 0\\ \end{array}}\right]. \end{eqnarray*}$

Substituting $\bar{u}\equiv 0, \bar{y}_{1}\equiv 0$ , and $\bar{y}_{2}\equiv 0$ directly into the above equations, the following results can be obtained directly

$\begin{eqnarray*} \left\{ \begin{array}{ll} \overline{\varphi}_{1}(t)\equiv 0,\\ \overline{\varphi}_{2}(t)\equiv 1, \end{array}\right. \quad t\in[0,1], \end{eqnarray*}$

and

$\begin{eqnarray*} \left\{ \begin{array}{ll} \overline{w}_{11} = t-1, t\in[0,1],\\ \overline{w}_{12} = \overline{w}_{21} = \overline{w}_{22} = \left\{\begin{array}{ll} 0,&t\in[0.5,1],\\ -0.5,&t\in[0,0.5).\\ \end{array}\right.\\ \end{array}\right. \end{eqnarray*}$

By (4.4), the necessary condition for the singular control $\bar{u}\equiv 0$ to be the optimal control scheme is given by

$\begin{eqnarray} (t-1)v^{2}\geq 0, \forall v\; \in \overline{U}(t). \end{eqnarray}$

(5.1)

But, because of the above equations, (5.1) cannot be true for arbitrary but fixed $t\in (0, 1)$ . Therefore, regarding singular control $\bar{u} = 0$ , according to Theorem 4.6, it must not be optimal singular control.

6. Conclusions

In this paper, we have investigated the pointwise Jacobson type necessary conditions for Problem P. By introducing an impulsive linear matrix Riccati differential equation, we have derived the integral representation of the functional second-order variational equation. On this basis, we obtained the integral form of the second-order necessary conditions and the pointwise Jacobson type necessary conditions for optimal singular control in the classical sense. Incidentally, the Legendre-Clebsch condition and the pointwise Legendre-Clebsch condition were also obtained. These conclusions have been derived under weaker conditions, thereby enriching existing conclusions. In the future, we will continue to research the pointwise Jacobson-type second-order necessary optimality conditions in the Pontryagin sense.

Use of AI tools declaration

The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

Acknowledgments

The authors are grateful to the anonymous referees for their helpful comments and valuable suggestions which have improved the quality of the manuscript. This work was supported by the National Natural Science Foundation of China (No. 12061021 and No. 11161009).

Conflict of interest

The authors declare that there is no conflict of interest.

References

[1]	A. A. Abdelhamid, E. Abdelhalim, M. A. Mohamed, F. Khalifa, Multi-classification of chest X-rays for COVID-19 diagnosis using deep learning algorithms, Appl. Sci., 12 (2022), 2080. https://doi.org/10.3390/app12042080 doi: 10.3390/app12042080
[2]	W. S. McCulloch, W. Pitts, A logical calculus of the ideas immanent in nervous activity, Bull. Math. Biophys., 5 (1943), 115–133.
[3]	Z. Li, F. Liu, W. Yang, S. Peng, J. Zhou, A survey of convolutional neural networks: Analysis, applications and prospects, IEEE Trans. Neural Netw. Learn Syst., 12 (2022), 6999–7019. https://doi.org/10.1109/TNNLS.2021.3084827 doi: 10.1109/TNNLS.2021.3084827
[4]	J. P. Cohen, L. Dao, K. Roth, P. Morrison, Y. Bengio, A. F. Abbasi, et al., Predicting COVID-19 pneumonia severity on chest X-ray with deep learning, Cureus, 12 (2020), e9448. https://doi.org/10.7759/cureus.9448 doi: 10.7759/cureus.9448
[5]	V. Ravi, H. Narasimhan, T. D. Pham, A cost‐sensitive deep learning‐based meta‐classifier for pediatric pneumonia classification using chest X‐rays, Expert Syst., (2020), e12966. https://doi.org/10.1111/exsy.12966 doi: 10.1111/exsy.12966
[6]	I. Borlea, R. Precup, A. Borlea, D. Iercan, A unified form of fuzzy C-means and K-means algorithms and its partitional implementation, Knowledge-Based Syst., 214 (2021), 106731. http://dx.doi.org/10.1016/j.knosys.2020.106731 doi: 10.1016/j.knosys.2020.106731
[7]	D. Varshni, K. Thakral, L. Agarwal, R. Nijhawan, A.Mittal, Pneumonia detection using CNN based feature extraction, in IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT), (2019), 1–7.
[8]	M. Taresh, N. Zhu, T. A. A. Ali, Transfer learning to detect COVID-19 automatically from X-ray images, using convolutional neural networks, Int. J. Biomed. Imaging, (2021), 8828404. https://doi.org/10.1155/2021/8828404 doi: 10.1155/2021/8828404
[9]	S. R. Velu, V. Ravi, K. Tabianan, Data mining in predicting liver patients using classification model, Health Technol. (Berl), 12 (2022), 1211–1235. https://doi.org/10.1007/s12553-022-00713-3 doi: 10.1007/s12553-022-00713-3
[10]	M. H. Alsharif, Y. H. Alsharif, K. Yahya, O. A. Alomari, M. A. Albreem, A. Jahid, Deep learning applications to combat the dissemination of COVID-19 disease: A review, Eur. Rev. Med. Pharmacol. Sci., 24 (2020), 11455–11460. https://doi.org/10.26355/eurrev_202011_23640 doi: 10.26355/eurrev_202011_23640
[11]	S. Sharma, Drawing insights from COVID-19-infected patients using CT scan images and machine learning techniques: A study on 200 patients, Environ. Sci. Pollut. Res., 27 (2020), 37155–37163. https://doi.org/10.1007/s11356-020-10133-3 doi: 10.1007/s11356-020-10133-3
[12]	A. Narin, C. Kaya, Z. Pamuk, Automatic detection of coronavirus disease (COVID-19) using X-ray images and deep convolutional neural networks, Pattern Anal, Appl., 24 (2021), 1207–1220. https://doi.org/10.1007/s10044-021-00984-y doi: 10.1007/s10044-021-00984-y
[13]	H. Panwar, P. K. Gupta, M. K. Siddiqui, R. Morales-Menendez, V. Singh, Application of deep learning for fast detection of COVID-19 in X-Rays using nCOVnet, Chaos Solitons Fract., 138 (2020), 109944. https://doi.org/10.1016/j.chaos.2020.109944 doi: 10.1016/j.chaos.2020.109944
[14]	M. Singh, S. Bansal, S. Ahuja, R. K. Dubey, Panigrahi, B. K. Dey, Transfer learning–based ensemble support vector machine model for automated COVID-19 detection using lung computerized tomography scan data, Med. Biol. Eng. Comput., 59 (2021), 825–839. https://doi.org/10.1007/s11517-020-02299-2 doi: 10.1007/s11517-020-02299-2
[15]	A. M. Alqudah, S. Qazan, A. Alqudah, Automated systems for detection of COVID-19 using chest X-ray images and lightweight convolutional neural networks, Emerg. Radiol., 4 (2020). https://doi.org/10.1007/s13246-020-00865-4 doi: 10.1007/s13246-020-00865-4
[16]	I. D. Apostolopoulos, T. A. Mpesiana, COVID-19: Automatic detection from X-ray images utilizing transfer learning with convolutional neural networks, Phys. Eng. Sci. Med., 43 (2020), 635–640. https://doi.org/10.1016/j.eng.2020.04.010 doi: 10.1016/j.eng.2020.04.010
[17]	X. Xu, X.Jiang, C. Ma, P. Du, X. Li, S. Lv, et al., deep learning system to screen novel A Coronavirus Disease 2019 pneumonia, Engineering, 6 (2020), 1122–1129. https://doi.org/10.1016/j.eng.2020.04.010 doi: 10.1016/j.eng.2020.04.010
[18]	E. Hussain, M. Hasan, M. A. Rahman, I. Lee, T. Tamanna, M. Z. Parvez, CoroDet: A deep learning based classification for COVID-19 detection using chest X-ray images, Chaos Solitons Fract., 142 (2021), 110495. https://doi.org/10.1016/j.chaos.2020.110495 doi: 10.1016/j.chaos.2020.110495
[19]	S. Wang, B. Kang, J. Ma, X. Zeng, M. Xiao, J. Guo, et al., A deep learning algorithm using CT images to screen for Corona Virus Disease (COVID-19), Eur Radiol., 31 (2021), 6096–6104. https://doi.org/10.1007/s00330-021-07715-1 doi: 10.1007/s00330-021-07715-1
[20]	L. L, L. Qin, Z.Xu, Y. Yin, X. Wang, B. Kong, et al., Artificial intelligence distinguishes COVID-19 from community acquired pneumonia on chest CT, Radiology, 296 (2020). https://doi.org/10.1148/radiol.2020200905 doi: 10.1148/radiol.2020200905
[21]	A. N. J Raj, H. Zhu, A. Khan, Z. Zhuang, Z. Yang, V. G. V. Mahesh, et al., ADID-UNET—a segmentation model for COVID-19 infection from lung CT scans, PeerJ Comput. Sci., 7 (2021), e349. https://doi.org/10.7717/PEERJ-CS.349 doi: 10.7717/PEERJ-CS.349
[22]	H. Khalid, M. Hussain, M. A. Al Ghamdi, T. Khalid, K. Khalid, M. A. Khan, et al., A comparative systematic literature review on knee bone reports from MRI, X-rays and CT scans using deep learning and machine learning methodologies, Diagnostics, 10 (2020), 518. https://doi.org/10.3390/diagnostics10080518 doi: 10.3390/diagnostics10080518
[23]	G. Puneet, Pneumonia detection using convolutional neural networks, Int. J. Sci. Technol. Res., 7 (2021), 77–80. https://doi.org/10.46501/ijmtst070117 doi: 10.46501/ijmtst070117
[24]	X. Ding, Y. Guo, G. Ding, J. Han, Acnet: Strengthening the kernel skeletons for powerful CNN via asymmetric convolution blocks, in IEEE/CVF international conference on computer vision (ICCV), (2019), pp. 1911–1920. http://dx.doi.org/10.1109/ICCV.2019.00200
[25]	S. Kostadinov, What is deep transfer learning and why is it becoming so popular? Towards Data Science, (2019).
[26]	M. Lascu, Deep learning in classification of Covid-19 coronavirus, pneumonia and healthy lungs on CXR and CT images, J. Med. Biol. Eng., 41 (2021), 514–522. http://dx.doi.org/10.1007/s40846-021-00630-2 doi: 10.1007/s40846-021-00630-2
[27]	X. Ma, B. Zheng, Y. Zhu, F. Yu, R. Zhang, B. Chen, Covid-19 lesion discrimination and localization network based on multi-receptive field attention module on CT images, Optik, 241 (2021), 167100. http://dx.doi.org/10.1016/j.ijleo.2021.167100 doi: 10.1016/j.ijleo.2021.167100
[28]	R. Kundu, R. Das, Z. W. Geem, G. T. Han, R. Sarkar, Pneumonia detection in chest X-ray images using an ensemble of deep learning models, PLoS One, 16 (2021), e0256630. https://doi.org/10.1371/journal.pone.0256630 doi: 10.1371/journal.pone.0256630

Reader Comments

Your name:*

Email:*
© 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)