Features gradient-based signals selection algorithm of linear complexity for convolutional neural networks

Yuto Omae; Yusuke Sakai; Hirotaka Takahashi; Yuto Omae; Yusuke Sakai; Hirotaka Takahashi

doi:10.3934/math.2024041

AIMS Mathematics

2024, Volume 9, Issue 1: 792-817. doi: 10.3934/math.2024041

Previous Article Next Article

Research article Special Issues

Features gradient-based signals selection algorithm of linear complexity for convolutional neural networks

1.
College of Industrial Technology, Nihon University, 1-2-1, Izumi, Narashino, Chiba 275-8575, Japan
2.
Research Center for Space Science, Advanced Research Laboratories and Department of Design and Data Science, Tokyo City University, Kanagawa 224-8551, Japan

Received: 19 August 2023 Revised: 06 November 2023 Accepted: 15 November 2023 Published: 04 December 2023
MSC : 68T01, 68T07, 68T20

Recently, convolutional neural networks (CNNs) for classification by time domain data of multi-signals have been developed. Although some signals are important for correct classification, others are not. The calculation, memory, and data collection costs increase when data that include unimportant signals for classification are taken as the CNN input layer. Therefore, identifying and eliminating non-important signals from the input layer are important. In this study, we proposed a features gradient-based signals selection algorithm (FG-SSA), which can be used for finding and removing non-important signals for classification by utilizing features gradient obtained by the process of gradient-weighted class activation mapping (grad-CAM). When we defined $n_ \mathrm{s}$ as the number of signals, the computational complexity of FG-SSA is the linear time $\mathcal{O}(n_ \mathrm{s})$ (i.e., it has a low calculation cost). We verified the effectiveness of the algorithm using the OPPORTUNITY dataset, which is an open dataset comprising of acceleration signals of human activities. In addition, we checked the average of 6.55 signals from a total of 15 signals (five triaxial sensors) that were removed by FG-SSA while maintaining high generalization scores of classification. Therefore, FG-SSA can find and remove signals that are not important for CNN-based classification. In the process of FG-SSA, the degree of influence of each signal on each class estimation is quantified. Therefore, it is possible to visually determine which signal is effective and which is not for class estimation. FG-SSA is a white-box signal selection algorithm because it can understand why the signal was selected. The existing method, Bayesian optimization, was also able to find superior signal sets, but the computational cost was approximately three times greater than that of FG-SSA. We consider FG-SSA to be a low-computational-cost algorithm.

Keywords:

Citation: Yuto Omae, Yusuke Sakai, Hirotaka Takahashi. Features gradient-based signals selection algorithm of linear complexity for convolutional neural networks[J]. AIMS Mathematics, 2024, 9(1): 792-817. doi: 10.3934/math.2024041

Related Papers:

[1]	Azmy S. Ackleh, Rainey Lyons, Nicolas Saintier . Finite difference schemes for a structured population model in the space of measures. Mathematical Biosciences and Engineering, 2020, 17(1): 747-775. doi: 10.3934/mbe.2020039
[2]	Azmy S. Ackleh, Vinodh K. Chellamuthu, Kazufumi Ito . Finite difference approximations for measure-valued solutions of a hierarchicallysize-structured population model. Mathematical Biosciences and Engineering, 2015, 12(2): 233-258. doi: 10.3934/mbe.2015.12.233
[3]	Azmy S. Ackleh, Mark L. Delcambre, Karyn L. Sutton, Don G. Ennis . A structured model for the spread of Mycobacterium marinum: Foundations for a numerical approximation scheme. Mathematical Biosciences and Engineering, 2014, 11(4): 679-721. doi: 10.3934/mbe.2014.11.679
[4]	Horst R. Thieme . Discrete-time population dynamics on the state space of measures. Mathematical Biosciences and Engineering, 2020, 17(2): 1168-1217. doi: 10.3934/mbe.2020061
[5]	Qihua Huang, Hao Wang . A toxin-mediated size-structured population model: Finite difference approximation and well-posedness. Mathematical Biosciences and Engineering, 2016, 13(4): 697-722. doi: 10.3934/mbe.2016015
[6]	Inom Mirzaev, David M. Bortz . A numerical framework for computing steady states of structured population models and their stability. Mathematical Biosciences and Engineering, 2017, 14(4): 933-952. doi: 10.3934/mbe.2017049
[7]	Suzanne M. O'Regan, John M. Drake . Finite mixture models of superspreading in epidemics. Mathematical Biosciences and Engineering, 2025, 22(5): 1081-1108. doi: 10.3934/mbe.2025039
[8]	Ugur G. Abdulla, Vladislav Bukshtynov, Saleheh Seif . Cancer detection through Electrical Impedance Tomography and optimal control theory: theoretical and computational analysis. Mathematical Biosciences and Engineering, 2021, 18(4): 4834-4859. doi: 10.3934/mbe.2021246
[9]	József Z. Farkas, Peter Hinow . Physiologically structured populations with diffusion and dynamic boundary conditions. Mathematical Biosciences and Engineering, 2011, 8(2): 503-513. doi: 10.3934/mbe.2011.8.503
[10]	Nalin Fonseka, Jerome Goddard Ⅱ, Alketa Henderson, Dustin Nichols, Ratnasingham Shivaji . Modeling effects of matrix heterogeneity on population persistence at the patch-level. Mathematical Biosciences and Engineering, 2022, 19(12): 13675-13709. doi: 10.3934/mbe.2022638

Abstract

1. Introduction

Coagulation-fragmentation (CF) equations have been used to model many physical and biological phenomena ^[1,2]. In particular, when combined with transport terms, these equations can be used to model the population dynamics of oceanic phytoplankton ^[3,4,5]. Setting such models in the space of Radon measures allows for the unified study of both discrete and continuous structures. Not only are the classical discrete and continuous CF equations special cases of the measure valued model (as shown in ^[6]), but this setting allows for a mixing of the two structures, which has become of interest in particular applications ^[7,8].

With the above applications in mind, numerical schemes to solve CF equations are of great importance to researchers. In particular, finite difference methods offer numerical schemes which are easy to implement and approximate the solution with a high order of accuracy. The latter benefit is especially important in the study of stability and optimal control of such equations.

The purpose of this article is to make improvements on two of the three first-order schemes presented in ^[9], namely the fully explicit and semi-implicit schemes. These schemes are shown to have certain advantages and disadvantages as discussed in the aforementioned study. In particular, the fully explicit scheme has the qualitative property of conservation of mass through coagulation. On the other hand, the semi-implicit scheme has a more relaxed Courant–Friedrichs–Lewy (CFL) condition, which does not depend on the initial condition. We have decided not to attempt to improve the third scheme presented in ^[9] as there does not seem to be a significant advantage of the named conservation law scheme to outweigh the drastic computational cost. The main improvement here is to lift-up these two first-order schemes to second-order ones on the space of Radon measures; however, as this state space contains singular elements (including point measures), the improvement of these schemes must be handled with care. As shown in ^[10], discontinuities and singularities in the solution can cause drastic changes in not only the order of convergence of the scheme, but also in the behavior of the scheme. To address these issues, we turn to a high resolution scheme studied with classical structured population models (i.e. without coagulation-fragmentation) in ^[11,12,13]. This scheme makes use of a minmod flux limiter to control any oscillatory behavior of the scheme caused by irregularities. With this new flux, we show that it is possible for second order convergence rates to be obtained for continuous density solutions. However, as the solutions become more irregular, one should expect the convergence rate to decline. Such a phenomenon is demonstrated in ^[10,13], and we direct the reader to these manuscripts for more discussion.

The layout of the paper is as follows. In Section 2, we present the notation and preliminary results about the model and state space used throughout the paper. In Section 3, we describe the model and state all assumptions imposed on the model parameters. In Section 4, we present the numerical schemes, their CFL conditions, and state the main theorem of the paper. In Section 5, we test the convergence rate of the schemes against well-known examples. In Section 6, we provide a conclusion and in the Appendix (Section 7) we provide proofs for some of our results.

2. Notation

We make use of standard notations for function spaces. The most common examples of these are $C^1(\mathbb{R}^+)$ for the space of real valued continuously differentable functions and $W^{1, \infty}(\mathbb{R}^+)$ for the usual Sobelov space. The space of Radon measures will be denoted with $\mathcal{M}(\mathbb{R}^+)$ , with $\mathcal{M}^+(\mathbb{R}^+)$ representing its positive cone. This space will be equipped with the Bounded-Lipschitz (BL) norm given by

$\|\mu \|_{BL} : = \sup\limits_{\|\phi\|_{W^{1, \infty}}\leq 1} \left \lbrace \int_{\mathbb{R}^+} \phi(x) \mu(dx) : \phi \in W^{1, \infty}( \mathbb{R}^+) \right \rbrace .$

Another norm of interest to this space is the well studied Total Variation (TV) norm given by

$\Vert \nu \Vert_{TV} = |\nu|( \mathbb{R}^+) = \sup\limits_{\| f \|_{\infty}\leq 1} \left\lbrace\int_{ \mathbb{R}^+}f(x) \nu(dx) : f\in C_c( \mathbb{R}^+) \right\rbrace.$

For more information about these particular norms and their relationship we direct the reader to ^[14,15]. For lucidity, we use operator notation in place of integration when we believe it necessary, namely

$(\mu, f) : = \int_{A} f(x) \mu(dx),$

where the set $A$ is the support of the measure $\mu$ . Finally, we denote the minmod function by $\text{mm}(a, b)$ and use the following definition

$\text{mm}(a, b) : = \frac{\text{sign}(a) + \text{sign}(b)}{2}\max(|a|, |b|).$

3. Model and assumptions

The model of interest is the size-structured coagulation fragmentation model given by

$\begin{equation} \left \lbrace \begin{split} &\partial_t \mu + \partial_x(g(t, \mu)\mu) + d(t, \mu)\mu = K[\mu] + F[\mu], \qquad (t, x) \in (0, T)\times (0, \infty), \\ &g(t, \mu)(0) D_{dx}\mu (0) = \int_{ \mathbb{R}^+}\beta(t, \mu)(y) \mu(dy), \quad \quad \quad \quad t\in [0, T], \\ &\mu(0) = \mu_0 \in \mathcal{M}^+( \mathbb{R}^+), \end{split} \right. , \end{equation}$

(3.1)

where $\mu(t) \in \mathcal{M}^+(\mathbb{R}^+)$ represents individuals' size distribution at time $t$ and the functions $g, d, \beta$ are their growth, death, and reproduction rate, respectively. The coagulation and fragmentation processes of a population distributed according to $\mu \in \mathcal{M}^+(\mathbb{R}^+)$ are modeled by the measures $K[\mu]$ and $F[\mu]$ given

$(K[\mu], \phi) = \frac12 \int_{\mathbb{R}^+} \int_{\mathbb{R}^+} \kappa(y, x)\phi(x+y)\, \mu(dx) \, \mu(dy) - \int_{\mathbb{R}^+} \int_{\mathbb{R}^+} \kappa(y, x)\phi(x)\, \mu(dy) \, \mu(dx)$

and

$(F[\mu], \phi) = \int_{ \mathbb{R}^+} (b(y, \cdot), \phi) a(y)\, \mu(dy) - \int_{\mathbb{R}^+} a(y)\phi(y)\mu(dy)$

for any test function $\phi$ . Here, $\kappa(x, y)$ is the rate at which individuals of size $x$ coalesce with individuals of size $y$ , $a(y)$ is the global fragmentation rate of individuals of size $y$ , and $b(y, \cdot)$ is a measure supported on $[0, y]$ such that $b(y, A)$ represents the probability a particle of size $y$ fragments to a particle with size in the Borel set $A$ .

Definition 3.1. Given $T\geq 0$ , we say a function $\mu \in C([0, T], \mathcal{M^+}(\mathbb{R}^+))$ is a weak solution to (3.1) if for all $\phi \in (C^1 \cap W^{1, \infty})([0, T]\times \mathbb{R}^+)$ and for all $t\in [0, T]$ , the following holds:

$\begin{equation} \begin{split} & \int_{0}^{\infty} \phi(t, x)\mu_t(dx) - \int_{0}^{\infty}\phi(0, x)\mu_0(dx) = \\ &\quad \int_0^t \int_{0}^{\infty} \left[ \partial_t \phi(s, x) +g(s, \mu_s)(x) \partial_x \phi(s, x) - d(s, \mu_s)(x)\phi(s, x)\right]\mu_s(dx)ds \\ &\quad \quad +\int_0^t (K[\mu_s]+F[\mu_s], \phi(s, \cdot))\, ds + \int_0^t \int_{0}^{\infty} \phi(s, 0)\beta(s, \mu_s)(x) \mu_s(dx) ds. \end{split} \end{equation}$

(3.2)

For the numerical scheme, we will restrict ourselves to a finite domain, $[0, x_{\max}]$ . Thus, we impose the following assumptions on the growth, death and birth functions:

$({\rm{A1}})$ For any $R > 0$ , there exists $L_R > 0$ such that for all $\| \mu_i\|_{TV}\leq R$ and $t_i\in [0, \infty)$ ( $i = 1, 2$ ) the following hold for $f = g, d, \beta,$

$\|f(t_1, \mu_1) - f(t_2 , \mu_2)\|_{\infty} \leq L_R(|t_1 - t_2| + \| \mu_1 - \mu_2 \|_{BL}),$

$({\rm{A2}})$ There exists $\zeta > 0$ such that for all $T > 0$ ,

$\sup\limits_{t\in [0, T]} \sup\limits_{\mu \in \mathcal{M^+}( \mathbb{R}^+)} \| g(t, \mu) \|_{W^{1, \infty}} +\| d(t, \mu) \|_{W^{1, \infty}}+\| \beta(t, \mu) \|_{W^{1, \infty}} < \zeta,$

$({\rm{A3}})$ For all $(t, \mu)\in [0, \infty)\times \mathcal{M^+}(\mathbb{R}^+)$ ,

$g(t, \mu)(0) > 0 \quad \text{and} \quad g(t, \mu)( x_{\max}) = 0$

for some large $x_{\max} > 0$ .

We assume that the coagulation kernel $\kappa$ satisfies the following assumption:

$({\rm{K1}})$ $\kappa$ is symmetric, nonnegative, bounded by a constant $C_\kappa$ , and globally Lipschitz with Lipschitz constant $L_\kappa$ .

$({\rm{K2}})$ $\kappa(x, y) = 0 \text{ whenever } x+y > x_{\max}.$

We assume that the fragmentation kernel satisfies the following assumptions:

$({\rm{F1}})$ $a\in W^{1, \infty}(\mathbb{R}^+)$ is non-negative,

$({\rm{F2}})$ for any $y\ge 0$ , $b(y, dx)$ is a measure such that

$({\rm{i}})$ $b(y, dx)$ is non-negative and supported in $[0, y]$ , and there exist a $C_b > 0$ such that $b(y, \mathbb{R}^+)\le C_b$ for all $y > 0$ ,

$({\rm{ii}})$ there exists $L_b$ such that for any $y, \bar y\ge 0$ ,

$\|b(y, \cdot)-b(\bar y, \cdot)\|_{BL}\le L_b|y-\bar y|$

$({\rm{iii}})$ for any $y\ge 0$ ,

$(b(y, dx), x) = \int_0^y x\, b(y, dx) = y.$

The existence and uniqueness of mass conserving solutions of model (3.1) under these assumptions were established in ^[6].

4. Numerical methods

We adopt the numerical discretization presented in ^[6]. For some fixed mesh sizes $\Delta x, \Delta t > 0$ , we discretize the size domain $[0, x_{\max}]$ with the cells

$\Lambda^{\Delta x}_j : = [(j-\frac{1}{2})\Delta x, (j+\frac{1}{2})\Delta x), \text{ for } j = 1, \dots, J,$

and

$\Lambda^{\Delta x}_0 : = [0, \frac{\Delta x}{2}).$

We denote the midpoints of these grids by $x_j$ . The initial condition $\mu_0 \in \mathcal{M}^+(\mathbb{R}^+)$ will be approximated by a combination of Dirac measures

$\mu_0^{\Delta x} = \sum\limits_{j = 0}^J m_j^0 \delta_{x_j}, \text{ where } m_j^0 : = \mu_0(\Lambda^{\Delta x}_j).$

We first approximate the model coefficients $\kappa$ , $a$ , $b$ as follows. For the physical ingredients, we define

$a^{\Delta x}_i = \frac{1}{\Delta x}\int_{\Lambda^{\Delta x}_{i}} a(y)dy, \qquad \kappa^{\Delta x}_{i, j} = \frac{1}{\Delta x^2}\int_{\Lambda^{\Delta x}_{i}\times \Lambda^{\Delta x}_{j}} \kappa(x, y)dxdy$

for $i, j\ge 1$ , and

$a^{\Delta x}_0 = \frac{2}{\Delta x}\int_{\Lambda^{\Delta x}_0} a(y)dy, \qquad \kappa^{\Delta x}_{0, 0} = \frac{4}{\Delta x^2}\int_{\Lambda^{\Delta x}_{0}\times \Lambda^{\Delta x}_{0}} \kappa(x, y)dxdy$

(with the natural modifications for $\kappa^{\Delta x}_{0, j}$ and $\kappa^{\Delta x}_{i, 0}$ , $i\ge 1$ ). We then let $a^{\Delta x}\in W^{1, \infty}(\mathbb{R}^+)$ and $\kappa^{\Delta x}\in W^{1, \infty}(\mathbb{R}^+\times \mathbb{R}^+)$ be the linear interpolation of the $a^{\Delta x}_i$ and $\kappa^{\Delta x}_{i, j}$ , respectively. Finally, we define the measure $b^{\Delta x}(x_j, \cdot)\in \mathcal{M}^+(\Delta x \mathbb{N})$ by

$b^{\Delta x}(x_j, \cdot) = \sum\limits_{i\le j} b(x_j, \Lambda^{\Delta x}_{i}) \delta_{x_j} = : \sum\limits_{i\le j}b^{\Delta x}_{j, i}\delta_{x_j}$

and then $b^{\Delta x}(x, \cdot)\in \mathcal{M}^+(\Delta x \mathbb{N}_0)$ for $x\ge 0$ as the linear interpolate between the $b^{\Delta x}(x_j, \cdot)$ . When the context is clear, we omit the $\Delta x$ from the notation above.

We make use of these approximations to combine the high-resolution scheme presented in ^[13] with the fully explicit and semi-implicit schemes presented in ^[9]. Together, these schemes give us the numerical scheme

$\begin{equation} \left \lbrace \begin{split} m_j^{k+1} & = m_j^k -\frac{\Delta t}{\Delta x} (f_{j+\frac{1}{2}}^k - f_{j-\frac{1}{2}}^k) -\Delta t d_j^k m_j^k +\Delta t \left(\mathcal{C}_{j, k} + \mathcal{F}_{j, k} \right), \qquad j = 1, .., J, \\ g_0^k m_0^{k} & = \Delta x \sum\limits_{j = 1}^{J}{}^{*} \beta_j^k m_j^k : = \Delta x \left(\frac{3}{2}\beta_1^k m_1^k + \frac{1}{2}\beta_J^k m_J^k + \sum\limits_{j = 2}^{J-1} \beta_j^k m_j^k \right) \end{split}\right. , \end{equation}$

(4.1)

where the flux term is given by

$\begin{equation} f^k_{j+\frac{1}{2}} = \left \lbrace \begin{split} &g_j^k m_j^k +\frac{1}{2}(g_{j+1}^k - g_{j}^k)m_j^k + \frac{1}{2} g_j^k \; {\rm mm}(\Delta_+ m_j^k , \Delta_- m_j^k) & j = 2, 3, \dots, J-2 \\ &g_j^k m_j^k & j = 0, 1, J-1, J \end{split}\right. , \end{equation}$

(4.2)

the fragmentation term, $\mathcal{F}_{j, k}$ , is given by

$\begin{equation} \mathcal{F}_{j, k} : = \sum\limits_{i = j}^J b_{i, j}a_i m_i^k -a_jm_j^k , \end{equation}$

(4.3)

and the coagulation term, $\mathcal{C}_j$ , is either given by an explicit discretization as

$\begin{equation} \mathcal{C}^{\text{exp}}_{j, k} : = \frac{1}{2}\sum\limits_{i = 1}^{j-1} \kappa_{i, j-i}m_i^{k} m_{j-i}^k - \sum\limits_{i = 1}^{J} \kappa_{i, j} m_i^k m_j^{k} , \end{equation}$

(4.4)

or by an implicit one as

$\begin{equation} \mathcal{C}^{\text{imp}}_{j, k} : = \frac{1}{2}\sum\limits_{i = 1}^{j-1} \kappa_{i, j-i}m_i^{k+1} m_{j-i}^k - \sum\limits_{i = 1}^{J} \kappa_{i, j} m_i^k m_j^{k+1} . \end{equation}$

(4.5)

As discussed in ^[9], the explicit scheme which uses (4.4) to approximate the coagulation term and the semi-implicit scheme which instead uses (4.5) to approximate the coagulation term behave differently with respect to the mass conservation and have different Courant–Friedrichs–Lewy (CFL) conditions. The assumed CFL condition for the schemes are

$\begin{equation} \begin{matrix} \text{Explicit:} & \Delta t\Big(C_\kappa \|\mu_0 \|_{TV} \exp ((\zeta+ C_bC_a)T) + C_a \max\{1, C_b\} + (1+\frac{3}{2\Delta x})\zeta\Big) \leq 1\\ \text{Semi-Implicit:} & \bar{\zeta} (2 + \frac{3}{2\Delta x})\Delta t \leq 1 , \end{matrix} \end{equation}$

(4.6)

where $\bar{\zeta} = \max\{ \zeta, \|a\|_{W^{1, \infty}}\}$ , $C_a = \|a\|_\infty$ . The CFL conditions above are similar to those used in ^[9], but are adjusted due to the flux limiter term as in ^[13]. It is clear that the semi-implicit scheme has a less restrictive and simpler CFL condition than the explicit scheme. In particular, the CFL condition of the semi-implicit scheme is independent on the initial condition, unlike its counterpart. The trade off for this is a loss of qualitative behavior of the scheme in the sense of mass conservation. Indeed as shown in ^[9], when $\beta = d = g = a = 0$ , the semi-implicit coagulation term does not conserve mass, whereas the explicit term does. As shown in the appendix, this loss is controlled by the time step size, $\Delta t$ .

It is useful to define the following coefficients:

$A_j^k = \begin{cases} g_j^k &j = 1, J, \\ \frac{1}{2}\left( g_{j+1}^k +g_j^k +g_j^k \frac{ \; {\rm mm}(\Delta_+ m_j^k , \Delta_- m_j^k)}{\Delta_- m_j^k} \right) & j = 2, \\ \frac{1}{2}\left(g_{j+1}^k +g_j^k +g_j^k \frac{ \; {\rm mm}(\Delta_+ m_j^k , \Delta_- m_j^k)}{\Delta_- m_j^k} -g_{j-1}^k \frac{ \; {\rm mm}(\Delta_- m_j^k , \Delta_- m_{j-1}^k)}{\Delta_- m_j^k} \right) &j = 3, \dots, J-2, \\ \frac{1}{2}\left(2g_{j}^k -g_{j-1}^k \frac{ \; {\rm mm}(\Delta_- m_j^k , \Delta_- m_{j-1}^k)}{\Delta_- m_j^k}\right) & j = J-1, \end{cases}$

and

$B_j^k = \begin{cases} \Delta_- g_j^k & j = 1, J, \\ \frac{1}{2} \Delta_+ g_j^k & j = 2, \\ \frac{1}{2}(\Delta_+ g_j^k + \Delta_-g_j^k) & j = 3, \dots, J-2, \\ \frac{1}{2} \Delta_- g_j^k & j = J-1. \end{cases}.$

Notice, $|A_j^k| \leq \frac{3\Delta t}{2\Delta x}\zeta$ and $A_j^k -B_j^k \geq 0$ as

$2(A_j^k - B_j^k) = \begin{cases} 2g_{j-1}^k & j = 1, J, \\ g_j^k \left( 2+\frac{ \; {\rm mm}(\Delta_+ m_j^k , \Delta_- m_j^k)}{\Delta_- m_j^k}\right) & j = 2, \\ g_j^k \left( 1+\frac{ \; {\rm mm}(\Delta_+ m_j^k , \Delta_- m_j^k)}{\Delta_- m_j^k}\right) + g_{j-1}^k \left( 1-\frac{ \; {\rm mm}(\Delta_- m_j^k , \Delta_- m_{j-1}^k)}{\Delta_- m_j^k} \right) & j = 3, \dots , J-2, \\ g_j^n + g_{j-1}^n \left(1-\frac{ \; {\rm mm}(\Delta_- m_j^n, \Delta_- m_{j-1}^n)}{\Delta_- m_j^n} \right) &j = J-1. \end{cases}.$

Scheme (4.1) can then be rewritten as

$\begin{equation} \left \lbrace \begin{split} m_j^{k+1} & = (1-\frac{\Delta t}{\Delta x}A_j^k - \Delta t (d_j^k + a_j))m_j^k + \frac{\Delta t}{\Delta x}(A_j^k - B_j^k)m_{j-1}^k \\ & \qquad + \Delta t \sum\limits_{i = j}^J b_{i, j}a_i m_i^k +\Delta t. \mathcal{C}_{j, k}\\ g_0^k m_0^{k} & = \Delta x \sum\limits_{j = 1}^{J}{}^* \beta_j^k m_j^k \; . \end{split}. \right. \end{equation}$

(4.7)

Depending on the choice of coagulation term, this formulation leads to either

$\begin{equation} \left \lbrace \begin{split} m_j^{k+1} & = (1-\frac{\Delta t}{\Delta x}A_j^k - \Delta t (d_j^k + a_j)- \Delta t \sum\limits_{i = 1}^J \kappa_{i, j}m_i^k )m_j^k + \frac{\Delta t}{\Delta x}(A_j^k - B_j^k)m_{j-1}^k \\ & \qquad + \Delta t \sum\limits_{i = j}^J b_{i, j}a_i m_i^k +\frac{\Delta t}{2} \sum\limits_{i = 1}^{j-1} \kappa_{i, j-i} m_i^k m_{j-i}^k\\ g_0^k m_0^{k} & = \Delta x \sum\limits_{j = 1}^{J}{}^* \beta_j^k m_j^k \; \end{split}, \right. \end{equation}$

(4.8)

for the explicit term, $\mathcal{C}^{\text{exp}}_{j, k}$ , or

$\begin{equation} \left \lbrace \begin{split} (1+ \Delta t \sum\limits_{i = 1}^{J} \kappa_{i, j}m_i^k)m_j^{k+1} & = (1-\frac{\Delta t}{\Delta x}A_j^k - \Delta t (d_j^k + a_j))m_j^k + \frac{\Delta t}{\Delta x}(A_j^k - B_j^k)m_{j-1}^k \\ & \qquad + \Delta t \sum\limits_{i = j}^J b_{i, j}a_i m_i^k +\frac{\Delta t}{2} \sum\limits_{i = 1}^{j-1} \kappa_{i, j-i} m_i^{k+1} m_{j-i}^k\\ g_0^k m_0^{k} & = \Delta x \sum\limits_{j = 1}^{J}{}^* \beta_j^k m_j^k \; , \end{split}. \right. \end{equation}$

(4.9)

for the implicit term, $\mathcal{C}^{\text{imp}}_{j, k}$ .

For these, schemes, we have the following Lemmas which are proven in the appendix:

Lemma 4.1. For each $k = 1, 2, \dots, \bar{k}$ ,

● $m_j^k \geq 0$ for all $j = 1, 2, \dots J$ ,

● $\| \mu^k_{\Delta x}\|_{TV} \leq \|\mu_0 \|_{TV} \exp((\zeta + C_b C_a)T).$

Lemma 4.2. For any $l, p = 1, 2, \dots, \bar{k}$ ,

$\|\mu_{\Delta x}^l - \mu_{\Delta x}^p \|_{BL} \leq \mathcal{L}_T |l-p|.$

Using the above two Lemmas, we can arrive at analogous results for the linear interpolation (4.10):

$\begin{equation} \mu_{\Delta x}^{\Delta t}(t): = \mu_{\Delta x}^0 \chi_{\{0\}}(t) + \sum\limits_{k = 0}^{\bar{k}-1} \left[ (1- \frac{t-k\Delta t}{\Delta t})\mu^k_{\Delta x} + \frac{t-k\Delta t}{\Delta t} \mu^{k+1}_{\Delta x} \right] \chi_{(k\Delta t, (k+1)\Delta t]}(t). \end{equation}$

(4.10)

Thus by the well know Ascoli-Arzela Theorem, we have the existence of a convergent subsequence of the net $\{\mu_{\Delta x}^{\Delta t}(t) \}$ in $C([0, T], \mathcal{M}^+([0, x_{\max}])$ . We now need only show any convergent subsequence converges to the unique solution (3.2).

Theorem 4.1. As $\Delta x, \Delta t\to 0$ the sequence $\mu_{\Delta x}^{\Delta t}$ converges in $C([0, T], \mathcal{M^+}([0, x_{\max}]))$ to the solution of (3.1).

Proof. By multiplying (4.1) by a superfluously smooth test function $\phi \in (W^{1, \infty} \cap C^2)([0, T]\times \mathbb{R})$ , denoting $\phi_j^k : = \phi (k\Delta t, x_j)$ , summing over all $j$ and $k$ , and rearranging we arrive at

$\begin{align} \sum\limits_{k = 0}^{\bar{k}-1} \sum\limits_{j = 1}^{J} \left((m_j^{k+1}-m_j^k)\phi_j^k + \frac{\Delta t}{\Delta x} (f_{j+\frac{1}{2}}^k - f_{j-\frac{1}{2}}^k)\phi_j^k \right) +\Delta t \sum\limits_{k = 0}^{\bar{k}-1} \sum\limits_{j = 1}^{\infty} d_j^k m_j^k \phi_j^k \\ = \Delta t \sum\limits_{k = 1}^{\bar{k}-1}\sum\limits_{j = 1}^J \phi_j^k \left( \frac{1}{2}\sum\limits_{i = 1}^{j-1} \kappa_{i, j-i}m_i^{k} m_{j-i}^k - \sum\limits_{i = 1}^{J} \kappa_{i, j} m_i^k m_j^{k} + \sum\limits_{i = j}^J b_{i, j}a_i m_i^k -a_jm_j^k\right). \end{align}$

(4.11)

The left-hand side of equation (4.11) was shown in ^[13] to be equivalent to

$\begin{equation*} \begin{split} & \int_{0}^{ x_{\max}} \phi(T, x)d\mu^{\bar{k}}_{\Delta x}(x) - \int_{0}^{ x_{\max}} \phi(0, x)d\mu^0_{\Delta x}(x) \\ & - \Delta t\sum\limits_{k = 0}^{\bar{k}-1} \left( \int_{0}^{ x_{\max}} \partial_t\phi(t_k, x)d\mu^k_{\Delta x}(x) + \int_{0}^{ x_{\max}} \partial_x\phi(t_k, x)g(t_k, \mu^k_{\Delta x})(x) d\mu^k_{\Delta x}(x) \right. \\ & \left. \quad -\int_{ \mathbb{R}^+} d(t_k, \mu_{\Delta x}^k)(x) \phi(t_k, x)d\mu_{\Delta x}^k(x) +\int_{0}^{ x_{\max}} \phi(t_k, \Delta x)\beta(t_k, \mu^k_{\Delta x})(x) d\mu_{\Delta x}^k(x)\right) + o(1), \end{split} \end{equation*}$

where $o(1) \longrightarrow 0$ as $\Delta t, \Delta x \longrightarrow 0$ .

The right-hand side of (4.11) was shown in ^[9] to be equal to

$\Delta t \sum\limits_{k = 1}^{\bar{k}-1} \Big\{ (K[\mu_{\Delta x}^{\Delta t}(t_k)], \phi(t_k, \cdot))+(F[\mu_{\Delta x}^{\Delta t}(t_k)], \phi(t_k, \cdot))\Big\} + O(\Delta x).$

Making use of results, it is then easy to see (4.11) is equivalent to

$\begin{align*} & \int_{0}^{ x_{\max}} \phi(T, x )d\mu^{\Delta t}_{\Delta x}(T)(x) -\int_{0}^{ x_{\max}} \phi(0, x)d\mu^0_{\Delta x}(x) \\ & = \int_0^T \left( \int_{0}^{ x_{\max}} \partial_t\phi(t, x) + \partial_x\phi(t, x) g(t, \mu_{\Delta x}^{\Delta t} (t))(x) d\mu^{\Delta t}_{\Delta x}(t)(x) \right. \\ & \left. \quad -\int_{0}^{ x_{\max}} d(t, \mu_{\Delta x}^{\Delta t} (t))(x) \phi(t, x)d\mu^{\Delta t}_{\Delta x}(t)(x) +\int_{0}^{ x_{\max}} \phi(t, \Delta x)\beta(t, \mu_{\Delta x}^{\Delta t} (t))(x) d\mu^{\Delta t}_{\Delta x}(t)(x) \right) dt\\ &\quad +\int_0^T (K[\mu_{\Delta x}^{\Delta t}(t)], \phi(t, \cdot)) + (F[\mu_{\Delta x}^{\Delta t}(t)], \phi(t, \cdot)) \, dt + o(1). \end{align*}$

Passing the limit as $\Delta t, \Delta x \longrightarrow 0$ along a converging subsequence, we then obtain that equation (3.2) holds for any $\phi \in (C^2\cap W^{1, \infty})([0, T]\times \mathbb{R}^+)$ with compact support. A standard density argument shows that equation (3.2) holds for any $\phi \in (C^1\cap W^{1, \infty})([0, T]\times \mathbb{R}^+)$ . As the weak solution is unique ^[6], we conclude the net $\{\mu_{\Delta x}^{\Delta t}\}$ converges to the solution of model (3.1). □

We point out that while these schemes are higher-order in space, they are only first order in time. To lift these schemes into a second-order in time as well, we make use of the second-order Runge-Kutta time discretization ^[16] for the explicit scheme and second-order Richardson extrapolation ^[17] for the semi-implicit scheme.

5. Numerical examples

In this section, we provide numerical simulations which test the order of the explicit and semi-implicit schemes developed in the previous sections. We test each component separately, beginning first with a pure coagulation equation in example 1 (where we set $g = \beta = d = a = 0$ ), then a pure fragmentation equation in example 2 (where we set $g = \beta = d = \kappa = 0$ ). In example 3, we consider all components of model (3.1) including the boundary term which is implemented as in scheme (4.7). For readers interested in the schemes performance in the absence of the coagulation-fragmentation processes, we direct you to ^[11,12,13]. For each example, we give the BL error and the order of convergence. To appreciate the gain in the order of convergence compared to those studied in ^[9], which are based on a first order approximation of the transport term, we add some of the numerical results from the scheme presented in ^[9].

In some of the following examples, the exact solution of the model problem is given. In these cases, we approximate the order of accuracy, $q$ , with the standard calculation:

$q = \log_2\left(\dfrac{\rho (\mu^{\Delta t}_{\Delta x}(T), \mu(T))}{\rho (\mu^{0.5\Delta t}_{0.5\Delta x}(T), \mu(T))} \right)$

where $\mu$ represents the exact solution of the examples considered. In the cases where the exact solutions are unknown, we approximate the order by

$q = \log_2\left(\dfrac{\rho (\mu^{\Delta t}_{\Delta x}(T), \mu^{2\Delta t}_{2\Delta x}(T))}{\rho (\mu^{0.5\Delta t}_{0.5\Delta x}(T), \mu^{\Delta t}_{\Delta x}(T))} \right)$

and we report the numerator of the log argument as the error. The metric $\rho$ we use here was introduced in ^[18] and is equivalent to the BL metric, namely

$C\rho(\mu, \nu) \leq \|\mu -\nu \|_{BL} \leq \rho(\mu, \nu)$

for some constant $C$ (dependent on the finite domain). As discussed in ^[18], this metric is more efficient to compute than the BL norm and maintains the same order of convergence. An alternative to this algorithm would be to make use of the algorithms presented in ^[19], where convergence in the Fortet-Mourier distance is considered.

Example 1 In this example, we test the quality of the finite difference schemes against coagulation equations. To this end, we take $\kappa (x, y) \equiv 1$ and $\mu_0 = e^{-x} dx$ with all other ingredients set to $0$ . This example has an exact solution given by

$\mu_t = \left(\frac{2}{2+t}\right)^2 \exp\left(-\frac{2}{2+t}x \right)dx$

see ^[20] for more details. The simulation is performed over the truncated domain $(x, t) \in [0, 20]\times [0, 0.5]$ . We present the BL error and the numerical order of convergence for both schemes in Table 1.

Table 1. Error, order, and computation time for example 1. Here, Nx and Nt represent the number of points in

$x$ and

$t$ , respectively. The numerical result in the last row for the 1st order variant is generated from the scheme presented in ^[9].

		Explicit			Semi-Implicit
Nx	Nt	BL Error	Order	Time (secs)	BL Error	Order	Time (secs)
100	250	0.0020733		1.0374	0.0020886		0.79633
200	500	0.00054068	1.9391	6.8224	0.00054408	1.9407	5.1724
400	1000	0.00013802	1.9699	98.525	0.00013883	1.9705	73.298
800	2000	3.4842e-05	1.9860	2430.2	3.5040e-05	1.9862	1792.5
1600	4000	8.7417e-06	1.9948	43381	8.7906e-06	1.9950	32361
		Explicit (1st order)			Semi-Implicit (1st order)
800	2000	0.015675	0.96974	523.11	0.010996	0.97418	1393.3

| Show Table

DownLoad: CSV

Example 2 In this example, we test the quality of the finite difference scheme against fragmentation equations. We point out that in this case, the two schemes are identical in the spacial component. For this demonstration, we take $\mu_0 = e^{-x}dx$ , $b(y, \cdot) = \frac{2}{y}dx \text{ and } a(x) = x$ . As given in ^[21], this problem has an exact solution of

$\mu_t = (1+t)^2\exp(-x(1+t))dx .$

The simulation is performed over the finite domain $(x, t) \in [0, 20]\times [0, 0.5]$ . We present the BL error and the numerical order of convergence for both schemes in Table 2. Note as compared to coagulation, the fragmentation process is more affected by the truncation of the domain. This results in the numerical order of the scheme being further from 2 than example 1.

Table 2. Error, order, and computation time for example 2. Here, Nx and Nt represent the number of points in

$x$ and

$t$ , respectively. The numerical result in the last row for the 1st order variant is generated from the scheme presented in ^[9].

		Explicit			Semi-Implicit
Nx	Nt	BL Error	Order	Time (secs)	BL Error	Order	Time (secs)
100	250	0.0053857		1.0148	0.0053836		0.78499
200	500	0.0014548	1.8883	6.7398	0.0014536	1.8890	5.1448
400	1000	0.00037786	1.9449	99.38	0.00037753	1.9449	73.587
800	2000	9.6317e-05	1.9720	2369.4	9.6322e-05	1.9707	1763.3
1600	4000	2.4468e-05	1.9769	43512	2.4514e-05	1.9743	32585
		Explicit (1st order)			Semi-Implicit (1st order)
800	2000	0.059804	0.9128	574.91	0.096943	0.86667	1368.9

| Show Table

DownLoad: CSV

Example 3 In this example, we test the schemes against the complete model (i.e., with all biological and physical processes). To this end, we take $\mu_0 = e^{-x}dx$ , $g(x) = 2-2e^{x-20}$ , $\beta (x) = 2$ , $d(x) = 1$ , $\kappa (x, y) = 1$ , $a(x) = x$ , and $b(y, \cdot) = \frac{2}{y}$ . The simulation is performed over the finite domain $(x, t) \in [0, 20]\times [0, 0.5]$ . To our knowledge, the solution of this problem is unknown.

Table 3. Error, order, and computation time for example 3. Here, Nx and Nt represent the number of points in

$x$ and

$t$ , respectively. The numerical result in the last row for the 1st order variant is generated from the scheme presented in ^[9].

		Explicit			Semi-Implicit
Nx	Nt	BL Error	Order	Time (secs)	BL Error	Order	Time (secs)
100	250	0.0023026		1.0332	0.0028799		0.74398
200	500	0.00085562	1.4282	6.8831	0.00076654	1.9096	5.5104
400	1000	0.0002743	1.6412	100.57	0.00076654	1.9549	75.055
800	2000	7.5404e-05	1.8631	2371.2	5.021e-05	1.9775	1739.1
1600	4000	1.9495e-05	1.9515	43779	1.2651e-05	1.9887	32286
		Explicit (1st order)			Semi-Implicit (1st order)
800	2000	0.0092432	0.97728	625.3	0.0014192	0.98355	1112.8

| Show Table

DownLoad: CSV

Example 4 As mentioned in ^[9], the mixed discrete and continuous fragmentation model studied in ^[7,8], with adjusted assumptions, is a special case of model (3.1). Indeed, by removing the biological and coagulation terms and letting the kernel

$(b(y, \cdot), \phi) = \sum\limits_{i = 1}^N b_i(y) \phi(ih) + \int_{Nh}^y \phi(x) b^c(y, x) dx$

with $\text{supp} \ b^c(y, \cdot) \subset [Nh, y]$ for some $h > 0$ , we have the mixed model in question. We wish to demonstrate the finite difference scheme presented here maintains this mixed structure.

To this end, we take the fragmentation kernel

$b^c(y, x) = \frac{2}{y}, \quad b_i(y) = \frac{2}{y}, \text{ and } a(x) = x^{-1},$

with initial condition $\mu = \sum_{i = 1}^5 \delta_i + \chi_{[5,15]}(x)dx$ , where $\chi_A$ represents the characteristic function over the set $A$ . This is similar to some examples in ^[8], where more detail and analysis are provided. In Figure 1, we present the simulation of this example. Notice, the mixed structure is preserved in finite time. For examples of this type, the scheme could be improved upon by the inclusion of mass conservative fragmentation terms similar to those presented in ^[6].

Figure 1. Initial condition and numerical solution at time

$T = 4$ of example 4.

DownLoad: Full-Size Img PowerPoint

6. Conclusion

In this paper, we have lifted two of the first order finite difference schemes presented in ^[9] to second order high resolution schemes using flux limiter methods. The difference between both schemes is only found in the coagulation term, where the semi-implicit scheme is made linear. In context of standard structured population models (i.e. without coagulation or fragmentation), these type of schemes have been shown to be well-behaved in the presences of discontinuities and singularities. This quality makes them a well fit tool for studying PDEs in spaces of measures. We prove the convergence of both schemes under the assumption of natural CFL conditions. The order of convergence of both schemes is then tested numerically with previously used examples.

In summary, the schemes preform as expected in the presence of smooth initial conditions. In all such simulations, the numerical schemes presented demonstrate a convergence rate of order 2. For simulations with biological terms, this convergence rate is expected to drop when singularities and discontinuities occur, as demonstrated in ^[13]. Mass conservation of the schemes, an important property for coagulation/fragmentation processes, is discussed in detail in ^[6,9].

Acknowledgments

The research of ASA is supported in part by funds from R.P. Authement Eminent Scholar and Endowed Chair in Computational Mathematics at the University of Louisiana at Lafayette. RL is grateful for the support of the Carl Tryggers Stiftelse via the grant CTS 21:1656.

7. Appendix

7.1. Proof of Lemmas 4.1 and 4.2

In this section, we present the proofs of Lemmas 4.1 and 4.2 for the explicit coagulation term. The semi-implicit term follows from similar arguments in the same fashion as ^[9].

Proof of Lemma 4.1

Proof. We first prove via induction that for any $k = 1, 2, \dots, \bar{k},$ $\mu_{\Delta x}^k$ satisfies the following:

$({\rm{i}})$ $\mu^k_{\Delta x} \in \mathcal{M}^+(\mathbb{R}^+)$ i.e. $m_j^k \geq 0$ for all $j = 1, \dots, J$ ,

$({\rm{ii}})$ $\|\mu_{\Delta x}^k\|_{TV} \leq \|\mu^0_{\Delta x}\|_{TV} (1+(\zeta + C_b C_a)\Delta t)^k.$

Then, the TV bound in the Lemma follows from standard arguments (see e.g. Lemma 4.1 in ^[9]). We prove this Theorem for the choice of the explicit coagulation term, $\mathcal{C}_{j, k}^{\text{exp}},$ as the implicit case is similar and more straight forward.

We begin by showing that $m_j^{k+1} \geq 0$ for every $j = 1, 2, \dots, J$ . Notice by way of (4.8), this reduces down to showing

$\begin{equation*} \frac{\Delta t}{\Delta x}A_j^k + \Delta t (d_j^k + a_j)+ \Delta t \sum\limits_{i = 1}^J \kappa_{i, j}m_i^k \leq 1. \end{equation*}$

Indeed, by the CFL condition (4.6), induction hypothesis, and

$\sum\limits_{i = 1}^{J}\kappa_{i, j}m_i^k \le C_\kappa \sum\limits_{i = 1}^{J} m_i^k = C_\kappa \|\mu^k_{\Delta x}\|_{TV} \le C_\kappa \|\mu^0_{\Delta x}\|_{TV} \exp((\zeta + C_b C_a )T),$

we arrive at the result.

For the TV bound, we have since the $m_j^k$ are non-negative, $\|\mu_{\Delta x}^k\|_{TV} = \sum_{j = 1}^{J} m_j^k$ . By rearranging (4.8) and summing over $j = 1, 2, \dots, J$ we have

$\begin{equation} \begin{split} \|\mu^{k+1}_{\Delta x}\|_{TV} & \le \sum\limits_{j = 1}^{J} m^{k}_j + \frac{\Delta t}{\Delta x} \sum\limits_{j = 1}^{J} \Big(f^k_{j-\frac12}-f^k_{j+\frac12} \Big) + \Delta t \sum\limits_{j = 1}^{J}\sum\limits_{i = j}^J b_{i, j}a_i m_i^k \\ & \quad \left. \qquad+ \Delta t \Big(\frac{1}{2} \sum\limits_{j = 1}^{J} \sum\limits_{i = 1}^{j-1} \kappa_{i, j-i}m_i^{k} m_{j-i}^k - \sum\limits_{j = 1}^{J} \sum\limits_{i = 1}^{J} \kappa_{i, j} m_i^k m_j^{k} \Big). \right. \end{split} \end{equation}$

(7.1)

To bound the right-hand side of equation (7.1), we directly follow the arguments of Lemma 4.1 in ^[9] which yields

$\|\mu^{k+1}_{\Delta x}\|_{TV}\le (1+(\zeta+C_aC_b)\Delta t) \sum\limits_{j = 1}^{J} m^k_j = (1+(\zeta+C_aC_b)\Delta t) \|\mu^k_{\Delta x}\|_{TV}.$

Using the induction hypothesis, we obtain $\|\mu^{k+1}_{\Delta x}\|_{TV}\le \|\mu^0_{\Delta x}\|_{TV} (1+(\zeta + C_b C_a)\Delta t)^{k+1}$ as desired. □

Proof of Lemma 4.2

Proof. For $\phi \in W^{1, \infty}(\mathbb{R}^+)$ with $\|\phi\|_{W^{1, \infty}} \leq 1$ , and denoting $\phi_j: = \phi(x_j)$ , we have for any $k$ ,

$\begin{align*} (\mu^{k+1}_{\Delta x}-\mu^k_{\Delta x}, \phi) = & \sum\limits_{j = 1}^{J} (m^{k+1}_{j}-m^k_{j}) \phi_j \\ \leq& \Delta t \sum\limits_{j = 1}^{J} \phi_j \Big(\frac{1}{\Delta x} (f_{j-\frac12}^k -f_{j+\frac12}^k) - d_j^km_j^k - a_jm_j^k \\ & \quad +\frac{1}{2} \sum\limits_{i = 1}^{j-1}\kappa_{i, j-i}m_i^{k} m_{j-i}^k - \sum\limits_{i = 1}^{J} \kappa_{i, j} m_i^k m_j^{k} + \sum\limits_{i = j}^J b_{i, j}a_i m_i^k\Big). \end{align*}$

Let $C$ be the right-hand side of the TV-bound from Lemma 4.1, we then see

$(\mu^{k+1}_{\Delta x}-\mu^k_{\Delta x}, \phi) \le \frac{\Delta t}{\Delta x} \sum\limits_{j = 1}^{J} \phi_j (f_{j-\frac12}^k -f_{j+\frac12}^k) + \Delta t (\zeta+C_a+C_bC_a+\frac32 C_\kappa C^* )C^*.$

Moreover, since $g_J^k = 0$ the sum in the right-hand side takes the form

$\begin{eqnarray*} \phi_1g_0^km_0^k + \sum\limits_{j = 1}^{J-1} (\phi_{j+1}-\phi_j) f_{j+\frac12}^k = \Delta x \phi_1 \sum\limits_{j = 1}^{J} {}^* \beta_j^k m_j^k + \sum\limits_{j = 1}^{J-1} (\phi_{j+1}-\phi_j) f_{j+\frac12}^k \le 3.5\Delta x\zeta C^*. \end{eqnarray*}$

We thus obtain

$\begin{align*} (\mu^{k+1}_{\Delta x}-\mu^k_{\Delta x}, \phi) \le L\Delta t, \qquad L: = (3.5\zeta+C_a+C_bC_a+\frac32 C_\kappa C^* )C^*. \end{align*}$

Taking the supremum over $\phi$ gives $\|\mu^{k+1}_{\Delta x}-\mu^k_{\Delta x}\|_{BL}\le L\Delta t$ for any $k$ . The result follows. □

7.2. Estimate of mass loss for the implicit coagulation discretization given by ${\mathcal C}^{\text{imp}}_{j, k}$

In this section, we consider the semi-implicit scheme (4.9) without any biological ingredients or fragmentation (i.e., $a, g, d, \beta = 0$ ) where mass is not conserved (observed by the numerical experiments in Section 5) and show this change is controlled by the time step. For a bound on the loss of mass via fragmentation, we direct the reader to Section 6.1 of ^[6]. Multiplying (4.9) by $x_j$ and summing over $j$ , we can arrive at

$\sum\limits_{j = 1}^{J} x_j m_j^{k+1} = \sum\limits_{j = 1}^{J} x_j m_j^k + \frac{\Delta t}{2} \sum\limits_{j = 1}^{J} \sum\limits_{i = 1}^{j-1} x_j \, \kappa_{i, j-i} m_i^{k+1} m_{j-i}^k - \Delta t \sum\limits_{j = 1}^{J} \sum\limits_{i = 1}^{J} \, x_j \kappa_{i, j} m_i^{k} m_j^{k+1}.$

In [6, Section 6.1] it was shown that the explicit scheme conserves mass through the coagulation process, i.e.,

$\frac{\Delta t}{2} \sum\limits_{j = 1}^{J} \sum\limits_{i = 1}^{j-1} x_j \, \kappa_{i, j-i} m_i^{k} m_{j-i}^k - \Delta t \sum\limits_{j = 1}^{J} \sum\limits_{i = 1}^{J} \, x_j \kappa_{i, j} m_i^{k} m_j^{k} = 0.$

Adding this to the previous equation we have

$\begin{align*} \sum\limits_{j = 1}^{J} x_j m_j^{k+1} & = \sum\limits_{j = 1}^{J} x_j m_j^k + \frac{\Delta t}{2} \sum\limits_{j = 1}^{J} \sum\limits_{i = 1}^{j-1} x_j \, \kappa_{i, j-i} (m_i^{k+1}-m_i^{k}) m_{j-i}^k \\ & \qquad - \Delta t \sum\limits_{j = 1}^{J} \sum\limits_{i = 1}^{J} \, x_j \kappa_{i, j} m_i^{k} (m_j^{k+1} - m_j^{k})\\ & = \sum\limits_{j = 1}^{J} x_j m_j^k + \frac{\Delta t}{2} \sum\limits_{i = 1}^{J} \sum\limits_{l = 1}^{J} x_{l+i} \, \kappa_{i, l} (m_i^{k+1}-m_i^{k}) m_{l}^k \\ & \qquad - \Delta t \sum\limits_{j = 1}^{J} \sum\limits_{i = 1}^{J} \, x_j \kappa_{i, j} m_i^{k} (m_j^{k+1} - m_j^{k}), \end{align*}$

where in the last equality, we change the order of integration and introduce the new index $l = j-i$ . Noticing that due to the uniform mesh size $x_{l+i} = x_l + x_i$ , we can split the second term on the right-hand side and obtain the equation

$\sum\limits_{j = 1}^{J} x_j m_j^{k+1} = \sum\limits_{j = 1}^{J} x_j m_j^k + \frac{\Delta t}{2} \sum\limits_{i = 1}^{J} \sum\limits_{l = 1}^{J} x_{i} \, \kappa_{i, l} \big(m_{l}^k (m_i^{k+1}-m_i^{k}) - m_{i}^k (m_l^{k+1}-m_l^{k}) \big).$

Since $x_j \leq x_{\max}$ , we can bound the last term on the right-hand side

$\left| \frac{\Delta t}{2} \sum\limits_{i = 1}^{J} \sum\limits_{l = 1}^{J} x_{i} \, \kappa_{i, l} \big(m_{l}^k (m_i^{k+1}-m_i^{k}) - m_{i}^k (m_l^{k+1}-m_l^{k}) \big)\right| \leq \Delta t \, C_\kappa x_{\max} \|\mu_{\Delta x}^k \|_{TV} \, \|\mu_{\Delta x}^{k+1} - \mu_{\Delta x}^{k}\|_{BL}.$

Using Lemmas 4.1 and 4.2, we have the estimate

$\left| \sum\limits_{j = 1}^{J} \frac{x_j m_j^{k+1} - x_j m_j^{k}}{\Delta t} \right| \leq C_\kappa x_{\max} L \exp((\zeta + C_a C_b)T) \|\mu^0\|_{TV} \Delta t .$

References

[1]	N. Shahini, Z. Bahrami, S. Sheykhivand, S. Marandi, M. Danishvar, S. Danishvar, et al., Automatically identified EEG signals of movement intention based on CNN network (end-to-end), Electronics, 11 (2022), 3297. https://doi.org/10.3390/electronics11203297 doi: 10.3390/electronics11203297
[2]	T. Zebin, P. J. Scully, K. B. Ozanyan, Human activity recognition with inertial sensors using a deep learning approach, Proceedings IEEE Sensors, (2017), 1–3. https://doi.org/10.1109/ICSENS.2016.7808590
[3]	W. Xu, Y. Pang, Y. Yang, Y. Liu, Human activity recognition based on convolutional neural network, Proceedings of the International Conference on Pattern Recognition, (2018), 165–170. https://doi.org/10.1109/ICPR.2018.8545435
[4]	Y. Omae, M. Kobayashi, K. Sakai, T. Akiduki, A. Shionoya, H. Takahashi, Detection of swimming stroke start timing by deep learning from an inertial sensor, ICIC Express Letters Part B: Applications ICIC International, 11 (2020), 245–251. https://doi.org/10.24507/icicelb.11.03.245 doi: 10.24507/icicelb.11.03.245
[5]	D. Sagga, A. Echtioui, R. Khemakhem, M. Ghorbel, Epileptic seizure detection using EEG signals based on 1D-CNN approach, Proceedings of the 20th International Conference on Sciences and Techniques of Automatic Control and Computer Engineering, (2020), 51–56. https://doi.org/10.1109/STA50679.2020.9329321
[6]	N. Dua, S. N. Singh, V. B. Semwal, Multi-input CNN-GRU based human activity recognition using wearable sensors, Computing, 103 (2021), 1461–1478. https://doi.org/10.1007/s00607-021-00928-8 doi: 10.1007/s00607-021-00928-8
[7]	Y. H. Yeh, D. P. Wong, C. T. Lee, P. H. Chou, Deep learning-based real-time activity recognition with multiple inertial sensors, Proceedings of the 2022 4th International Conference on Image, Video and Signal Processing, (2022), 92–99. https://doi.org/10.1145/3531232.3531245
[8]	J. P. Wolff, F. Grützmacher, A. Wellnitz, C. Haubelt, Activity recognition using head worn inertial sensors, Proceedings of the 5th International Workshop on Sensor-based Activity Recognition and Interaction, (2018), 1–7. https://doi.org/10.1145/3266157.3266218
[9]	R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, D. Batra, Grad-CAM: Visual explanations from deep networks via gradient-based localization, Int. J. Comput. Vision, 128 (2016), 336–359. https://doi.org/10.1109/ICCV.2017.74 doi: 10.1109/ICCV.2017.74
[10]	B. Zhou, A. Khosla, A. Lapedriza, A. Oliva, A. Torralba, Learning deep features for discriminative localization, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2016), 2921–2929.
[11]	M. Kara, Z. Öztürk, S. Akpek, A. A. Turupcu, P. Su, Y. Shen, COVID-19 diagnosis from chest ct scans: A weakly supervised CNN-LSTM approach, AI, 2 (2021), 330–341. https://doi.org/10.3390/ai2030020 doi: 10.3390/ai2030020
[12]	M. Kavitha, N. Yudistira, T. Kurita, Multi instance learning via deep CNN for multi-class recognition of Alzheimer's disease, 2019 IEEE 11th International Workshop on Computational Intelligence and Applications, (2019), 89–94. https://doi.org/10.1109/IWCIA47330.2019.8955006
[13]	J. G. Nam, J. Kim, K. Noh, H. Choi, D. S. Kim, S. J. Yoo, et al., Automatic prediction of left cardiac chamber enlargement from chest radiographs using convolutional neural network, Eur. Radiol., 31 (2021), 8130–8140. https://doi.org/10.1007/s00330-021-07963-1 doi: 10.1007/s00330-021-07963-1
[14]	T. Matsumoto, S. Kodera, H. Shinohara, H. Ieki, T. Yamaguchi, Y. Higashikuni, et al., Diagnosing heart failure from chest X-ray images using deep learning, Int. Heart J., 61 (2020), 781–786. https://doi.org/10.1536/ihj.19-714 doi: 10.1536/ihj.19-714
[15]	Y. Hirata, K. Kusunose, T. Tsuji, K. Fujimori, J. Kotoku, M. Sata, Deep learning for detection of elevated pulmonary artery wedge pressure using standard chest X-ray, Can. J. Cardiol., 37 (2021), 1198–1206. https://doi.org/10.1016/j.cjca.2021.02.007 doi: 10.1016/j.cjca.2021.02.007
[16]	M. Dutt, S. Redhu, M. Goodwin, C. W. Omlin, SleepXAI: An explainable deep learning approach for multi-class sleep stage identification, Appl. Intell., 53 (2023), 16830–16843. https://doi.org/10.1007/s10489-022-04357-8 doi: 10.1007/s10489-022-04357-8
[17]	S. Jonas, A. O. Rossetti, M. Oddo, S. Jenni, P. Favaro, F. Zubler, EEG-based outcome prediction after cardiac arrest with convolutional neural networks: Performance and visualization of discriminative features, Human Brain Mapp., 40 (2019), 4606–4617. https://doi.org/10.1002/hbm.24724 doi: 10.1002/hbm.24724
[18]	C. Barros, B. Roach, J. M. Ford, A. P. Pinheiro, C. A. Silva, From sound perception to automatic detection of schizophrenia: An EEG-based deep learning approach, Front. Psychiatry, 12 (2022), 813460. https://doi.org/10.3389/fpsyt.2021.813460 doi: 10.3389/fpsyt.2021.813460
[19]	Y. Yan, H. Zhou, L. Huang, X. Cheng, S. Kuang, A novel two-stage refine filtering method for EEG-based motor imagery classification, Front. Neurosci., 15 (2021), 657540. https://doi.org/10.3389/fnins.2021.657540 doi: 10.3389/fnins.2021.657540
[20]	M. Porumb, S. Stranges, A. Pescapè, L. Pecchia, Precision medicine and artificial intelligence: A pilot study on deep learning for hypoglycemic events detection based on ECG, Sci. Rep-UK., 10 (2020), 170. https://doi.org/10.1038/s41598-019-56927-5 doi: 10.1038/s41598-019-56927-5
[21]	S. Raghunath, A. E. U. Cerna, L. Jing, D. P. vanMaanen, J. Stough, D. N. Hartzel, et al., Prediction of mortality from 12-lead electrocardiogram voltage data using a deep neural network, Nat. Med., 26 (2020), 886–891. https://doi.org/10.1038/s41591-020-0870-z doi: 10.1038/s41591-020-0870-z
[22]	H. Shin, Deep convolutional neural network-based hemiplegic gait detection using an inertial sensor located freely in a pocket, Sensors, 22 (2022), 1920. https://doi.org/10.3390/s22051920 doi: 10.3390/s22051920
[23]	G. Aquino, M. G. Costa, C. F. C. Filho, Explaining one-dimensional convolutional models in human activity recognition and biometric identification tasks, Sensors, 22 (2022), 5644. https://doi.org/10.3390/s22155644 doi: 10.3390/s22155644
[24]	R. Ge, M. Zhou, Y. Luo, Q. Meng, G. Mai, D. Ma, et al, , Mctwo: A two-step feature selection algorithm based on maximal information coefficient, BMC Bioinformatics, 17 (2016), 142. https://doi.org/10.1186/s12859-016-0990-0 doi: 10.1186/s12859-016-0990-0
[25]	T. Naghibi, S. Hoffmann, B. Pfister, Convex approximation of the NP-hard search problem in feature subset selection, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, (2013), 3273–3277. https://doi.org/10.1109/ICASSP.2013.6638263
[26]	D. S. Hochba, Approximation algorithms for NP-hard problems, ACM SIGACT News, 28 (1997), 40–52. https://doi.org/10.1145/261342.571216 doi: 10.1145/261342.571216
[27]	C. Yun, J. Yang, Experimental comparison of feature subset selection methods, Seventh IEEE International Conference on Data Mining Workshops, (2007), 367–372. https://doi.org/10.1109/ICDMW.2007.77
[28]	W. C. Lin, Experimental study of information measure and inter-intra class distance ratios on feature selection and orderings, IEEE T. Syst. Man Cy-S, 3 (1973), 172–181. https://doi.org/10.1109/TSMC.1973.5408500 doi: 10.1109/TSMC.1973.5408500
[29]	W. Y. Loh, Classification and regression trees, Data Mining and Knowledge Discovery, 1 (2011), 14–23. https://doi.org/10.1002/widm.8 doi: 10.1002/widm.8
[30]	M. R. Osborne, B. Presnell, B. A. Turlach, On the lasso and its dual, J. Comput. Graph. Stat., 9 (2000), 319–337. https://doi.org/10.1080/10618600.2000.10474883
[31]	R. J. Palma-Mendoza, D. Rodriguez, L. de Marcos, Distributed Relieff-based feature selection in spark, Knowl. Inf. Syst., 57 (2018), 1–20. https://doi.org/10.1007/s10115-017-1145-y doi: 10.1007/s10115-017-1145-y
[32]	Y. Huang, P. J. McCullagh, N. D. Black, An optimization of Relieff for classification in large datasets, Data Knowl. Eng., 68 (2009), 1348–1356. https://doi.org/10.1016/j.datak.2009.07.011 doi: 10.1016/j.datak.2009.07.011
[33]	R. Yao, J. Li, M. Hui, L. Bai, Q. Wu, Feature selection based on random forest for partial discharges characteristic set, IEEE Access, 8 (2020), 159151–159161. https://doi.org/10.1109/ACCESS.2020.3019377 doi: 10.1109/ACCESS.2020.3019377
[34]	M. Mori, R. G. Flores, Y. Suzuki, K. Nukazawa, T. Hiraoka, H. Nonaka, Prediction of Microcystis occurrences and analysis using machine learning in high-dimension, low-sample-size and imbalanced water quality data, Harmful Algae, 117 (2022), 102273. https://doi.org/10.1016/j.hal.2022.102273 doi: 10.1016/j.hal.2022.102273
[35]	Y. Omae, M. Mori, E2H distance-weighted minimum reference set for numerical and categorical mixture data and a Bayesian swap feature selection algorithm, Mach. Learn. Know. Extr., 5 (2023), 109–127. https://doi.org/10.3390/make5010007 doi: 10.3390/make5010007
[36]	R. Garriga, J. Mas, S. Abraha, J. Nolan, O. Harrison, G. Tadros, et al., Machine learning model to predict mental health crises from electronic health records, Nat. Med., 28 (2022), 1240–1248. https://doi.org/10.1038/s41591-022-01811-5 doi: 10.1038/s41591-022-01811-5
[37]	G. Chandrashekar, F. Sahin, A survey on feature selection methods, Comput. Electr. Eng., 40 (2014), 16–28. https://doi.org/10.1016/j.compeleceng.2013.11.024 doi: 10.1016/j.compeleceng.2013.11.024
[38]	N. Gopika, M. Kowshalaya, Correlation based feature selection algorithm for machine learning, Proceedings of the 3rd International Conference on Communication and Electronics Systems, (2018), 692–695. https://doi.org/10.1109/CESYS.2018.8723980
[39]	L. Fu, B. Lu, B. Nie, Z. Peng, H. Liu, X. Pi, Hybrid network with attention mechanism for detection and location of myocardial infarction based on 12-lead electrocardiogram signals, Sensors, 20 (2020), 1020. https://doi.org/10.3390/s20041020 doi: 10.3390/s20041020
[40]	F. M. Rueda, R. Grzeszick, G. A. Fink, S. Feldhorst, M. T. Hompel, Convolutional neural networks for human activity recognition using body-worn sensors, Informatics, 5 (2018), 26. https://doi.org/10.3390/informatics5020026 doi: 10.3390/informatics5020026
[41]	T. Thenmozhi, R. Helen, Feature selection using extreme gradient boosting bayesian optimization to upgrade the classification performance of motor imagery signals for BCI, J. Neurosci. Meth., 366 (2022), 109425. https://doi.org/10.1016/j.jneumeth.2021.109425 doi: 10.1016/j.jneumeth.2021.109425
[42]	R. Garnett, M. A. Osborne, S. J. Roberts, Bayesian optimization for sensor set selection, Proceedings of the 9th ACM/IEEE International Conference on Information Processing in Sensor Networks, (2019), 209–219. https://doi.org/10.1145/1791212.1791238
[43]	E. Kim, Interpretable and accurate convolutional neural networks for human activity recognition, IEEE T. Ind. Inform., 16 (2020), 7190–7198. https://doi.org/10.1109/TII.2020.2972628 doi: 10.1109/TII.2020.2972628
[44]	M. Jaén-Vargas, K. M. R. Leiva, F. Fernandes, S. B. Goncalves, M. T. Silva, D. S. Lopes, et al., Effects of sliding window variation in the performance of acceleration-based human activity recognition using deep learning models, PeerJ Comput. Sci., 8 (2022), e1052. https://doi.org/10.7717/peerj-cs.1052 doi: 10.7717/peerj-cs.1052
[45]	R. Chavarriaga, H. Sagha, A. Calatroni, S. T. Digumarti, G. Tröster, J. D. R. Millán, et al., The opportunity challenge: A benchmark database for on-body sensor-based activity recognition, Pattern Recogn. Lett., 34 (2013), 2033–2042. https://doi.org/10.1016/j.patrec.2012.12.014 doi: 10.1016/j.patrec.2012.12.014
[46]	H. Sagha, S. T. Digumarti, J. D. R. Millán, R. Chavarriaga, A. Calatroni, D. Roggen, et al., Benchmarking classification techniques using the opportunity human activity dataset, 2011 IEEE International Conference on Systems, Man and Cybernetics, (2011), 36–40. doi: 10.1109/ICSMC.2011.6083628
[47]	A. Murad, J. Y. Pyun, Deep recurrent neural networks for human activity recognition, Sensors, 17 (2017), 2556. https://doi.org/10.3390/s17112556 doi: 10.3390/s17112556
[48]	J. B. Yang, M. N. Nguyen, P. P. San, X. L. Li, S. Krishnaswamy, Deep convolutional neural networks on multichannel time series for human activity recognition, Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, (2015), 3995–4001.
[49]	O. Banos, J. M. Galvez, M. Damas, H. Pomares, I. Rojas, Window size impact in human activity recognition, Sensors, 14 (2014), 6474–6499. https://doi.org/10.3390/s140406474 doi: 10.3390/s140406474
[50]	T. Tanaka, I. Nambu, Y. Maruyama, Y. Wada, Sliding-window normalization to improve the performance of machine-learning models for real-time motion prediction using electromyography, Sensors, 22 (2022), 5005. https://doi.org/10.3390/s22135005 doi: 10.3390/s22135005
[51]	J. Wu, X. Y. Chen, H. Zhang, L. D. Xiong, H. Lei, S. H. Deng, Hyperparameter optimization for machine learning models based on bayesian optimization, J. Electron. Sci. Technol., 17 (2019), 26–40. https://doi.org/10.11989/JEST.1674-862X.80904120 doi: 10.11989/JEST.1674-862X.80904120
[52]	P. Doke, D. Shrivastava, C. Pan, Q. Zhou, Y. D. Zhang, Using CNN with bayesian optimization to identify cerebral micro-bleeds, Mach. Vision Appl., 31 (2020), 1–14. https://doi.org/10.1007/s00138-020-01087-0 doi: 10.1007/s00138-020-01087-0
[53]	J. Bergstra, R. Bardenet, Y. Bengio, B. Kegl, Algorithms for hyper-parameter optimization, Adv. Neural Inf. Process. Syst., 24 (2011), 2546–2554.
[54]	T. Akiba, S. Sano, T. Yanase, T. Ohta, M. Koyama, Optuna: A next-generation hyperparameter optimization framework, Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, (2019), 2623–2631, https://optuna.readthedocs.io/en/stable/. doi: 10.1145/3292500.3330701
[55]	H. Makino, E. Kita, Stochastic schemata exploiter-based AutoML, 2021 IEEE International Conference on Data Mining Workshops, (2021), 238–245. https://doi.org/10.1109/ICDMW53433.2021.00037
[56]	P. Siirtola, P. Laurinen, J. Roning and H. Kinnunen, Efficient accelerometer-based swimming exercise tracking, IEEE SSCI 2011: Symposium Series on Computational Intelligence, (2011), 156–161. https://doi.org/10.1109/CIDM.2011.5949430
[57]	G. Brunner, D. Melnyk, B. Sigfússon, R. Wattenhofer, Swimming style recognition and lap counting using a smartwatch and deep learning, 2019 International Symposium on Wearable Computers, (2019), 23–31. https://doi.org/10.1145/3341163.3347719

This article has been cited by:

1.	Carlo Bianca, Nicolas Saintier, Thermostatted kinetic theory in measure spaces: Well-posedness, 2025, 251, 0362546X, 113666, 10.1016/j.na.2024.113666
2.	Konstantinos Alexiou, Daniel B. Cooney, Steady-State and Dynamical Behavior of a PDE Model of Multilevel Selection with Pairwise Group-Level Competition, 2025, 87, 0092-8240, 10.1007/s11538-025-01476-4

Reader Comments

Your name:*

Email:*
© 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)