Metastability of solitary waves in diatomic FPUT lattices

Nickolas Giardetti; Amy Shapiro; Stephen Windle; J. Douglas Wright; Nickolas Giardetti; Amy Shapiro; Stephen Windle; J. Douglas Wright

doi:10.3934/mine.2019.3.419

Mathematics in Engineering

2019, Volume 1, Issue 3: 419-433. doi: 10.3934/mine.2019.3.419

Previous Article Next Article

Research article Special Issues

Metastability of solitary waves in diatomic FPUT lattices

Department of Mathematics, Drexel University, 3141 Chestnut St, Philadelphia, PA 19104

Received: 14 December 2018 Accepted: 25 February 2019 Published: 30 April 2019

It is known that long waves in spatially periodic polymer Fermi-Pasta-Ulam-Tsingou lattices are well-approximated for long, but not infinite, times by suitably scaled solutions of Korteweg-de Vries equations. It is also known that dimer FPUT lattices possess nanopteron solutions, i.e., traveling wave solutions which are the superposition of a KdV-like solitary wave and a very small amplitude ripple. Such solutions have infinite mechanical energy. In this article we investigate numerically what happens over very long time scales (longer than the time of validity for the KdV approximation) to solutions of diatomic FPUT which are initially suitably scaled (finite energy) KdV solitary waves. That is we omit the ripple. What we find is that the solitary wave continuously leaves behind a very small amplitude "oscillatory wake." This periodic tail saps energy from the solitary wave at a very slow (numerically sub-exponential) rate. We take this as evidence that the diatomic FPUT "solitary wave" is in fact quasi-stationary or metastable.

Keywords:

diatomic Fermi-Pasta-Ulam-Tsingou lattices,
metastability,
Hamiltonian lattices,
solitary waves,
nonlinear waves

Citation: Nickolas Giardetti, Amy Shapiro, Stephen Windle, J. Douglas Wright. Metastability of solitary waves in diatomic FPUT lattices[J]. Mathematics in Engineering, 2019, 1(3): 419-433. doi: 10.3934/mine.2019.3.419

Related Papers:

[1]	Nurul Akhma Zakaria, A.A. Kutty, M.A. Mahazar, Marina Zainal Abidin . Arsenic acute toxicity assessment on select freshwater organism species in Malaysia. AIMS Environmental Science, 2016, 3(4): 804-814. doi: 10.3934/environsci.2016.4.804
[2]	Patricia Morcillo, Maria Angeles Esteban, Alberto Cuesta . Mercury and its toxic effects on fish. AIMS Environmental Science, 2017, 4(3): 386-402. doi: 10.3934/environsci.2017.3.386
[3]	Antonio Zanin, Ivonez Xavier de Almeida, Francieli Pacassa, Fabricia Silva da Rosa, Paulo Afonso . Maturity level of environmental management in the pulp and paper supply chain. AIMS Environmental Science, 2021, 8(6): 580-596. doi: 10.3934/environsci.2021037
[4]	Djordje Vilimanovic, Gangadhar Andaluri, Robert Hannah, Rominder Suri, A. Ronald MacGillivray . Occurrence and aquatic toxicity of contaminants of emerging concern (CECs) in tributaries of an urbanized section of the Delaware River Watershed. AIMS Environmental Science, 2020, 7(4): 302-319. doi: 10.3934/environsci.2020019
[5]	Zeynab Rezazadeh Salteh, Saeed Fazayeli, Saeid Jafarzadeh Ghoushchi . Evaluation and prioritization of barriers to the implementation of the eco-regenerative supply chains using fuzzy ZE-numbers framework in group decision-making. AIMS Environmental Science, 2024, 11(4): 516-550. doi: 10.3934/environsci.2024026
[6]	Francesco Teodori . Health physics calculation framework for environmental impact assessment of radiological contamination. AIMS Environmental Science, 2021, 8(4): 403-420. doi: 10.3934/environsci.2021026
[7]	P. U. Singare, J.P. Shirodkar . Persistent and carcinogenic polycyclic aromatic hydrocarbons in the north-western coastal marine environment of India. AIMS Environmental Science, 2021, 8(2): 169-189. doi: 10.3934/environsci.2021012
[8]	Melanie Voigt, Indra Bartels, Anna Nickisch-Hartfiel, Martin Jaeger . Elimination of macrolides in water bodies using photochemical oxidation. AIMS Environmental Science, 2018, 5(5): 372-388. doi: 10.3934/environsci.2018.5.372
[9]	Fernanda P. Maciel, Joshua M. Peschel . A GIS-based tool for bioaccumulation risk analysis and its application to study polychlorinated biphenyls in the Great Lakes. AIMS Environmental Science, 2018, 5(1): 1-23. doi: 10.3934/environsci.2018.1.1
[10]	Jamshid Ali . Environmental resilience: transition to regenerative supply chain management. AIMS Environmental Science, 2024, 11(2): 107-128. doi: 10.3934/environsci.2024007

Abstract

1. Introduction

With the continuous development and progress of computing technology over the past decade, deep learning has taken a big step on the road of evolution. Deep learning can be found in many fields, such as imaging and natural language. Scholars have begun to apply deep learning to solve some complex partial differential equations (PDEs), which include PDEs with high order derivatives ^[1], high-dimensional PDEs ^[2,3], subdiffusion problems with noisy data ^[4] and so on. Based on the deep learning method, Raissi et al. ^[5] proposed a novel algorithm called the physics informed neural network (PINN), which has made excellent achievements for solving forward and inverse PDEs. It integrates physical information described by PDEs into a neural network. In recent years, the PINNs algorithm has attracted extensive attention. To solve forward and inverse problems of integro-differential equations (IDEs), Yuan et al. ^[6] proposed auxiliary physics informed neural network (A-PINN). Lin and Chen ^[7] designed a two-stage physics informed neural network for approximating localized wave solutions, which introduces the measurement of conserved quantities in stage two. Yang et al. ^[8] developed Bayesian physics informed neural networks (B-PINNs) that takes Bayesian neural network and Hamiltonian Monte Carlo or the variational inference as a priori and posteriori estimators, respectively. Scholars have also presented other variant algorithms of PINN, such as RPINNs ^[9] and $hp$ -VPINNs ^[10]. PINN has also achieved good performance for solving physical problems, including high-speed flows ^[11] and heat transfer problems ^[12]. In addition to integer-order differential equations, authors have studied the application of PINN in solving fractional differential equations such as fractional advection-diffusion equations (see Pang et al. ^[13] for fractional physics informed neural network (fPINNs)), high dimensional fractional PDEs (see Guo et al. ^[14] for Monte Carlo physics-informed neural networks (MC-PINNs)) and fractional water wave models (see Liu et al. ^[15] for time difference PINN).

In recent decades, fractional differential equations (FDEs) have been concerned and studied in many fields, such as image denoising ^[16] and physics ^[17,18,19]. The reason why fractional differential equations have attracted wide attention is that they can more clearly describe complex physical phenomena. As an indispensable part of fractional problems, distributed-order differential equations are difficult to solve due to the complexity of distributed-order operators. To solve the time multi-term and distributed-order fractional sub-diffusion equations, Gao et al. ^[20] proposed a second order numerical difference formula. Jian et al. ^[21] derived a fast second-order implicit difference scheme of time distributed-order and Riesz space fractional diffusion-wave equations and analyzed the unconditional stability and second-order convergence. Li et al. ^[22] applied the mid-point quadrature rule with finite volume method to approximate the distributed-order equation. For the nonlinear distributed-order sub-diffusion model ^[23], the distributed-order derivative and the spatial direction were approximated by the FBN- $\theta$ formula with a second-order composite numerical integral formula and the $H^1$ -Galerkin mixed finite element method, respectively. In ^[24], Guo et al. adopted the Legendre-Galerkin spectral method for solving 2D distributed-order space-time reaction-diffusion equations. For the two-dimensional Riesz space distributed-order equation, Zhang et al. ^[25] used Gauss quadrature to calculate the distributed-order derivative and applied an alternating direction implicit (ADI) Galerkin-Legendre spectral scheme to approximate the spatial direction. For the distributed-order fourth-order sub-diffusion equation, Ran and Zhang ^[26] developed new compact difference schemes and proved their stability and convergence. In ^[27,28], authors developed spectral methods for the distributed-order time fractional fourth-order PDEs.

As we know, distributed-order fractional PDEs can be regarded as the limiting case of multi-term fractional PDEs ^[29]. Moreover, Diethelm and Ford ^[30] have observed that small changes in the order of a fractional PDE lead to only slight changes in the final solution, which gives initial support to the employed numerical integration method. In view of this, we develop the FBN- $\theta$ ^[31,32] with a second-order composite numerical integral formula combined with a multi-output neural network for solving 1D and 2D nonlinear time distributed-order models. Based on the idea of using a single output neural network combined with the discrete scheme of fractional models ^[13], we also use a single output neural network combined with a time discrete scheme to solve the nonlinear time distributed-order models. However, the accuracy of the prediction solution calculated by the single output neural network scheme is low and the training progress takes a lot of time. Therefore, we introduce a multi-output neural network to obtain the numerical solution of the time discrete scheme. Compared with the single output neural network scheme, the proposed multi-output neural network scheme has two main advantages as follows:

● Saving more computing time. The multi-output neural network scheme makes the sampling domain of the collocation points from the spatiotemporal domain to the spatial domain, which decreases the number of the training dataset and thus reduces the training time.

● Improving the accuracy of predicted solution. Due to the discrete scheme of the distributed-order derivative, the $n$ -th output item of the multi-output neural network will be constrained by the previous $n-1$ output items.

The remainder of this article is as follows: In Section 2, we show what the components of neural network are and how to construct a neural network. In Section 3, we give the lemmas used to approximate the distributed-order derivative and the process of building the loss function. In Section 4, we provide some numerical results to confirm the capability of our proposed method. Finally, we make some conclusions in Section 5.

2. Neural network

In face of different objectives in various fields, scholars have developed many different types of neural networks, such as feed-forward neural network (FNN) ^[6], recurrent neural network (RNN) ^[33] and convolutional neural network (CNN) ^[34]. The FNN considered in this article can effectively solve most PDEs. Input layer, hidden layer and output layer are three indispensable components of FNN, which can be given, respectively, by

$\begin{equation} \notag \begin{aligned} \text{input layer:}\; \; \; \Phi_0(x) & = x\in\mathbb{R}^{d_{in}}, \\ \text{hidden layers:}\; \; \; \Phi_k(x) & = \sigma(\boldsymbol W_k\Phi_{k-1}(x)+\boldsymbol b_k)\in\mathbb{R}^{\lambda_k}, \; 1\leq k\leq K-1, \\ \text{output layer:}\; \; \Phi_K(x) & = \boldsymbol W_K\Phi_{K-1}(x)+\boldsymbol b_K\in\mathbb{R}^{d_{out}}. \end{aligned} \end{equation}$

$\boldsymbol W_k\in\mathbb{R}^{\lambda_k\times\lambda_{k-1}}$ and $\boldsymbol b_k\in\mathbb{R}^{\lambda_k}$ represent the weight matrix and the bias vector in the $k$ th layer, respectively. We define $\delta = \{\boldsymbol W_k, \boldsymbol b_k\}_{1\leq k\leq K}$ , which is the trainable parameters of FNN. $\lambda_k$ represents the number of neurons included in the $k$ th layer. $\sigma$ is a nonlinear activation function. In this article, the hyperbolic tangent function ^[3,6] is selected as the activation function. There are many other functions that can be considered as activation functions, such as the rectified linear unit (ReLU) $\sigma(x) = \max\{x, 0\}$ ^[4] and the logistic sigmoid $\sigma(x) = \frac{1}{1+e^{-x}}$ ^[35].

3. Methodology

3.1. Problem setup

In this article, we consider a nonlinear distributed-order model with the following general form:

$\begin{equation} \begin{split} \left\{ \begin{aligned} & D^w_tu+\mathcal{N}(u) = f(\boldsymbol x, t), \; (\boldsymbol x, t) \in \Omega \times J, \\ & u(\boldsymbol x, 0) = u_0(\boldsymbol x), \boldsymbol x\in\bar{\Omega}, \end{aligned} \right. \end{split} \end{equation}$

(3.1)

where $\Omega \subset \mathbb{R}^d(d\leq2)$ and $J = (0, T]$ . $\mathcal{N}[\cdot]$ is a nonlinear differential operator. $D^w_tu$ represents the distributed-order derivative and has the following definition:

$\begin{equation} \begin{split} D^w_tu(\boldsymbol x, t) = \int^1_0 \omega(\alpha){}^C_0D^{\alpha}_tu(\boldsymbol x, t)d\alpha, \end{split} \end{equation}$

(3.2)

where $\omega(\alpha) \geq 0$ , $\int^1_0\omega(\alpha)d\alpha = c_0 > 0$ and ${}^C_0D^{\alpha}_tu(\boldsymbol x, t)$ is the Caputo fractional derivative expressed by

$\begin{equation} {}^C_0D^{\alpha}_tu(\boldsymbol x, t) = \left\{ \begin{aligned} & \frac{1}{\Gamma(1-\alpha)}\int^t_0\frac{ u_{\eta}(\boldsymbol x, \eta)}{(t-\eta)^{-\alpha}}d\eta, \; 0 < \alpha < 1, \\ & u_t(\boldsymbol x, t), \alpha = 1. \end{aligned} \right. \end{equation}$

(3.3)

The specific boundary condition is determined by the practical problem.

3.2. Some lemmas

For simplicity, choosing a mesh size $\Delta \alpha = \frac{1}{2I}$ , we denote the nodes on the interval $[0, 1]$ with coordinates $\alpha_i = i\Delta \alpha$ for $i = 0, 1, 2, \cdots, 2I$ . The time interval $[0, T]$ is divided as the uniform mesh with the grid points $t_n = n\Delta t (n = 0, 1, 2, \cdots, N)$ and $\Delta t = T/N$ is the time step size. We denote $v^n \approx u^n = u(\boldsymbol x, t_n)$ , $u^{n+\frac{1}{2}}_t = \frac{u^{n+1} - u^n}{\Delta t} + O(\Delta t^2)$ and $u^{n+\frac{1}{2}} : = \frac{u^{n+1}+u^n}{2}$ . $v^n$ is defined as the approximation solution of the time discrete scheme. The following lemmas are introduced to construct the numerical discrete formula of (3.1):

Lemma 3.1. (See ^[23]) Supposing $\omega(\alpha) \in C^2[0, 1]$ , we can get

$\begin{equation} \int^1_0\omega(\alpha)d\alpha = \Delta \alpha\mathop \sum \limits_{k = 0}^{2I} c_i\omega(\alpha_i)-\frac{\Delta \alpha^2}{12}\omega^{(2)}(\gamma), \; \gamma \in (0, 1), \end{equation}$

(3.4)

where

$\begin{equation} c_i = \left\{ \begin{aligned} & \frac{1}{2}, \; i = 0, 2I, \\ & 1, \; \mathit{\text{otherwise}}. \end{aligned} \right. \end{equation}$

(3.5)

Lemma 3.2. From ^[23,31,32], the discrete formula of the Caputo fractional derivative (3.3) can be obtained by

$\begin{equation} {}^C_0D^{\alpha}_tu(\boldsymbol x, t_{n+\frac{1}{2}}) = \frac{{}^C_0D^{\alpha}_tu^{n+1} + {}^C_0D^{\alpha}_tu^n}{2} + O(\Delta t^2) = \Delta t^{-\alpha}\mathop \sum \limits_{s = 0}^{n + 1} \widetilde{\kappa}^{(\alpha)}_{n+1-s}u^s + O(\Delta t^2), \end{equation}$

(3.6)

where

$\begin{equation} \widetilde{\kappa}^{(\alpha)}_{n+1-s} = \left\{ \begin{aligned} & \frac{\kappa^{(\alpha)}_{0}}{2}, \; s = n+1, \\ & \frac{\kappa^{(\alpha)}_{n-s} + \kappa^{(\alpha)}_{n+1-s}}{2}, \; otherwise. \end{aligned} \right. \end{equation}$

(3.7)

The parameters $\kappa^{(\alpha)}_{i}(i = 0, 1, \cdots, n+1)$ that are the coefficients of FBN- $\theta$ $(\theta \in [-\frac{1}{2}, 1])$ can be given by

$\begin{equation} \kappa^{(\alpha)}_{i} = \left\{ \begin{aligned} & 2^{-\alpha}(1+\alpha\theta)(3-2\theta)^{\alpha}, \; i = 0, \\ & \frac{\phi_0\kappa^{(\alpha)}_{0}}{\psi_0}, \; i = 1, \\ & \frac{1}{2\psi_0}[(\phi_0-\psi_1)\kappa^{(\alpha)}_{1} + \phi_1\kappa^{(\alpha)}_{0}], \; i = 2, \\ & \frac{1}{i\psi_0}\sum^{3}_{j = 1}[\phi_{j-1}-(i-j)\psi_j]\kappa^{(\alpha)}_{i-j}, \; i\geq3, \end{aligned} \right. \end{equation}$

(3.8)

where

$\begin{equation} \phi_i = \left\{ \begin{aligned} & 2\alpha(\theta-1)(\alpha\theta+1)+\alpha\theta(\theta-\frac{3}{2}), \; i = 0, \\ & -\alpha(2\theta^2-3\alpha\theta+4\alpha\theta^2-1), \; i = 1, \\ & -\alpha\theta(\frac{1}{2}-\theta+\alpha-2\alpha\theta), \; i = 2, \\ \end{aligned} \right. \end{equation}$

(3.9)

and

$\begin{equation} \psi_i = \left\{ \begin{aligned} & \frac{1}{2}(3-2\theta)(1+\alpha\theta), \; i = 0, \\ & -\frac{\alpha\theta}{2}(3-2\theta)-2(1-\theta)(\alpha\theta+1), \; i = 1, \\ & -\frac{1}{2}(2\theta-1)(\alpha\theta+1)-2\alpha\theta(\theta-1), \; i = 2, \\ & -\frac{1}{2}\alpha\theta(1-2\theta). \end{aligned} \right. \end{equation}$

(3.10)

Lemma 3.3. (See ^[23]) The distributed-order term $D^{\omega}_tu$ at $t = t_{n+\frac{1}{2}}$ can be calculated by the following formula:

$\begin{equation} D^{\omega}_tu(\boldsymbol x, t_{n+\frac{1}{2}}) = \frac{D^{\omega}_tu^{n+1} + D^{\omega}_tu^{n}}{2}+O(\Delta t^2) = \frac{1}{2}\mathop \sum \limits_{s = 0}^{n + 1} \beta^{n+1}_su^s+O(\Delta t^2+\Delta\alpha^2), \end{equation}$

(3.11)

where

$\begin{equation} \beta^{n+1}_s = \left\{ \begin{aligned} & \widehat{\kappa}_{n-s}+\widehat{\kappa}_{n+1-s}, \; 0\leq s < n+1, \\ & \widehat{\kappa}_0, \; s = n+1, \\ \end{aligned} \right. \end{equation}$

(3.12)

and

$\begin{equation} \widehat{\kappa}_{n-s} = \mathop \sum \limits_{i = 0}^{2I} \frac{\varphi_i}{\Delta t^{\alpha_i}}\kappa^{(\alpha_i)}_{n-s}, \; \varphi_i = \Delta\alpha\omega(\alpha_i)c_i. \end{equation}$

(3.13)

3.3. The loss function

Based on the above lemmas, the discrete scheme of the distributed-order model (3.1) at $t = t_{n+\frac{1}{2}}(n = 0, 1, 2, \cdots, N-1)$ can be expressed by the following equality:

$\begin{equation} \frac{1}{2}\mathop \sum \limits_{s = 0}^{n + 1} \beta^{n+1}_sv^s + \mathcal{N}(v^{n+\frac{1}{2}}) = f^{n+\frac{1}{2}}. \end{equation}$

(3.14)

Then we can obtain the system of equations as follows:

$\begin{equation} \left\{ \begin{aligned} & \frac{1}{2}\sum^{1}_{s = 0}\beta^{1}_sv^s + \mathcal{N}(v^{\frac{1}{2}}) = f^{\frac{1}{2}}, \\ & \frac{1}{2}\sum^{2}_{s = 0}\beta^{2}_sv^s + \mathcal{N}(v^{1+\frac{1}{2}}) = f^{1+\frac{1}{2}}, \\ & \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \; \vdots \\ & \frac{1}{2}\sum^{N}_{s = 0}\beta^{N}_sv^s + \mathcal{N}(v^{N-\frac{1}{2}}) = f^{N-\frac{1}{2}}. \end{aligned} \right. \end{equation}$

(3.15)

The system of Eq (3.15) can be rewritten as the following matrix form:

$\begin{equation} \boldsymbol v(\boldsymbol x)M + \boldsymbol{\mathcal{N}}(\boldsymbol v(\boldsymbol x)) + v^0(\boldsymbol x)\boldsymbol\rho_0 = \boldsymbol f(\boldsymbol x), \end{equation}$

(3.16)

where the symbols $\boldsymbol\rho_0$ , $\boldsymbol v(\boldsymbol x)$ , $\boldsymbol f(\boldsymbol x)$ and $\boldsymbol{\mathcal{N}}(\boldsymbol v(\boldsymbol x))$ are vectors, which are given, respectively, by

$\begin{equation} \notag \begin{aligned} & \boldsymbol\rho_0 = [\frac{1}{2}\beta^1_0, \frac{1}{2}\beta^2_0, \frac{1}{2}\beta^3_0, \cdots, \frac{1}{2}\beta^N_0], \\ \\ & \boldsymbol v(\boldsymbol x) = [v^1(\boldsymbol x), v^2(\boldsymbol x), \cdots, v^N(\boldsymbol x)], \\ \\ & \boldsymbol f(\boldsymbol x) = [\boldsymbol f^{\frac{1}{2}}(\boldsymbol x), \boldsymbol f^{1+\frac{1}{2}}(\boldsymbol x), \cdots, \boldsymbol f^{N-\frac{1}{2}}(\boldsymbol x)], \\ \\ & \boldsymbol{\mathcal{N}}(\boldsymbol v(\boldsymbol x)) = [\mathcal{N}(v^{\frac{1}{2}}(\boldsymbol x)), \mathcal{N}(v^{1+\frac{1}{2}}(\boldsymbol x)), \cdots, \mathcal{N}(v^{N-\frac{1}{2}}(\boldsymbol x))]. \end{aligned} \end{equation}$

The symbol $M$ is an $N\times N$ matrix that has the following definition:

$\begin{equation} \notag \begin{aligned} M = \begin{pmatrix} \frac{1}{2}\beta^1_1 & \frac{1}{2}\beta^2_1 &\cdots &\frac{1}{2}\beta^N_1 \\ 0 & \frac{1}{2}\beta^2_2 & \cdots &\frac{1}{2}\beta^N_2 \\ \vdots& \vdots& &\vdots\\ 0 & 0& \cdots & \frac{1}{2}\beta^N_N \end{pmatrix}. \end{aligned} \end{equation}$

Now, we introduce a multi-output neural network $\boldsymbol v(\boldsymbol x; \delta) = [v^1(\boldsymbol x; \delta), v^2(\boldsymbol x; \delta), \cdots, v^N(\boldsymbol x; \delta)]$ into Eq (3.16), which takes $\boldsymbol x$ as an input and is used to approximate time discrete solutions $\boldsymbol v(\boldsymbol x) = [v^1(\boldsymbol x), v^2(\boldsymbol x), \cdots, v^N(\boldsymbol x)]$ . This will result in a multi-output PINN $\boldsymbol \ell(\boldsymbol x) = [\ell^1(\boldsymbol x), \ell^2(\boldsymbol x), \cdots, \ell^N(\boldsymbol x)]$ :

$\begin{equation} \boldsymbol \ell(\boldsymbol x) = \boldsymbol v(\boldsymbol x; \delta)M + \boldsymbol{\mathcal{N}}(\boldsymbol v(\boldsymbol x; \delta)) + v^0(\boldsymbol x)\boldsymbol\rho_0 - \boldsymbol f(\boldsymbol x), \end{equation}$

(3.17)

where $\ell^{n+1}(\boldsymbol x)$ is denoted as residual error of the discrete scheme (3.14), which is given by

$\begin{equation} \ell^{n+1}(\boldsymbol x) = \frac{1}{2}\mathop \sum \limits_{s = 1}^{n + 1} \beta^{n+1}_sv^s(\boldsymbol x; \delta) + \mathcal{N}(v^{n+\frac{1}{2}}(\boldsymbol x; \delta)) + \frac{1}{2}\beta^{n+1}_0v^0(\boldsymbol x)-f^{n+\frac{1}{2}}(\boldsymbol x), \; n = 0, 1, 2, \cdots, N-1. \end{equation}$

(3.18)

The loss function is constructed in the form of mean square error. Combined with the boundary condition loss, the total loss function can be expressed by the following formula:

$\begin{equation} MSE_{total} = MSE_{\ell}+MSE_b, \end{equation}$

(3.19)

where

$\begin{equation} MSE_{\ell} = \frac{1}{N\times N_x}\mathop \sum \limits_{j = 1}^N \mathop \sum \limits_{i = 1}^{{N_x}} |\ell^j(\boldsymbol x^i_{\ell})|^2, \end{equation}$

(3.20)

and boundary condition loss

$\begin{equation} MSE_b = \frac{1}{N\times N_b}\mathop \sum \limits_{j = 1}^N \mathop \sum \limits_{i = 1}^{{N_b}} |v^j(\boldsymbol x^i_b; \delta)-u^j(\boldsymbol x^i_b)|^2. \end{equation}$

(3.21)

Here, $\{\boldsymbol x^i_{\ell}\}^{N_x}_{i = 1}$ corresponds to the collocation points on the space domain $\Omega$ and $\{\boldsymbol x^i_{b}\}^{N_b}_{i = 1}$ denotes the boundary training data. The schematic diagram of using the multi-output neural network scheme to solve nonlinear time distributed-order models is shown in Figure 1.

Figure 1. A multi-output neural network framework to solve nonlinear time distributed-order models, where

$MSE_*$ represents the loss function.

DownLoad: Full-Size Img PowerPoint

4. Algorithm implementation

In this section, we consider two nonlinear time distributed-order equations to verify the feasibility and effectiveness of our proposed method. The performance is evaluated by calculating the relative $L^2$ error between the predicted and exact solutions. The definition of relative $L^2$ error is given by

$\begin{equation} ||u-v||_{L^2} = \frac{\sqrt{\mathop \sum \nolimits_{j = 1}^N \mathop \sum \nolimits_{i = 1}^{{N_x}} |u^j(\boldsymbol x^i)-v^j(\boldsymbol x^i)|^2}}{\sqrt{\sum^N_{j = 1}\mathop \sum \nolimits_{i = 1}^{{N_x}} |u^j(\boldsymbol x^i)|^2}}. \end{equation}$

(4.1)

Table 1 indicates which optimizer is selected for each example to minimize the loss function.

Table 1. The hyperparameters configured in numerical examples.

Example	Optimizer	Learning rate	Iterations
$1$	Adam + L-BFGS	0.001	20000
$2$	Adam + L-BFGS	0.001	20000
$3$	Adam + L-BFGS	0.001	20000
$4$	L-BFGS	-	-

| Show Table

DownLoad: CSV

We use Python to code our algorithms and all codes run on a Lenovo laptop with AMD R7-6800H CPU @ 3.20 GHz and 16.0GB RAM.

4.1. The distributed-order sub-diffusion model

Here, we solve the following distributed-order sub-diffusion model:

$\begin{equation} u_t+D^{\omega}_tu-\Delta u-\Delta u_t+G(u) = f(\boldsymbol x, t), \; (\boldsymbol x, t)\in\Omega\times J, \end{equation}$

(4.2)

with boundary condition

$\begin{equation} u(\boldsymbol x, t) = 0, \; \boldsymbol x\in\partial\Omega, \; t\in\bar{J}, \end{equation}$

(4.3)

and initial condition

$\begin{equation} u(\boldsymbol x, 0) = u_0(\boldsymbol x), \; \boldsymbol x\in\bar{\Omega}, \end{equation}$

(4.4)

where the nonlinear term $G(u) = u^2$ . The symbol $\Delta$ is the Laplace operator. Based on Eqs (3.14)–(3.21), the loss function $MSE_{total}$ can be obtained by

$\begin{equation} \notag MSE_{total} = MSE_{\ell}+MSE_b, \end{equation}$

where

$\begin{equation} \notag \begin{aligned} & MSE_{\ell} = \frac{1}{N\times N_x}\mathop \sum \limits_{j = 1}^N \mathop \sum \limits_{i = 1}^{{N_x}} |\ell^j(\boldsymbol x^i_{\ell})|^2, \\ & MSE_b = \frac{1}{N\times N_b}\mathop \sum \limits_{j = 1}^N \mathop \sum \limits_{i = 1}^{{N_b}} |v^j(\boldsymbol x^i_b; \delta)-0|^2. \end{aligned} \end{equation}$

Example 1.

For this example, we set space domain $\Omega = (0, 1)$ and time interval $J = (0, \frac{1}{2}]$ . The training set consists of the boundary points and $N_x = 200$ collocation points randomly selected in the space domain $\Omega$ . Choosing $\omega(\alpha) = \Gamma(3-\alpha)$ and the source term

$\begin{equation} \notag f(x, t) = 2t\sin(2\pi x)+\frac{\Gamma(3) t(t-1)}{\ln t}\sin(2\pi x)+4t^2\pi^2\sin(2\pi x)+8t\pi^2\sin(2\pi x)+(t^2\sin(2\pi x))^2, \end{equation}$

then we can obtain the exact solution $u(x, t) = t^2\sin(2\pi x)$ .

To evaluate the performance of our proposed method, the exact solution and the predicted solution solved by a multi-output neural network that consists of $6$ hidden layers with 40 neurons in each hidden layer are showed in Figure 2.

Figure 2. Example 1: the exact solution and predicted solution with

$\Delta \alpha = \frac{1}{500}$ ,

$\theta = 1$ and

$N = 20$ at

$t = 0.5$ .

DownLoad: Full-Size Img PowerPoint

The influence of different network structures on our proposed method to solve Example 1 is presented in . The accuracy of the predicted solution fluctuates with different network architectures, and shows a fluctuating growth with the increase of the number of hidden layers. Based on the three network architectures, shows the behavior performance of the proposed method with gradual decrease of the time step size. With the increase of the number of grid points in the time interval, we observe that the behavior of relative $L^2$ error generally presents an upward trend for fixed network architecture and expanding the depth of the hidden layer can effectively improve the accuracy of the predicted solution.

Table 2. The relative

$L^2$ error between the predicted solution with parameters

$N = 20$ ,

$\Delta\alpha = \frac{1}{500}$ ,

$\theta = 1$ and exact solution for different numbers of hidden layers and different numbers of neurons per layer.

	20	30	40	50	60
2	3.583482e-02	7.850782e-02	5.238403e-02	1.143600e-01	3.761048e-02
4	3.815002e-02	5.144618e-02	3.622479e-02	2.317038e-02	3.721057e-02
6	2.433509e-02	1.216386e-01	2.638224e-02	2.837553e-02	2.682958e-02

| Show Table

DownLoad: CSV

Figure 3. Example 1: the variation trend of relative

$L^2$ error between the predicted solution with

$\theta = 1$ ,

$\Delta\alpha = \frac{1}{500}$ and the exact solution.

DownLoad: Full-Size Img PowerPoint

Numerical results calculated by the single output neural network and multi-output neural network schemes are presented in , where we select $200$ collocation points in the given spatial domain by random sampling method and set $N = 10$ , $\theta = 1$ and $\Delta\alpha = \frac{1}{500}$ . It is easy to find that the accuracy of the predicted solution calculated by the multi-output neural network scheme is higher than that solved by the single output neural network scheme and replacing single output neural network with multi-output neural network can save a lot of computing time.

Table 3. The relative

$L^2$ error and computing time given by the single output and multi-output neural network schemes.

Neural network	Layers	Neurons	Relative $L^2$ error	CPU time (s)
multi-output	4	20	1.948281e-02	32.80
	4	40	1.858120e-02	44.45
	6	20	9.724722e-03	51.40
	6	40	1.024869e-02	65.88
single output	4	20	8.741336e-02	661.99
	4	40	2.702814e-02	742.17
	6	20	2.937448e-02	748.31
	6	40	2.278774e-02	840.09

| Show Table

DownLoad: CSV

Example 2.

In this numerical example, considering the space domain $\Omega = (0, 1)\times(0, 1)$ , the time interval $J = (0, \frac{1}{2}]$ , $\omega(\alpha) = \Gamma(3-\alpha)$ and the exact solution $u(x, y, t) = t^2\sin(2\pi x)\sin(2\pi y)$ , the source term can be given by

$\begin{equation} \notag \begin{aligned} f(x, y, t) = & 2t\sin(2\pi x)\sin(2\pi y) + \frac{\Gamma(3)t(t-1)}{\ln t}\sin(2\pi x)\sin(2\pi y) \\ & + 8t^2\pi^2\sin(2\pi x)\sin(2\pi y) + 16t\pi^2\sin(2\pi x)\sin(2\pi y) + (t^2\sin(2\pi x)\sin(2\pi y))^2. \end{aligned} \end{equation}$

The training dataset is shown in Figure 5 and the collocation and boundary points are selected by random sampling method.

To better illustrate the behavior of the predicted solution, portrays the contour plot of the exact and predicted solutions, where the training set consists of $961$ collocation points and $124$ boundary points selected by the equidistant uniform sampling method and the network architecture consists of 12 hidden layers with 60 neurons per layer.

Figure 4. Example 2: the contour plot of the exact (a) and predicted (b) solutions with

$N = 10$ ,

$\theta = 1$ and

$\Delta\alpha = \frac{1}{500}$ at

$t = 0.5$ .

DownLoad: Full-Size Img PowerPoint

Figure 5. Example 2: distribution of the collocation points (a) sampled in the domain

$\Omega$ and the boundary training dataset (b).

DownLoad: Full-Size Img PowerPoint

shows the impact of depth and width of the network on the accuracy of the predicted solution. In , we present the behavior of how relative $L^2$ error changes with respect to different grid points $N$ . Combined with and , we observe that increasing the number of hidden layers or neurons has a positive effect on reducing relative $L^2$ error in general.

Table 4. The relative

$L^2$ error between the predicted solution with parameters

$N = 20$ ,

$\theta = 1$ ,

$\Delta\alpha = \frac{1}{500}$ and the exact solution for different number of hidden layers and neurons per layer.

	20	30	40	50	60
4	1.155984e-01	1.101123e-01	7.228454e-02	7.787707e-02	4.492646e-02
6	6.273340e-02	6.088659e-02	9.802474e-02	6.536637e-02	6.245142e-02
8	6.163051e-02	7.424834e-02	6.844676e-02	5.500003e-02	5.169303e-02

| Show Table

DownLoad: CSV

Figure 6. Example 2: the variation trend of relative

$L^2$ error between the predicted solution with

$\theta = 1$ ,

$\Delta\alpha = \frac{1}{500}$ and the exact solution.

DownLoad: Full-Size Img PowerPoint

The results shown in reveal the performance of the single output neural network and multi-output neural network schemes, where we select $40$ boundary points and $200$ collocation points in the given spatial domain by random sampling method and set $N = 10$ , $\theta = 1$ and $\Delta\alpha = \frac{1}{500}$ . One can see that using multi-output neural network effectively improves the precision and reduces the computing time.

Table 5. The relative

$L^2$ error and computing time given by the single output and multi-output neural network schemes.

Neural network	Layers	Neurons	Relative $L^2$ error	CPU time (s)
multi-output	4	20	1.080369e-01	52.84
	4	40	7.472573e-02	66.05
	6	20	6.654115e-02	72.71
	6	40	7.937601e-02	93.77
single output	4	20	3.443264e-01	731.22
	4	40	7.054495e-01	728.11
	6	20	2.636401e-01	772.08
	6	40	1.410780e-01	892.94

| Show Table

DownLoad: CSV

4.2. The distributed-order fourth-order sub-diffusion model

Further, we consider the following distributed-order fourth-order sub-diffusion model:

$\begin{equation} u_t+D^{\omega}_tu-\Delta u+\Delta^2u+G(u) = f(\boldsymbol x, t), \; (\boldsymbol x, t)\in\Omega\times J, \end{equation}$

(4.5)

with boundary condition

$\begin{equation} \notag u(\boldsymbol x, t) = \Delta u(\boldsymbol x, t) = 0, \; \boldsymbol x\in\partial\Omega, \; t\in\bar{J}, \end{equation}$

and initial condition

$\begin{equation} \notag u(\boldsymbol x, 0) = u_0(\boldsymbol x), \boldsymbol x\in\bar{\Omega}, \end{equation}$

where the nonlinear term $G(x) = u^2$ .

Similarly, the corresponding loss function $MSE_{total}$ can be calculated by

$\begin{equation} \notag MSE_{total} = MSE_{\ell}+MSE_b, \end{equation}$

where

$\begin{equation} \notag \begin{aligned} MSE_{\ell} = & \frac{1}{N\times N_x}\mathop \sum \limits_{j = 1}^N \mathop \sum \limits_{i = 1}^{{N_x}} |\ell^j(\boldsymbol x^i_{\ell})|^2, \\ MSE_b = &\frac{1}{N\times N_b}\mathop \sum \limits_{j = 1}^N \mathop \sum \limits_{i = 1}^{{N_b}} |v^j(\boldsymbol x^i_b; \delta)-0|^2 \\ & + \frac{1}{N\times N_b}\mathop \sum \limits_{j = 1}^N \mathop \sum \limits_{i = 1}^{{N_b}} |\Delta v^j(\boldsymbol x^i_b; \delta)-0|^2. \end{aligned} \end{equation}$

Example 3.

Here, we define the space-time domain $\Omega\times J = (0, 1)\times(0, \frac{1}{2}]$ . Considering $\omega(\alpha) = \Gamma(3-\alpha)$ and the source term

$\begin{equation} f(x, t) = 2t\sin(\pi x)+\frac{\Gamma(3)t(t-1)}{\ln t}\sin(\pi x)+t^2\pi^2\sin(\pi x)+t^2\pi^4\sin(\pi x)+(t^2\sin(\pi x))^2, \end{equation}$

(4.6)

the exact solution can be given by $u(x, t) = t^2\sin(\pi x)$ . Similar to Example $1$ , we also randomly sample $200$ collocation points in the space domain $\Omega$ .

In order to conveniently observe the capability of our proposed method, shows the change in the trajectory of the predicted and exact solutions with respect to the space point $x$ , where the parameters are set as $N = 20$ , $\Delta\alpha = \frac{1}{500}$ , $\theta = 1$ and the network consists of $6$ hidden layers with $50$ neurons per layer. portrays the trajectory of relative $L^2$ error with three different network architectures. Table 6 shows the impact of expanding depth or width of the network on the accuracy of the predicted solutions. Based on Figure 8 and Table 6, it is easy to observe that increasing the number of hidden layer plays a positive role in improving the accuracy of the predicted solutions.

Figure 7. Example 3: the predicted and exact solutions at

$t = 0.5$ .

DownLoad: Full-Size Img PowerPoint

Figure 8. Example 3: the relative

$L^2$ error between the predicted solution with

$\theta = 1$ ,

$\Delta \alpha = \frac{1}{500}$ and exact solution for different number of grid points.

DownLoad: Full-Size Img PowerPoint

Table 6. The relative

$L^2$ error between the predicted solution with parameters

$N = 20$ ,

$\theta = 1$ ,

$\Delta\alpha = \frac{1}{500}$ and the exact solution for different number of hidden layers and neurons per layer.

	20	30	40	50	60
2	8.994465e-03	8.835675e-03	7.976196e-03	1.121301e-02	1.284193e-02
4	6.698531e-03	6.623791e-03	1.012124e-02	7.129064e-03	6.760145e-03
6	5.238653e-03	3.178300e-03	4.705057e-03	7.221552e-03	5.002096e-03

| Show Table

DownLoad: CSV

The relative $L^2$ error and CPU time obtained by the multi-output neural network and single output neural network schemes are presented in , where we select $200$ collocation points in the given spatial domain by random sampling method and set $N = 10$ , $\theta = 1$ and $\Delta\alpha = \frac{1}{500}$ . The error of the proposed multi-output neural network scheme is smaller than that of the single output neural network scheme. For this 1D system, the multi-output neural network scheme is more efficient than the single output neural network scheme.

Table 7. The relative

$L^2$ error and computing time given by the single output and multi-output neural network schemes.

Neural network	Layers	Neurons	Relative $L^2$ error	CPU time (s)
multi-output	4	20	8.027377e-03	334.07
	4	40	8.656712e-03	366.52
	6	20	8.315435e-03	557.78
	6	40	6.788274e-03	601.19
single output	4	20	2.412753e-02	994.62
	4	40	2.365781e-02	1317.16
	6	20	2.428662e-02	1344.76
	6	40	2.139562e-02	1795.75

| Show Table

DownLoad: CSV

Example 4.

Now we take space domain $\Omega = (0, 1)\times(0, 1)$ and time interval $J = (0, \frac{1}{2}]$ . Let $\omega(\alpha) = \Gamma(3-\alpha)$ and the exact solution $u(x, y, t) = t^2\sin(\pi x)\sin(\pi y)$ . Then we arrive at the source term

$\begin{equation} \begin{aligned} f(x, y, t) = & 2t\sin(\pi x)\sin(\pi y) + \frac{\Gamma(3)t(t-1)}{\ln t}\sin(\pi x)\sin(\pi y)+2t^2\pi^2\sin(\pi x)\sin(\pi y)\\ & +4t^2\pi^4\sin(\pi x)\sin(\pi y)+(t^2\sin(\pi x)\sin(\pi y))^2. \end{aligned} \end{equation}$

(4.7)

Here, we apply the training data set shown in Figure 5.

In order to more intuitively demonstrate the feasibility of our proposed method for solving this 2D system, shows the contour plot of $u$ and $|u-v|$ , where the training set consists of $900$ collocation points and $120$ boundary points selected by the equidistant uniform sampling method and the network is composed of 6 hidden layers with 20 neurons per layer.

Figure 9. Example 4: the contour plot of

$u$ and

$|u-v|$ with

$\theta = 1$ ,

$\Delta \alpha = \frac{1}{500}$ and

$N = 40$ at

$t = 0.5$ .

DownLoad: Full-Size Img PowerPoint

From the behavior of relative $L^2$ error in , one can see that the accuracy of the predicted solutions with the fixed network first presents a increasing trend and then gradually presents a downward trend. This is because the approximation ability of neural network reaches saturation with the increase of grid points $N$ . shows the relative $L^2$ error calculated by different network architectures. On the whole, the relative $L^2$ error slightly decreases with expanding depth of the network, while it first slightly decreases and then increases with expanding width of the network. To show the precision and efficiency of the multi-output neural network scheme for this 2D system, the relative $L^2$ error and CPU time obtained by the multi-output neural network and single output neural network schemes are shown in , where we select $40$ boundary points and $200$ collocation points in the given spatial domain by random sampling method and set $N = 10$ , $\theta = 1$ and $\Delta\alpha = \frac{1}{500}$ . It illustrates that the multi-output neural network scheme is more accurate and efficient than the single output neural network scheme.

Figure 10. Example 4: the relative

$L^2$ error between the predicted solution with

$\theta = 1$ ,

$\Delta \alpha = \frac{1}{500}$ and the exact solution for different number of grid points.

DownLoad: Full-Size Img PowerPoint

Table 8. The relative

$L^2$ error between the predicted solution with parameters

$N = 40$ ,

$\theta = 1$ ,

$\Delta\alpha = \frac{1}{500}$ and the exact solution for different number of hidden layers and neurons per layer.

}	20	30	40	50	60
2	7.593424e-02	4.556132e-02	2.926755e-02	5.850593e-02	4.956921e-02
4	3.716823e-02	2.498269e-02	3.388639e-02	4.073820e-02	5.282780e-02
6	2.690082e-02	3.039336e-02	2.769315e-02	4.331023e-02	6.456580e-02

| Show Table

DownLoad: CSV

Table 9. The relative

$L^2$ error and computing time given by the single output and multi-output neural network schemes.

Neural network	Layers	Neurons	Relative $L^2$ error	CPU time (s)
multi-output	4	20	4.549939e-02	345.03
	4	40	5.847813e-02	339.46
	6	20	5.680785e-02	466.02
	6	40	6.110534e-02	528.10
single output	4	20	1.455785e-01	902.99
	4	40	1.330922e-01	1309.05
	6	20	1.496581e-01	1469.85
	6	40	1.322566e-01	1990.96

| Show Table

DownLoad: CSV

5. Conclusions

In this article, a multi-output physics informed neural network combined with the Crank-Nicolson scheme including the FBN- $\theta$ method and the composite numerical integral formula was constructed to solve 1D and 2D nonlinear time distributed-order models. The calculation process is described in detail. Numerical experiments are provided to prove the effectiveness and feasibility of our algorithm. Compared with the results calculated by a single output neural network combined with the FBN- $\theta$ method and the Crank-Nicolson scheme, one can clearly see that the proposed multi-output neural network scheme is more efficient and accurate. Moreover, some numerical methods, such as finite difference or finite element method, need to linearize the nonlinear term, which will give rise to extra costs. The process of linearization can be directly omitted by PINN. Further work will investigate the application of the proposed methodology in high-dimensional problems and practical problems ^{[36,37,38,39,40]}.

Use of AI tools declaration

The authors declare that they have not used Artificial Intelligence (AI) tools in the creation of this article.

Acknowledgements

The authors would like to thank the editor and all the anonymous referees for their valuable comments, which greatly improved the presentation of the article. This work is supported by the National Natural Science Foundation of China (12061053, 12161063), Natural Science Foundation of Inner Mongolia (2021MS01018), Young innovative talents project of Grassland Talents Project, Program for Innovative Research Team in Universities of Inner Mongolia Autonomous Region (NMGIRT2413, NMGIRT2207), and 2023 Postgraduate Research Innovation Project of Inner Mongolia (S20231026Z).

Conflict of interest

The authors declare that they have no conflict of interest.

References

[1]	Faver TE, Wright JD (2018) Exact diatomic Fermi-Pasta-Ulam-Tsingou solitary waves with optical band ripples at infinity. SIAM J Math Anal 50: 182–250. doi: 10.1137/15M1046836
[2]	Lustri CJ, Porter MA (2018) Nanoptera in a period-2 Toda chain. SIAM J Appl Dyn Syst 17: 1182–1212. doi: 10.1137/16M108639X
[3]	Porter M, Daraio C, Szelengowicz I, et al. (2009) Highly nonlinear solitary waves in heterogeneous periodic granular media. Phys D 238: 666–676. doi: 10.1016/j.physd.2008.12.010
[4]	Gaison J, Moskow S, Wright JD, et al. (2014) Approximation of polyatomic FPU lattices by KdV equations. Multiscale Model Simul 12: 953–995. doi: 10.1137/130941638
[5]	Qin WX (2015) Wave propagation in diatomic lattices. SIAM J Math Anal 47: 477–497. doi: 10.1137/130949609
[6]	Betti M, Pelinovsky DE (2013) Periodic traveling waves in diatomic granular chains. J Nonlinear Sci 23: 689–730. doi: 10.1007/s00332-013-9165-6
[7]	Chirilus-Bruckner M, Chong C, Prill O, et al. (2012) Rigorous description of macroscopic wave packets in infinite periodic chains of coupled oscillators by modulation equations. Discrete Contin Dyn Syst Ser S 5: 879–901. Available from: https://doi.org/10.3934/dcdss.2012.5.879. doi: 10.3934/dcdss.2012.5.879
[8]	Brillouin L (1953) Wave Propagation in Periodic Structures. Electric Filters and Crystal Lattices,2Eds., New York: Dover Publications, Inc.
[9]	Tabata Y (1996) Stable solitary wave in diatomic Toda lattice. J Phys Soc Jpn 65: 3689–3691. doi: 10.1143/JPSJ.65.3689
[10]	Okada Y, Watanabe S, Tanaca H (1990) Solitary wave in periodic nonlinear lattice. J Phys Soc Jpn 59: 2647–2658. Available from: https://doi.org/10.1143/JPSJ.59.2647. doi: 10.1143/JPSJ.59.2647
[11]	Hoffman A, Wright JD (2017) Nanopteron solutions of diatomic Fermi-Pasta-Ulam-Tsingou lattices with small mass-ratio. Phys D 358: 33–59. doi: 10.1016/j.physd.2017.07.004
[12]	Vainchtein A, Starosvetsky Y, Wright JD, et al. (2016) Solitary waves in diatomic chains. Phys Rev E 93: 042210. doi: 10.1103/PhysRevE.93.042210
[13]	Schneider G, Wayne CE (1999) Counter-propagating waves on fluid surfaces and the continuum limit of the Fermi-Pasta-Ulam model, In: International Conference on Differential Equations,Vol. 1, 2 (Berlin, 1999), 390–404, World Sci. Publ., River Edge, NJ, 2000.
[14]	Friesecke G, Pego RL (1999) Solitary waves on FPU lattices: I. Qualitative properties, renormalization and continuum limit. Nonlinearity 12: 1601–1627.
[15]	Friesecke G, Pego RL (2002) Solitary waves on FPU lattices: II. Linear implies nonlinear stability. Nonlinearity 15: 1343–1359.
[16]	Friesecke G, Pego RL (2004) Solitary waves on Fermi-Pasta-Ulam lattices: III. Howland-type Floquet theory. Nonlinearity 17: 207–227.
[17]	Friesecke G, Pego RL (2004) Solitary waves on Fermi-Pasta-Ulam lattices: IV. Proof of stability at low energy. Nonlinearity 17: 229–251.
[18]	Mizumachi T (2009) Asymptotic stability of lattice solitons in the energy space. Commun Math Phys 288: 125–144. doi: 10.1007/s00220-009-0768-6
[19]	Boyd JP (1998) Weakly Nonlocal Solitary Waves and Beyond-All-Orders Asymptotics Generalized Solitons and Hyperasymptotic Perturbation Theory , In Series: Mathematics and its Applications. Dordrecht: Kluwer Academic Publishers, vol. 442. Available from: https://doi.org/10.1007/978-1-4615-5825-5.
[20]	Faver T (2017) Nanopteron-stegoton traveling waves in spring dimer Fermi-Pasta-Ulam-Tsingou lattices, in press. Available from: https://arxiv.org/abs/1710.07376.
[21]	Lombardi E (2000) Oscillatory Integrals and Phenomena Beyond All Algebraic Orders with Applications to Homoclinic Orbits in Reversible Systems, In series: Lecture Notes in Mathematics. Berlin: Springer-Verlag, vol. 1741. Available from: https://doi.org/10.1007/BFb0104102.
[22]	Sun SM (1999) Non-existence of truly solitary waves in water with small surface tension. Proc Math Phys Eng Sci 455: 2191–2228. doi: 10.1098/rspa.1999.0399
[23]	Martínez AJ, Kevrekidis PG, Porter MA (2016) Superdiffusive transport and energy localization in disordered granular crystals. Phys Rev E 93: 022902. Available from: https://doi.org/10.1103/physreve.93.022902. doi: 10.1103/PhysRevE.93.022902
[24]	Hairer E, Lubich C, Wanner G (2006) Geometric Numerical Integration: Structure-Preserving Algorithms for Ordinary Differential Equations, 2Eds., In series: Springer Series in Computational Mathematics. Springer, Heidelberg, 2010, vol. 31.
[25]	Beyn WJ, Thümmler V (2004) Freezing solutions of equivariant evolution equations. SIAM J Appl Dyn Syst 3: 85–116. doi: 10.1137/030600515
[26]	Beyn WJ, Otten D, Rottmann-Matthes J (2018) Computation and stability of traveling waves in second order evolution equations. SIAM J Numer Anal 56: 1786–1817. doi: 10.1137/16M108286X
[27]	Beale JT (1991) Exact solitary water waves with capillary ripples at infinity. Commun Pure Appl Math 44: 211–257. doi: 10.1002/cpa.3160440204
[28]	Sun SM (1991) Existence of a generalized solitary wave solution for water with positive Bond number less than 1/3. J Math Anal Appl 156: 471–504. doi: 10.1016/0022-247X(91)90410-2
[29]	LeVeque RJ, Yong DH (2003) Solitary waves in layered nonlinear media. SIAM J Appl Math 63: 1539–1560. doi: 10.1137/S0036139902408151
[30]	LeVeque RJ, Yong DH (2003) Phase plane behavior of solitary waves in nonlinear layered media, In: Hyperbolic Problems: Theory, Numerics, Applications. Berlin: Springer, 43–51.
[31]	Kevrekidis PG, Stefanov AG, Xu H (2016) Traveling waves for the mass in mass model of granular chains. Lett Math Phys 106: 1067–1088. doi: 10.1007/s11005-016-0854-6
[32]	Pnevmatikos S, Flytzanis N, Remoissenet M (1986) Soliton dynamics of nonlinear diatomic lattices. Phys Rev B 33: 2308–2321. doi: 10.1103/PhysRevB.33.2308

This article has been cited by:

1.	Laura Lomba, David Lapeña, Natalia Ros, Elena Aso, Mariachiara Cannavò, Diego Errazquin, Beatriz Giner, Ecotoxicological study of six drugs in Aliivibrio fischeri, Daphnia magna and Raphidocelis subcapitata, 2020, 27, 0944-1344, 9891, 10.1007/s11356-019-07592-8
2.	Laura Lomba, M. Pilar Ribate, Estefanía Zuriaga, Cristina B. García, Beatriz Giner, Acute and subacute effects of drugs in embryos of Danio rerio. QSAR grouping and modelling, 2019, 172, 01476513, 232, 10.1016/j.ecoenv.2019.01.081
3.	Carlos Castillo-Zacarías, Mario E. Barocio, Enrique Hidalgo-Vázquez, Juan Eduardo Sosa-Hernández, Lizeth Parra-Arroyo, Itzel Y. López-Pacheco, Damià Barceló, Hafiz N.M. Iqbal, Roberto Parra-Saldívar, Antidepressant drugs as emerging contaminants: Occurrence in urban and non-urban waters and analytical methods for their detection, 2021, 757, 00489697, 143722, 10.1016/j.scitotenv.2020.143722
4.	Melanie M. Marshall, Kevin E. McCluney, Mixtures of co-occurring chemicals in freshwater systems across the continental US, 2021, 268, 02697491, 115793, 10.1016/j.envpol.2020.115793
5.	Harveer S. Srain, Karen F. Beazley, Tony R. Walker, Pharmaceuticals and personal care products and their sublethal and lethal effects in aquatic organisms, 2021, 29, 1181-8700, 142, 10.1139/er-2020-0054
6.	Kaiwen Hu, Weifeng Li, Weixia Zhang, Kuankuan Yuan, Chenxin Gong, Yang Shu, Yingying Yu, Conghui Shan, Yan Gao, Xunyi Zhang, Haibin Yu, Wei Shi, Guangxu Liu, Diltiazem disrupts Ca2+-homeostasis and exerts immunotoxic effects on a marine bivalve mollusc, the blood clam (Tegillarca granosa), 2025, 217, 0025326X, 118055, 10.1016/j.marpolbul.2025.118055

Reader Comments

Your name:*

Email:*
© 2019 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Mathematics in Engineering

1.4 2.2

Metrics

Article views(6735) PDF downloads(1653) Cited by(12)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Figures and Tables

Figures(8) / Tables(1)

Mathematics in Engineering

Metastability of solitary waves in diatomic FPUT lattices

Related Papers:

Abstract

1. Introduction

2. Neural network

3. Methodology

3.1. Problem setup

3.2. Some lemmas

3.3. The loss function

4. Algorithm implementation

4.1. The distributed-order sub-diffusion model

4.2. The distributed-order fourth-order sub-diffusion model

5. Conclusions

Use of AI tools declaration

Acknowledgements

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

Abstract

1. Introduction

2. Neural network

3. Methodology

3.1. Problem setup

3.2. Some lemmas

3.3. The loss function

4. Algorithm implementation

4.1. The distributed-order sub-diffusion model

4.2. The distributed-order fourth-order sub-diffusion model

5. Conclusions

Use of AI tools declaration

Acknowledgements

Conflict of interest

References

Mathematics in Engineering

Metastability of solitary waves in diatomic FPUT lattices

Related Papers:

Abstract

1. Introduction

2. Neural network

3. Methodology

3.1. Problem setup

3.2. Some lemmas

3.3. The loss function

4. Algorithm implementation

4.1. The distributed-order sub-diffusion model

4.2. The distributed-order fourth-order sub-diffusion model

5. Conclusions

Use of AI tools declaration

Acknowledgements

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

Abstract

1. Introduction

2. Neural network

3. Methodology

3.1. Problem setup

3.2. Some lemmas

3.3. The loss function

4. Algorithm implementation

4.1. The distributed-order sub-diffusion model

4.2. The distributed-order fourth-order sub-diffusion model

5. Conclusions

Use of AI tools declaration

Acknowledgements

Conflict of interest

References