Hardware-friendly compression and hardware acceleration for transformer: A survey

Shizhen Huang; Enhao Tang; Shun Li; Xiangzhan Ping; Ruiqi Chen; Shizhen Huang; Enhao Tang; Shun Li; Xiangzhan Ping; Ruiqi Chen

doi:10.3934/era.2022192

Electronic Research Archive

2022, Volume 30, Issue 10: 3755-3785. doi: 10.3934/era.2022192

Previous Article Next Article

Review Special Issues

Hardware-friendly compression and hardware acceleration for transformer: A survey

1.
College of Physics and Information Engineering, Fuzhou University, Fuzhou 350116, China
2.
Department of Optoelectronic Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
3.
Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai 200433, China

Academic Editor: Sibo Cheng

Received: 18 June 2022 Accepted: 27 July 2022 Published: 16 August 2022

The transformer model has recently been a milestone in artificial intelligence. The algorithm has enhanced the performance of tasks such as Machine Translation and Computer Vision to a level previously unattainable. However, the transformer model has a strong performance but also requires a high amount of memory overhead and enormous computing power. This significantly hinders the deployment of an energy-efficient transformer system. Due to the high parallelism, low latency, and low power consumption of field-programmable gate arrays (FPGAs) and application specific integrated circuits (ASICs), they demonstrate higher energy efficiency than Graphics Processing Units (GPUs) and Central Processing Units (CPUs). Therefore, FPGA and ASIC are widely used to accelerate deep learning algorithms. Several papers have addressed the issue of deploying the Transformer on dedicated hardware for acceleration, but there is a lack of comprehensive studies in this area. Therefore, we summarize the transformer model compression algorithm based on the hardware accelerator and its implementation to provide a comprehensive overview of this research domain. This paper first introduces the transformer model framework and computation process. Secondly, a discussion of hardware-friendly compression algorithms based on self-attention and Transformer is provided, along with a review of a state-of-the-art hardware accelerator framework. Finally, we considered some promising topics in transformer hardware acceleration, such as a high-level design framework and selecting the optimum device using reinforcement learning.

Keywords:

Citation: Shizhen Huang, Enhao Tang, Shun Li, Xiangzhan Ping, Ruiqi Chen. Hardware-friendly compression and hardware acceleration for transformer: A survey[J]. Electronic Research Archive, 2022, 30(10): 3755-3785. doi: 10.3934/era.2022192

Related Papers:

[1]	Raimund Bürger, Kenneth H. Karlsen, John D. Towers . On some difference schemes and entropy conditions for a class of multi-species kinematic flow models with discontinuous flux. Networks and Heterogeneous Media, 2010, 5(3): 461-485. doi: 10.3934/nhm.2010.5.461
[2]	Raimund Bürger, Christophe Chalons, Rafael Ordoñez, Luis Miguel Villada . A multiclass Lighthill-Whitham-Richards traffic model with a discontinuous velocity function. Networks and Heterogeneous Media, 2021, 16(2): 187-219. doi: 10.3934/nhm.2021004
[3]	Raimund Bürger, Stefan Diehl, María Carmen Martí . A conservation law with multiply discontinuous flux modelling a flotation column. Networks and Heterogeneous Media, 2018, 13(2): 339-371. doi: 10.3934/nhm.2018015
[4]	Mauro Garavello, Roberto Natalini, Benedetto Piccoli, Andrea Terracina . Conservation laws with discontinuous flux. Networks and Heterogeneous Media, 2007, 2(1): 159-179. doi: 10.3934/nhm.2007.2.159
[5]	Giuseppe Maria Coclite, Lorenzo di Ruvo, Jan Ernest, Siddhartha Mishra . Convergence of vanishing capillarity approximations for scalar conservation laws with discontinuous fluxes. Networks and Heterogeneous Media, 2013, 8(4): 969-984. doi: 10.3934/nhm.2013.8.969
[6]	Maya Briani, Emiliano Cristiani . An easy-to-use algorithm for simulating traffic flow on networks: Theoretical study. Networks and Heterogeneous Media, 2014, 9(3): 519-552. doi: 10.3934/nhm.2014.9.519
[7]	Christophe Chalons, Paola Goatin, Nicolas Seguin . General constrained conservation laws. Application to pedestrian flow modeling. Networks and Heterogeneous Media, 2013, 8(2): 433-463. doi: 10.3934/nhm.2013.8.433
[8]	Clément Cancès . On the effects of discontinuous capillarities for immiscible two-phase flows in porous media made of several rock-types. Networks and Heterogeneous Media, 2010, 5(3): 635-647. doi: 10.3934/nhm.2010.5.635
[9]	Wen Shen . Traveling wave profiles for a Follow-the-Leader model for traffic flow with rough road condition. Networks and Heterogeneous Media, 2018, 13(3): 449-478. doi: 10.3934/nhm.2018020
[10]	Adriano Festa, Simone Göttlich, Marion Pfirsching . A model for a network of conveyor belts with discontinuous speed and capacity. Networks and Heterogeneous Media, 2019, 14(2): 389-410. doi: 10.3934/nhm.2019016

Abstract

1. Introduction

1.1. Scope

It is the purpose of this work to introduce, and in part analyze, a numerical scheme for a system of conservation laws with source terms of the type

$\begin{align} \begin{split} \partial_t \left( A(z) \begin{pmatrix} \phi \\ \psi \end{pmatrix} \right) + \partial_z \left( A(z) \begin{pmatrix} J(\phi, z, t) \\ \tilde{F} ( \phi, \psi, z, t ) \end{pmatrix} \right) = \sum\limits_{k = 1}^K Q_{\mathrm{F}, k} (t) \begin{pmatrix} \phi_{\mathrm{F}, k} (t) \\ \psi_{\mathrm{F}, k} (t) \end{pmatrix} \delta ( z- z_{\mathrm{F}, k} ), \end{split} \end{align}$

(1.1)

where $t$ is time, $z$ is spatial position, and $\phi$ and $\psi$ are the volume fractions of the primary and secondary disperse phases, respectively. Both disperse phases move within the continuous phase of the one-dimensional flow. We let $A(z)$ denote a variable cross-sectional area. The flux functions $J$ and $\smash{\tilde{F}}$ are discontinuous across the positions $z = z_{\mathrm{U}} < z_{\mathrm{F}, 1} < \dots < z_{\mathrm{F}, K} < z_{\mathrm{E}}$ , and due to constitutive assumptions of the model, are nonlinear functions of $\phi$ and $\psi$ . The right-hand side of Eq (1.1) describes singular sources located at $z = z_{\mathrm{F}, k}$ , $k = 1, \dots, K$ , and is composed of given functions. It is assumed that $Q_{\mathrm{F}, k} (t)$ is the volumetric bulk flow of the mixture (of the continuous and two disperse phases) injected at $z = z_{\mathrm{F}, k}$ , and that $\phi_{\mathrm{F}, k}(t)$ and $\psi_{\mathrm{F}, k}(t)$ are the volume fractions of the primary and secondary disperse phases in the feed flow, respectively. The system Eq (1.1) is posed on $\Pi_T : = \mathbb{R} \times (0, T)$ together with initial conditions

$\begin{align} \phi(z, 0) & = \phi_0(z) \quad \text{for all }\;z \in \mathbb{R} , \end{align}$

(1.2a)

$\begin{align} \psi(z, 0) & = \psi_0 (z) \quad \text{for all } z \in \mathbb{R} , \end{align}$

(1.2b)

where we assume that

$\begin{align} 0 \leq \phi_0(z)\leq 1, \quad 0 \leq \psi_0(z) \leq 1 - \phi_0(z) \quad \text{for all}\; z \in \mathbb{R} \end{align}$

(1.3)

along with

$\begin{align} \operatorname*{TV} ( \phi_0) < \infty, \quad \operatorname*{TV} ( \psi_0) < \infty. \end{align}$

(1.4)

Likewise, we assume that $\phi_{\mathrm{F}, k}$ and $\psi_{\mathrm{F}, k}$ are piecewise continuous functions of bounded variation with a finite number of discontinuities and that they satisfy the bounds

$\begin{align} 0 \leq \phi_{\mathrm{F}, k}(t) \leq 1, \quad 0 \leq \psi_{\mathrm{F}, k}(t) \leq 1 - \phi_{\mathrm{F}, k}(t) \quad \text{for all}\; k = 1, \dots, K \;{\rm{and}} \;t \in [0, T] . \end{align}$

(1.5)

(In later parts of the analysis we will assume that these functions and the bulk flows $Q_{\mathrm{F}, k}$ are constants.) If $\theta$ denotes the volume fraction of the continuous phase, then we assume that

$\begin{align} 0 \leq \phi, \psi, \theta \leq 1; \quad \phi + \psi + \theta = 1, \end{align}$

(1.6)

which motivates assumptions Eq (1.3) and Eq (1.5). (Of course, satisfaction of Eq (1.6) by exact or numerical solutions of Eq (1.1), Eq (1.2) on $\Pi_T$ needs to be proved.)

A specific application that gives rise to the system Eq (1.1) is a model of a flotation column ^[8,9], where $\phi$ denotes the volume fraction of bubbles and $\psi$ that of solid particles (). The bottom of the column has the coordinate $z_\mathrm{U}$ (the underflow) and the top $z_\mathrm{E}$ (the effluent). The primary disperse phase of bubbles – specifically, aggregate bubbles, to which hydrophobic valuable particles (minerals) are attached – is assumed to flow through the suspension of solid particles and liquid independently of the volume fraction of solids. The secondary disperse phase consists of solid hydrophilic particles (ore) that move in the remaining space outside the bubbles. If the solid particles of the secondary disperse phase have a density larger than that of the fluid, the two disperse phases undergo counter-current, and otherwise, co-current flow. The distinction between primary and secondary disperse phase also becomes evident in the flux functions: the flux $J$ of the primary disperse phase depends on $\phi$ only (besides $z$ and $t$ ), while that of the secondary disperse phase, $\tilde{F}$ , depends both on $\phi$ and $\psi$ . Thus, the system Eq (1.1) is triangular; however, it is generally non-strictly hyperbolic; see ^[9], where a counter-current model of the form Eq (1.1) is studied.

Figure 1. Schematic of a one-dimensional column with

$K = 3$ inlets and

$K+1 = 4$ zones, where

$Q_\mathrm{U}$ is the downwards volumetric outflow,

$Q_{\mathrm{F}, j}$ is the volumetric flow at the inlet

$z_{\mathrm{F}, j}$ , for each

$j = 1, \ldots, K$ , and

$Q_\mathrm{E}$ is the upwards volumetric outflow. Note that the distances between the inlets/outlets are arbitrary and the cross-sectional area

$A = A(z)$ may vary piecewise continuously (although the figure shows a piecewise constant example).

DownLoad: Full-Size Img PowerPoint

The main contribution of this work is an easily implemented explicit monotone numerical scheme for Eq (1.1). To properly address the theoretical support we are able to provide for this scheme, we refer to the complete model Eq (1.1), with all assumptions stated so far in effect, as "Model 1", and the corresponding scheme (that handles Model 1) as "Scheme 1". Additional properties of Scheme 1 can be established for two successively simplified versions of Model 1, named "Model 2" and "Model 3", for which the corresponding versions of Scheme 1 are addressed as "Scheme 2" and "Scheme 3", respectively.

The scheme is supported by three partial theoretical arguments. Firstly, it is proved that Scheme 1 satisfies an invariant-region property, i.e., the approximate volume fractions satisfy a discrete analogue of Eq (1.6) at every point. Secondly, the assumption of a constant cross-sectional area, i.e.,

$\begin{align} A \equiv \mathrm{constant}, \quad A > 0, \end{align}$

(1.7)

and time-independent feed and volume rates defines Model 2, and it is shown that the corresponding scheme for the primary disperse phase (the " $\phi$ -scheme" of Scheme 2; in short, " $\phi$ -Scheme 2") converges to a suitably defined entropy solution. Thirdly, we additionally assume that there are no flux discontinuities, so that Model 2 is reduced to the triangular system of conservation laws ("Model 3")

$\begin{align} \partial_t \phi + \partial_z J(\phi) & = 0, \end{align}$

(1.8a)

$\begin{align} \partial_t \psi + \partial_z \tilde{F} (\phi, \psi ) & = 0, \quad (z, t) \in \Pi_T, \end{align}$

(1.8b)

where $J$ and $\tilde{F}$ are $z$ - and $t$ -independent versions of the fluxes arising in Eq (1.1) and Eq (1.8) is equipped with the initial conditions Eq (1.2), where assumptions Eq (1.3) remain in effect. The corresponding reduced version of Scheme 2 that handles Model 3 is called "Scheme 3." Under these additional assumptions, we may invoke arguments of compensated compactness to prove that the scheme for the secondary disperse phase (the " $\psi$ -Scheme 3") converges to a weak solution of the corresponding conservation law Eq (1.8b). Summarizing all arguments, we prove that Scheme 3 converges to a weak solution of the system Eq (1.8) in the sense of the following definition.

Definition 1.1. The pair $(\phi, \psi)$ is called a weak solution of the initial value problem Eq (1.8), Eq (1.2) if

(ⅰ) The functions $\phi$ and $\psi$ belong to $L^{\infty} (\Pi_T)$ .

(ⅱ) The functions $\phi$ and $\psi$ satisfy Eq (1.8), Eq (1.2) in the sense of distributions on $\Pi_T$ , that is, for each smooth test function $\zeta$ with compact support in $\Pi_T$ , the following identities hold:

$\begin{align} \iint_{\Pi_T} \bigl( \phi \partial_t \zeta + J(\phi) \partial_z \zeta \bigr) \, \mathrm{d} z \, \mathrm{d} t + \int_{\mathbb{R}} \phi_0 (z) \, \mathrm{d} z & = 0, \end{align}$

(1.9)

$\begin{align} \iint_{\Pi_T} \bigl( \psi \partial_t \zeta + \tilde{F} (\phi, \psi) \partial_z \zeta \bigr) \, \mathrm{d} z \, \mathrm{d} t + \int_{\mathbb{R}} \psi_0 (z) \, \mathrm{d} z & = 0. \end{align}$

(1.10)

(ⅲ) The function $\phi$ is an entropy solution of the single conservation law Eq (1.8a), that is, for each smooth and nonnegative test function $\zeta$ with compact support in $\Pi_T$ , the following inequality holds for all $c \in \mathbb{R}$ :

$\begin{align} & \iint_{\Pi_T} \bigl( |\phi-c| \partial_t \zeta + \operatorname*{sgn} (\phi-c) \bigl( J(\phi) - J(k) \bigr) \partial_z \zeta \bigr) \, \mathrm{d} z \, \mathrm{d} t + \int_{\mathbb{R}} \bigl| \phi_0 (z) -c \bigr| \, \mathrm{d} z \geq 0. \end{align}$

(1.11)

Numerical experiments illustrate that Scheme 1 for the full model Eq (1.1) (Model 1), approximates expected solution behaviour for counter-current and co-current flows and that approximate numerical errors tend to zero as the mesh is refined.

1.2. Related work

The system Eq (1.1) models the evolution of the primary unknown $\phi$ independently of the secondary unknown $\psi$ . One application of such triangular systems is the process of column flotation, which is a solid-liquid separation process used in mineral processing, environmental and chemical engineering ^{[10,11,27,28,42,45]}. The model Eq (1.1) restricted to three-phase counter-current flow in a flotation column was originally proposed in ^[9]. Its nonlinear constitutive assumptions come from the drift-flux theory (used to analyze the bubbly and froth regions ^[41,50,51]) and the solids-flux theory (for particles in a liquid ^[24,25,38]). In ^[9], the construction of steady-state solutions is detailed, where conservation laws with discontinuous flux are a key ingredient with a specific entropy condition ^[2,22,29]. The most interesting desired steady states are classified in ^[9] and visualized in graphical so-called operating charts that show how the control variables $Q_\mathrm{U}$ , $Q_\mathrm{F}: = Q_\mathrm{F, 1}$ (feed mixture of gas, solids and water) and $Q_\mathrm{W}: = Q_\mathrm{F2}$ (feed washwater) should be chosen.

The mathematical and numerical difficulties associated with Eq (1.1) are twofold; namely, one has to deal with discontinuities of the fluxes with respect to $z$ , as well as with the definition of the governing model by a (triangular) system of conservation laws (in contrast to otherwise similar, known scalar two-phase models arising in flotation or sedimentation ^[8,13,22,23]). The well-known difficulty of conservation laws with discontinuous flux lies in the appropriate formulation of admissibility conditions of jumps of the solution across discontinuities of the flux such that the resulting concept of weak (discontinuous) solutions supplied with an entropy condition would admit a uniqueness result. There exist many criteria for selecting unique solutions (e.g., ^[1,22]), each of which corresponds to a particular physical reality and relies on specific assumptions on the fluxes adjacent to a discontinuity. A unified treatment of this problem is advanced in ^[2]. While scalar conservation laws with discontinuous flux have been studied widely, only a few analyses of systems with discontinuous flux are available (e.g., ^[16,47]). That said, its triangular nature makes Eq (1.1) potentially easier to treat than a full $2 \times 2$ system of conservation laws (where the flux of each component would depend on both unknowns).

The triangular system with discontinuous flux studied in ^[9] was solved numerically with a staggered-grid scheme that utilizes the triangular structure of Eq (1.1). Such a semi-Godunov scheme for general triangular hyperbolic systems is one of the two suggested schemes by Karlsen et al. ^[32,33], who proved convergence of the numerical solutions under certain assumptions on the flux functions. We here propose a simpler numerical scheme (on a single grid) that is easier to implement and analyze. The analysis (of the scheme proposed under simplifying assumptions) relies on the aligned version of the scheme introduced in ^[33] and in particular on the convergence analysis of an Engquist-Osher scheme for multi-dimensional triangular system of conservation laws by Coclite et al. ^[18]. (The proof of convergence done in ^[18] is motivated by the more easily proven convergence of a vanishing viscosity approximation for the same model, see ^[17].) These analyses, and the present treatment for the reduced model Eq (1.8), rely on compactness techniques that use discrete entropy inequalities and the compensated compactness framework.

Further applications and results on the analysis of triangular systems include two-component chromatography ^[3]. Furthermore, polymer flooding in oil recovery is modelled by a $2\times 2$ system ^[31], which can be converted to a triangular system in Lagrange coordinates ^[49]. In ^[20,40,48], the authors study the delta shock wave formation in solutions of triangular system of conservation laws from the so-called generalized pressureless gas dynamics model. Bressan et al. ^[5] established the existence and uniqueness of vanishing viscosity solutions for scalar conservation laws for a Cauchy problem and their results can be applied to a triangular system under suitable assumptions. The results of Karlsen et al. ^[32,33] for general triangular systems can be applied to models of three-phase flows in porous media, for example, in oil recovery.

1.3. Outline of the paper

The remainder of this paper is organized as follows. In Section 2, the model of ^[9] of gas-solid-liquid three-phase flow in a flotation column from is written in a slightly more general form. Starting from the balance equations of the three phases we outline the derivation of the algebraic forms of the fluxes $J(\phi, z, t)$ and $\tilde{F} (\phi, \psi, z, t)$ arising in the governing PDE system Eq (1.1). In Section 3 the numerical method (Scheme 1) proposed for the approximation of solutions to the initial value problem Eq (1.1), Eq (1.2) (Model 1) is detailed, where computational effort is essentially reduced to the interior of the vessel (cf. ). After outlining the discretization in Section 3.1, we specify the numerical fluxes and update formulas for the primary and secondary disperse phases in Sections 3.2 and 3.3, respectively. Both formulas are adapted to the particular algebraic form of the fluxes $J(\phi, z, t)$ and $\tilde{F} (\phi, \psi, z, t)$ and involve upwind discretizations, a particular monotone discretization for "concentration times velocity" fluxes from ^[6], and the Engquist-Osher numerical flux ^[26]. We then prove in Section 3.4 that Scheme 1 is monotone and that the numerical solutions satisfy a so-called invariant-region property (Theorems 3.1 and 3.2), that is, a discrete analogue of Eq (1.6), provided, of course, that the initial data satisfy Eq (1.3) and the time step and spatial meshwidth satisfy a CFL condition. Section 4 provides further partial results of the convergence analysis of the numerical scheme based on additional simplifying assumptions, namely those of a constant cross-sectional area $A$ and constant bulk and feed flows $Q_{\mathrm{U}}$ , $Q_{\mathrm{F}, k}$ , $\phi_{\mathrm{F}, k}$ and $\psi_{\mathrm{F}, k}$ ( $k = 1, \dots, K$ ). Thus, the scheme under discussion is Scheme 2. We can then prove convergence of the $\phi$ -Scheme 2 (the one that discretizes the $\phi$ -component of the governing PDE; Section 4.1) and $L^1$ Lipschitz continuity of the $\psi$ -Scheme 2 (Section 4.2). If in addition all $z$ -dependent flux discontinuities are removed, so that the governing PDE system is Eq (1.8) (Model 3) and the scheme reduces to Scheme 3, we may apply compensated compactness techniques to prove convergence of the $\psi$ -scheme (Section 5). For the simplified problems, the initial conditions Eq (1.2) and assumptions Eq (1.3) and Eq (1.5) are imposed, so Theorems 3.1 and 3.2 remain in effect. While in that case the convergence of (the monotone) $\phi$ -Scheme 3 to an entropy solution of Eq (1.8a) follows by standard arguments (for monotone schemes), the principal result of Section 5 is convergence of $\psi$ -Scheme 3 to a weak solution of Eq (1.8b) (Lemma 5.5 and Theorem 5.1). Estimations of errors and convergence order of the numerical method can be found in Section 6.1. Some numerical examples are presented in Section 6, starting with preliminaries (Section 6.1). First, in Section 6.3, we use a smooth solution to estimate the order of convergence. Later on, we present two numerical examples that illustrate the model predictions for counter-current (Section 6.4) and co-current flows (Sections 6.5 and 6.6). Finally, some conclusions are drawn in Section 7, and Appendix A contains the proofs of Lemma 5.5 and Theorem 5.1.

2. Three-phase flow model

The density of each phase is assumed constant, so the conservation of mass can be expressed by the balance equations ( ${v}_{\phi}$ , ${v}_{\psi}$ , and ${v}_{\theta}$ are the phase velocities)

$\begin{align} \partial_{t} \bigl(A(z)\phi \bigr) + \partial_{z} \bigl(A(z) \phi {v}_{\phi} \bigr) & = \sum\limits_{k = 1}^K Q_{\mathrm{F}, k} (t) \phi_{\mathrm{F}, k} (t) \delta (z- z_{\mathrm{F}, k}), \end{align}$

(2.1)

$\begin{align} \partial_{t} \bigl(A(z)\psi \bigr) + \partial_{z} \bigl( A(z)\psi {v}_{\psi} \bigr) & = \sum\limits_{k = 1}^K Q_{\mathrm{F}, k} (t) \psi_{\mathrm{F}, k} (t) \delta (z- z_{\mathrm{F}, k}) , \end{align}$

(2.2)

$\begin{align} \partial_{t} \bigl(A(z)\theta \bigr) + \partial_{z} \bigl(A(z)\theta {v}_{\theta} \bigr) & = \sum\limits_{k = 1}^K Q_{\mathrm{F}, k} (t) \bigl( 1 - \phi_{\mathrm{F}, k} (t) - \psi_{\mathrm{F}, k} (t) \bigr) \delta (z- z_{\mathrm{F}, k}), \end{align}$

(2.3)

where the right-hand sides contain Dirac symbols, the feed volume fractions $\phi_{\mathrm{F}, k}$ and $\psi_{\mathrm{F}, k}$ of the disperse phases, and the corresponding volume fraction $1 - \phi_{\mathrm{F}, k} (t) - \psi_{\mathrm{F}, k} (t)$ of the continuous phase, at the inlet located at $z = z_{\mathrm{F}, k}$ , $k = 1, \dots, K$ .

We define the volume-average velocity, or bulk velocity, of the mixture by

$\begin{align*} {q}&: = \phi {v}_{\phi}+\psi {v}_{\psi} +\theta v_{\theta}, \end{align*}$

and replace Eq (2.3) by the sum of Eqs (2.1)–(2.3), which is

$\begin{align} \partial_{z} \bigl(A(z)q \bigr)& = \sum\limits_{k = 1}^K Q_{\mathrm{F}, k} (t) \delta (z-z_{\mathrm{F}, k}), \end{align}$

(2.4)

hence $q$ varies with $z$ due to the $K$ inlets and the variable cross-sectional area. We define $Q(z, t): = A(z)q(z, t)$ and integrate Eq (2.4) from any point $z_0 < z_\mathrm{U}$ to obtain

$\begin{align*} Q(z, t) = Q(z_0, t) + \sum\limits_{k = 1}^K Q_{\mathrm{F}, k} (t) \mathcal{H}(z-z_{\mathrm{F}, k}), \end{align*}$

where $\mathcal{H}(\cdot)$ is the Heaviside function. If the volumetric underflow $Q_{\mathrm{U}}(t)$ is given, then $Q(z, t) = - Q_{\mathrm{U}}(t)$ for $z < z_\mathrm{U}$ , and

$\begin{align*} Q(z, t) & = - Q_{\mathrm{U}}(t) + \sum\limits_{k = 1}^K Q_{\mathrm{F}, k} (t) \mathcal{H}(z-z_{\mathrm{F}, k}) = - Q_{\mathrm{U}}(t) + \sum\limits_{k:z_{\mathrm{F}, k} \geq z} Q_{\mathrm{F}, k} (t). \end{align*}$

This continuity equation of the mixture replaces Eq (2.3). Next, Eq (2.1) and Eq (2.2) are rewritten in terms of $q$ and two constitutive functions. We refer to the continuous phase and the secondary disperse phase as "secondary mixture", and define the volume fraction of the secondary disperse phase within the secondary mixture as

$\begin{align*} \varphi : = \frac{\psi}{\psi + \theta} = \frac{\psi}{1-\phi} \quad \text{(when }\phi < 1 ), \end{align*}$

where $0 \leq \varphi \leq 1$ by Eq (1.6). The volume-average velocity of the secondary mixture is

$\begin{align*} q_{\mathrm{s}} : = \frac{\psi v_{\psi} + \theta v_{\theta}}{\psi + \theta} = \varphi v_{\psi} + \frac{1- \phi - \psi}{1- \phi} v_{\theta} = \varphi v_{\psi} + (1- \varphi) v_{\theta}. \end{align*}$

It is then assumed that within $[z_{\mathrm{U}}, z_{\mathrm{E}})$ , the relative velocity $v_{\phi \mathrm{s}} : = v_{\mathrm{\phi}} - q_{\mathrm{s}}$ of the primary disperse phase with respect to the secondary mixture is a given constitutive function $\tilde{v}_{\phi \mathrm{s}}(\phi)$ , while outside that interval, both phases move at the same velocity, so their velocity difference is zero. Thus, in terms of the characteristic function

$\begin{align*} \gamma (z) : = \chi_{[z_{\mathrm{U}}, z_{\mathrm{E}})} (z) : = \begin{cases} 1 & \text{for }z \in [z_{\mathrm{U}}, \;z_{\mathrm{E}}) , \\ 0 & \text{for } z \notin [z_{\mathrm{U}}, \;z_{\mathrm{E}}) , \end{cases} \end{align*}$

this assumption can be expressed as $v_{\phi \mathrm{s}} = \gamma (z) \tilde{v}_{\phi \mathrm{s}} (\phi)$ . Within $[z_{\mathrm{U}}, z_{\mathrm{E}})$ , the relative velocity of the secondary disperse phase with respect to the continuous phase $v_{\psi \theta} : = v_{\psi} - v_{\theta}$ is supposed to be a given function $\tilde{v}_{\psi \theta}$ of $\varphi$ , that is, $v_{\psi \theta} = \gamma (z) \tilde{v}_{\psi \theta} (\varphi)$ .

The definitions of all velocities imply the identities

$\begin{align} \begin{split} \phi v_{\phi} & = \phi q + \gamma(z) \phi (1-\phi) \tilde{v}_{\phi\mathrm{s}} (\phi) , \\ \psi v_{\psi} & = \psi q + \gamma(z) \psi \bigl( (1- \varphi) \tilde{v}_{\psi\theta} ( \varphi) - \phi \tilde{v}_{\phi\mathrm{s}} ( \phi) \bigr) \end{split} \end{align}$

(2.5)

for the (unweighted) fluxes $\phi v_{\mathrm{\phi}}$ and $\psi v_{\mathrm{\psi}}$ arising in Eq (2.1) and Eq (2.2), respectively. It is then useful to introduce the velocity and flux functions

$\begin{alignat} 2 W(\phi) &: = (1-\phi) \tilde{v}_{\phi\mathrm{s}} (\phi), &\qquad V(\varphi) &: = \sigma (1 -\varphi) \tilde{v}_{\psi \theta} (\varphi), \\ j(\phi)& : = \phi W(\phi), &\qquad f(\varphi)&: = \varphi V(\varphi), \end{alignat}$

(2.6)

where $\sigma = \pm 1$ is chosen depending on the application such that $V(\varphi), f(\varphi)\geq 0$ (for standard convenience, e.g., when plotting their graphs); $\sigma = 1$ for co-current flows (upwards) and $\sigma = -1$ for counter-current flows. The velocity and flux of the secondary disperse phase with respect to $z$ are therefore $\sigma V(\varphi)$ and $\sigma f(\varphi)$ , respectively. We assume that $W', V'\leq 0$ and $V(1) = W(1) = 0$ , as well as that

$\begin{align} { f }\; {\text{has one local maximum}}\; \omega \;{\text{and one inflection point}}\; \tilde{\omega}, \quad 0 < \omega < \tilde{\omega} < 1 . \end{align}$

(2.7)

Combining Eq (2.5) and Eq (2.6) we obtain the expressions

$\begin{align} \phi v_{\phi} & = \phi q + \gamma(z) \phi W(\phi ) = :J(\phi , z, t), \\ \psi v_{\psi} & = (1-\phi )\varphi q + \gamma(z) \bigl( (1-\phi ) \varphi\sigma V(\varphi ) - \varphi \phi W(\phi ) \bigr) = :F(\phi, \varphi, z, t) \end{align}$

(2.8)

for the total fluxes of Eq (2.1) and Eq (2.2). For $\phi < 1$ , we define the final flux function

$\begin{align} \tilde{F} (\phi, \psi, z, t) : = F\biggl( \phi, \frac{\psi}{1-\phi}, z, t \biggr) = \psi q + \gamma(z)\biggl(\psi\sigma V\left(\frac{\psi}{1-\phi}\biggr)-\frac{\psi\phi W(\phi)}{1-\phi}\right), \end{align}$

(2.9)

whereas for $\phi = 1$ , we set $\tilde{F}(1, \psi, z, t): = 0$ (since $F(1, \varphi, z, t) = 0$ for all $\varphi\in[0, 1]$ ). Substituting Eq (2.8) and Eq (2.9) into Eq (2.1) and Eq (2.2), respectively, we obtain the final governing PDE system Eq (1.1).

Illustrations and numerical examples are based on the expressions

$\begin{align} W(\phi) & = v_{\mathrm{term, p}}(1-\phi)^{n_{\mathrm{p}}}\quad \text{for }0\leq\phi\leq 1 ,\; n_{\mathrm{p}} > 1 , \end{align}$

(2.10)

$\begin{align} V(\varphi) & = v_{\mathrm{term, s}}(1-\varphi)^{n_{\mathrm{s}}}\quad \text{for }0\leq\varphi \leq 1 , \;n_{\mathrm{s}} > 1 \end{align}$

(2.11)

(see ^[46]), where $v_{\mathrm{term, p}}$ and $v_{\mathrm{term, s}}$ are the terminal velocities of a single particle of the primary and secondary disperse phases, respectively, in an unbounded fluid. We set $n_{\mathrm{p}} = 3.2$ , $v_{\mathrm{term, p}} = 2.7\, \mathrm{cm/s}$ , $n_{\mathrm{s}} = 2.5$ , and $v_{\mathrm{term, s}} = 0.5\, \mathrm{cm/s}$ along with $\sigma = -1$ . These values are used in Applications 1 and 2 in Section 6. The resulting nonlinearities of $J(\phi, z, t)$ and $\tilde{F} (\phi, \psi, z, t)$ in the different zones of the column are illustrated in [52,Figure 3.2].

3. Numerical method

3.1. Discretization and CFL condition

The discretization of the model is based on the triangularity of the system of conservation laws Eq (1.1). The numerical fluxes for $\phi$ are based on the particular treatment of conservation laws with fluxes having an explicit "concentration times velocity" structure ^[6]. In each time step, an approximate solution $\phi$ of the first PDE of Eq (1.1) is obtained and used as a given piecewise constant function in space and time in the second PDE of Eq (1.1), which is updated accordingly.

We define a computational domain $[0, z_\mathrm{end})$ (to be used for the error calculation; see Section 6.1) consisting of $N$ cells by covering the vessel with $N-2$ cells and placing one cell each below and above; see . This setup, with a finite spatial domain, is introduced for practical reasons and is the minimal spatial domain that captures the interior of the tank and the concentrations in the underflow and effluent zones. The formulation of the scheme (i.e., Scheme 1) and subsequent proof of invariant region property are referred to this computational domain, but for the convergence analysis the model is specified as the initial value problem Eq (1.1), Eq (1.2) with the initial data posed on the real line. This distinction is merely a formal one since on $(- \infty, 0)$ and $(z_{\mathrm{end}}, \infty)$ the model reduces to linear advection equations describing that matter is transported away from the unit at constant velocity (if no changes in $A$ in these zones arise).

Figure 2. (Left) Discretization of

$\phi$ and

$\psi$ in the application to flotation, where the height of the vessel is

$H = z_\mathrm{E}-z_\mathrm{U}$ , there are

$K$ inlets, and the cross-sectional area

$A(z)$ has two values separated by a discontinuity at

$z = z_{\mathrm{F}, 2}$ ; cf. the examples in Sections 6.4 and 6.5, (right) enlarged view illustrating cell division for error computations when

$\Delta_z^{\mathrm{r}}$ is the discretization of the reference solution (see Section 6.1).

DownLoad: Full-Size Img PowerPoint

Given the column height $H$ , we define $\Delta z: = H/(N-2)$ , the cell boundaries $z_{i}: = i\Delta z$ , $i = 0, 1, \ldots, N$ , and the cells (intervals) $I_{i-1/2}: = [z_{i-1}, z_{i})$ and $I_{i}: = [z_{i-1/2}, z_{i+1/2})$ . We place the column between $z_\mathrm{U}: = \Delta z = z_{1}$ and $z_\mathrm{E}: = z_\mathrm{U}+H = (N-1)\Delta z = z_{N-1}$ . Then the length of the interval of error calculation is $z_\mathrm{end}: = H+2\Delta z = N\Delta z$ . Each injection point $z_{\mathrm{F}, k}$ is assumed to belong to one cell $I_{i-1/2}$ and we define the dimensionless quantity

$\begin{align} \delta_{k, i-1/2}: = \int_{I_{i-1/2}}^{}\delta_{z_{\mathrm{F}, k}}(z)\, \mathrm{d}z : = \begin{cases} 1 &\text{if }z_{\mathrm{F}, k}\in I_{i-1/2} , \\ 0 &\text{otherwise}. \end{cases} \end{align}$

(3.1)

The cross-sectional area $A = A(z)$ is allowed to have a finite number of discontinuities and it is discretized by

$\begin{align*} A_{i}: = \frac{1}{\Delta z}\int_{I_i}^{}A(z)\, \mathrm{d}z, \qquad A_{i+1/2}: = \frac{1}{\Delta z}\int_{I_{i+1/2}}A(z)\, \mathrm{d}z. \end{align*}$

We simulate $N_T$ time steps up to the final time $T: = N_T\Delta t$ , with the fixed time step $\Delta t$ satisfying the Courant-Friedrichs-Lewy (CFL) condition

$\begin{align} {\Delta t}\bigg(\frac{2\|Q\|_{\infty, T}}{A_\mathrm{min}} + M\big(\max\left\{V(0), \|V'\|_\infty\right\} + \|W\|_\infty + \|W'\|_\infty \big)\bigg)\leq\Delta z, \end{align}$

(3.2)

where

$\begin{align*} & M : = \max\limits_{i = 1, 2, \ldots, N} \left\{\frac{A_{i-1}}{A_{i-1/2}}, \frac{A_{i}}{A_{i-1/2}}\right\}, \quad A_\mathrm{min} : = \min\limits_{k = 0, 1/2, 1, 3/2, \dots, N}A_{k}, \\ & \|Q\|_{\infty, T} : = \max\limits_{0\leq t\leq T}\sum\limits_{k = 1}^{K}Q_{\mathrm{F}, k}(t), \quad \|W'\|_\infty : = \max\limits_{0\leq\phi\leq 1}|W'(\phi)|. \end{align*}$

Finally, we set $t^n: = n\Delta t$ for $n = 0, 1, \ldots, N_T$ .

The time-dependent feed functions are discretized as

$\begin{align*} Q_{\mathrm{F}, k}^n: = \frac{1}{\Delta t}\int_{t^n}^{t^{n+1}}Q_{\mathrm{F}, k}(t)\, \mathrm{d}t, \qquad \phi_{\mathrm{F}, k}^n: = \frac{1}{\Delta t}\int_{t^n}^{t^{n+1}}\phi_{\mathrm{F}, k}(t)\, \mathrm{d}t, \end{align*}$

for $k = 1, \ldots, K$ , and the same is made for $\psi_{\mathrm{F}, k}$ .

3.2. Update of $\phi$ -Scheme 1

The first equation of Eq (1.1) is discretized by combining upwind discretizations of $q \phi$ with the particular scheme proposed in ^[6] for models with a "concentration times velocity" flux, as is the case for the term $\phi W (\phi)$ .

The initial data are discretized by

$\begin{align*} \phi_{i-1/2}^{0}: = \frac{1}{A_{i-1/2}\Delta z}\int_{I_{i-1/2}}^{}\phi(z, 0)A(z)\, \mathrm{d}z. \end{align*}$

To advance from $t^n$ to $t^{n+1}$ from given values $\smash{\phi_{i-1/2}^{n}}$ , $i = 1, \ldots, N$ , we define the numerical flux at $z = z_{i}$ by

$\begin{align} \mathcal{J}_{i}^n : = \begin{cases} \phi_{1/2}^nq_{0}^{n-} & \text{for } i = 0 , \\ \phi_{i-1/2}^nq_{i}^{n+} + \phi_{i+1/2}^nq_{i}^{n-} +\gamma_{i}\phi_{i-1/2}^n W(\phi_{i+1/2}^{n}) & \text{for } i = 1, \ldots, N-1 , \\ \phi_{N-1/2}^nq_{N}^{n+} & \text{for } i = N , \end{cases} \end{align}$

(3.3)

where the notation

$\begin{align*} a^+: = \max\{ a, 0 \}, \quad a^-: = \min \{ a, 0\} , \quad \gamma_{i}: = \gamma(z_{i}), \quad \text{and} \quad q_{i}^{n+}: = (q(z_{i}, t^n))^+ \end{align*}$

is used. Since the bulk fluxes above and below the tank are directed away from it,

$\begin{align*} \phi_{-1/2}^nq_{0}^{n+} = 0 \quad \text{and} \quad \phi_{N+1/2}^nq_{N}^{n-} = 0 \quad \text{for any values of }\smash{\phi_{-1/2}^n}\; {\rm{and}} \;\smash{\phi_{N+1/2}^n} . \end{align*}$

To simplify the presentation, we use the middle line of Eq (3.3) as the definition of $\mathcal{J}_{i}^n$ together with $\smash{\phi_{-1/2}^n: = 0}$ and $\smash{\phi_{N+1/2}^n: = 0}$ . With the notation $\smash{\lambda: = \Delta t/\Delta z}$ and $\smash{Q_{i}^{n+}: = A_iq_{i}^{n+}}$ etc., the conservation law on $I_{i-1/2}$ implies the update formula

$\begin{align} \begin{split} \phi_{i-1/2}^{n+1}& = \phi_{i-1/2}^{n}+\frac{\lambda}{A_{i-1/2}} \Biggl(A_{i-1}\mathcal{J}_{i-1}^{n} -A_{i}\mathcal{J}_{i}^{n} + \sum\limits_{k = 1}^K{Q_{\mathrm{F}, k}^n \phi_{\mathrm{F}, k}^n\delta_{k, i-1/2}}\Biggr)\\ & = : \mathcal H_{i-1/2} \bigl(\phi_{i-3/2}^n, \phi_{i-1/2}^n, \phi_{i+1/2}^n \bigr), \quad i = 1, \ldots, N. \end{split} \end{align}$

(3.4)

Then we define the piecewise constant approximate solution $\phi^{\Delta z}$ on $\mathbb{R}\times[0, T)$ by

$\begin{align} \phi^{\Delta z}(z, t) : = \sum\limits_{i, n}{\chi_{I_{i-1/2}}(z) \chi_{[t^n, t^{n+1})}(t)\phi_{i-1/2}^{n}}, \end{align}$

(3.5)

where $\chi_\Omega$ denotes the characteristic function of the set $\Omega$ .

3.3. Update of $\psi$ -Scheme 1

We discretize the initial data by

$\begin{align*} \psi_{i-1/2}^{0}: = \frac{1}{A_{i-1/2}\Delta z}\int_{I_{i-1/2}}^{}\psi(z, 0)A(z)\, \mathrm{d}z. \end{align*}$

The well-known Engquist-Osher numerical flux ^[26] for a given continuous, piecewise differentiable flux function $g$ and real values $a$ and $b$ on the left/right is given by

$\begin{equation} \mathcal{G}(g;a, b) : = g(0) + \int_0^a \max \bigl\{ 0, g'(s) \bigr\} \, \mathrm{d} s + \int_0^b \min \bigl\{ 0, g'(s) \bigr\} \, \mathrm{d} s. \end{equation}$

(3.6)

Then a consistent numerical flux corresponding to Eq (2.9) is

$\begin{array}{l} \mathcal{F}_{i}^n : = \psi_{i-1/2}^n q_{i}^{n+} + \psi_{i+1/2}^n q_{i}^{n-} \\ +\gamma_{i} \bigg( G_{i}^{n}\big(\psi_{i-1/2}^{n}, \psi_{i+1/2}^{n}\big) -\phi_{i-1/2}^n\dfrac{\psi_{i+1/2}^n}{1-\phi_{i+1/2}^{n}}W(\phi_{i+1/2}^{n} )\bigg), \quad i = 0, \dots, N, \end{array}$

where we set $\smash{\psi_{-1/2}^n: = 0}$ and $\smash{\psi_{N+1/2}^n: = 0}$ with the same motivation as for $\phi$ above (these values are irrelevant). Here

$\begin{align} G_{i}^{n} \bigl(\psi_{i-1/2}^{n}, \psi_{i+1/2}^{n} \bigr) : = \mathcal{G} \bigl(\sigma f_{i}^{n};\psi_{i-1/2}^{n}, \psi_{i+1/2}^{n} \bigr) \end{align}$

(3.7)

is the Engquist-Osher numerical flux associated with the function

$\begin{align} \sigma f_{i}^{n}(\psi): = \sigma\psi\tilde{V} \biggl(\frac{\psi}{\psi_{\mathrm{max}, i}^n}\biggr), \quad \tilde{V}(u): = \begin{cases} V(u) &\text{for }u < 1 , \\ 0 & \text{for }u\geq 1 , \end{cases} \end{align}$

(3.8)

where ( $a\wedge b: = \min\{a, b\}$ , $a\vee b: = \max\{a, b\}$ )

$\begin{align*} \psi_{\mathrm{max}, i}^n: = (1-\phi_{i-1/2}^n) \wedge (1-\phi_{i+1/2}^n) = 1- (\phi_{i-1/2}^n\vee \phi_{i+1/2}^n). \end{align*}$

The function $\psi \mapsto \sigma f_{i}^{n} (\psi)$ is unimodal. Let $\smash{\hat{\psi}_{i}^n}$ denote the maximum point of $f_{i}^{n}$ . For a given function $\tilde{V}$ the values $\smash{\hat{\psi}_{i}^n}$ and $\smash{\psi_{\mathrm{max}, i}^n}$ are related by the following lemma.

Lemma 3.1. Assume that $0 < \omega < \tilde{\omega} < 1$ are the unique local maximum and inflection point, respectively, of $f(\varphi) = \varphi V(\varphi)$ (cf. Eq (2.7)). Then $\smash{\hat{\psi}_i^n = \omega \psi_{\max, i}^n}$ for all $i$ and $n$ and all possible values $0 \leq \psi_{\max, i}^n \leq 1$ . Moreover, the unique inflection point $\smash{\psi_{\mathrm{infl}, i}^n \in (\hat{\psi}_i^n, \psi_{\max, i}^n)}$ of $f_i^n$ satisfies $\smash{\psi_{\mathrm{infl}, i}^n = \tilde{\omega} \psi_{\max, i}^n}$ for all $i$ and $n$ and all possible values $0 \leq \psi_{\max, i}^n \leq 1$ . (See Figure 3.)

Figure 3. Illustration of Lemma 3.1.

DownLoad: Full-Size Img PowerPoint

Proof. Assume that $0 < \psi_{\max, i}^n \leq 1$ . Since $\smash{\hat{\psi}_i^n}$ is the unique solution $\smash{\hat{\psi}_i^n < \psi_{\max, i}^n}$ of

$\begin{align*} \frac{\mathrm{d}}{\mathrm{d} \psi} \biggl( \psi \tilde{V} \biggl( \frac{\psi}{\psi_{\max, i}^n} \biggr)\biggr) = 0 \quad \Leftrightarrow \quad \tilde{V} \biggl( \frac{\psi}{\psi_{\max, i}^n} \biggr) + \frac{\psi}{\psi_{\max, i}^n} \tilde{V}' \biggl( \frac{\psi}{\psi_{\max, i}^n} \biggr) = 0, \end{align*}$

it follows that $\omega$ is the unique solution in $(0, 1)$ of $\smash{\tilde{V} (\omega) + \omega \tilde{V}'(\omega) = 0}$ (cf. Eq (2.7)). By a similar argument, $\tilde{\omega}$ is the unique solution of $2 \tilde{V}' (\tilde{\omega}) + \tilde{\omega} \tilde{V}'' (\tilde{\omega}) = 0$ .

The Engquist-Osher numerical flux Eq (3.7), which in this form appears in Scheme 1 as well as in its reduced versions, Schemes 2 and 3, can now be computed as follows, where we recall that $f_i^n (0) = 0$ . For $\sigma = 1$ we get

$\begin{align} \begin{split} \int_0^{\psi_{i-1/2}^n} \max \bigl\{ 0, (f_i^n)'(s) \bigr\} \, \mathrm{d} s & = \begin{cases} f_i^n ( \psi_{i-1/2}^n ) & \text{if } \psi_{i-1/2}^n \leq \hat{\psi}_i^n , \\ f_i^n (\hat{\psi}_i^n) & \text{if } \psi_{i-1/2}^n > \hat{\psi}_i^n , \end{cases}\\ \int_0^{\psi_{i+1/2}^n} \min \bigl\{ 0, (f_i^n)'(s) \bigr\} \, \mathrm{d} s & = \begin{cases} 0 & \text{if } \psi_{i+1/2}^n \leq \hat{\psi}_i^n , \\ f_i^n (\psi_{i+1/2}^n) - f_i^n (\hat{\psi}_i^n) & \text{if }\psi_{i+1/2}^n > \hat{\psi}_i^n , \end{cases} \end{split} \end{align}$

(3.9)

hence

$\begin{array}{l} \mathcal{G}(f_{i}^{n};\psi_{i-1/2}^{n}, \psi_{i+1/2}^{n}) \\ = \begin{cases} f_i^n ( \psi_{i-1/2}^n ) & \text{if }\psi_{i-1/2}^n \leq \hat{\psi}_i^n \;{\rm{and}}\; \psi_{i+1/2}^n \leq \hat{\psi}_i^n , \\ f_i^n ( \psi_{i-1/2}^n ) + f_i^n (\psi_{i+1/2}^n) - f_i^n (\hat{\psi}_i^n) & \text{if }\psi_{i-1/2}^n \leq \hat{\psi}_i^n \;{\rm{and}}\; \psi_{i+1/2}^n > \hat{\psi}_i^n , \\ f_i^n (\hat{\psi}_i^n) & \text{if } \psi_{i-1/2}^n > \hat{\psi}_i^n \;{\rm{and}}\; \psi_{i+1/2}^n \leq \hat{\psi}_i^n , \\ f_i^n (\psi_{i+1/2}^n) & \text{if }\psi_{i-1/2}^n > \hat{\psi}_i^n \;{\rm{and}}\; \psi_{i+1/2}^n > \hat{\psi}_i^n . \end{cases} \end{array}$

(3.10)

By analogous reasoning we obtain for $\sigma = -1$

$\begin{array}{l} \mathcal{G}(-f_{i}^{n};\psi_{i-1/2}^{n}, \psi_{i+1/2}^{n})\\ = \begin{cases} -f_i^n ( \psi_{i+1/2}^n ) \text{if }\psi_{i-1/2}^n \leq \hat{\psi}_i^n \;{\rm{and}}\; \psi_{i+1/2}^n \leq \hat{\psi}_i^n , \\ - f_i^n (\hat{\psi}_i^n) & \text{if }\psi_{i-1/2}^n \leq \hat{\psi}_i^n \;{\rm{and}}\; \psi_{i+1/2}^n > \hat{\psi}_i^n , \\ f_i^n (\hat{\psi}_i^n) - f_i^n ( \psi_{i-1/2}^n ) - f_i^n (\psi_{i+1/2}^n) & \text{if }\psi_{i-1/2}^n > \hat{\psi}_i^n \;{\rm{and}}\; \psi_{i+1/2}^n \leq \hat{\psi}_i^n , \\ - f_i^n (\psi_{i-1/2}^n) & \text{if }\psi_{i-1/2}^n > \hat{\psi}_i^n \;{\rm{and}}\; \psi_{i+1/2}^n > \hat{\psi}_i^n . \end{cases} \end{array}$

(3.11)

We define the difference operators $\Delta_-a_i: = a_{i}-a_{i-1}$ and $\Delta_+a_i: = a_{i+1}-a_{i}$ . Then the marching formula for $\psi$ -Scheme 1 is

$\begin{array}{l} \psi_{i-1/2}^{n+1} = \psi_{i-1/2}^{n} + \dfrac{\lambda}{A_{i-1/2}}\Bigg( A_{i-1}\mathcal{F}_{i-1}^{n} - A_{i}\mathcal{F}_{i}^{n} + \sum\limits_{k = 1}^K{Q_{\mathrm{F}, k}^n\psi_{\mathrm{F}, k}^n\delta_{k, i-1/2}} \Bigg)\\ = \psi_{i-1/2}^{n} - \dfrac{\lambda}{A_{i-1/2}}\Bigg( \Delta_- \biggl( \psi_{i-1/2}^{n}Q_{i}^{n+} + \psi_{i+1/2}^n Q_{i}^{n-}\\ \quad + (A\gamma)_{i}\biggl( G_{i}^{n}\big(\psi_{i-1/2}^{n}, \psi_{i+1/2}^{n}\big) -\phi_{i-1/2}^n\frac{\psi_{i+1/2}^n}{1-\phi_{i+1/2}^{n}}W(\phi_{i+1/2}^{n}) \biggr) \biggr) - \sum\limits_{k = 1}^K{Q_{\mathrm{F}, k}^n\psi_{\mathrm{F}, k}^n\delta_{k, i-1/2}}\Bigg), \\ i = 1, \dots, N. \end{array}$

(3.12)

Then we define the piecewise constant approximate solution $\psi^{\Delta z}$ on $\mathbb{R}\times[0, T)$ by

$\begin{align} \psi^{\Delta z}(z, t) : = \sum\limits_{i, n}{\chi_{I_{i-1/2}}(z) \chi_{[t^n, t^{n+1})}(t)\psi_{i-1/2}^{n}}. \end{align}$

(3.13)

3.4. Monotonicity and invariant-region principle

We prove that Scheme 1, defined by the update formulas Eq (3.4) and Eq (3.12), is monotone, a property which then is used to prove the invariant-region property that the approximate solutions are positive and bounded.

Theorem 3.1. If the CFL condition Eq (3.2) is satisfied, then the update formula for $\phi$ , Eq (3.4) (that is, $\phi$ -Scheme 1) is monotone and $\smash{0\leq\phi_{i-1/2}^n \leq 1}$ for $i = 1, \ldots, N$ and $n = 1, \dots, N_T$ .

Proof. We recall the assumption Eq (1.3). We first prove monotonicity of the three-point scheme for $\phi$ Eq (3.4), i.e, that $\smash{\partial\phi_{i-1/2}^{n+1}/\partial\phi_{k-1/2}^{n}\geq 0}$ for all $i = 1, \ldots, N$ and $k = i-1, i, i+1$ . We have

$\begin{align*} \frac{\partial\phi_{i-1/2}^{n+1}}{\partial\phi_{i-3/2}^{n}}& = \frac{\lambda}{A_{i-1/2}}\left(Q_{i-1}^{n+} +(A\gamma)_{i-1}W(\phi_{i-1/2}^{n})\right)\geq 0, \\ \frac{\partial\phi_{i-1/2}^{n+1}}{\partial\phi_{i+1/2}^{n}}& = \frac{\lambda}{A_{i-1/2}}\left(-Q_{i}^{n-} -(A\gamma)_{i}\phi_{i-1/2}^n W'(\phi_{i+1/2}^{n})\right)\geq 0, \\ \frac{\partial\phi_{i-1/2}^{n+1}}{\partial\phi_{i-1/2}^{n}}& = 1+\frac{\lambda}{A_{i-1/2}}\Big(Q_{i-1}^{n-} +(A\gamma)_{i-1}\phi_{i-3/2}^n W'(\phi_{i-1/2}^{n}) -Q_{i}^{n+} -(A\gamma)_{i}W(\phi_{i+1/2}^{n})\Big)\\ & \geq 1-\lambda\left(\frac{2\|Q\|_{\infty, T}}{A_\mathrm{min}} +M\big(\|W'\|_\infty+\|W\|_\infty\big)\right)\geq 0, \end{align*}$

where we have used the CFL condition Eq (3.2).

We now prove that if $\smash{0\leq\phi_{i-1/2}^n\leq 1}$ for all $i$ , then $\smash{0\leq\phi_{i-1/2}^{n+1}\leq 1}$ for all $i$ . Clearly, Eq (1.3) implies that $\smash{0\leq\phi_{i-1/2}^0\leq 1}$ for all $i$ . Since the scheme Eq (3.4) is monotone, $\mathcal H_{i-1/2}$ is non-decreasing in each argument. Since by assumption $W(1) = 0$ , we get the following estimation (where we use $a^++a^- = a$ ):

$\begin{align*} 0&\leq \frac{\lambda}{A_{i-1/2}} \sum\limits_{k = 1}^K{Q_{\mathrm{F}, k}^n\phi_{\mathrm{F}, k}^n\delta_{k, i-1/2}} = \mathcal H_{i-1/2}(0, 0, 0)\leq\phi_{i-1/2}^{n+1}\\ & = \mathcal{H}_{i-1/2} \bigl(\phi_{i-3/2}^n, \phi_{i-1/2}^n, \phi_{i+1/2}^n \bigr) \leq\mathcal{H}_{i-1/2}(1, 1, 1)\\ & = 1 +\frac{\lambda}{A_{i-1/2}}\left( \big(Q_{i-1}^{n} - Q_{i}^{n}\big) + \sum\limits_{k = 1}^K{Q_{\mathrm{F}, k}^n \phi_{\mathrm{F}, k}^n\delta_{k, i-1/2}}\right)\\ &\leq 1 +\frac{\lambda}{A_{i-1/2}}\left( \sum\limits_{k = 1}^K \big(-Q_{\mathrm{F}, k}^n\big)\delta_{k, i-1/2} + \sum\limits_{k = 1}^K{Q_{\mathrm{F}, k}^n} \delta_{k, i-1/2}\right) = 1. \end{align*}$

Lemma 3.2. The function $f_{i}^{n}$ (cf. Eq (3.8)) satisfies $\|(f_{i}^{n})'\|_\infty\leq\max \{ V(0), \|V'\|_\infty \}$ .

Proof. By Eq (2.7), the function $f(\varphi) = \varphi V(\varphi)$ has a single inflection point $\tilde{\omega} \in(0, 1)$ and by Lemma 3.1, $f_{i}^{n}$ has the inflection point $\tilde{\omega} \psi_{\max, i}^n \in(0, \psi_{\max, i}^n)$ . We have $(f_{i}^{n})'(0) = V(0)$ , $(f_{i}^{n})'(\varphi) = 0$ for $\psi_{\max, i}^n \leq \varphi \leq 1$ and the lowest (and negative) value of $(f_{i}^{n})'$ is obtained at its only critical point $\tilde{\omega} \psi_{\max, i}^n$ , for which

$\begin{align*} (f_{i}^{n}) '(\tilde{\omega} \psi_{\max, i}^n) = \tilde{V}(\tilde{\omega} ) + \tilde{\omega} \tilde{V}'( \tilde{\omega} )\geq -\left\|V'\right\|_\infty. \end{align*}$

This concludes the proof.

Lemma 3.3. There holds $\smash{G_{i}^{n} (1-\phi_{i-1/2}^{n}, 1-\phi_{i+1/2}^{n}) = 0}$ for all $i$ and $n$ .

Proof. Assume that $\smash{0 < \psi_{\mathrm{max}, i}^{n} = (1-\phi_{i-1/2}^n) \wedge (1-\phi_{i+1/2}^n) \leq 1}$ . By Lemma 3.1, $\hat{\psi}_{i}^n < \psi_{\mathrm{max}, i}^{n}$ , hence Eq (3.10), Eq (3.11), and

$\begin{align*} \tilde{V} \bigl( (1-\phi_{i-1/2}^{n}) / \psi_{\mathrm{max}, i}^{n} \bigr) = \tilde{V} \bigl( (1-\phi_{i+1/2}^{n} ) / \psi_{\mathrm{max}, i}^{n} \bigr) = 0 \end{align*}$

imply that

$\begin{align*} G_{i}^{n}\big(1-\phi_{i-1/2}^{n}, 1-\phi_{i+1/2}^{n}\big) & = \begin{cases} f_i^n ( 1- \phi_{i+1/2}^n) = 0 & \text{if }\sigma = 1 , \\ - f_i^n ( 1- \phi_{i-1/2}^n) = 0 & \text{if }\sigma = -1 . \end{cases} \end{align*}$

Theorem 3.2. Under the assumptions of Theorem 3.1, the update formula for $\psi$ , Eq (3.12) (i.e., $\psi$ -Scheme 1) is monotone and along with Eq (3.4) produces approximate solutions that satisfy $\smash{0\leq\psi_{i-1/2}^n\leq 1-\phi_{i-1/2}^n}$ for all $i$ and $n$ .

Proof. Assumptions Eq (1.3) and Eq (1.5) imply that $\smash{0\leq\psi_{i-1/2}^0\leq 1-\phi_{i-1/2}^0}$ for all $i$ and

$\begin{align} \psi_{\mathrm{F}, k}^n\leq 1-\phi_{\mathrm{F}, k}^n \quad \text{for all}\; n . \end{align}$

(3.14)

To prove that the scheme Eq (3.12) is monotone, we write it as

$\begin{align} \psi_{i-1/2}^{n+1} = \mathcal{K}_{i-1/2}^n \bigl( \psi_{i-3/2}^n, \psi_{i-1/2}^n, \psi_{i+1/2}^n \bigr) \end{align}$

(3.15)

and show that this expression is non-decreasing in each of its arguments.

Since $\smash{0 \leq \phi_{i-1/2}^n \leq 1}$ for a given $n$ and all $i$ , and appealing to Eq (3.6), we have

$\begin{align*} \frac{\partial\psi_{i-1/2}^{n+1}}{\partial\psi_{i-3/2}^{n}}& = \frac{\lambda}{A_{i-1/2}}\bigg( Q_{i-1}^{n+} + (A\gamma)_{i-1} \frac{\partial G_{i-1}^{n}}{\partial\psi_{i-3/2}^{n}} \bigg)\geq0, \\ \frac{\partial\psi_{i-1/2}^{n+1}}{\partial\psi_{i+1/2}^{n}}& = \frac{\lambda}{A_{i-1/2}}\biggl( -Q_{i}^{n-} - (A\gamma)_{i}\frac{\partial G_{i}^{n}}{\partial\psi_{i+1/2}^{n}} + {(A\gamma)_{i}\frac{\phi_{i-1/2}^{n}}{1-\phi_{i+1/2}^{n}}W(\phi_{i+1/2}^{n})}\biggr)\geq 0, \\ \frac{\partial\psi_{i-1/2}^{n+1}}{\partial\psi_{i-1/2}^{n}}& = 1+\frac{\lambda}{A_{i-1/2}}\biggl( Q_{i-1}^{n-} -Q_{i}^{n+}+ (A\gamma)_{i-1}\biggl( \frac{\partial G_{i-1}^{n}}{\partial\psi_{i-1/2}^{n}} - \frac{\phi_{i-3/2}^nW(\phi_{i-1/2}^n)}{1-\phi_{i-1/2}^n}\biggr) -(A\gamma)_{i}\frac{\partial G_{i}^{n}}{\partial\psi_{i-1/2}^{n}} \biggr) \\ &\geq 1-{\lambda}\bigg(\frac{2\|Q\|_{\infty, T}}{A_\mathrm{min}} + M\biggl(\frac{\partial G_{i}^{n}}{\partial\psi_{i-1/2}^{n}} - \frac{\partial G_{i-1}^{n}}{\partial\psi_{i-1/2}^{n}} + \frac{W(\phi_{i-1/2}^n)}{1-\phi_{i-1/2}^n} \biggr)\bigg). \end{align*}$

By Eq (3.6) and Lemma 3.2 we also obtain

$\begin{array}{l} \frac{\partial G_{i}^{n}}{\partial\psi_{i-1/2}^{n}} - \frac{\partial G_{i-1}^{n}}{\partial\psi_{i-1/2}^{n}} \\ = (f_i^n)'\bigl( \psi_{i-1/2}^n \bigr)^+ - (f_i^n) '\bigl( \psi_{i-1/2}^n \bigr)^- = \bigl| (f_i^n)'\bigl( \psi_{i-1/2}^n \bigr) \bigr| \leq \|(f_{i}^{n})'\|_\infty \leq \max\left\{V(0), \|V'\|_\infty\right\}, \end{array}$

and for the remaining term, we use that $W(1) = 0$ to get

$\begin{align*} \frac{W(\phi_{i-1/2}^n)}{1-\phi_{i-1/2}^n} = \frac{W(\phi_{i-1/2}^n)- W(1)}{1-\phi_{i-1/2}^n} = -W'(\xi) \leq \|W'\|_\infty \quad \text{for some } \xi\in(\phi_{i-1/2}^n, 1) . \end{align*}$

Hence, the CFL condition Eq (3.2) implies

$\begin{align*} \frac{\partial\psi_{i-1/2}^{n+1}}{\partial\psi_{i-1/2}^{n}}& \geq 1-{\lambda}\bigg(\frac{2\|Q\|_{\infty, T}}{A_\mathrm{min}} + M\big(\max\left\{V(0), \|V'\|_\infty\right\} + \|W'\|_\infty \big)\bigg)\geq 0. \end{align*}$

The inequalities proved imply that $\smash{\mathcal{K}_{i-1/2}^n}$ is non-decreasing in each of its arguments. Now we use that $0\leq\psi_{i-1/2}^n\leq 1-\phi_{i-1/2}^n$ for all $i$ and Lemma 3.3 to obtain

$\begin{align*} 0& \leq \frac{\lambda}{A_{i-1/2}} \sum\limits_{k = 1}^{K}Q_{\mathrm{F}, k}^{n} \psi_{\mathrm{F}, k}^{n}\delta_{k, {i-1/2}} = \mathcal{H}_{i-1/2}(0, 0, 0)\leq \psi_{i-1/2}^{n+1}\\& = \mathcal{H}_{i-1/2}(\psi_{i-3/2}^n, \psi_{i-1/2}^n, \psi_{i+1/2}^n) \leq \mathcal{H}_{i-1/2}(1-\phi_{i-3/2}^n, 1-\phi_{i-1/2}^n, 1-\phi_{i+1/2}^n) \\ & = 1-\phi_{i-1/2}^n + \frac{\lambda}{A_{i-1/2}}\Bigg( A_{i-1}\mathcal{F}_{i-1}^{n}(1-\phi_{i-3/2}^n, 1-\phi_{i-1/2}^n) \\ & \quad - A_{i}\mathcal{F}_{i}^{n}(1-\phi_{i-1/2}^n, 1-\phi_{i+1/2}^n) + \sum\limits_{k = 1}^K{Q_{\mathrm{F}, k}^n\psi_{\mathrm{F}, k}^n\delta_{k, i-1/2}} \Bigg)\\ & = 1-\phi_{i-1/2}^n + \frac{\lambda}{A_{i-1/2}}\Bigg( (1-\phi_{i-3/2}^n)Q_{i-1}^{n+} + (1-\phi_{i-1/2}^n)Q_{i-1}^{n-}- (A\gamma)_{i-1}\phi_{i-3/2}^n W(\phi_{i-1/2}^n) \\ & \qquad -(1-\phi_{i-1/2}^n)Q_{i}^{n+} - (1-\phi_{i+1/2}^n)Q_{i}^{n-} + (A\gamma)_{i}\phi_{i-1/2}^n W(\phi_{i+1/2}^n) + \sum\limits_{k = 1}^K{Q_{\mathrm{F}, k}^n\psi_{\mathrm{F}, k}^n\delta_{k, i-1/2}} \Bigg). \end{align*}$

Appealing to Eq (3.14) and the update formula for $\phi$ Eq (3.4), we get

$\begin{align*} \psi_{i-1/2}^{n+1} & \leq 1-\phi_{i-1/2}^{n+1}+ \frac{\lambda}{A_{i-1/2}}\Bigg( Q_{i-1}^{n+} + Q_{i-1}^{n-} - Q_{i}^{n+} - Q_{i}^{n-} + \sum\limits_{k = 1}^K{Q_{\mathrm{F}, k}^n\delta_{k, i-1/2}} \Bigg)\\ & = 1-\phi_{i-1/2}^{n+1}+ \frac{\lambda}{A_{i-1/2}}\Bigg\{ Q_{i-1}^{n} - Q_{i}^{n} + \sum\limits_{k = 1}^K{Q_{\mathrm{F}, k}^n\delta_{k, i-1/2}} \Bigg\} = 1-\phi_{i-1/2}^{n+1}. \end{align*}$

The last equality holds since $\{ \dots \} = 0$ irrespective of whether there is a source in the cell; $Q_{i-1}^{n} - Q_{i}^{n} + Q_{\mathrm{F}, k}^n = 0$ , or not; $Q_{i-1}^{n} - Q_{i}^{n} = 0$ .

4. Partial convergence analysis of Scheme 2

For ease of the argument, let us focus on the case of a constant interior cross-sectional area $A$ , i.e., assume that Eq (1.7) is in effect. In addition, we assume that $\smash{Q_{\mathrm{F}, k}^n}$ , $\smash{\phi_{\mathrm{F}, k}^n}$ , and $\smash{\psi_{\mathrm{F}, k}^n}$ ( $k = 1, \dots, K$ ) are constant and therefore do not depend on $n$ . The same is assumed for the underflow volumetric flow $Q_{\mathrm{U}}$ . (That is, we now study Scheme 2 suitable for Model 2.) Then Eq (3.4) and Eq (3.12) take the forms

$\begin{align} \phi_{i-1/2}^{n+1} & = \phi_{i-1/2}^{n}- \lambda \Delta_- \bigl( \phi_{i-1/2}^nq_{i}^{+} + \phi_{i+1/2}^nq_{i}^{-} + \gamma_{i}\phi_{i-1/2}^n W \bigr(\phi_{i+1/2}^{n} \bigr) \bigr) + \lambda \sum\limits_{k = 1}^K q_{\mathrm{F}, k} \phi_{\mathrm{F}, k} \delta_{k, i-1/2}, \end{align}$

(4.1)

$\begin{array}{l} \psi_{i-1/2}^{n+1} = \psi_{i-1/2}^{n} - \lambda\Delta_- \Bigg( \psi_{i-1/2}^{n}q_{i}^{+}+ \psi_{i+1/2}^n q_{i}^{-} + \gamma_{i} \\ \biggl( G_{i}^{n}\big(\psi_{i-1/2}^{n}, \psi_{i+1/2}^{n}\big) -\phi_{i-1/2}^n\frac{\psi_{i+1/2}^n W(\phi_{i+1/2}^{n}) }{1-\phi_{i+1/2}^{n}} \biggr) \Bigg) \\ + \lambda \sum\limits_{k = 1}^K q_{\mathrm{F}, k} \psi_{\mathrm{F}, k} \delta_{k, i-1/2}, \end{array}$

(4.2)

where $q_{\mathrm{F}, k}: = {Q_{\mathrm{F}, k}}/{A}$ . To embed the treatment into available analyses of schemes for conservation laws with discontinuous flux, we absorb the feed terms into the numerical flux. That is, we define $i_{k}: = i$ if $\delta_{k, i-1/2} = 1$ (see Eq (3.1)). Then

$\begin{align} q_i = \begin{cases} -q_{\mathrm{U}} & \text{if }i \leq i_1-1 , \\ -q_{\mathrm{U}} + q_{\mathrm{F}, 1} + \dots + q_{\mathrm{F}, l} & \text{if }i_l \leq i \leq i_{l+1}-1 , l = 1, \dots, K-1 , \\ -q_{\mathrm{U}}+ q_{\mathrm{F}, 1} + \dots + q_{\mathrm{F}, K} & \text{for }i \geq i_K . \end{cases} \end{align}$

(4.3)

Furthermore, we define the feed flux

$\begin{align} h_{\mathrm{F}, i} : = \begin{cases} 0 & \text{if }i \leq i_1-1 , \\ q_{\mathrm{F}, 1} \phi_{\mathrm{F}, 1} + \dots + q_{\mathrm{F}, l} \phi_{\mathrm{F}, l} & \text{if }i_l \leq i \leq i_{l+1}-1 , l = 1, \dots, K-1 , \\ q_{\mathrm{F}, 1} \phi_{\mathrm{F}, 1} + \dots + q_{\mathrm{F}, K} \phi_{\mathrm{F}, K} & \text{for }i \geq i_K , \end{cases} \end{align}$

(4.4)

such that

$\begin{align*} h_{\mathrm{F}, i}- h_{\mathrm{F}, i-1} = \sum\limits_{k = 1}^K q_{\mathrm{F}, k} \phi_{\mathrm{F}, k} \delta_{k, i-1/2}. \end{align*}$

Consequently, we may write the scheme (i.e., $\phi$ -Scheme 2) as

$\begin{align} \phi_{i-1/2}^{n+1 } = \phi_{i-1/2}^{n} - \lambda \Delta_- \bigl( \phi_{i+1/2}^{n} q_i^- + \phi_{i-1/2}^{n} q_i^+ + \gamma_i \phi_{i-1/2}^{n} W (\phi_{i+1/2}^{n}) + h_{\mathrm{F}, i} \bigr). \end{align}$

(4.5)

For later use we define the piecewise constant functions

$\begin{align*} q(z)&: = q_{k}\quad\text{and}\quad h_{\mathrm{F}}(z): = h_{\mathrm{F}, {k}} \quad\text{for }z_{\mathrm{F}, k} < z < z_{\mathrm{F}, k+1} , k = 0, \dots, K , \end{align*}$

where $z_{\mathrm{F}, 0}: = -\infty$ , $z_{\mathrm{F}, K+1}: = \infty$ , and we define the function

$\begin{align} h(z, v, u) : = q^-(z) v +q^+ (z) u + \gamma(z) u W(v) + h_{\mathrm{F}} (z) \end{align}$

(4.6)

that allows us to write Eq (4.5) as

$\begin{align} \phi_{i-1/2}^{n+1 } = \phi_{i-1/2}^{n} - \lambda \Delta_- h \bigl( z_i , \phi_{i+1/2}^n, \phi_{i-1/2}^n \bigr). \end{align}$

(4.7)

4.1. Convergence of $\phi$ -Scheme 2

The PDE for $\phi$ within Model 2, that is when the simplification Eq (1.7) is applied to Model 1, is the conservation law

$\begin{align} \partial_t \phi + \partial_z J ( \phi, z) = 0, \quad (z, t) \in \Pi_T \end{align}$

(4.8)

with discontinuous flux

$\begin{align} \begin{split} J( \phi, z) & = \begin{cases} q(z) \phi - \sum\limits_{k = 1}^K q_{\mathrm{F}, k} \phi_{\mathrm{F}, k} & \text{for }z > z_{\mathrm{E}} , \\ q(z) \phi - \sum\limits_{k = 1}^K q_{\mathrm{F}, k} \phi_{\mathrm{F}, k} + j(\phi) & \text{for }z_{\mathrm{F}, K} < z < z_{\mathrm{E}} , \\ q(z) \phi - \sum\limits_{k = 1}^{l} q_{\mathrm{F}, k} \phi_{\mathrm{F}, k} + j(\phi) & \text{for }z_{\mathrm{F}, l} < z < z_{\mathrm{F}, l+1} , l = 1, \dots, K-1 , \\ -q_{\mathrm{U}} \phi + j ( \phi) & \text{for }z_{\mathrm{U}} < z < z_{\mathrm{F}, 1} , \\ -q_{\mathrm{U}} \phi & \text{for }z < z_{\mathrm{U}} . \end{cases} \end{split} \end{align}$

(4.9)

posed along with the initial condition Eq (1.2a).

The choice of the appropriate solution concept for weak solutions, and the ways we may relate the model to the available theory of conservation laws with discontinuous flux, requires verifying whether $J(\phi, z)$ as given by Eq (4.9) satisfies the so-called "crossing condition" across each discontinuity

$\begin{equation} z \in \mathcal{Z}: = \{ z_{\mathrm{U}}, z_{\mathrm{F}, 1}, \dots, z_{\mathrm{F}, K}, z_{\mathrm{E}} \}. \end{equation}$

(4.10)

Certain early well-posedness (existence, stability, and uniqueness) results for conservation laws with discontinuous flux (and related equations) rely on satisfaction of this condition ^[35], although later developments advance solution concepts that do not rely on satisfaction of the crossing condition ^[4,36,39]. In the present context this condition is satisfied for a particular discontinuity at $z$ if the adjacent fluxes to the right and the left, $J(\phi, z^+)$ and $J (\phi, z^-)$ , satisfy

$\begin{align} \forall \phi_1, \phi_2 \in [0, 1]: J( \phi_1 , z^+)- J( \phi_1 , z^-) < 0 < J( \phi_2 , z^+)- J( \phi_2 , z^-) \Rightarrow \phi_1 < \phi_2, \end{align}$

(4.11)

which means either the graphs of $J(\cdot, z^-)$ and $J(\cdot, z^+)$ do not intersect, or if they do, there is at most one flux crossing $\phi_{\chi}$ and the graph of $J(\cdot, z^-)$ lies above that of $J(\cdot, z^+)$ to the left of $\phi_{\chi}$ . For $J(\phi, z)$ as given by Eq (4.9) this condition is clearly satisfied for $z \in \{ z_{\mathrm{E}}, z_{\mathrm{U}} \}$ (considering that $j(\phi) > 0$ for $0 < \phi < 1$ implies that $J(\cdot, z^-)$ and $J(\cdot, z^+)$ do not intersect in this case), while

$\begin{align*} J ( \phi, z_{\mathrm{F}, l}^+) - J ( \phi, z_{\mathrm{F}, l}^-) = q_{\mathrm{F}, l} ( \phi- \phi_{\mathrm{F}, l}) \quad \text{for } l = 1, \dots, K .\end{align*}$

Thus, the crossing condition is satisfied also for $z = z_{\mathrm{F}, l}$ , $l = 1, \dots, K$ , since either $\phi_{\mathrm{F}, l} = 0$ and the adjacent fluxes do not intersect in $(0, 1)$ , or the intersection takes place at $\phi_{\chi} = \phi_{\mathrm{F}, l}$ and Eq (4.11) holds since $q_{\mathrm{F}, l} > 0$ for all $l$ . The preceding consideration is analogous to the one for the simpler clarifier-thickener model (equivalent to $K = 1$ in the present notation) studied e.g. in ^[13,14]. With the present analysis it is clear that the crossing condition is satisfied at each flux discontinuity $z \in \mathcal{Z}$ .

Some of the available analyses refer to initial-value problems of the type

$\begin{align} \begin{split} \partial_t u + \partial_x \mathcal{F} (u, x) & = 0 \quad \text{for }(x, t) \in \Pi_T , \\ u(x, 0) & = u_0 (x) \quad \text{for }x \in \mathbb{R} , \\ \text{where} \quad \mathcal{F} (u, x) & : = \mathcal{H}(-x) g(u) + \mathcal{H}(x) f(u) \end{split} \end{align}$

(4.12)

where $f$ and $g$ are Lipschitz continuous functions of $u$ denoting the "right" and "left" flux adjacent to a flux discontinuity across $x = 0$ and $H$ denotes the Heavyside function. The model problem Eq (4.12) features, of course, only one flux discontinuity (sitting at $x = 0$ ), while Eq (4.9), Eq (1.2a) includes several of them at separate spatial locations. The study of Eq (4.12) is, however, sufficient for the analysis of each single flux discontinuity.

Here we start from the concept of entropy solutions of type $\mathcal{V}$ introduced by Karlsen and Towers ^[36]. This concept does not appeal to the existence of traces of the unknown with respect to the interfaces $z \in \mathcal{Z}$ across which $J(\phi, z)$ is discontinuous. To state its adaptation to the situation at hand, we define the sets

$\begin{align*} \Pi_T^{(K+3/2)} & : = (z_{\mathrm{E}} , \infty) \times (0, T), \\ \Pi_T^{(K+1/2)} & : = (z_{\mathrm{F}, K}, z_{\mathrm{E}} ) \times (0, T), \\ \Pi_T^{(k-1/2)} &: = (z_{ \mathrm{F}, k-1} , z_{\mathrm{F}, k}) \times (0, T), \quad k = 2, \dots, K, \\ \Pi_T^{(1/2)} &: = (z_{ \mathrm{U}} , z_{\mathrm{F}, 1}) \times (0, T), \\ \Pi_T^{(-1/2)} &: = (- \infty, z_{ \mathrm{U}}) \times (0, T). \end{align*}$

Definition 4.1. A measurable function $\phi = \phi(z, t) \in L^1(\Pi_T)$ is an entropy solution of type $\mathcal{V}$ of the initial-value problem Eq (4.8), Eq (1.2a) if it satisfies the following conditions:

(ⅰ) The function $\phi$ belongs to $L^{\infty} (\Pi_T)$ ; for a.e. $(z, t) \in \Pi_T$ there holds $\phi(z, t) \in [0, 1]$ .

(ⅱ) The function $\phi$ is a weak solution of Eq (4.8), i.e., for all smooth test functions $\zeta$ with compact support in $\Pi_T$ ,

$\begin{align} \iint_{\Pi_T} \bigl( \phi \partial_t \zeta + J ( \phi, z) \partial_z \zeta \bigr) \, \mathrm{d} z \, \mathrm{d} t = 0. \end{align}$

(4.13)

(ⅲ) For all $l = 0, \dots, K+2$ , for any nonnegative smooth test function $\zeta^{(l)}$ with compact support in $\smash{\Pi_T^{(l)}}$ and all $c \in [0, 1]$ there holds

$\begin{align} \iint_{\Pi_T} \Bigl( |\phi-c| \partial_t \zeta^{(l)} + \operatorname*{sgn} ( \phi-c) \bigl( J ( \phi, z) - J(c, z) \bigr) \partial_z \zeta^{(l)} \Bigr) \, \mathrm{d} z \, \mathrm{d} t + \int_\mathbb{R} | \phi_0 -c | \, \zeta^{(l)} (z, 0) \, \mathrm{d} t \geq 0. \end{align}$

(4.14)

(ⅳ) The following Kružkov-type ^[37] entropy inequality holds for all nonnnegative smooth test functions $\zeta$ with compact support in $\Pi_T$ and all constants $c \in \mathbb{R}$ :

$\begin{align} \begin{split} & \iint_{\Pi_T} \Bigl( |\phi-c| \partial_t \zeta + \operatorname*{sgn} ( \phi-c) \bigl( J ( \phi, z) - J(c, z) \bigr) \partial_z \zeta \Bigr) \, \mathrm{d} z \, \mathrm{d} t \\ & + \int_0^T \sum\limits_{z \in \mathcal{Z}} \bigl| J( c, z^+)- J(c, z^-) \bigr| \, \zeta (z, t) \, \mathrm{d} t \geq 0. \end{split} \end{align}$

(4.15)

Note that the entropy inequality Eq (4.15) does not imply the weak formulation Eq (4.13). The standard derivation of the weak formulation from the Kružkov entropy inequality (e.g., [,Section 2.1]) does not apply here since some of the flux differences $| J(c, z^+)- J(c, z^-)|$ are not compactly supported with respect to $c$ , cf. [13,Rem. 1.1].

Lemma 4.1. Consider $\phi$ -Scheme 2 applied to Eq (4.8), Eq (1.2a). There exists a constant $C_1$ , depending on $\operatorname*{TV} (\phi_0)$ , such that

$\begin{align*} \Delta z \sum\limits_{i \in \mathbb{Z}} \bigl| \phi_{i-1/2}^{n+1} - \phi_{i-1/2}^{n} \bigr| \leq \Delta z \sum\limits_{i \in \mathbb{Z}} \bigl| \phi_{i-1/2}^{1} - \phi_{i-1/2}^{0} \bigr| \leq C_1 \Delta t . \end{align*}$

Proof. Subtracting from Eq (4.1) its version from the previous time step, we get

$\begin{align*} \phi_{i-1/2}^{n+1} - \phi_{i-1/2}^{n} & = \bigl( \phi_{i-3/2}^{n} - \phi_{i-3/2}^{n-1} \bigr) \lambda B_{i-1/2}^{n} + \bigl( \phi_{i-1/2}^{n} - \phi_{i-1/2}^{n-1} \bigr) \bigl\{ 1 - \lambda B_{i+1/2}^{n} + \lambda C_{i-1/2}^{n} \bigr\} \\ & \quad + \bigl( \phi_{i+1/2}^{n} - \phi_{i+1/2}^{n-1} \bigr) \{ - \lambda C_{i+1/2}^{n} \}, \end{align*}$

where we define

$\begin{align*} B_{i-1/2}^{n} & : = q_{i-1}^+ + \gamma_{i-1} W \bigl( \phi_{i-1/2}^{n} \bigr), \\ \quad C_{i+1/2}^{n} & : = \begin{cases} q_i^- + \gamma_{i} \phi_{i-1/2}^{n-1} {\frac{{ {{W(\phi_{i+1/2}^{n}) - W( \phi_{i+1/2}^{n-1})}}}}{{ {{ \phi_{i+1/2}^{n} - \phi_{i+1/2}^{n-1}}}}}} & \text{if }\phi_{i+1/2}^{n} \neq \phi_{i+1/2}^{n-1} , \\ 0 & \text{otherwise.} \end{cases} \end{align*}$

Clearly $B_{i-1/2}^{n} \geq 0$ , $C_{i+1/2}^{n} \leq 0$ , and due to the CFL condition,

$\begin{align*} 1 - \lambda B_{i+1/2}^{n} + \lambda C_{i-1/2}^{n} \geq 0, \end{align*}$

hence taking absolute values and summing over $i \in \mathbb{Z}$ we get, by appealing to standard arguments, that

$\begin{align*} \Delta z \sum\limits_{i \in \mathbb{Z}} \bigl| \phi_{i-1/2}^{n+1} - \phi_{i-1/2}^{n} \bigr| \leq \Delta z \sum\limits_{i \in \mathbb{Z}} \bigl| \phi_{i-1/2}^{n} - \phi_{i-1/2}^{n-1} \bigr| \leq \Delta z \sum\limits_{i \in \mathbb{Z}} \bigl| \phi_{i-1/2}^{1} - \phi_{i-1/2}^{0} \bigr|. \end{align*}$

Furthermore, following the lines e.g. of the proof of [,Lemma 3.2], we get that there exists a constant $C_2$ that is independent of $(\Delta t, \Delta z)$ such that

$\begin{align*} \sum\limits_{i \in \mathbb{Z}} \bigl| \phi_{i-1/2}^{1} - \phi_{i-1/2}^{0} \bigr| \leq C_2 \bigl( \operatorname*{TV} ( \phi^0) + \operatorname*{TV}( q) + \operatorname*{TV} ( \gamma) \bigr), \end{align*}$

which completes the proof.

A straightforward calculation yields that we can write the scheme in the form

$\begin{align*} \phi_{i-1/2}^{n+1} = \phi_{i-1/2}^n + C_i^n \Delta_+ \phi_{i-1/2}^n - D_{i-1}^n \Delta_- \phi_{i-1/2}^n - \theta_i^n, \end{align*}$

where we define

$\begin{align*} C_i^n & : = \begin{cases} - \lambda q_i^- - \lambda \gamma_{i-1} \phi_{i-3/2}^n {\frac{{ {{\Delta_- W( \phi_{i+1/2}^n ) }}}}{{ {{ \Delta_+ \phi_{i-1/2}^n}}}}} & \text{if }\Delta_+ \phi_{i-1/2}^n \neq 0 , \\ - \lambda q_i^- & \text{otherwise, } \end{cases} \\ D_{i-1}^n & : = \lambda q_i^+ + \lambda \gamma_{i-1} W ( \phi_{i+1/2}^n ), \\ \theta_i^n & : = \lambda \bigl( \phi_{i-1/2}^n \Delta_- q_i^- + \phi_{i-3/2}^n \Delta_- q_i^+ + \phi_{i-1/2}^n W ( \phi_{i+1/2}^n ) \Delta_- \gamma_i - \Delta_- h_{\mathrm{F}, i} \bigr). \end{align*}$

The incremental coefficients satisfy $\smash{ C_i^n \geq 0}$ and $\smash{ D_{i}^n \geq 0}$ ; furthermore, the CFL condition ensures that $C_i^n + D_{i}^n \leq 1$ (in all cases for all $i$ and $n$ ). Notice that $\theta_i^n = 0$ with the possible exception for those indices $i$ at which $\Delta_- q_i^- \neq 0$ , $\Delta_- q_i^+ \neq 0$ , or $\Delta_- \gamma_i \neq 0$ . According to the definition of $\gamma_i$ and that of $q_i$ , see Eq (4.3), this may occur at most at a finite number of indices. Precisely, we may assert that (see Eq (4.10))

$\begin{align*} \theta_i^n = 0 \quad \text{if }z_{i-1}, z_i \not\in \mathcal{Z} , \end{align*}$

hence for all indices $i$ with the exception of finitely many indices $i$ such that $|z_j - \zeta| \leq \Delta z$ for some $\zeta \in \mathcal{Z}$ , the scheme is given by the incremental form

$\begin{align*} \phi_{i-1/2}^{n+1} = \phi_{i-1/2}^n + C_i^n \Delta_+ \phi_{i-1/2}^n - D_{i-1}^n \Delta_- \phi_{i-1/2}^n \end{align*}$

with incremental coefficients $\smash{ C_i^n \geq 0}$ , $\smash{ D_{i}^n \geq 0}$ , and $C_i^n + D_{i}^n \leq 1$ . This property, in conjunction with Lemma 4.1, shows that we may apply [,Lemma 5.3] (which is essentially Lemma 4.2 of ^[6], where a proof can be found) to the situation at hand. From [,Lemma 5.3] we deduce the following lemma, where $V_a^b (g)$ denotes the total variation of a function $z \mapsto g(z)$ over the interval $(a, b)$ .

Lemma 4.2. Consider $\phi$ -Scheme 2 applied to Eq (4.8), Eq (1.2a). For any interval $[a, b]$ such that $[a, b] \cap \mathcal{Z} = \varnothing$ and any $t \in [0, T]$ there exists a total variation bound

$\begin{align*} V_a^b \bigl( \phi^{\Delta z} ( \cdot, t) \bigr) \leq C(a, b), \end{align*}$

where $C(a, b)$ is independent of $(\Delta x, \Delta t)$ and $t$ for $t \in [0, T]$ .

Finally, we have shown in Theorem 3.1 that $\phi$ -Scheme 1 Eq (3.4) is monotone. This applies, in particular, to the reduced $\phi$ -Scheme 2 Eq (4.1) or equivalently, Eq (4.5) or Eq (4.7). Thus, $\phi$ -Scheme 2 satisfies a discrete entropy inequality. The proof of the following lemma is identical to that of [36,Lemma 5.2], and is therefore omitted.

Lemma 4.3. The scheme Eq (4.7) ( $\phi$ -Scheme 2) satisfies the following entropy inequality for any $c_{i-3/2}, c_{i-1/2}, c_{ i+1/2} \in [0, 1]$ :

$\begin{align*} \bigl| \phi^{n+1}_{i-1/2} - c_{i-1/2} \bigr| & \leq \bigl| \phi^{n}_{i-1/2} - c_{i-1/2} \bigr| - \lambda \Delta_- H_i^n - \lambda \operatorname*{sgn} \bigl( \phi^{n+1}_{i-1/2} - c_{i-1/2} \bigr) \Delta_- h \bigl( z_i, \phi_{i+1/2}^n, \phi_{i-1/2}^n \bigr), \end{align*}$

where $h$ is defined in Eq (4.6) and the numerical entropy flux $H_i^n$ is defined by

$\begin{align*} H_i^n & : = h \bigl( z_i, \phi_{i+1/2}^n \vee c_{i+1/2}, \phi_{i-1/2}^n \vee c_{i-1/2} \bigr) - h \bigl( z_i, \phi_{i+1/2}^n \wedge c_{i+1/2}, \phi_{i-1/2}^n \wedge c_{i-1/2} \bigr). \end{align*}$

We now may appeal to the results of ^[36] and argue as follows. Theorem 3.1 and Lemmas 4.1–4.3 ensure convergence of the functions $\phi^{\Delta z}$ to a weak solution of Eq (4.8), Eq (1.2a) that satisfies items (ⅰ), (ⅱ) and (ⅲ) of Definition 4.1. It also satisfies the entropy inquality Eq (4.15) arising in part (ⅳ) of that definition by utilizing the discrete entropy inequality stated in Lemma 4.3. Thus, we have proved the following theorem.

Theorem 4.1. Suppose that assumptions Eq (1.3) to Eq (1.5) are in effect and that $\phi^{\Delta z}$ is defined by Eq (3.5), where the values $\smash{\phi_{i-1/2}^n}$ are defined by the scheme Eq (4.5) (that is, $\phi$ -Scheme 2). Let $\Delta t, \Delta z \to 0$ with $\lambda = \Delta t / \Delta z = \mathrm{const.}$ such that the CFL condition Eq (3.2) is satisfied. Then $\phi^{\Delta z}$ converges in $L_{\mathrm{loc}}^1 (\Pi_T)$ and a.e. in $\Pi_T$ to an entropy solution of type $\mathcal{V}$ of the initial-value problem Eq (4.8), Eq (1.2a).

4.2. $L^1$ Lipschitz continuity in time of $\psi$ -Scheme 2

Next, we deal with the marching formula Eq (4.2). To this end, we define a feed flux $\smash{\tilde{h}_{\mathrm{F}, i}}$ exactly as in Eq (4.4) but with $\phi_{\mathrm{F}, i}$ replaced by $\psi_{\mathrm{F}, i}$ for $i = 1, \dots, K$ . Furthermore, we recall that $\tilde{v}_{\phi \mathrm{s}} (\phi) = W(\phi)/(1-\phi)$ . Thus, the scheme can be written as

$\begin{array}{l} \psi_{i-1/2}^{n+1} = \psi_{i-1/2}^{n} - \lambda \Delta_- \Bigl( \tilde{h}_{\mathrm{F}, i} +\psi_{i-1/2}^{n}q_{i}^{+} + \psi_{i+1/2}^n q_{i}^{-} + \gamma_{i} \\ \bigl( G_{i}^{n}\big(\psi_{i-1/2}^{n}, \psi_{i+1/2}^{n}\big) - \phi_{i-1/2}^n \tilde{v}_{\phi \mathrm{s}} \bigl( \phi_{i+1/2}^{n} \bigr) \psi_{i+1/2}^n \bigr) \Bigr). \end{array}$

(4.16)

Lemma 4.4 (Crandall and Tartar ^[19]). Assume that $(\Omega, \mu)$ is some measure space and that $D$ is a subset of $L^1(\Omega)$ with the property that if $u, v \in D$ , then $(u \vee v) = \max\{u, v\} \in D$ . Assume that $T$ is a map $T: D \ni u \mapsto T(u) \in D$ such that

$\begin{align*} \int_{\Omega} T(u) \, \mathrm{d} \mu = \int_{\Omega} u \, \mathrm{d} \mu \quad \text{for all } u \in D . \end{align*}$

Then the following statements, valid for all $u, v \in D$ , are equivalent:

(ⅰ) If $u \leq v$ , then $T(u) \leq T(v)$ .

(ⅱ) $\int_{\Omega} ((T(u) - T(v)) \vee 0) \, \mathrm{d} \mu \leq \int_{\Omega} ((u-v) \vee 0) \, \mathrm{d} \mu$ .

(ⅲ) $\int_{\Omega} |T(u) - T(v)| \, \mathrm{d} \mu \leq \int_{\Omega} |u - v | \, \mathrm{d} \mu$ .

Following, for instance, ^[18], we utilize Lemma 4.4 for the following mapping. Assume that $D \subset L^1 (\mathbb{R})$ is the set of piecewise constant functions and that are constant on the intervals $I_{i-1/2}$ for $i \in \mathbb{Z}$ , and that with the marching formula Eq (3.15) we associate an operator $\mathcal{K}^n: D \to D$ such that if $\psi^{\Delta z} (\cdot, t^n)$ is the piecewise constant function defined by Eq (3.13) for $t = t_n$ , we may write $\psi$ -Scheme 2 as

$\begin{align*} \psi^{\Delta z} ( \cdot, t_{n+1} ) = \mathcal{K}^n \bigl( \psi^{\Delta z} ( \cdot, t_n) \bigr). \end{align*}$

Clearly, the monotonicity of the scheme implies that if $u, v \in D$ , then

$\begin{align*} u \leq v \Rightarrow \mathcal{K}^n (u ) \leq \mathcal{K}^n ( v). \end{align*}$

Thus, Lemma 4.4 (ⅰ) holds. For $u = \psi^{\Delta z} (\cdot, t_n)$ and $v = \psi^{\Delta z} (\cdot, t_{n-1})$ , Lemma 4.4 (ⅲ) implies that

$\begin{align*} \Delta z \sum\limits_{i \in \mathbb{Z}} \bigl| \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr| & = \int_{\mathbb{R}} \bigl| \psi^{\Delta z} ( z , t_{n+1}) - \psi^{\Delta z} ( z , t_n) \bigr| \, \mathrm{d} z \\ & \leq \int_{\mathbb{R}} \bigl| \psi^{\Delta z} ( z , t_{n}) - \psi^{\Delta z} ( z , t_{n-1}) \bigr| \, \mathrm{d} z = \Delta z \sum\limits_{i \in \mathbb{Z}} \bigl| \psi_{i-1/2}^{n} - \psi_{i-1/2}^{n-1} \bigr| \end{align*}$

and therefore

$\begin{align*} & \Delta z \sum\limits_{i \in \mathbb{Z}} \bigl| \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr| \leq \Delta z \sum\limits_{i \in \mathbb{Z}} \bigl| \psi_{i-1/2}^{1} - \psi_{i-1/2}^{0} \bigr| . \end{align*}$

However, we may assert that there exists a constant $C_3$ , which is independent of $(\Delta t, \Delta x)$ , such that

$\begin{align*} & \sum\limits_{i \in \mathbb{Z}} \bigl| \psi_{i-1/2}^{1} - \psi_{i-1/2}^{0} \bigr| \\ & = \sum\limits_{i \in \mathbb{Z}} \biggl| \Delta_- \bigg( \psi_{i-1/2}^{0}q_{i}^{+}+ \psi_{i+1/2}^0 q_{i}^{-} + \gamma_{i}\biggl( G_{i}^{0}\big(\psi_{i-1/2}^{0}, \psi_{i+1/2}^{0}\big) -\phi_{i-1/2}^n\frac{\psi_{i+1/2}^0 W(\phi_{i+1/2}^{0}) }{1-\phi_{i+1/2}^{0}} \biggr) \bigg)\\ & \quad - \lambda \sum\limits_{k = 1}^K \frac{Q_{\mathrm{F}, k}}{A} \psi_{\mathrm{F}, k}^0\delta_{k, i-1/2} \biggr| \leq C_3. \end{align*}$

Since Eq (1.4) is a sufficient condition for this bound on the initial discrete divergence to hold, we get

$\begin{align*} & \Delta z \sum\limits_{i \in \mathbb{Z}} \bigl| \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr| \leq \Delta z C_3 = \frac{\Delta t}{\lambda} C_3 . \end{align*}$

Consequently, we have proved the following lemma.

Lemma 4.5. There exists a constant $C_4$ that is independent of $(\Delta t, \Delta z)$ such that the numerical approximations produced by $\psi$ -Scheme 2 satisfy

$\begin{align*} \Delta z \sum\limits_{i \in \mathbb{Z}} \bigl| \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr| \leq C_4 \Delta t. \end{align*}$

5. Convergence analysis of Scheme 3

To write down Scheme 1 in the simplest setting possible, we consider the model and numerical scheme under the assumptions before, and additionally assume a constant bulk velocity $q$ , that the feed terms (giving rise to the singular source) are zero, and set the parameter $\gamma = 1$ . Thus, the model reduces to the triangular system of conservation laws Eq (1.8) with the initial conditions Eq (1.2) (Model 3), where we recall that assumptions Eq (1.3) are in effect, and Scheme 2, which in turn is a reduced form of Scheme 1, now is further reduced to Scheme 3.

Assume now that $\eta = \eta (\psi)$ is a smooth convex entropy function and $Q = Q(\phi, \psi)$ is the corresponding compatible entropy flux compatible with Eq (1.8b), i.e.,

$\begin{align} \partial_{\psi} {Q} ( \phi, \psi) = \eta' ( \psi) \partial_{\psi} \tilde{F} ( \phi, \psi). \end{align}$

(5.1)

In what follows, we refer to $(\eta, Q)$ as an entropy pair for Eq (1.8b). In particular we denote by $(\eta_0, Q_0)$ the Kružkov entropy pair ^[37], that is

$\begin{align} \eta_0 (\psi) = |\psi- c|, \quad Q_0 ( \phi, \psi) = \operatorname*{sgn} ( \psi-c) \bigl( \tilde{F} ( \phi, \psi) - \tilde{F} ( \phi, c) \bigr), \end{align}$

(5.2)

where $c \in \mathbb{R}$ is a constant.

The convergence proof is based on the following lemma, slightly adapted from [18,Lemma 2.2], which in turn is an adaptation of [43,Theorem 5] (see also ^[44]).

Lemma 5.1. Let $\phi$ be the unique entropy solution of the initial-value problem Eq (1.8a), Eq (1.2a), and assume that $\smash{\{ \psi^{\nu} \}_{\nu > 0}}$ is a family of functions defined on $\Pi_T$ . If $\smash{\{ \psi^{\nu} \}}$ is bounded in $L^{\infty} (\Pi_T)$ and $\{ \partial_t \eta_0 (\psi^{\nu}) + \partial_z Q_0 (\phi, \psi^{\nu})\}_{\nu > 0}$ lies in a compact set of $H_{\mathrm{loc}}^{-1} (\Pi_T)$ for all constants $c$ , then there exists a sequence $\{ \nu_n \}_{n \in \mathbb{N}}$ such that $\nu_n \to 0$ as $n \to \infty$ and a function $\psi \in L^{\infty} (\Pi_T)$ such that

$\begin{align*} \psi^{\nu_n} \to \psi \quad \text{a.e. and in } L_{\mathrm{loc}}^p ( \Pi_T) , 1 \leq p < \infty . \end{align*}$

Consistently with Eq (4.6), Eq (4.7) we assume that the scheme employed to approximate entropy solutions of Eq (1.8a) is $\phi$ -Scheme 3, that is,

$\begin{align*} \phi_{i-1/2}^{n+1 } = \phi_{i-1/2}^{n} - \lambda \Delta_- h \bigl( \phi_{i+1/2}^n, \phi_{i-1/2}^n \bigr), \quad h(v, u) : = q^- v +q^+ u + u W(v). \end{align*}$

Clearly, under a suitable CFL condition, the $\phi$ -Scheme 3 converges to the unique entropy solution of Eq (1.8a), Eq (1.2a). Our goal is to establish convergence of the corresponding scheme for $\psi$ ( $\psi$ -Scheme 3). We here write the scheme as

$\begin{align} \psi_{i-1/2}^{n+1}& = \psi_{i-1/2}^n - \lambda \Delta_- \mathcal{F} \bigl( \phi_{i-1/2}^n, \phi_{i+1/2}^n, \psi_{i-1/2}^n, \psi_{i+1/2}^n \bigr) \equiv \psi_{i-1/2}^n - \lambda \Delta_- \mathcal{F} ( \boldsymbol{\phi}_i^n, \boldsymbol{\psi}_i^n), \end{align}$

(5.3)

where we define the four-argument numerical flux

$\begin{align} \mathcal{F} ( a, b, u, v ) : = q^+ u + q^- v + \bigl( G(a, b, u, v) - a \tilde{v}_{\phi \mathrm{s}} (b) v \bigr), \end{align}$

(5.4)

denote pairs of neighboring $\phi$ - and $\psi$ -values by

$\begin{align*} \boldsymbol{\phi}_i^n : = \bigl( \phi_{i-1/2}^n, \phi_{i+1/2}^n \bigr) \quad \text{and} \quad \boldsymbol{\psi}_i^n : = \bigl( \psi_{i-1/2}^n, \psi_{i+1/2}^n \bigr), \end{align*}$

and replace the arguments " $\smash{\phi_{i-1/2}^n, \phi_{i+1/2}^n}$ " by $\smash{\boldsymbol{\phi}_i^n}$ (analogously for $\psi$ ). In Eq (5.4) $a$ and $b$ play the roles of $\smash{\phi_{i-1/2}^n}$ and $\smash{\phi_{i+1/2}^n}$ , and $u$ and $v$ those of $\smash{\psi_{i-1/2}^n}$ and $\smash{\psi_{i+1/2}^n}$ , respectively, and we define $G(a, b, u, v)$ as follows (cf. Eq (3.7), Eq (3.8)). Let

$\begin{align*} f( a, b , \psi) : = \psi \tilde{V} \left( \frac{\psi}{1- (a \vee b )} \right), \end{align*}$

then $G(a, b, \cdot, \cdot)$ is the Engquist-Osher numerical flux ^[26] associated with $f(a, b, \cdot)$ .

The compensated compactness approach strongly depends on entropy inequalities satisfied by the scheme Eq (5.3). To prepare for the derivation of suitable uniform estimates, we multiply the scheme Eq (5.3) by $\smash{\eta' (\psi_{i-1/2}^{n+1})}$ , where $\eta$ is a smooth convex entropy function, and utilize the Taylor expansion

$\begin{align*} \eta' \bigl( \psi_{i-1/2}^{n+1} \bigr) \bigl( \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr) & = \eta \bigl( \psi_{i-1/2}^{n+1} \bigr) - \eta \bigl( \psi_{i-1/2}^{n} \bigr) + \frac{1}{2} \eta'' \bigl( \xi_{i-1/2}^{n+1/2} \bigr) \bigl( \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr)^2, \end{align*}$

where $\smash{\xi_{i-1/2}^{n+1/2}}$ is an intermediate value between $\smash{\psi_{i-1/2}^{n}}$ and $\smash{\psi_{i-1/2}^{n+1}}$ . This yields

$\begin{align} \begin{split} & \eta \bigl( \psi_{i-1/2}^{n+1} \bigr) - \eta \bigl( \psi_{i-1/2}^{n} \bigr) + \frac{1}{2} \eta'' \bigl( \xi_{i-1/2}^{n+1/2} \bigr) \bigl( \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr)^2 \\ & = - \lambda \eta' \bigl( \psi_{i-1/2}^{n+1} \bigr) \Delta_- \mathcal{F} ( \boldsymbol{\phi}_i^n, \boldsymbol{\psi}_i^n) \\ & = - \lambda \eta' \bigl( \psi_{i-1/2}^{n} \bigr) \Delta_- \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i) - \lambda \bigl( \eta' \bigl( \psi_{i-1/2}^{n+1} \bigr) - \eta' \bigl( \psi_{i-1/2}^{n} \bigr) \bigr) \Delta_- \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i). \end{split} \end{align}$

(5.5)

We now define the functions $\hat{f}$ and $\check{f}$ as the partial derivatives

$\begin{align*} \hat{f} (a, b, u) & : = \partial_{u} \mathcal{F} (a, b, u, v) = q^+ + \partial_u G(a, b, u, v) \geq 0, \\ \check{f} (a, b, v) & : = \partial_{v} \mathcal{F} (a, b, u, v) = q^- + \big(\partial_v G(a, b, u, v) - a \tilde{v}_{\phi \mathrm{s}} (b) \leq 0. \end{align*}$

The dependence of $\partial_{u} \mathcal{F} (a, b, u, v)$ and $\partial_{v} \mathcal{F} (a, b, u, v)$ on $u$ and $v$ only, respectively, is crucial for the subsequent analysis. We define the functions

$\begin{align*} \hat{\mathcal{F}} (a, b, u): = \int_0^u \hat{f} (a, b, s) \, \mathrm{d} s, \quad \check{\mathcal{F}} (a, b, v) : = \int_0^v \check{f} (a, b, s) \, \mathrm{d} s \end{align*}$

and note that

$\begin{align} \mathcal{F} (a, b, u, v) = \hat{\mathcal{F}} (a, b, u) + \check{\mathcal{F}} (a, b, v). \end{align}$

(5.6)

Next, we define

$\begin{align} \begin{split} \hat{\mathcal{Q}} (a, b, \psi)& : = \int_0^{\psi} \eta'(u) \hat{f} (a, b, u) \, \mathrm{d} u, \quad \check{\mathcal{Q}} (a, b, \psi ) : = \int_0^{\psi} \eta'(v) \check{f} (a, b, v ) \, \mathrm{d} v , \\ \mathcal{Q} (a, b, \psi_1, \psi_2 )& : = \hat{\mathcal{Q}} (a, b, \psi_1 ) + \check{\mathcal{Q}} (a, b, \psi_2 ). \end{split} \end{align}$

(5.7)

The function $\mathcal{Q}$ is a consistent numerical entropy flux for the scheme Eq (5.3) for the entropy function $\eta$ since

$\begin{align*} \mathcal{Q} (a, a, \psi, \psi) & = \int_0^{\psi} \eta' (u) \bigl( \hat{f} (a, a, u) + \check{f} (a, a, u ) \bigr) \, \mathrm{d} u \\ & = \int_0^{\psi} \eta' (u) \partial_u \mathcal{F} (a, a, u, u) \, \mathrm{d} u = \int_0^{\psi} \eta' (u) \tilde{F} (a, u) \, \mathrm{d} u = Q(a, \psi). \end{align*}$

Furthermore, integration by parts yields

$\begin{array}{l} \hat{\mathcal{Q}} (a, b, \psi) - \hat{\mathcal{Q}} (a, b, \tilde{\psi} ) \\ = \eta' ( \psi) \bigl( \hat{\mathcal{F}} (a, b, \psi) - \hat{\mathcal{F}} (a, b, \tilde{\psi} ) \bigr) - \int_{\tilde{\psi}}^{\psi} \eta'' (u) \bigl( \hat{\mathcal{F}} (a, b, u ) - \hat{\mathcal{F}} (a, b, \tilde{\psi}) \bigr) \, \mathrm{d} u, \end{array}$

(5.8)

$\begin{array}{l} \check{\mathcal{Q}} (a, b, \psi) - \check{\mathcal{Q}} (a, b, \tilde{\psi} ) \\ = \eta' ( \psi ) \bigl( \check{\mathcal{F}} (a, b, \psi) - \check{\mathcal{F}} (a, b, \tilde{\psi} ) \bigr) - \int_{\tilde{\psi}}^{\psi} \eta'' (u) \bigl( \check{\mathcal{F}} (a, b, u ) - \check{\mathcal{F}} (a, b, \tilde{\psi} ) \bigr)\, \mathrm{d} u \end{array}$

(5.9)

$\begin{align} & = \eta' ( \tilde{\psi} ) \bigl( \check{\mathcal{F}} (a, b, \psi) - \check{\mathcal{F}} (a, b, \tilde{\psi} ) \bigr) - \int_{\tilde{\psi}}^{\psi} \eta'' (u) \bigl( \check{\mathcal{F}} (a, b, u ) - \check{\mathcal{F}} (a, b, \psi ) \bigr)\, \mathrm{d} u. \end{align}$

(5.10)

Now, denoting by $\Delta_-^{\phi}$ and $\Delta_-^{\psi}$ difference operators that act on both $\phi$ - and $\psi$ -arguments only, respectively (leaving the two others unchanged), we observe that

$\begin{align} \Delta_- \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i) = \Delta_-^{\psi} \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i) + \Delta_+ ^{\phi} \mathcal{F} ( \boldsymbol{\phi}^n_{i-1}, \boldsymbol{\psi}^n_{i-1}). \end{align}$

(5.11)

In light of Eq (5.8) and Eq (5.10),

$\begin{array}{l} \eta' \bigl( \psi_{i-1/2}^n \bigr) \Delta_-^{\psi} \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i \bigr) \\ = \hat{\mathcal{Q}} \bigl( \boldsymbol{\phi}_i^n, \psi_{i-1/2}^n \bigr) - \hat{\mathcal{Q}} \bigl( \boldsymbol{\phi}_i^n, \psi_{i-3/2}^n \bigr) + \check{\mathcal{Q}} \bigl(\boldsymbol{\phi}_i^n , \psi_{i+1/2}^n \bigr) - \check{\mathcal{Q}} \bigl(\boldsymbol{\phi}_i^n, \psi_{i-1/2}^n \bigr) \\ - \Biggl( \eta' ( \psi_{i-1/2}^n ) \bigl( \hat{\mathcal{F}} (\boldsymbol{\phi}_i^n, \psi_{i-1/2}^n ) - \hat{\mathcal{F}} (\boldsymbol{\phi}_i^n, \psi_{i-3/2}^n ) \bigr) \\ \qquad \qquad - \int_{\psi_{i-3/2}^n}^{\psi_{i-1/2}^n} \eta'' (u) \bigl( \hat{\mathcal{F}} ( \boldsymbol{\phi}_i^n, u ) - \hat{\mathcal{F}} (\boldsymbol{\phi}_i^n, \psi_{i-3/2}^n) \bigr) \, \mathrm{d} u \Biggr) \\ - \Biggl( \eta' ( \psi_{i-1/2}^n ) \bigl( \check{\mathcal{F}} (\boldsymbol{\phi}_i^n, \psi_{i+1/2}^n ) - \check{\mathcal{F}} (\boldsymbol{\phi}_i^n, \psi_{i-1/2}^n ) \bigr) \\ \qquad + \int_{\psi_{i+1/2}^n}^{\psi_{i-1/2}^n} \eta'' (u) \bigl( \check{\mathcal{F}} ( \boldsymbol{\phi}_i^n, u ) - \check{\mathcal{F}} (\boldsymbol{\phi}_i^n, \psi_{i+1/2}^n) \bigr) \, \mathrm{d} u \Biggr) \\ + \eta' \bigl( \psi_{i-1/2}^n \bigr) \bigl( \hat{\mathcal{F}} \bigl(\boldsymbol{\phi}_i^n, \psi_{i-1/2}^n \bigr) - \hat{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n, \psi_{i-3/2}^n \bigr) + \check{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n, \psi_{i+1/2}^n \bigr) - \check{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n, \psi_{i-1/2}^n \bigr) \bigr) \\ = \mathcal{Q} \bigl(\boldsymbol{\phi}_i^n, \psi_{i-1/2}^n, \psi_{i+1/2}^n \bigr) - \mathcal{Q} \bigl(\boldsymbol{\phi}_i^n, \psi_{i-3/2}^n, \psi_{i-1/2}^n \bigr) + \Theta_{i-1/2}^n \\ = \Delta_-^{\psi} \mathcal{Q} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i) + \Theta_{i-1/2}^n , \end{array}$

(5.12)

where the notation for evaluations and differences for $\mathcal{Q}$ is the same as for $\mathcal{F}$ and $\smash{\Theta_{i-1/2}^n : = \hat{\Theta}_{i-1}^n + \check{\Theta}_{i}^n}$ , where

$\begin{align*} \hat{\Theta}_{i-1}^n & : = \int_{\psi_{i-3/2}^n}^{\psi_{i-1/2}^n} \eta'' (u) \bigl( \hat{\mathcal{F}} \bigl(\boldsymbol{\phi}_i^n , u \bigr) - \hat{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n, \psi_{i-3/2}^n \bigr)\bigr) \, \mathrm{d} u, \\ \check{\Theta}_{i}^n & : = - \int_{\psi_{i+1/2}^n}^{\psi_{i-1/2}^n} \eta'' (u) \bigl( \check{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n, u \bigr) - \check{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n, \psi_{i+1/2}^n \bigr) \bigr) \, \mathrm{d} u. \end{align*}$

Since $\hat{\mathcal{F}}$ is increasing and $\check{\mathcal{F}}$ is decreasing in the respective third argument, there holds $\smash{ \hat{\Theta}_{i-1}^n, \check{\Theta}_{i}^n \geq 0}$ and therefore $\smash{\Theta_{i-1/2}^n \geq 0}$ . Furthermore, we notice that

$\begin{align} \eta' \bigl( \psi_{i-1/2}^n \bigr) \Delta_+^{\phi} \mathcal{F} ( \boldsymbol{\phi}^n_{i-1}, \boldsymbol{\psi}^n_{i-1}) = \Delta_+^{\phi} \bigl( \eta' \bigl( \psi_{i-1/2}^n \bigr) \mathcal{F} ( \boldsymbol{\phi}^n_{i-1}, \boldsymbol{\psi}^n_{i-1}) \bigr). \end{align}$

(5.13)

From Eq (5.11) we obtain by taking into account Eq (5.12) and Eq (5.13)

$\begin{array}{l} \eta' \bigl( \psi_{i-1/2}^n \bigr) \Delta_- \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i) = \Delta_-^{\psi} \mathcal{Q} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i) + \eta' \bigl( \psi_{i-1/2}^n \bigr) \Delta_+^{\phi} \mathcal{F} ( \boldsymbol{\phi}^n_{i-1}, \boldsymbol{\psi}^n_{i-1}) + \Theta_{i-1/2}^n \nonumber \\ = \Delta_- \mathcal{Q} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i) - \Delta_+^{\phi} \mathcal{Q} ( \boldsymbol{\phi}^n_{i-1}, \boldsymbol{\psi}^n_{i-1}) + \eta' \bigl( \psi_{i-1/2}^n \bigr) \Delta_+^{\phi} \mathcal{F} ( \boldsymbol{\phi}^n_{i-1}, \boldsymbol{\psi}^n_{i-1}) + \Theta_{i-1/2}^n \nonumber \\ = \Delta_- \mathcal{Q} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i) + \Delta_+^{\phi} \bigl( \eta' \bigl( \psi_{i-1/2}^n \bigr) \mathcal{F} ( \boldsymbol{\phi}^n_{i-1}, \boldsymbol{\psi}^n_{i-1}) - \mathcal{Q} ( \boldsymbol{\phi}^n_{i-1}, \boldsymbol{\psi}^n_{i-1}) \bigr) + \Theta_{i-1/2}^n. \end{array}$

Consequently, Eq (5.5) can be written as

$\begin{align} \begin{split} & \eta \bigl( \psi_{i-1/2}^{n+1} \bigr) - \eta \bigl( \psi_{i-1/2}^{n} \bigr) + \frac{1}{2} \eta'' \bigl( \xi_{i-1/2}^{n+1/2} \bigr) \bigl( \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr)^2 + \lambda \Theta_{i-1/2}^n \\ & = - \lambda \Delta_- \mathcal{Q} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i) - \lambda \bigl( \eta' \bigl( \psi_{i-1/2}^{n+1} \bigr) - \eta' \bigl( \psi_{i-1/2}^{n} \bigr) \bigr) \Delta_- \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i) \\ & \quad - \lambda \Delta_+^{\phi} \bigl( \eta' \bigl( \psi_{i-1/2}^n \bigr) \mathcal{F} ( \boldsymbol{\phi}^n_{i-1}, \boldsymbol{\psi}^n_{i-1}) - \mathcal{Q} ( \boldsymbol{\phi}^n_{i-1}, \boldsymbol{\psi}^n_{i-1}) \bigr) . \end{split} \end{align}$

(5.14)

Multiplying Eq (5.14) by $\Delta z$ and summing over $(n, i) \in \mathcal{I}_1$ , where

$\begin{align*} \mathcal{I}_k : = \{ (n, i) \mid n = 0 , \dots, N_T -k, \, i \in \mathbb{Z} \}, \end{align*}$

we get

$\begin{align*} & \Delta z \sum\limits_{i \in \mathbb{Z}} \eta \bigl( \psi_{i-1/2}^{N} \bigr) - \Delta z \sum\limits_{i \in \mathbb{Z}} \eta \bigl( \psi_{i-1/2}^{0} \bigr) + \frac{\Delta z}{2} \sum\limits_{ \mathcal{I}_1 } \eta'' \bigl( \xi_{i-1/2}^{n+1/2} \bigr) \bigl( \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr)^2 + \lambda \Delta z \sum\limits_{\mathcal{I}_1} \Theta_{i-1/2}^n \\ & = - \lambda \Delta z \sum\limits_{\mathcal{I}_1} \Delta_- \mathcal{Q}( \boldsymbol{\phi}_i^n, \boldsymbol{\psi}_i^n) - \lambda \Delta z \sum\limits_{\mathcal{I}_1} \bigl( \eta' \bigl( \psi_{i-1/2}^{n+1} \bigr) - \eta' \bigl( \psi_{i-1/2}^{n} \bigr) \bigr) \Delta_- \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i) \\ & \quad - \lambda \Delta z \sum\limits_{\mathcal{I}_1} \Delta_+^{\phi} \bigl( \eta' \bigl( \psi_{i-1/2}^n \bigr) \mathcal{F} ( \boldsymbol{\phi}_{i-1}^n, \boldsymbol{\psi}_{i-1}^n) - \mathcal{Q} ( \boldsymbol{\phi}_{i-1}^n, \boldsymbol{\psi}_{i-1}^n) \bigr), \end{align*}$

which implies the inequality

$\begin{align*} & \Delta z \sum\limits_{i \in \mathbb{Z}} \eta \bigl( \psi_{i-1/2}^{N} \bigr) + \frac{\Delta z}{2}\sum\limits_{\mathcal{I}_1} \eta'' \bigl( \xi_{i-1/2}^{n+1/2} \bigr) \bigl( \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr)^2 + \lambda \Delta z \sum\limits_{\mathcal{I}_1} \Theta_{i-1/2}^n \\ & \leq \Delta z \sum\limits_{i \in \mathbb{Z}} \eta \bigl( \psi_{i-1/2}^{0} \bigr) +2 \| \eta' \|_{L^{\infty}} \Delta z \Delta t \sum\limits_{\mathcal{I}_1} \frac{1}{\Delta z} \bigl| \Delta_- \mathcal{F} ( \boldsymbol{\phi}_i^n, \boldsymbol{\psi}_i^n) \bigr| \\ & \quad + C \Delta z \Delta t \sum\limits_{\mathcal{I}_1} \frac{1}{\Delta z} \bigl|\Delta_+^{\phi} \bigl( \eta' \bigl( \psi_{i-1/2}^n \bigr) \mathcal{F} ( \boldsymbol{\phi}_{i-1}^n, \boldsymbol{\psi}_{i-1}^n) - \mathcal{Q} ( \boldsymbol{\phi}_{i-1}^n, \boldsymbol{\psi}_{i-1}^n) \bigr) \bigr| . \end{align*}$

The last term on the right-hand side is uniformly bounded since $\phi^{\Delta z}$ has bounded variation. Now let us choose $\eta(v) = v^2$ and take into account ^[34] that there exists a constant $C_{\mathcal{F}}$ such that

$\begin{align*} \hat{\Theta}_{i-1}^n \geq \frac{1}{C_{\mathcal{F}}} \Bigl( \Delta_-^{\psi} \hat{\mathcal{F}} \bigl(\boldsymbol{\phi}_i^n , \psi_{i-1/2} \bigr) \Bigr)^2, \quad \check{\Theta}_{i}^n \geq \frac{1}{C_{\mathcal{F}}} \Bigl( \Delta_+^{\psi} \check{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n , \psi_{i-1/2} \bigr) \Bigr)^2. \end{align*}$

Noticing that Lemma 4.5, applied to the present scheme ( $\psi$ -Scheme 3), implies the bound on the discrete divergence of the numerical flux

$\begin{align} \Delta z \sum\limits_{i \in \mathbb{Z}} \frac{1}{\Delta z} \bigl| \Delta_- \mathcal{F}_i ( \boldsymbol{\phi}^n, \boldsymbol{\psi}^n) \bigr| \leq C, \end{align}$

(5.15)

we obtain from Eq (5.14)

$\begin{align} \begin{split} & \Delta z \sum\limits_{i \in \mathbb{Z}} \eta \bigl( \psi_{i-1/2}^{N} \bigr) + \Delta z \sum\limits_{\mathcal{I}_1} \bigl( \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr)^2 + \frac{\lambda}{C_{\mathcal{F}}} \Delta z \sum\limits_{\mathcal{I}_1} \Bigl( \bigl( \Delta_-^{\psi} \hat{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n, \psi_{i-1/2} \bigr) \bigr)^2 + \bigl( \Delta_+^{\psi} \check{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n, \psi_{i-1/2}^n \bigr) \bigr)^2 \Bigr) \\ & \leq \Delta z \sum\limits_{i \in \mathbb{Z}} \bigl( \psi_{i-1/2}^{0} \bigr)^2 + C_{T}. \end{split} \end{align}$

(5.16)

Inequality Eq (5.16) implies the following estimate.

Lemma 5.2. Consider numerical approximations produced by Scheme 3. There exists a constant $C_7$ that is independent of $(\Delta z, \Delta t)$ such that

$\begin{align} \Delta t \Delta z \sum\limits_{n = 0}^{N_T-1} \sum\limits_{i \in \mathbb{Z}} \bigl( \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr)^2 \leq C_7 \Delta z. \end{align}$

(5.17)

Proof. The estimate for the "time variation" of $\psi^{\Delta z}$ , Eq (5.17), follows immediately from Eq (5.16) if we consider that its right-hand side is uniformly bounded.

Before proceeding, we prove the following lemma that is crucial for the subsequent analysis. For ease of notation we define the difference operators $\smash{\Delta_{\pm}^{(3)}}$ and $\smash{\Delta_{\pm}^{(4)}}$ that only act on the third or fourth argument of a function, respectively.

Lemma 5.3. Consider numerical approximations produced by Scheme 3. There exist constants $C_8$ and $C_9$ that are independent of $(\Delta z, \Delta t)$ such that for all $i$ ,

$\begin{align} \bigl| \Delta_- \mathcal{Q} ( \boldsymbol{\phi}_i^n, \boldsymbol{\psi}_i^n ) \bigr| & \leq C_8 \bigl| (\Delta_-^{(3)} + \Delta_-^{(4)}) \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) \bigr| + C_9 \bigl( \bigl|\Delta_- \phi_{i-1/2}^n \bigr| + \bigl| \Delta_+ \phi_{i-1/2}^n \bigr| \bigr). \end{align}$

(5.18)

Proof. We note that

$\begin{align} \Delta_- \mathcal{Q} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) = \Delta_-^{\psi} \mathcal{Q} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) + \Delta_+^{\phi} \mathcal{Q} ( \boldsymbol{\phi}^n_{i-1}, \boldsymbol{\psi}^n_{i-1} ). \end{align}$

(5.19)

We first discuss

$\begin{align*} \Delta_-^{\psi} \mathcal{Q} ( \boldsymbol{\phi}_i^n, \boldsymbol{\psi}_i^n ) = \Delta_-^{(3)} \hat{\mathcal{Q}} \bigl( \boldsymbol{\phi}_i^n , \psi_{i-1/2}^n \bigr) + \Delta_-^{(3)} \check{\mathcal{Q}} \bigl( \boldsymbol{\phi}_i^n , \psi_{i+1/2}^n \bigr) . \end{align*}$

From Eq (5.8) we get

$\begin{align*} \bigl| \Delta_-^{(3)} \hat{\mathcal{Q}} \bigl( \boldsymbol{\phi}_i^n , \psi_{i-1/2}^n \bigr) \bigr| & = \Biggl| \eta' \bigl( \psi_{i-1/2}^n \bigr) \Delta_-^{(3)} \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) - \int_{\psi_{i-3/2}^n}^{\psi_{i-1/2}^n} \eta'' (u) \Bigl( \hat{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n, u \bigr) - \hat{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n , \psi_{i-1/2}^n \bigr) \Bigr) \, \mathrm{d} u \Biggr| \\ & \leq \bigl| \eta' \bigl( \psi_{i-1/2}^n \bigr) \bigr| \bigl| \Delta_-^{(3)} \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) \bigr| + \Biggl| \int_{\psi_{i-3/2}^n}^{\psi_{i-1/2}^n} \eta'' (u) \, \mathrm{d} u \Biggr| \bigl| \Delta_-^{(3)} \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i) \bigr| \\ & \leq 3 \| \eta' \|_{L^{\infty} (0, 1)} \bigl| \Delta_-^{(3)} \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) \bigr| \end{align*}$

and analogously

$\begin{align*} \bigl| \Delta_-^{(3)} \check{\mathcal{Q}} \bigl( \boldsymbol{\phi}_i^n , \psi_{i+1/2}^n \bigr) \bigr| \leq 3 \| \eta' \|_{L^{\infty} (0, 1)} \bigl| \Delta_-^{(4)} \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}_i^n ) \bigr|, \end{align*}$

hence

$\begin{align} \bigl| \Delta_-^{\psi} \mathcal{Q} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) \bigr| \leq 3 \| \eta' \|_{L^{\infty} (0, 1)} \bigl| \bigl( \Delta_-^{(3)} + \Delta_-^{(4)} \bigr) \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) \bigr|. \end{align}$

(5.20)

On the other hand, we take into account that

$\begin{align*} \Delta_+^{\phi} \mathcal{Q}( \boldsymbol{\phi}_{i-1} ^n, \boldsymbol{\psi}_{i-1} ^n ) = \Delta_+^{\phi} \hat{\mathcal{Q}} \bigl( \boldsymbol{\phi}_{i-1}^n , \psi_{i-3/2}^n \bigr) + \Delta_+^{\phi} \check{\mathcal{Q}} \bigl( \boldsymbol{\phi}_{i-1}^n, \psi_{i-1/2}^n \bigr). \end{align*}$

Now

$\begin{array}{l} \Delta_+^{\phi} \hat{\mathcal{Q}} \bigl( \boldsymbol{\phi}_{i-1}^n, \psi_{i-3/2}^n \bigr) = \int_0^{\psi_{i-3/2}^n} \eta'(u) \bigl( \hat{f} \bigl( \boldsymbol{\phi}_i^n , u \bigr) - \hat{f} \bigl( \boldsymbol{\phi}_{i-1}^n , u \bigr) \bigr) \, \mathrm{d} u \\ = \Bigl[ \eta'(u) \bigl( \hat{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n, u \bigr) - \hat{\mathcal{F}} \bigl( \boldsymbol{\phi}_{i-1}^n, u \bigr) \bigr) \Bigr]_{u = 0}^{u = \psi_{i-3/2}^n} - \int_0^{\psi_{i-3/2}^n} \eta''(u) \bigl( \hat{\mathcal{F}} ( \boldsymbol{\phi}_i^n, u ) - \hat{\mathcal{F}} \bigl( \boldsymbol{\phi}_{i-1}^n , u ) \bigr) \, \mathrm{d} u \end{array}$

(5.21)

and analogously

$\begin{array}{l} \Delta_+^{\phi} \check{\mathcal{Q}} \bigl( \boldsymbol{\phi}_{i-1}^n, \psi_{i-1/2}^n \bigr) = \int_0^{\psi_{i-1/2}^n} \eta'(u) \bigl( \check{f} \bigl( \boldsymbol{\phi}_i^n , v \bigr) - \check{f} \bigl( \boldsymbol{\phi}_{i-1}^n , v \bigr) \bigr) \, \mathrm{d} v \\ = \Bigl[ \eta'(v) \bigl( \check{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n, v \bigr) - \check{\mathcal{F}} \bigl( \boldsymbol{\phi}_{i-1}^n, v \bigr) \bigr) \Bigr]_{v = 0}^{v = \psi_{i-1/2}^n} - \int_0^{\psi_{i-1/2}^n} \eta''(v) \bigl( \check{\mathcal{F}} ( \boldsymbol{\phi}_i^n, v ) - \check{\mathcal{F}} ( \boldsymbol{\phi}_{i-1}^n , v ) \bigr) \, \mathrm{d} v. \end{array}$

Consequently,

$\begin{align} \bigl| \Delta_+^{\phi} \hat{\mathcal{Q}} \bigl( \boldsymbol{\phi}_{i-1}^n, \psi_{i-3/2}^n \bigr) \bigr| \leq 3 \| \eta' \|_{L^{\infty} (0, 1)} \max\limits_{0 \leq u \leq \psi_{i-3/2}^n } \bigl| \hat{\mathcal{F}} \bigl(\boldsymbol{\phi}_i^n, u \bigr) - \hat{\mathcal{F}} \bigl(\boldsymbol{\phi}_{i-1}^n, u \bigr) \bigr|, \end{align}$

(5.22)

and by analogous reasoning for $\check{\mathcal{Q}}$ ,

$\begin{align} \bigl| \Delta_+^{\phi} \check{\mathcal{Q}} \bigl( \boldsymbol{\phi}_{i-1}^n, \psi_{i-1/2}^n \bigr) \bigr| \leq 3 \| \eta' \|_{L^{\infty} (0, 1)} \max\limits_{0 \leq v \leq \psi_{i-1/2}^n } \bigl| \check{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n, v \bigr) - \check{\mathcal{F}} \bigl(\boldsymbol{\phi}_{i-1}^n, v \bigr) \bigr|. \end{align}$

(5.23)

To estimate the right-hand side of Eq (5.22), we recall that

$\begin{align*} \hat{\mathcal{F}} \bigl(\boldsymbol{\phi}_i^n, u \bigr) - \hat{\mathcal{F}} \bigl(\boldsymbol{\phi}_{i-1}^n, u \bigr) = \int_0^u \bigl( ((f_i^n)')^+ (s) - ((f_{i-1}^n)')^+ (s) \bigr) \, \mathrm{d} s = : \hat{\mathcal{D}}_{i-1/2}^n. \end{align*}$

We now assume that $\sigma = 1$ and use Eq (3.9). To estimate $\smash{ \hat{\mathcal{D}}_{i-1/2}^n}$ , we will appeal to the trivial but useful inequality $| (\alpha \vee x) - (\beta \vee y) | \leq |\alpha - \beta| + |x-y|$ . We proceed by discussing all possible cases of the location of $u$ in relation to the extrema $\smash{\hat{\psi}_i^n}$ and $\smash{\hat{\psi}_{i-1}^n}$ of $f_i^n$ and $f_{i-1}^n$ , respectively, and assume that $\sigma = 1$ .

1. Assume that $\smash{u \leq \hat{\psi}_i^n \wedge \hat{\psi}_{i-1}^{n} }$ . Since $\smash{ \hat{\psi}_i^n \leq \psi_{\max, i}^n }$ and $\smash{ \hat{\psi}_{i-1}^{n} \leq \psi_{\max, i-1}^{n} }$ , in this case $\smash{\hat{\mathcal{D}}}_{i-1/2}^n = 0$ if $\smash{\psi_{\max, i}^n = \psi_{\max, i-1}^{n} }$ and otherwise

$\begin{align*} \bigl| \hat{\mathcal{D}}_{i-1/2}^n \bigr| & = \bigl| f_i^n (u) - f_{i-1}^n (u) \bigr| = u \bigl| \tilde{V} \bigl( u/ \psi_{\max, i}^n \bigr) - \tilde{V} \bigl( u/ \psi_{\max, i-1}^{n} \bigr) \bigr| \\ & \leq \frac{u^2}{\psi_{\max, i}^n \psi_{\max, i-1}^{n}} \| \tilde{V}' \|_{\infty} | \psi_{\max, i}^n - \psi_{\max, i-1}^{n}| \leq \| \tilde{V}' \|_{\infty} \bigl| \psi_{\max, i}^n - \psi_{\max, i-1}^{n} \bigr|. \end{align*}$

Noticing that

$\begin{align} \bigl| \psi_{\max, i}^n - \psi_{\max, i-1}^{n} \bigr| & = \bigl| \phi_{i-1/2}^{n} \vee \phi_{i+1/2}^{n} - \phi_{i-3/2}^{n} \vee \phi_{i-1/2}^{n} \bigr| \leq \bigl| \Delta_- \phi_{i-1/2}^{n} \bigr| + \bigl| \Delta_+ \phi_{i-1/2}^n \bigr|, \end{align}$

(5.24)

we conclude that

$\begin{align*} \bigl| \hat{\mathcal{D}}_{i-1/2}^n \bigr| \leq \| \tilde{V}' \|_{\infty} \bigl( \bigl| \Delta_- \phi_{i-1/2}^{n} \bigr| + \bigl| \Delta_+ \phi_{i-1/2}^n \bigr| \bigr). \end{align*}$

2. If $\smash{\hat{\psi}_i^n < u < \hat{\psi}_{i-1}^{n}}$ then

$\begin{align*} \bigl| \hat{\mathcal{D}}_{i-1/2}^n \bigr| & = \bigl| f_i^n \bigl( \hat{\psi}_i^n ) - f_{i-1}^n (u) \bigr| \leq \bigl| f_i^n \bigl( \hat{\psi}_i^n \bigr) - f_{i-1}^n \bigl( \hat{\psi}_{i-1}^n \bigr) \bigr| + \bigl| f_{i-1}^n \bigl( \hat{\psi}_{i-1}^n \bigr) - f_{i-1}^n (u) \bigr|. \end{align*}$

Since $\smash{f_i^n (\hat{\psi}_i^n) = \psi_{\max, i}^n \omega \tilde{V} (\omega)}$ for all $i$ , we conclude that

$\begin{align} \bigl| f_i^n \bigl( \hat{\psi}_i^n \bigr) - f_{i-1}^n \bigl( \hat{\psi}_{i-1}^n \bigr) \bigr| & \leq \omega \| \tilde{V} \|_{\infty} \left| \psi_{\max, i}^n - \psi_{\max, i-1}^n \right| \leq \| \tilde{V} \|_{\infty} \bigl( \bigl| \Delta_- \phi_{i-1/2}^{n} \bigr| + \bigl| \Delta_+ \phi_{i-1/2}^n \bigr| \bigr). \end{align}$

(5.25)

On the other hand, in the present case

$\begin{align*} \bigl| f_{i-1}^n \bigl( \hat{\psi}_{i-1}^n \bigr) - f_{i-1}^n (u) \bigr| = f_{i-1}^n \bigl( \hat{\psi}_{i-1}^n \bigr) - f_{i-1}^n (u) \leq f_{i-1}^n \bigl( \hat{\psi}_{i-1}^n \bigr) - f_{i-1}^n \bigl( \hat{\psi}_{i}^n \bigr). \end{align*}$

Since for $\smash{s \in [0, \hat{\psi}_{i-1}^n]}$ there holds $\smash{(f_{i-1}^n)'(s) \leq (f_{i-1}^n)' (0) = \tilde{V} (0)}$ , we get

$\begin{align*} \bigl| f_{i-1}^n \bigl( \hat{\psi}_{i-1}^n \bigr) - f_{i-1}^n (u) \bigr| & = \int_{u}^{\hat{\psi}_{i-1}^n} (f_{i-1}^n)'(s) \, \mathrm{d} s \leq \int_{\hat{\psi}_i^n}^{\hat{\psi}_{i-1}^n} (f_{i-1}^n)'(s) \, \mathrm{d} s \leq \tilde{V}(0) \bigl(\hat{\psi}_{i-1}^n - \hat{\psi}_i^n \bigr). \end{align*}$

Lemma 3.1 (a) implies that

$\begin{align} \bigl| \hat{\psi}_i^n - \hat{\psi}_{i-1}^{n} \bigr| = \omega \bigl| \psi_{\max, i}^n - \psi_{\max, i-1}^{n} \bigr|, \end{align}$

(5.26)

hence

$\begin{align*} \bigl| \hat{\mathcal{D}}_{i-1/2}^n \bigr| \leq 2 \| \tilde{V} \|_{\infty} \bigl( \bigl| \Delta_- \phi_{i-1/2}^n \bigr| + \bigl| \Delta_+ \phi_{i-1/2}^{n} \bigr| \bigr). \end{align*}$

The same estimate holds for $\smash{\hat{\psi}_{i-1}^{n} < u < \hat{\psi}_i^{n}}$ .

3. If $\smash{u \geq \hat{\psi}_i^n \vee \hat{\psi}_{i-1}^{n}}$ then utilizing Eq (5.25) we get

$\begin{align*} \bigl| \hat{\mathcal{D}}_{i-1/2}^n \bigr| = \bigl| f_i^n \bigl( \hat{\psi}_i^n \bigr) - f_{i-1}^n \bigl( \hat{\psi}_{i-1}^n \bigr) \bigr| \leq \| \tilde{V} \|_{\infty} \bigl( \bigl| \Delta_- \phi_{i-1/2}^{n} \bigr| + \bigl| \Delta_+ \phi_{i-1/2}^n \bigr| \bigr). \end{align*}$

Combining all possible cases we deduce that

$\begin{align} \bigl| \hat{\mathcal{D}}_{i-1/2}^n \bigr| \leq \bigl( \| \tilde{V}' \|_{\infty} + 2 \| \tilde{V} \|_{\infty} \bigr) \bigl( \bigl| \Delta_- \phi_{i-1/2}^{n} \bigr| + \bigl| \Delta_+ \phi_{i-1/2}^n \bigr| \bigr). \end{align}$

(5.27)

Next, we deal with Eq (5.23), recalling that (see Eq (5.4))

$\begin{align*} & \check{\mathcal{F}} ( \boldsymbol{\phi}_i^n, v ) - \check{\mathcal{F}} (\boldsymbol{\phi}_{i-1}^n, v ) = \check{\mathcal{D}}_{i-1/2}^n + \mathcal{E}_{i-1/2}^n, \end{align*}$

where we define

$\begin{align*} \check{\mathcal{D}}_{i-1/2}^n & : = \int_0^v \bigl( ((f_i^n)')^- (s) - ((f_{i-1}^n)')^- (s) \bigr) \, \mathrm{d} s, \quad \mathcal{E}_{i-1/2}^n : = \bigl( \phi_{i-1/2}^n \tilde{v}_{\phi \mathrm{s}} ( \phi_{i+1/2}^n) - \phi_{i-3/2}^n \tilde{v}_{\phi \mathrm{s}} ( \phi_{i-1/2}^n ) \bigr) v. \end{align*}$

The discussion of all possible cases of the location of $v$ in relation to $\smash{\hat{\psi}_i^n}$ and $\smash{\hat{\psi}_{i-1}^n}$ gives rise to the following cases for the estimation of $\smash{\check{\mathcal{D}}_{i-1/2}^n}$ .

1. If $v \leq \hat{\psi}_{i}^n \wedge \hat{\psi}_{i-1}^{n}$ or $v \geq \psi_{\max, i}^n \vee \psi_{\max, i-1}^{n}$ then $\smash{\check{\mathcal{D}}_{i-1/2}^n = 0}$ .

2. To handle the case $\smash{ \hat{\psi}_i^n \wedge \hat{\psi}_{i-1}^{n} \leq v \leq \hat{\psi}_i^n \vee \hat{\psi}_{i-1}^{n} }$ we assume that $\smash{ \hat{\psi}_{i}^{n} < \hat{\psi}_{ i-1}^{n} }$ and $\smash{ \hat{\psi}_{i}^{n} \leq v \leq \hat{\psi}_{ i-1}^{n} }$ . Then

$\begin{align*} \bigl| \check{\mathcal{D}}_{i-1/2}^n \bigr| & = \bigl| f_i^n (v) - f_i^n \bigl( \hat{\psi}_i^n \bigr) \bigr| = \left| \int^v_{\hat{\psi_i^n}} (( f_i^n)')^- (s) \, \mathrm{d} s \right| \leq \max\limits_{\hat{\psi}_i^n \leq s \leq \hat{\psi}_{i-1}^n } \bigl| (( f_i^n)')^- (s) \bigr| \bigl| \hat{\psi}_i^n - \hat{\psi}_{i-1}^{n} \bigr| \\ & \leq \bigl| (f_i^n)' ( \psi_{\mathrm{infl}, i}^n ) \bigr| \bigl| \hat{\psi}_i^n - \hat{\psi}_{i-1}^{n} \bigr|. \end{align*}$

By Lemma 3.1 (ⅱ),

$\begin{align*} (f_i^n)' ( \psi_{\mathrm{infl}, i}^n ) = (f_i^n)' ( \tilde{\omega} \phi_{\max, i}^n ) = \tilde{V} ( \tilde{\omega}) + \tilde{\omega} \tilde{V}' ( \tilde{\omega}), \end{align*}$

so $\smash{(f_i^n)' (\psi_{\mathrm{infl}, i}^n)}$ does not depend on $\smash{\phi_{\max, i}^n}$ and we conclude that

$\begin{align*} \bigl| \check{\mathcal{D}}_{i-1/2}^n \bigr| \leq \bigl( \| \tilde{V} \|_{\infty} + \| \tilde{V}' \|_{\infty} \bigr) \bigl| \hat{\psi}_i^n - \hat{\psi}_{i-1}^{n} \bigr|. \end{align*}$

Applying the argument of Eq (5.24) and Eq (5.26) yields

$\begin{align*} \bigl| \check{\mathcal{D}}_{i-1/2}^n \bigr| \leq \bigl( \| \tilde{V} \|_{\infty} + \| \tilde{V}' \|_{\infty} \bigr) \bigl( \bigl| \Delta_- \phi_{i-1/2}^n \bigr| + \bigl|\Delta_+ \phi_{i-1/2}^{n} \bigr| \bigr). \end{align*}$

The same inequality is also valid if $\smash{ \hat{\psi}_{i-1}^{n} < \hat{\psi}_{ i}^{n} }$ and $\smash{ \hat{\psi}_{i-1}^{n} \leq v \leq \hat{\psi}_{ i}^{n} }$ .

3. Finally, assume that $\smash{v > \hat{\psi}_{i}^n \vee \hat{\psi}_{i-1}^{n} }$ . In this case

$\begin{align*} \bigl| \check{\mathcal{D}}_{i-1/2}^n \bigr| & = \bigl| f_i^n (v) - f_{i-1}^n (v) - f_i^n \bigl( \hat{\psi}_{i}^n \bigr) + f_{i-1}^n \bigl( \hat{\psi}_{i-1}^n \bigr) \bigr| \leq \bigl| f_i^n (v) - f_{i-1}^n (v) \bigr| + \bigl| f_i^n \bigl( \hat{\psi}_{i}^n \bigr) - f_{i-1}^n \bigl( \hat{\psi}_{i-1}^n \bigr) \bigr|. \end{align*}$

Taking into account that $f_i^n (\hat{\psi}_{i}^n) = \psi_{\max, i}^n \omega \tilde{V} (\omega)$ , we get

$\begin{align} \bigl| f_i^n \bigl( \hat{\psi}_{i}^n \bigr) - f_{i-1}^n \bigl( \hat{\psi}_{i-1}^n \bigr) \bigr|& = \omega \tilde{V} ( \omega) \left| \psi_{\max, i}^n - \psi_{\max, i-1}^n \right| \leq \| \tilde{V} \|_{\infty} \bigl( \bigl| \Delta_- \phi_{i-1/2}^n \bigr| + \bigl|\Delta_+ \phi_{i-1/2}^{n} \bigr| \bigr). \end{align}$

(5.28)

If $v > \psi_{\max, i}^n \vee \psi_{\max, i-1}^{n}$ , then $f_i^n(v) = f_{i-1}^n(v) = 0$ , hence Eq (5.28) means that

$\begin{align*} \bigl| \check{\mathcal{D}}_{i-1/2}^n \bigr| \leq \| \tilde{V} \|_{\infty} \bigl( \bigl|\Delta_- \phi_{i-1/2}^n \bigr| + \bigl| \Delta_+ \phi_{i-1/2}^{n} \bigr| \bigr). \end{align*}$

Suppose now that

$\begin{align} \hat{\psi}_{i}^n \vee \hat{\psi}_{i-1}^{n} \leq \psi_{\max, i}^n \leq v \leq \psi_{\max, i-1}^{n}. \end{align}$

(5.29)

Since we know that $\smash{v = \psi_{i+1/2}^n \leq 1-\phi_{i+1/2}^n}$ , the inequality $\smash{ \psi_{\max, i}^n \leq v}$ can only be satisfied if

$\begin{align*} \psi_{\max, i}^n = (1- \phi_{i-1/2}^n) \wedge (1- \phi_{i+1/2}^n) = 1 - \phi_{i-1/2}^n. \end{align*}$

On the other hand,

$\begin{align*} \psi_{\max, i-1}^n = (1 - \phi_{i-1/2}^n) \wedge (1-\phi_{i-3/2}^n) \leq 1- \phi_{i-1/2}^n, \end{align*}$

so Eq (5.29) is only possible when $\smash{\psi_{\max, i}^n = v = \psi_{\max, i-1}^{n} = 1- \phi_{i-1/2}^n}$ , which means $\smash{ \check{\mathcal{D}}_{i-1/2}^n = 0}$ .

If instead of Eq (5.29) we have

$\begin{align} \hat{\psi}_{i}^n \vee \hat{\psi}_{i-1}^{n} \leq \psi_{\max, i-1}^n \leq v \leq \psi_{\max, i}^{n}, \end{align}$

(5.30)

then $\smash{1/\psi_{\max, i-1}^n \leq 1/(\omega \psi_{\max, i}^n)}$ implies

$\begin{align*} \bigl| f_i^n (v) - f_{i-1}^n (v) \bigr| & = v \bigl| \tilde{V} \bigl( v/ \psi_{\max, i}^n \bigr) - \tilde{V} \bigl( v/ \psi_{\max, i-1}^{n} \bigr) \bigr| \\ & \leq v^2 \| \tilde{V}' \|_{\infty} \frac{| \psi_{\max, i}^n - \psi_{\max, i-1}^{n}|}{\psi_{\max, i}^n \psi_{\max, i-1}^{n}} \leq \frac{ \| \tilde{V}' \|_{\infty} }{\omega} \bigl| \psi_{\max, i}^n - \psi_{\max, i-1}^{n} \bigr|. \end{align*}$

The remainder of the estimate is based on Eq (5.24). Since $\omega < 1$ , we conclude that if Eq (5.29) holds, then

$\begin{align*} \bigl| f_i^n (v) - f_{i-1}^n (v) \bigr| \leq \frac{ \| \tilde{V}' \|_{\infty}}{\omega} \bigl( \bigl| \Delta_- \phi_{i-1/2}^n \bigr| + \bigl| \Delta_+ \phi_{i-1/2}^{n} \bigr| \bigr). \end{align*}$

In combination with Eq (5.28) we obtain in this case

$\begin{align} \bigl| \check{\mathcal{D}}_{i-1/2}^n \bigr| \leq \biggl( \| \tilde{V} \|_{\infty} + \frac{ \| \tilde{V}' \|_{\infty}}{\omega} \biggr) \bigl( \bigl| \Delta_- \phi_{i-1/2}^n \bigr| + \bigl| \Delta_+ \phi_{i-1/2}^{n} \bigr| \bigr). \end{align}$

(5.31)

Next, suppose that instead of Eq (5.29) or Eq (5.30) there holds

$\begin{align*} \hat{\psi}_{i}^n \leq \psi_{\max, i}^n \leq \hat{\psi}_{i-1}^{n} \leq v \leq \psi_{\max, i-1}^{n}, \end{align*}$

then the discussion of Eq (5.29) can be applied again and we get that this ordering is only feasible if all terms are equal and zero, and therefore $\smash{ \check{\mathcal{D}}_{i-1/2}^n = 0}$ . On the other hand, let us assume that

$\begin{align*} \hat{\psi}_{i-1}^n \leq \psi_{\max, i-1}^n \leq \hat{\psi}_{i}^{n} \leq v \leq \psi_{\max, i}^{n}. \end{align*}$

In this case,

$\begin{align*} \bigl| \check{\mathcal{D}}_{i-1/2}^n \bigr| & = \left| \int_0^v ((f_i^n)')^- (s) \, \mathrm{d} s \right| = \left| \int_{\hat{\psi}_i^n}^v ((f_i^n)')^- (s) \, \mathrm{d} s \right| \leq \int_{\hat{\psi}_i^n}^v \bigl| ((f_i^n)')^- (s) \bigr| \, \mathrm{d} s \\ & \leq \int_{\psi_{\max, i-1}^n}^{\psi_{\max, i}^n} \bigl| (f_i^n)' \bigr| \, \mathrm{d} s \leq \bigl( \| \tilde{V} \|_{\infty} + \| \tilde{V}' \|_{\infty} \bigr) \bigl( \psi_{\max, i}^{n} - \psi_{\max, i-1}^{n}\bigr) \\ & \leq \bigl( \| \tilde{V} \|_{\infty} + \| \tilde{V}' \|_{\infty} \bigr) \bigl( \bigl|\Delta_- \phi_{i-1/2}^n \bigr| + \bigl|\Delta_+ \phi_{i-1/2}^{n} \bigr| \bigr). \end{align*}$

It remains to treat the case $\hat{\psi}_i^n \vee \hat{\psi}_{i-1}^{n} \leq v \leq \psi_{\max, i}^n \wedge \psi_{\max, i-1}^{n}$ . We then have $\smash{v^2/(\psi_{\max, i-1}^{n} \psi_{\max, i}^n) \leq 1}$ , and analogously to the derivation of Eq (5.31) we get

$\begin{align*} \bigl| \check{\mathcal{D}}_{i-1/2}^n \bigr| \leq \bigl( \| \tilde{V} \|_{\infty} +\| \tilde{V}' \|_{\infty} \bigr) \bigl( \bigl|\Delta_- \phi_{i-1/2}^n \bigr| + \bigl|\Delta_+ \phi_{i-1/2}^{n} \bigr| \bigr). \end{align*}$

Collecting all estimates for $\smash{ \check{\mathcal{D}}_{i-1/2}^n }$ , we see that

$\begin{align} \bigl|\check{\mathcal{D}}_{i-1/2}^{n} \bigr| & \leq \bigl( 3 \| \tilde{V} \|_{\infty} + (1+1/\omega) \| \tilde{V}' \|_{\infty} \bigr) \bigl( \bigl|\Delta_- \phi_{i-1/2}^n \bigr| + \bigl| \Delta_+ \phi_{i-1/2}^{n} \bigr| \bigr). \end{align}$

(5.32)

Furthermore, we obtain

$\begin{align} \bigl| \mathcal{E}_{i-1/2}^{n} \bigr| \leq \| \tilde{v}_{\phi \mathrm{s}} '\|_{\infty} \bigl| \Delta_+ \phi_{i-1/2}^{n}\bigr| + \| \tilde{v}_{\phi \mathrm{s}} \|_{\infty} \bigl| \Delta_- \phi_{i-1/2}^n \bigr|. \end{align}$

(5.33)

Combining the estimates Eq (5.27), Eq (5.32) and Eq (5.33), we obtain from Eq (5.22) and Eq (5.23) the bounds

$\begin{align} \bigl| \hat{\mathcal{F}} \bigl(\boldsymbol{\phi}_i^n, u \bigr) - \hat{\mathcal{F}} \bigl(\boldsymbol{\phi}_{i-1}^n, u \bigr) \bigr|, \, \bigl| \check{\mathcal{F}} \bigl(\boldsymbol{\phi}_i^n, v \bigr) - \check{\mathcal{F}} \bigl(\boldsymbol{\phi}_{i-1}^n, v \bigr) \bigr| \leq C_{10} \bigl( \bigl| \Delta_- \phi_{i-1/2}^n \bigr| + \bigl| \Delta_+ \phi_{i-1/2}^{n} \bigr| \bigr) \end{align}$

(5.34)

and therefore

$\begin{align*} \bigl| \Delta_-^{\phi} \mathcal{Q} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) \bigr| \leq C_{11} \bigl( \bigl| \Delta_- \phi_{i-1/2}^n \bigr| + \bigl| \Delta_+ \phi_{i-1/2}^{n} \bigr| \bigr) \end{align*}$

with constants $C_{10}$ and $C_{11}$ . Combining the last inequality with Eq (5.19) and Eq (5.20) we arrive at the desired estimate Eq (5.18).

From Eq (5.18), and considering that $0 \leq \phi_{i-1/2}^n \leq 1$ for all $i$ and $n$ , we obtain

$\begin{array}{l} \bigl( \Delta_- Q ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) \bigr)^2 \leq 2 C_8^2 \bigl( \bigl( \Delta_-^{(3)} + \Delta_-^{(4)} \bigr) \mathcal{F} ( \boldsymbol{\phi}_i^n, \boldsymbol{\psi}_i^n \bigr) \bigr)^2 + 2 C_9^2 \bigl( \bigl| \Delta_- \phi_{i-1/2}^n \bigr| + \bigl| \Delta_+ \phi_{i-1/2}^{n} \bigr| \bigr)^2 \\ \leq 4 C_8^2 \Bigl( \bigl( \Delta_-^{(3)} \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) \bigr)^2 + \bigl( \Delta_-^{(4)} \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) \bigr)^2 \Bigr) + 4 C_9^2 \bigl( \bigl| \Delta_- \phi_{i-1/2}^n \bigr|^2 + \bigl| \Delta_+ \phi_{i-1/2}^{n} \bigr|^2 \bigr) \\ \leq C_{12} \Bigl( \bigl( \Delta_-^{(3)} \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i \bigr) \bigr)^2 + \bigl( \Delta_-^{(4)} \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i \bigr) \bigr)^2 + \bigl| \Delta_- \phi_{i-1/2}^n \bigr| + \bigl| \Delta_+ \phi_{i-1/2}^{n} \bigr| \Bigr). \end{array}$

Summing over $(i, n) \in \mathcal{I}_0$ we get

$\begin{align*} \sum\limits_{\mathcal{I}_0} \bigl( \Delta_- Q( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) \bigr)^2 & \leq C_{12} \sum\limits_{\mathcal{I}_0} \Bigl( \bigl( \Delta_-^{(3)} \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) \bigr)^2 + \bigl( \Delta_-^{(4)} \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) \bigr) \Bigr)^2 + 2 C_{12} \sum\limits_{ \mathcal{I}_1 } \bigl| \Delta_+ \phi_{i-1/2}^n \bigr| \\ & \leq C_{12} \sum\limits_{\mathcal{I}_0} \Bigl( \bigl( \Delta_-^{(3)} \mathcal{F} ( \boldsymbol{\phi}_i^n, \boldsymbol{\psi}_i^n \bigr) \bigr)^2 + \bigl( \Delta_-^{(4)} \mathcal{F} ( \boldsymbol{\phi}_i^n, \boldsymbol{\psi}_i^n \bigr) \Bigr)^2 + 2 C_{12} \sum\limits_{n = 0}^{N} \operatorname*{TV} ( \boldsymbol{\phi}^n). \end{align*}$

Multiplying this inequality by $\Delta t \Delta z$ and taking into account Eq (5.16) and the uniform bound on $\operatorname*{TV} (\boldsymbol{\phi}^n)$ we have proved the following lemma.

Lemma 5.4. Consider numerical approximations produced by Scheme 3. There exists a constant $C = C(T)$ that is independent of $\Delta t$ and $\Delta z$ such that the following estimate holds:

$\begin{align*} \Delta t \Delta z \sum\limits_{n = 0}^{N_T} \sum\limits_{i \in \mathbb{Z}} \bigl( \Delta_- \mathcal{Q} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) \bigr)^2 \leq C(T) \Delta z. \end{align*}$

In part following the proofs of Lemmas 5.5 and 5.9 in ^[33] and in particular that of Lemma 3.4 in ^[18] we now prove the $H_{\mathrm{loc}}^{-1}$ compactness result.

Lemma 5.5. Assume that $\psi^{\Delta z}$ is generated by the scheme Eq (5.3) ( $\psi$ -Scheme 3), and that $\phi$ is the unique entropy solution of Eq (1.8a), Eq (1.2a) on $\Pi_T$ . Furthermore, we denote by $(\eta_0, Q_0)$ the Kružkov entropy pair Eq (5.2), and the distribution

$\begin{align*} \mu^{\Delta z} : = \partial_t \eta_0 ( \psi^{\Delta z} ) + \partial_z Q_0 ( \phi^{\Delta z} , \psi^{\Delta z}). \end{align*}$

Then the sequence $\{ \mu^{\Delta z} \}_{\Delta z > 0}$ belongs to a compact subset of $H_{\mathrm{loc}}^{-1} (\Pi_T)$ .

The proof of Lemma 5.5 essentially follows the proof of [18,Lemma 3.4], with some slight modifications due the particular definition of the numerical flux in the present case. The proof is presented in Appendix A.

Since $\phi^{\Delta z} \to \phi$ strongly in $L^p$ , we obtain that there exists a constant $C$ such that

$\begin{align*} \bigl| \big\langle \partial_z \bigl( Q ( \phi^{\Delta z}, \psi^{\Delta z} ) - Q ( \phi, \psi^{\Delta z} ) \bigr) , \zeta \big\rangle \bigr| & \leq C \| \phi^{\Delta z} - \phi \|_{L^2( \Pi_T)} \| \zeta \|_{H^1 ( \Pi_T) } \to 0 \quad \text{as }\Delta z \to 0 , \end{align*}$

hence the sequence $\smash{ \{ \tilde{\mu}^{\Delta z} \}_{\Delta z > 0}}$ , where we define

$\begin{align*} \tilde{\mu}^{\Delta z} : = \partial_t \eta_0 ( \psi^{\Delta z} ) + \partial_z Q ( \phi, \psi^{\Delta z} ), \end{align*}$

is compact in $\smash{H_{\mathrm{loc}}^{-1} (\Pi_T)}$ . Now, by Lemma 5.1 there exists a subsequence $\{ \psi^{\Delta z} \}$ (which we do not relabel) and a function $\psi \in L^{\infty} (\Pi_T)$ such that

$\begin{align} \psi^{\Delta z} \to \psi \quad \text{as}\; \Delta z \to 0 , \; \text{a.e. and in } L^p_{\mathrm{loc}} ( \Pi_T)\; \text{for any } p \in [1, \infty) . \end{align}$

(5.35)

Theorem 5.1. Assume that the maps $\phi$ and $\psi$ are the limit functions of $\phi^{\Delta z}$ and of $\psi^{\Delta z}$ as $\Delta z \to 0$ (the latter one being defined by Eq (5.35), that is, we consider Scheme 3). Then $(\phi, \psi)$ is a weak solution of the initial-value problem Eq (1.8), Eq (1.2) in the sense of Definition 1.1.

The proof follows that of [18,Lemma 3.5], again with slight modifications. We refer to Appendix A for details.

6. Numerical results

6.1. Computation of numerical error

To simplify error estimations we utilize a grid with the property that the boundaries of the tank agree with the boundaries of a cell (see ). Since an exact solution is frequently difficult to obtain, we use an approximate reference solution obtained with a large number $N_\mathrm{ref}$ cells against which the error of other simulated solutions with $N < N_\mathrm{ref}$ is measured. The error is estimated on a fixed interval $[0, z_\mathrm{end})$ slightly larger than the column of height $H$ so that the outflow volume fractions are included. We define the coarsest grid of $N_0$ cells with $\Delta z_0: = H/(N_0-2)$ and place the column between $z_\mathrm{U}: = \Delta z_0$ and $z_\mathrm{E}: = z_\mathrm{U}+H = (N_0-1)\Delta z_0$ . This corresponds to with $N = N_0$ . We define the length of the interval of error estimation as $z_\mathrm{end}: = H+2\Delta z_0 = N_0\Delta z_0$ .

To estimate the convergence order, we simulate with $N_k = 2^kN_0$ cells, $k = 0, \ldots, k_\mathrm{ref}-1$ , where the integer $k_\mathrm{ref}$ defines the number of cells $N_\mathrm{ref}: = N_{k_\mathrm{ref}} : = 2^{k_\mathrm{ref}}N_0$ of the reference solution. Then we define $\Delta z_k: = z_\mathrm{end}/N_k$ , $\Delta z^\mathrm{r}: = \Delta z_{k_\mathrm{ref}} : = z_\mathrm{end}/N_{k_\mathrm{ref}} = \Delta z_{0}/2^{k_\mathrm{ref}}$ and the factor of refinement from $N_k$ cells to $N_{\mathrm{ref}}$ as $m_k: = \Delta z_k/\Delta z^\mathrm{r} = N_{k_\mathrm{ref}}/N_k = 2^{k_\mathrm{ref}-k}$ . We note that $z_{N_k}: = N_k\Delta z_k = z_\mathrm{end}$ for all $k$ .

We will now measure the error between the piecewise constant numerical solution obtained by $N = N_k$ cells (we skip the index $k$ for a moment) and the reference solution obtained with $N_\mathrm{ref}$ cells on the grid refined by a factor $m = \Delta z/\Delta z^\mathrm{r}$ . The refined grid satisfies $z_0^\mathrm{r}: = z_0 = 0$ and we have $z_i = i\Delta z = im\Delta z^\mathrm{r} = :z^\mathrm{r}_{im}$ . The corresponding numerical solutions on the refined grids are denoted by (skipping the time index $n$ ) $\smash{\phi^\mathrm{r}_{i+1/2}}$ , $\smash{\psi^\mathrm{r}_{i+1/2}}$ , etc., where $\smash{A^\mathrm{r}_{i+1/2}}$ are defined by $\Delta z^\mathrm{r}$ . The refined cells are numbered such that the first cell for $\phi$ above $z^\mathrm{r}_0 = 0$ contains the value $\smash{\phi^\mathrm{r}_{1/2}}$ . Then $z_\mathrm{end} = Nm\Delta z^\mathrm{r}$ . This means that the cells within $[0, z_\mathrm{end})$ contain the values $\smash{\phi^\mathrm{r}_{1/2}}, \dots, \smash{\phi^\mathrm{r}_{Nm-1/2}}$ , and analogously for $\psi$ ; see Figure 2 (right).

Note that the location of the spatial discontinuities $z_\mathrm{U}$ and $z_\mathrm{E}$ will coincide with a cell boundary for any mesh considered in the refinement process while the locations of the inlets $z_\mathrm{F, 1}$ , etc. will be chosen in such a way that each of them lies inside a cell for the finest grid; hence, they do this for all the coarser meshes. In this way, the numerical fluxes at cell boundaries are well defined.

We compute the estimated error at a time point $t = T$ and define

$\begin{align*} \bigl\|\phi^{\Delta z}(\cdot, T)\bigr\| : = \int_{0}^{z_\mathrm{end}} A(z)\bigl|\phi^{\Delta z}(z, T)\bigr|\, \mathrm{d}z. \end{align*}$

The $L^1$ -difference between two numerical solutions computed on grids with cell sizes $\Delta z$ and $\Delta z^\mathrm{r}$ is calculated as follows for $\phi$ :

$\begin{align*} E^\phi(\Delta z, \Delta z^\mathrm{r}, T) : = \bigl\|\phi^{\Delta z}(\cdot, t)-\phi^{\Delta z^\mathrm{r}}(\cdot, T)\bigr\| = \sum\limits_{i = 0}^{N-1} I^\phi_{i+1/2}(T) \end{align*}$

with the summands defined by

$\begin{align*} I^\phi_{i+1/2}(T)&: = \int_{z_i}^{z_{i+1}}A(z) \bigl|\phi^{\Delta z}(z, T) -\phi^{\Delta z^\mathrm{r}}(z, T)\bigr|\, \mathrm{d}z = \sum\limits_{k = 0}^{m-1}\int_{z^\mathrm{r}_{im+k}}^{z^\mathrm{r}_{im+k+1}}A(z)\bigl|\phi_{i+1/2} -\phi^{\mathrm{r}}_{im+1/2+k}\bigr|\, \mathrm{d}z\\ & = \Delta z^{\mathrm{r}} \sum\limits_{k = 0}^{m-1}A^\mathrm{r}_{im+1/2+k}\bigl|\phi_{i+1/2} -\phi^{\mathrm{r}}_{im+1/2+k}\bigr|. \end{align*}$

The approximate relative error for $\phi$ in the interval $[0, z_\mathrm{end})$ is then defined as

$\begin{align*} e_{N_k}^{\phi}(T): = \frac{E^\phi(\Delta z_k, \Delta z^\mathrm{r}, T) }{\|\phi^{\Delta z^\mathrm{r}}(\cdot, T)\|} = \frac{ \|\phi^{\Delta z_k}(\cdot, T)-\phi^{\Delta z^\mathrm{r}}(\cdot, T)\|} {\|\phi^{\Delta z^\mathrm{r}}(\cdot, T)\|}. \end{align*}$

We define $e_N^{\psi}(t)$ analogously and hence, the total relative error can be defined as

$\begin{align*} e^\mathrm{tot}_{N_k}(T): = e_{N_k}^{\phi}(T)+e_{N_k}^{\psi}(T) \end{align*}$

and the observed convergence order between two discretizations $N_{k-1}$ and $N_{k}$ as

$\begin{equation*} \Upsilon_k (T) : = - \dfrac{\log(e^\mathrm{tot}_{N_{k-1}}(T)/e^\mathrm{tot}_{N_{k}}(T))}{\log(N_{k-1}/N_k)}, \quad k = 1, \dots, k_{\mathrm{ref}}-1. \end{equation*}$

For smooth solutions and a constant $A$ (see Eq (1.7)), we also use an alternative way ^[7] of calculating approximate errors and convergence orders in which a reference solution is not needed. One can use cubic interpolation to compute the quantities $\smash{\tilde{\phi}_{i+1/2}^{{\Delta z}_{k}}}$ from a grid with $N_{k+1} = 2\, N_k$ cells, $k = 0, \ldots, \hat{k}$ , with $\hat{k}$ an integer, taking into consideration that $\smash{z^k_{i+1/2} = (z^{k+1}_{2i+1/2}+z^{k+1}_{2i+3/2})/2}$ . Then, $\smash{\tilde{\phi}_{i+1/2}^{{\Delta z}_k}}$ is given by

$\begin{align*} \tilde{\phi}_{i+1/2}^{{\Delta z}_k} = \frac{9}{16}\bigl(\phi_{2i+3/2}^{{\Delta z}_{k+1}}+ \phi_{2i+1/2}^{{\Delta z}_{k+1}} \bigr) - \frac{1}{16}\bigl( \phi_{2i+5/2}^{{\Delta z}_{k+1}}+\phi_{2i-1/2}^{{\Delta z}_{k+1}}\bigr), \quad i = 1, \ldots, N_k. \end{align*}$

The alternative approximate relative $L^1$ -error for $\phi$ can then be calculated as

$\begin{align*} \hat{e}_{N_k}^{\phi}(T): = \frac{1}{N_k}\sum\limits_{i = 1}^{N_k}\bigl| \tilde{\phi}_{i+1/2}^{{\Delta z}_k}(\cdot, T) -\phi_{i+1/2}^{{\Delta z}_k}(\cdot, T) \bigr|. \end{align*}$

We can define $\smash{\tilde{\psi}_{i+1/2}^{N_k}}$ and $\smash{\hat{e}_{N_k}^{\psi}(T)}$ analogously along with the alternative total approximate $L^1$ -error and convergence order

$\begin{align*} \hat{e}^\mathrm{tot}_{N_k}(T) : = \hat{e}_{N_k}^{\phi}(T)+\hat{e}_{N_k}^{\psi}(T), \quad \hat{\Upsilon}_k (T) : = \log_2(\hat{e}^\mathrm{tot}_{N_{k}}(T)/\hat{e}^\mathrm{tot}_{N_{k+1}}(T)) \quad \text{for } k = 0, \dots, \hat{k} . \end{align*}$

6.2. Preliminaries for numerical tests

For the first example, in Section 6.3, we use a smooth solution away from spatial discontinuities, to estimate the order of convergence of the numerical scheme. For this example, we use $N_0 = 500$ , $N_k = 2^k N_0$ for $k = 0, 1, \ldots, 5$ and $k_{\mathrm{ref}} = 8$ ; hence, $N_5 = 16\, 000$ and $N_{\mathrm{ref}} = 128\, 000$ .

In Sections 6.4 and 6.5, we exemplify counter- and co-current flows of the primary and secondary disperse phases, respectively. For these two examples, we use $N_0 = 100$ , and $k_{\mathrm{ref}} = 7$ . We set three inlets $z_{\mathrm{F}, 1}$ , $z_{\mathrm{F}, 2}$ and $z_{\mathrm{F}, 3}$ dividing the tank into four equal parts each with the height $H/4$ , where $H = 1\, \mathrm{m}$ is used. These three inlets are defined so that they lie inside a cell for any mesh size considered. A fixed quantity of the is introduced through inlet $z_{\mathrm{F}, 1}$ , a fixed quantity of the secondary disperse phase through inlet $z_{\mathrm{F}, 2}$ and some wash water through inlet $z_{\mathrm{F}, 3}$ .

and show the estimated errors and convergence orders for the three scenarios studied. In the calculations of the alternative approximate error $\smash{\hat{e}^\mathrm{tot}_{N_k}(T)}$ and convergence order $\smash{\hat{\Upsilon}_{k} (T)}$ in , we use $\hat{k} = 6$ .

Table 1. Simulation of a smooth solution (Section 6.3). Total estimated relative $L^1$ -error $\smash{e^\mathrm{tot}_{N_k}(T)}$ , alternative relative $L^1$ -error $\smash{\hat{e}^\mathrm{tot}_{N_k}(T)}$ , estimated convergence order $\Upsilon_k (T)$ , and its alternative counterpart $\smash{\hat{\Upsilon}_{k} (T)}$ , calculated with $N_{\mathrm{ref}} = 128\, 000$ and $T = 9 \, \mathrm{s}$ .

$N_k$	$e^\mathrm{tot}_{N_k}(T)$	$\Upsilon_k (T)$	$\hat{e}^\mathrm{tot}_{N_k}(T)$	$\hat{\Upsilon}_{k} (T)$
500	$3.7212\times 10^{-2}$	——	$1.3041\times 10^{-3}$	$0.9513$
1000	$1.8985\times 10^{-2}$	$0.9709$	$6.7443\times 10^{-4}$	$0.9657$
2000	$9.5710\times 10^{-3}$	$0.9881$	$3.4533\times 10^{-4}$	$0.9781$
4000	$4.7582\times 10^{-3}$	$1.0083$	$1.7531\times 10^{-4}$	$0.9870$
8000	$2.3174\times 10^{-3}$	$1.0379$	$8.8448\times 10^{-5}$	$0.9927$
16000	$1.0867\times 10^{-3}$	$1.0926$	$4.4447\times 10^{-5}$	——

| Show Table

DownLoad: CSV

Table 2. Approximate total relative $L^1$ -errors $\smash{e^\mathrm{tot}_{N_k}(T)}$ and convergence orders $\smash{\Upsilon_k(T)}$ calculated between consecutive values of $N_k$ , with $N_{\mathrm{ref}} = 12\, 800$ (a) for Application 1 (counter-current flow) at simulated time $T = 350 \, \mathrm{s}$ , (b) for Application 2 (co-current flow) at simulated time $T = 500 \, \mathrm{s}$ .

| Show Table

DownLoad: CSV

6.3. Simulation of a smooth solution

We consider a vessel with a constant cross-sectional area of $A(z) = 83.65 \, \mathrm{cm}^2$ , and we set all inlet and outlet volumetric flows to zero, i.e, $Q_{\mathrm{F, 1}} = Q_{\mathrm{F, 2}} = Q_{\mathrm{F, 3}} = Q_{\mathrm{U}} = Q_{\mathrm{E}} = 0 \, \mathrm{cm^3/s}$ . (Under these assumptions, the scheme reduces to Scheme 3 for Model 3.) For the velocity functions $W$ and $V$ , given by Eq (2.10) and Eq (2.11), respectively, we use the parameters $n_{\mathrm{p}} = 2.2$ , $v_{\mathrm{term, p}} = 1.5\, \mathrm{cm/s}$ , $n_{\mathrm{s}} = 2.2$ and $v_{\mathrm{term, s}} = 1.5\, \mathrm{cm/s}$ , and consider $\sigma = -1$ (counter-current flow). The initial datum is a sinusoidal function for both phases with support in the interval $(z_\mathrm{U}, z_\mathrm{E})$ .

We simulate a short time, until $t = 9\, \mathrm{s}$ , before the first discontinuity appears; see the first row in where $N = 1000$ is used. In the second row, we compare two approximate solutions obtained with a coarse mesh with $N = 500$ and a finer one with $N = 8000$ . shows the estimated errors and convergence orders. Both $\Upsilon_k (T)$ and $\hat{\Upsilon}_{N_k} (T)$ assume values close to one as $N_k$ increases, as expected, confirming that the scheme is first-order accurate for smooth solution.

Figure 4. Simulation of a smooth solution (Section 6.3). First row: Time evolution of the volume fractions of the primary disperse phase $\phi$ (left) and the secondary disperse phase $\psi$ (right) from $t = 0 \, \mathrm{s}$ to $t = 9 \, \mathrm{s}$ . Second row: Approximate solutions at time $t = 9 \, \mathrm{s}$ computed with $N = 500$ (left) and $N = 8000$ (right).

DownLoad: Full-Size Img PowerPoint

6.4. Illustration of the crossing condition.

The remainder of examples refer to Model 1 discretized by Scheme 1. We first illustrate that the "crossing condition" is satisfied as mentioned in Section 4.1. For this we use the constant $A \equiv 83.65\, \mathrm{cm}^2$ and simulate a tank that initially contains only water, i.e., $\phi(z, 0) = \psi(z, 0) = 0$ for all $z$ . At $t = 0$ we start pumping aggregates, solids, fluid and wash water with $\phi_{\mathrm{F}, 1} = 0.9$ , $\psi_{\mathrm{F}, 1} = 0$ , $\phi_{\mathrm{F}, 2} = 0.2$ , $\psi_{\mathrm{F}, 2} = 0.4$ , $\phi_{\mathrm{F}, 3} = 0.1$ and $\psi_{\mathrm{F}, 3} = 0$ . We choose the volumetric flows $(Q_{\mathrm{U}}, Q_{\mathrm{F, 1}}, Q_{\mathrm{F, 2}}, Q_{\mathrm{F, 3}}) = (15, 20, 25, 15)\, \mathrm{cm^3/s}$ , so that the volumetric flows in the tank are positive in all zones but not in zone 1. Three inlets $z_{\mathrm{F}, 1}$ , $z_{\mathrm{F}, 2}$ and $z_{\mathrm{F}, 3}$ divide the tank into four zones of equal height.

shows the graphs of the flux functions on both sides of each discontinuity in $z$ , for this case three inlets $(z_{\mathrm{F}, 1}, z_{\mathrm{F}, 2}, z_{\mathrm{F}, 3})$ and two outlets $(z_{\mathrm{U}}, z_{\mathrm{E}})$ . We see that the fluxes $J(\phi, z^{\pm})$ (defined in Eq (4.9)) intersect when $\phi_{\mathrm{F}, 1} = 0.9$ , $\phi_{\mathrm{F}, 2} = 0.2$ and $\phi_{\mathrm{F}, 3} = 0.1$ , and do not intersect in $(0, 1)$ at neither $z_{\mathrm{U}}$ nor $z_{\mathrm{E}}$ . shows the simulation results during $200 \, \mathrm{s}$ .

Figure 5. Illustration of the crossing condition (Section 6.4). The crossing condition is satisfied at each of the five spatial discontinuities.

DownLoad: Full-Size Img PowerPoint

Figure 6. Simulation of the example in Section 6.4 during $T = 200 \, \mathrm{s}$ .

DownLoad: Full-Size Img PowerPoint

6.5. Application 2: Counter-current fluxes

We consider now a complete tank with $\sigma = -1$ ; hence, the primary disperse phase will move upwards and the secondary disperse phase downwards with respect to the volume average velocity $q$ of the mixture. A straightforward interpretation of this scenario is the flotation process used in the mineral industry to recover valuable minerals from crushed ore; see the model in ^[8,9]. In that model, the primary disperse phase consists of aggregates, which are air bubbles fully loaded with hydrophobic minerals, and the secondary disperse phase is the tailings, consisting of hydrophilic particles suspended in the fluid that do not attach to air bubbles. We consider three inlets $z_{\mathrm{F}, 1}$ , $z_{\mathrm{F}, 2}$ and $z_{\mathrm{F}, 3}$ , dividing the tank into four regions with equal height. At $z_{\mathrm{F}, 1}$ , only gas is fed, at $z_{\mathrm{F}, 3}$ only wash water, while at $z_{\mathrm{F}, 2}$ a slurry of solids and water is fed into the column.

The cross-sectional area is discontinuous (cf. ) due to a centered pipe from the top down to $z_{\mathrm{F}, 2}$ that introduces material into the tank. It is given by

$\begin{align*} A(z) = \begin{cases} 72.25\, \mathrm{cm^2} & \text{for }z \geq z_{\mathrm{F, 2}} , \\ 83.65\, \mathrm{cm^2} & \text{for }z < z_{\mathrm{F, 2}} . \end{cases} \end{align*}$

These values correspond to the reflux flotation cell studied in ^[21].

We consider that the column is initially filled only with fluid, hence $\phi(z, 0) = \psi(z, 0) = 0$ for all $z$ , when we start pumping aggregates and solids with concentrations $\phi_{\mathrm{F}, 1} = 1.0$ , $\psi_{\mathrm{F}, 1} = 0$ , $\phi_{\mathrm{F}, 2} = 0$ , $\psi_{\mathrm{F}, 2} = 0.4$ , $\phi_{\mathrm{F}, 3} = 0$ and $\psi_{\mathrm{F}, 3} = 0$ , along with fluid and/or wash water. We choose $(Q_{\mathrm{U}}, Q_{\mathrm{F, 1}}, Q_{\mathrm{F, 2}}, Q_{\mathrm{F, 3}}) = (5, 15, 25, 10)\, \mathrm{cm^3/s}$ , so that the mixture flows in zones 2 and 3 are positive, i.e., directed upwards: $Q_{\mathrm{F, 1}}-Q_{\mathrm{U}} = 10\, \mathrm{cm^3/s}$ in zone 2 and $Q_{\mathrm{F, 2}}+Q_{\mathrm{F, 1}}-Q_{\mathrm{U}} = 35\, \mathrm{cm^3/s}$ in zone 3.

shows the time evolution of the volume fractions of $\phi$ and $\psi$ . It can be seen that the aggregates rise fast to the top, while the solids are travelling both up and down the vessel, leaving through the effluent and the underflow.

Figure 7. Application 1: Counter-current flows. Time evolution of the volume fraction profiles of the primary disperse phase $\phi$ (left) and secondary disperse phase $\psi$ (right) from time $t = 0 \, \mathrm{s}$ to $t = 1800 \, \mathrm{s}$ seen from two different angles (first and second rows).

DownLoad: Full-Size Img PowerPoint

At time $t = 350\, \mathrm{s}$ , we change the volumetric flow from $Q_{\mathrm{F}, 2} = 25\, \mathrm{cm^3/s}$ to $Q_{\mathrm{F}, 2} = 7\, \mathrm{cm^3/s}$ . After this change, the solids settle and we obtain a steady state. We mention that this is not a desired steady state in the mining industry (the capacity of the device is not fully used); see ^[9] for more examples. shows the estimated errors and convergence orders for this simulation. As in the smooth example in Section 6.3, the convergence orders tend to one as $N_k$ increases.

6.6. Application 2: Co-current fluxes

For the last example, we consider $\sigma = 1$ , i.e., both the primary and secondary disperse phases have a density smaller than that of the fluid and therefore move upwards relative to the mixture. This scenario could be a flotation process with two buoyant phases differing in density and possibly also in size. We consider here the same flotation column as in Application 1 and choose $n_{\mathrm{p}} = 3.2$ , $v_{\mathrm{term, p}} = 2.5\, \mathrm{cm/s}$ , $n_{\mathrm{s}} = 2.5$ , and $v_{\mathrm{term, s}} = 1.5\, \mathrm{cm/s}$ so that we have two buoyant phases with different (upwards-directed) velocities relative to the mixture. As in the previous example, only the primary disperse phase is fed into the tank at $z_{\mathrm{F}, 1}$ and only the secondary at $z_{\mathrm{F}, 2}$ . The column is initially filled with only fluid at time $t = 0\, \mathrm{s}$ , hence $\phi(z, 0) = \psi(z, 0) = 0$ for all $z$ , when we start pumping both phases with the following volume fractions: $\phi_{\mathrm{F}, 1} = 1.0$ , $\psi_{\mathrm{F}, 1} = 0.0$ , $\phi_{\mathrm{F}, 2} = 0.0$ , $\psi_{\mathrm{F}, 2} = 0.6$ , $\phi_{\mathrm{F}, 3} = 0$ and $\psi_{\mathrm{F}, 3} = 0$ . We choose the volumetric flows $(Q_{\mathrm{U}}, Q_{\mathrm{F, 1}}, Q_{\mathrm{F, 2}}, Q_{\mathrm{F, 3}}) = (15, 30, 20, 10)\, \mathrm{cm^3/s}$ , so that the volumetric flows in the tank are positive in all zones with the exception of zone 1.

shows the time evolution of the volume fractions of both phases. It can be seen that, for times $t < 350\, \mathrm{s}$ , the primary disperse phase leaves the tank through both the underflow and effluent outlets, while the secondary disperse phase quickly rises to the top part of the tank and leaves it just through the effluent outlet.

Figure 8. Application 2: Co-current flow. Time evolution of the volume fraction profiles of primary disperse phase $\phi$ (left) and secondary disperse phase $\psi$ (right) from time $t = 0 \, \mathrm{s}$ to $t = 1500 \, \mathrm{s}$ seen from two different angles (first and second rows).

DownLoad: Full-Size Img PowerPoint

At $t = 350 \, \mathrm{s}$ , we change the volumetric flow of the inlet $z_{\mathrm{F}, 1}$ from $Q_{\mathrm{F}, 1} = 30$ to $Q_{\mathrm{F}, 1} = 20\, \mathrm{cm^3/s}$ , maintaining the other volumetric flows. As a consequence we can see that the primary disperse phase $\phi$ rises and leaves zone 1, exiting the tank only through the effluent while the secondary disperse phase maintains the same behaviour as before and is present only above the feed level $z_{\mathrm{F, 2}}$ . Table 2 (b) shows the estimated errors and convergence orders for this simulation, which have the same behaviour as the ones in the numerical examples in Sections 6.3 and 6.4.

7. Conclusions

The present study outlines a numerical method for a triangular system of two PDEs, whose flux functions have several spatial discontinuities due to in- and outflows of a one-dimensional tank with possibly varying cross-sectional area. The triangular structure is utilized in the following way in the numerical scheme. The numerical update formula corresponding to the first scalar equation contains, for the nonlinear term, a numerical flux where the the volume fraction in the left cell is multiplied with the velocity computed in the right cell; see ^[6]. The update formula for the second equation uses the Engquist-Osher numerical flux for the term modelling the nonlinear relative flux of the secondary disperse phase, chosen in a particular way since this flux also depends on the primary disperse phase volume fraction. The other terms of the second update formula are also chosen in such a way that the entire scheme is proved to be monotone under the CFL condition Eq (3.2). We prove that the numerically obtained volume fractions satisfy the invariant-region property that they stay between zero and one, as is physically expected.

The numerical scheme is applied to simulate the hydrodynamic movement of simultaneously rising aggregates (air bubbles with attached hydrophobic particles) and settling hydrophilic particles in the fluid under in- and outflows of a flotation column. As a demonstration of the capabilities of the numerical method, three different settings are simulated. The convergence order of the numerical method is estimated. As expected, in regions where the solution is smooth, the order is one. The first-order scheme proposed in this paper could be improved to achieve second-order accuracy, for instance, by techniques of variable extrapolation ^[6,14].

In ^[9], the authors proposed a staggered scheme to compute numerical solutions for a flotation column, following the approach of Karlsen et al. ^[32,33]. Although the staggered scheme worked for a single inlet for a mixture of aggregates and solids, we have, in the case of several feed inlets, found it difficult to get consistent numerical solutions with respect to different mesh sizes.

We are currently ^[12] extending the model and numerical scheme to the explicit description of drainage of liquid from the foam forming at the top of a flotation column. This phenomenon gives rise to a model similar to Eq (1.1) but with an additional degenerating diffusion term. The numerical solution of the resulting system of non-linear convection-diffusion equations calls for semi-implicit discretizations to alleviate the severe restrictions in the CFL condition due to the diffusion term.

Acknowledgements

R.B. is supported by ANID (Chile) through Anillo project ANID/PIA/ACT210030; Centro de Modelamiento Matemático (BASAL projects ACE210010 and FB210005); CRHIAM, project ANID/Fondap/15130015; and Fondecyt project 1210610. S.D. is supported by the Swedish Research Council (Vetenskapsrådet, 2019-04601). M.C.M. is supported by grant PID2020-117211GB-I00 funded by MCIN/AEI/10.13039/501100011033 and project CIAICO/2021/227. Y.V. is supported by IFARHU-SENACYT (Panama) and project UCO1866.

Conflict of interest

The authors declare there is no conflict of interest.

Appendix A. Proofs of Lemma 5.5 and Theorem 5.1

Proof of Lemma 5.5. Following ^[18], we work with smooth entropies instead of $\eta_0$ , so we denote by $\eta_{\Delta z}$ a smooth convex approximation to $\eta_0$ , so that $\eta_{\Delta z} (0) = 0$ and $|\eta_{\Delta z} | \leq 1$ , and $\| \eta_{\Delta z} - \eta_0 \|_{L^{\infty}} \leq \Delta z$ . Moreover, if $Q_{\Delta z}$ is the entropy flux associated with $\eta_{\Delta z}$ , then there also holds $\| Q_{\Delta z} - Q_0 \|_{L^{\infty}} \to 0$ as $\Delta z \to 0$ . Then we split $\mu^{\Delta z}$ as $\smash{\mu^{\Delta z} = \mu_1^{\Delta z} + \mu_2^{\Delta z}}$ , where we define

$\begin{align*} \mu_1^{\Delta z} : = \partial_t \bigl( \eta_0 ( \psi^{\Delta z} ) - \eta_{\Delta z} ( \psi^{\Delta z} ) \bigr) , \quad \mu_2^{\Delta z} : = \partial_t \eta_{\Delta z} ( \psi^{\Delta z} ) + \partial_z Q_0 ( \phi^{\Delta z}, \psi^{\Delta z} ). \end{align*}$

If $\zeta \in C_0^1 (\Pi_T)$ denotes a test function with compact support, then as in ^[18], one verifies that

$\begin{align*} \bigl| \langle \mu_1^{\Delta z}, \zeta \rangle \bigr| \leq \iint_{\Pi_T} \bigl| \eta_{\Delta z} ( \psi^{\Delta z} ) - \eta_0 ( \psi^{\Delta z} ) \bigr| |\zeta_t | \, \mathrm{d} z \, \mathrm{d} t \leq C_{21} \| \zeta_t \|_{L^2 ( \Pi_T)} \| \eta_{\Delta z} - \eta_0 \|_{L^{\infty}} \to 0 \quad \text{as } \Delta z \to 0 , \end{align*}$

hence $\smash{ \{ \mu_1^{\Delta z} \}_{\Delta z > 0} }$ is compact in $H_{\mathrm{loc}}^{-1} (\Pi_T)$ . By an integration by parts we get

$\begin{align*} \langle \mu_2^{\Delta z}, \zeta \rangle & = - \iint_{\Pi_T} \bigl( \eta_{\Delta z} ( \psi^{\Delta z} ) \zeta_t + Q_0 ( \phi^{\Delta z}, \psi^{\Delta z} ) \zeta _z \bigr) \, \mathrm{d} z \, \mathrm{d} t \\ & = - \sum\limits_{n = 1}^{N_T-1} \int_{\mathbb{R}} \int_{t_n}^{t_{n+1}} \eta_{\Delta z} ( \psi^{\Delta z} ) \zeta_t \, \mathrm{d} t \, \mathrm{d} z - \sum\limits_{n = 1}^{N_T-1} \int_{t_n}^{t_{n+1}} \int_{\mathbb{R}} Q_{0} ( \phi^{\Delta z}, \psi^{\Delta z} ) \zeta_z \, \mathrm{d} z \, \mathrm{d} t \\ & = - \sum\limits_{n = 0}^{N_T-1} \int_{\mathbb{R}} \eta_{\Delta z} \bigl( \psi^{\Delta z} (z, t_n) \bigr) \bigl( \zeta ( z, t_{n+1} ) - \zeta ( z, t_n) \bigr) \, \mathrm{d} z \\ & \quad - \sum\limits_{n = 1}^{N_T-1} \sum\limits_{ i \in \mathbb{Z}} \int_{t_n}^{t_{n+1}} Q_{0} \bigl( \phi_{i-1/2}^n, \psi_{i-1/2}^n ) \bigl( \zeta (z_i, t) - \zeta( z_{i-1}, t) \bigr) \, \mathrm{d} z \, \mathrm{d} t, \end{align*}$

so we may finally write

$\begin{align} \begin{split} \langle \mu_2^{\Delta z}, \zeta \rangle & = \sum\limits_{n = 0}^{N_T-2} \sum\limits_{i \in \mathbb{Z}} \bigl( \eta_{\Delta z} ( \psi_{i-1/2}^{n+1} ) - \eta_{\Delta z} (\psi_{i-1/2}^{n} ) \bigr) \int_{I_{i-1/2}} \zeta (z, t_{n+1}) \, \mathrm{d}z \\ & \quad + \sum\limits_{n = 1}^{N_T-1} \sum\limits_{ i \in \mathbb{Z}} \bigl( \Delta_+ Q_0 ( \phi_{i-1/2}^n , \psi_{i-1/2}^n) \bigr)\int_{t_n}^{t_{n+1}} \zeta ( z_i, t) \, \mathrm{d} t. \end{split} \end{align}$

(A.1)

We define the cell average

$\begin{align} \zeta_{i-1/2}^n : = \frac{1}{\Delta z \Delta t} \iint_{I_{j-1/2}^n} \zeta (z, t) \, \mathrm{d} z \, \mathrm{d}{t}. \end{align}$

(A.2)

Replacing the integral in the first term of the right-hand side of Eq (A.1) by $\Delta z \zeta_{i-1/2}^n$ produces the following error, where we follow the derivation of Eq (3.27) in ^[18]:

$\begin{align*} & \left| \sum\limits_{\mathcal{I}_2} \bigl( \eta_{\Delta z} ( \psi_{i-1/2}^{n+1} ) - \eta_{\Delta z} ( \psi_{i-1/2}^{n} ) \bigr) \biggl( \int_{I_{i-1/2}} \zeta (z, t_{n+1}) \, \mathrm{d} z - \Delta z \zeta_{i-1/2}^n \biggr) \right| \\ & \leq \| \eta_{\Delta z}' \|_{L^{\infty}} \sum\limits_{\mathcal{I}_2} \bigl| \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr| \frac{1}{\Delta t} \iint_{I_{j-1/2}^n} \bigl| \zeta ( z, t_{n+1} ) - \zeta ( z, t) \bigr| \, \mathrm{d} z \, \mathrm{d} t \\ & \leq \sum\limits_{n, i} \bigl| \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr| \frac{1}{\Delta t} \iint_{I_{j-1/2}^n} \int_t^{t_{n+1}} \bigl| \zeta_t (z, s) \bigr| \, \mathrm{d} s \, \mathrm{d} z \, \mathrm{d} t \\ & \leq \sum\limits_{\mathcal{I}_2} \bigl| \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr| \frac{1}{\Delta t} \iint_{I_{j-1/2}} \int_{t_n}^{t_{n+1}} (t_{n+1}-t)^{1/2} \biggl( \int_{t_n}^{t_{n+1}} \bigl| \zeta_t (z, s) \bigr|^2 \, \mathrm{d} s\biggr)^{1/2} \mathrm{d} z \, \mathrm{d} t \\ & \leq \frac{2}{3} \sum\limits_{\mathcal{I}_2} \bigl| \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr| \Delta t^{1/2} \int_{I_{i-1/2}} \biggl( \int_{t_n}^{t_{n+1}} \bigl| \zeta_t (z, s) \bigr|^2 \, \mathrm{d} s\biggr)^{1/2} \mathrm{d} z \\ & \leq \frac{2}{3} \sum\limits_{\mathcal{I}_2} \bigl| \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr| \Delta z^{1/2} \Delta t^{1/2} \biggl( \iint_{I_{i-1/2}^n} \bigl( \zeta_t (z, s) \bigr)^2 \, \mathrm{d} z \, \mathrm{d} t \biggr)^{1/2} \\ & \leq \frac{2}{3} \Biggl( \Delta z \Delta t \sum\limits_{n, i} \bigl( \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^{n} \bigr)^2 \Biggr)^{1/2} \Biggl( \sum\limits_{\mathcal{I}_2} \iint_{I_{i-1/2}^n} \bigl( \zeta_t (z, s) \bigr)^2 \, \mathrm{d} z \, \mathrm{d} t \Biggr)^{1/2} \\ & \leq \frac{2}{3} (C_7 \Delta z)^{1/2} \| \zeta \|_{H^1 (\Pi_T)} \end{align*}$

(see Eq (5.17)). By similar arguments we obtain the bound

$\begin{align*} \Biggl| \sum\limits_{\mathcal{I}_1} \bigl( \Delta_+ Q_0 ( \phi_{i-1/2}^n , \psi_{i-1/2}^n) \bigr) \left(\int_{t_n}^{t_{n+1}} \zeta(z_{i+1/2}, t ) \, \mathrm{d} t - \Delta t \zeta_{i-1/2}^n \right) \Biggr| \leq \tilde{C}_T \Delta z^{1/2} \| \zeta \|_{H^1 ( \Pi_T)}. \end{align*}$

Consequently, and further following ^[18], we have shown that

$\begin{align*} \langle \mu_2^{\Delta z}, \zeta \rangle & = \Delta z \Delta t \sum\limits_{\mathcal{I}_1} \biggl\{ \frac{ \eta_{\Delta z} ( \psi_{i-1/2}^{n+1} ) - \eta_{\Delta z} ( \psi_{i-1/2}^{n} )}{\Delta t} + \frac{\Delta_+ Q_0 ( \phi_{i-1/2}^n, \psi_{i-1/2}^n )}{\Delta z} \biggr\} \zeta_{i-1/2}^n \\ & \quad + \text{terms which are compact in } H_{\mathrm{loc}}^{-1} ( \Pi_T ) . \end{align*}$

We now utilize the "scheme for $\eta$ ", Eq (5.14), to rewrite the term in curled brackets as $\smash{\{ \dots \} = \mathcal{A}_{i-1/2}^n + \mathcal{B}_{i-1/2}^n + \mathcal{C}_{i-1/2}^n + \mathcal{D}_{i-1/2}^n}$ , where we define

$\begin{align} \mathcal{A}_{i-1/2}^n & : = - \frac{1}{2\Delta t} \eta_{\Delta z}'' \bigl( \xi_{i-1/2}^{n+1/2} \bigr) \bigl( \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^n \bigr)^2 - \frac{1}{\Delta z} \Theta_{i-1/2}^n , \\ \mathcal{B}_{i-1/2}^n & : = - \frac{1}{\Delta z} \bigl( \eta' \bigl( \psi_{i-1/2}^{n+1} \bigr) - \eta' \bigl( \psi_{i-1/2}^{n} \bigr) \bigr) \Delta_- \mathcal{F} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i), \\ \mathcal{C}_{i-1/2}^n & : = - \frac{1}{\Delta z} \Delta_+^{\phi} \bigl( \eta' \bigl( \psi_{i-1/2}^n \bigr) \mathcal{F} ( \boldsymbol{\phi}_{i-1}^n, \boldsymbol{\psi}_{i-1}^n) - \mathcal{Q} ( \boldsymbol{\phi}_{i-1}^n, \boldsymbol{\psi}_{i-1}^n) \bigr) , \\ \mathcal{D}_{i-1/2}^n & : = - \frac{1}{\Delta z} \bigl( \Delta_- \mathcal{Q} ( \boldsymbol{\phi}_i^n, \boldsymbol{\psi}_i^n ) - \Delta_+ Q_0 \bigl( \phi_{i-1/2}^n, \psi_{i-1/2}^n \bigr) \bigr). \end{align}$

(A.3)

Thus, $\langle \mu_2^{\Delta z}, \zeta \rangle = \langle \mathcal{A}, \zeta \rangle + \langle \mathcal{B}, \zeta \rangle + \langle \mathcal{C}, \zeta \rangle + \langle \mathcal{D}, \zeta \rangle + \text{compact terms, }$ where

$\begin{align*} \langle \mathcal{A}, \zeta \rangle = \Delta z \Delta t \sum\limits_{n, i} \mathcal{A}_{i-1/2}^n \zeta_{i-1/2}^n \end{align*}$

and $\langle \mathcal{B}, \zeta \rangle$ , $\langle \mathcal{C}, \zeta \rangle$ , and $\langle \mathcal{D}, \zeta \rangle$ are defined analogously. In view of Lemma 5.2, we get

$\begin{align*} \bigl| \langle \mathcal{A}, \zeta \rangle \bigr| & \leq \| \zeta \|_{L^{\infty} (\Pi_T) } \biggl( \frac{\Delta z}{2} \sum\limits_{\mathcal{I}_1} \eta_{\Delta z}'' \bigl( \xi_{i-1/2}^{n+1/2} \bigr) \bigl( \psi_{i-1/2}^{n+1} - \psi_{i-1/2}^n \bigr)^2 + \Delta t \sum\limits_{\mathcal{I}_1} \Theta_{i-1/2}^n \biggr) \leq C_T \| \zeta \|_{L^{\infty} ( \Pi_T ) }, \end{align*}$

and therefore $\mathcal{A} \in \mathcal{M}_{\mathrm{loc}} (\Pi_T)$ . Appealing to the divergence bound of the numerical flux Eq (5.15) and taking into account the $BV$ bound on $\phi^{\Delta z}$ it also follows that $| \langle \mathcal{B} + \mathcal{C}, \zeta \rangle | \leq C_T \| \zeta \|_{L^{\infty} (\Pi_T)}$ , and therefore $\mathcal{B} + \mathcal{C} \in \mathcal{M}_{\mathrm{loc}} (\Pi_T)$ .

Finally, to deal with $\langle \mathcal{D}, \zeta \rangle$ we consider first $\varepsilon > 0$ and let $Q_{\varepsilon}$ , $\smash{\mathcal{Q}_{\varepsilon}^{\pm}}$ and $\mathcal{Q}_{\varepsilon}$ denote the entropy and numerical entropy fluxes calculated from Eq (5.1) and Eq (5.7), respectively, where $\eta = \eta_{\varepsilon}$ . Since $\mathcal{Q}_{\varepsilon}$ is consistent with $Q_{\varepsilon}$ ,

$\begin{align*} & \mathcal{Q}_{\varepsilon} ( \phi, \phi, \psi_1 , \psi_2) - Q_{\varepsilon} (\phi, \psi_2) \\ & = \mathcal{Q}_{\varepsilon} ( \phi, \phi, \psi_1 , \psi_2) - \mathcal{Q}_{\varepsilon} ( \phi, \phi, \psi_2 , \psi_2) = \int_{\psi_2}^{\psi_1} \eta_{\varepsilon}' (s) \check{f} ( \phi, \phi, s) \, \mathrm{d} s \\ & = \eta_{\varepsilon}' (\psi_1) \bigl( \check{\mathcal{F}} ( \phi, \phi, \psi_1) - \check{\mathcal{F}} ( \phi, \phi, \psi_2) \bigr) - \int_{\psi_2}^{\psi_1} \eta_{\varepsilon}'' (s) \bigl( \check{\mathcal{F}} ( \phi, \phi, s) - \check{\mathcal{F}} ( \phi, \phi, \psi_1) \bigr) \, \mathrm{d} s \end{align*}$

(cf. Eq (5.9)). By using the monotonicity of $\check{\mathcal{F}}$ with respect to its $\psi$ -argument we get

$\begin{align*} \bigl| \mathcal{Q}_{\varepsilon} ( \phi, \phi, \psi_1 , \psi_2) - Q_{\varepsilon} (\phi, \psi_2) \bigr| \leq 3 \| \eta_{\varepsilon}' \|_{L^{\infty}} \bigl| \check{\mathcal{F}} ( \phi, \phi, \psi_2) - \check{\mathcal{F}} ( \phi, \phi, \psi_1) \bigr| \leq 3 \bigl| \check{\mathcal{F}} ( \phi, \phi, \psi_2) - \check{\mathcal{F}} ( \phi, \phi, \psi_1) \bigr|, \end{align*}$

so in the limit $\varepsilon \to 0$ ,

$\begin{align} \bigl| \mathcal{Q} ( \phi, \phi, \psi_1 , \psi_2) - Q_0 (\phi, \psi_2) \bigr| & \leq 3 \bigl| \check{\mathcal{F}} ( \phi, \phi, \psi_2) - \check{\mathcal{F}} ( \phi, \phi, \psi_1) \bigr|. \end{align}$

(A.4)

Noticing that

$\begin{align*} \Delta_- \mathcal{Q} ( \boldsymbol{\phi}^n_i, \boldsymbol{\psi}^n_i ) - \Delta_+ Q_0 \bigl( \phi_{i-1/2}^n, \psi_{i-1/2}^n \bigr) & = \Delta_- \bigl( \mathcal{Q} \bigl( \boldsymbol{\phi}_{i}^n, \psi_{i-1/2}^n, \psi_{i+1/2}^n \bigr) - Q_0 ( \phi_{i+1/2}^n, \psi_{i+1/2}^n \bigr) \bigr) \\ & = \Delta_- \bigl( \mathcal{Q} \bigl( \phi_{i+1/2}^n, \phi_{i+1/2}^n, \boldsymbol{\psi}_i^n \bigr) - Q_0 ( \phi_{i+1/2}^n, \psi_{i+1/2}^n \bigr) \bigr) \\ & \quad + \Delta_- \bigl( \mathcal{Q} \bigl( \boldsymbol{\phi}_{i}^n, \boldsymbol{\psi}_{i}^n \bigr) - \mathcal{Q} \bigl( \phi_{i+1/2}^n, \phi_{i+1/2}^n, \boldsymbol{\psi}_{i}^n \bigr) \bigr) \end{align*}$

we get from Eq (A.3)

$\begin{array}{l} \Biggl| \Delta z \Delta t \sum\limits_{\mathcal{I}_1} \mathcal{D}_{i-1/2}^n \zeta_{i-1/2}^n \Biggr| \leq \Biggl| \Delta z \Delta t \sum\limits_{\mathcal{I}_1} \frac{1}{\Delta z} \Delta_- \bigl( \mathcal{Q} \bigl( \phi_{i+1/2}^n, \phi_{i+1/2}^n, \boldsymbol{\psi}_{i}^n \bigr) - Q_0 ( \phi_{i+1/2}^n, \psi_{i+1/2}^n \bigr)\bigr) \zeta_{i-1/2}^n \Biggr| \\ + \Biggl| \Delta z \Delta t \sum\limits_{\mathcal{I}_1} \frac{1}{\Delta z} \Delta_- \bigl( \mathcal{Q} \bigl( \boldsymbol{\phi}_{i}^n , \boldsymbol{\psi}_{i}^n \bigr) - \mathcal{Q} \bigl( \phi_{i+1/2}^n, \phi_{i+1/2}^n, \boldsymbol{\psi}_{i}^n \bigr) \bigr)\zeta_{i-1/2}^n \Biggr| = : |\mathcal{S}_1| + |\mathcal{S}_2|. \end{array}$

By a summation by parts and applying Eq (A.4) we get

$\begin{align} |\mathcal{S}_1| & = \Biggl|\Delta z \Delta t \sum\limits_{\mathcal{I}_1} \bigl( \mathcal{Q} \bigl( \phi_{i+1/2}^n, \phi_{i+1/2}^n, \boldsymbol{\psi}_{i}^n \bigr) - Q_0 ( \phi_{i+1/2}^n, \psi_{i+1/2}^n \bigr)\bigr) \frac{\Delta_+ \zeta_{i-1/2}^n}{\Delta z} \Biggr| \\ & \leq 3 \Delta z \Delta t \sum\limits_{\mathcal{I}_1} \bigl| \Delta_+^{(3)} \check{\mathcal{F}} \bigl( \phi_{i+1/2}^n, \phi_{i+1/2}^n , \psi_{i-1/2}^n \bigr)\bigr| \frac{|\Delta_+ \zeta_{i-1/2}^n|}{\Delta z}. \end{align}$

(A.5)

We now write

$\begin{align*} \Delta_+^{(3)} \check{\mathcal{F}} \bigl( \phi_{i+1/2}^n, \phi_{i+1/2}^n, \psi_{i-1/2}^n \bigr) = \Delta_+^{(3)} \check{\mathcal{F}} \bigl( \boldsymbol{\phi}_{i}^n, \psi_{i-1/2}^n \bigr) + \mathcal{Y}_{i+1/2}^n - \mathcal{Y}_{i-1/2}^n, \end{align*}$

where

$\begin{align*} \mathcal{Y}_{i \pm 1/2}^n : = \check{\mathcal{F}} \bigl( \phi_{i+1/2}^n, \phi_{i+1/2}^n, \psi_{i \pm 1/2}^n \bigr) -\check{\mathcal{F}} \bigl( \boldsymbol{\phi}_{i}^n, \psi_{i \pm 1/2}^n \bigr). \end{align*}$

From Eq (5.34), and considering $\smash{\phi_{i-3/2}^n = \phi_{i-1/2}^n}$ in that bound, we deduce there exists a constant $C$ such that $\smash{| \mathcal{Y}_{i\pm 1/2}^n | \leq C | \Delta_+ \phi_{i-1/2}^n |}$ , therefore there exists (another) constant $C$ such that

$\begin{align} \bigl| \mathcal{Y}_{i+1/2}^n - \mathcal{Y}_{i-1/2}^n \bigr| \leq C \bigl| \Delta_+ \phi_{i-1/2}^n \bigr|. \end{align}$

(A.6)

Consequently, from Eq (A.5) we deduce that

$\begin{align*} | \mathcal{S}_1 | & \leq 3 \Delta z \Delta t \left( \sum\limits_{\mathcal{I}_1} \bigl| \Delta_+^{(3)} \check{\mathcal{F}} \bigl( \boldsymbol{\phi}_{i}^n, \psi_{i-1/2}^n \bigr)\bigr| \frac{|\Delta_+ \zeta_{i-1/2}^n|}{\Delta z} + \sum\limits_{\mathcal{I}_1} \bigl| \Delta_+ \phi_{i-1/2}^n \bigr| \frac{|\Delta_+ \zeta_{i-1/2}^n|}{\Delta z} \right) \\ & \leq 3 \Biggl( \Delta z \Delta t \Biggl( \sum\limits_{\mathcal{I}_1} \bigl( \Delta_+^{(3)} \check{\mathcal{F}} \bigl( \phi_{i+1/2}^n, \phi_{i+1/2}^n , \psi_{i-1/2}^n \bigr)\bigr)^2 + C \sum\limits_n \operatorname*{TV} ( \boldsymbol{\phi}^n ) \Biggr) \Biggr)^{1/2} \\ & \Biggl( \Delta z \Delta t \sum\limits_{\mathcal{I}_1} \biggl( \frac{|\Delta_+ \zeta_{i-1/2}^n|}{\Delta z} \biggr)^2 \Biggr)^{1/2} . \end{align*}$

From Eq (5.16) we infer that there exists a constant $C_{T}$ such that

$\begin{align*} \Delta t \sum\limits_{\mathcal{I}_1} \bigl( \Delta_+^{(3)} \check{\mathcal{F}} \bigl( \phi_{i+1/2}^n, \phi_{i+1/2}^n , \psi_{i-1/2}^n \bigr)\bigr)^2 \leq C_{T}. \end{align*}$

Noticing that also

$\begin{align*} \Delta t \sum\limits_n \operatorname*{TV} ( \boldsymbol{\phi}^n ) \leq C_{t_N}, \end{align*}$

we conclude that there exists a constant $C_{t_N}$ such that

$\begin{align*} \bigl| \langle \mathcal{S}_1 , \zeta \rangle \bigr| \leq C_{T_N} \Delta z^{1/2} \| \partial_z \zeta \|_{L^2 ( \Pi_T )}. \end{align*}$

Next, we deal with $\mathcal{S}_2$ . Applying again a summation by parts, we get

$\begin{align*} |\mathcal{S}_2 | & = \Biggl| \Delta z \Delta t \sum\limits_{n, i} \bigl( \mathcal{Q} \bigl( \boldsymbol{\phi}_{i}^n, \boldsymbol{\psi}_i^n \bigr) - \mathcal{Q} \bigl( \phi_{i+1/2}^n, \phi_{i+1/2}^n, \boldsymbol{\psi}_i^n \bigr) \bigr) \frac{\Delta_+ \zeta_{i-1/2}^n}{\Delta z} \Biggr|. \end{align*}$

The definition of $\mathcal{Q}$ (see Eq (5.7)) yields

$\begin{align*} \bigl| \mathcal{Q} \bigl( \boldsymbol{\phi}_{i}^n, \boldsymbol{\psi}_i^n \bigr) - \mathcal{Q} \bigl( \phi_{i+1/2}^n, \phi_{i+1/2}^n, \boldsymbol{\psi}_i^n \bigr) \bigr| & \leq \bigl| \hat{\mathcal{Q}} ( \boldsymbol{\phi}_{i}^n , \psi_{i-1/2}^n ) - \hat{\mathcal{Q}} ( \phi_{i+ 1/2}^n, \phi_{i+1/2}^n, \psi_{i-1/2}^n ) \bigr| \\ & \quad + \bigl| \check{\mathcal{Q}} ( \boldsymbol{\phi}_{i}^n, \psi_{i+1/2}^n ) - \check{\mathcal{Q}} ( \phi_{i+ 1/2}^n, \phi_{i+1/2}^n, \psi_{i-1/2}^n ) \bigr|. \end{align*}$

By a computation similar to Eq (5.21) we get

$\begin{align*} \bigl| \hat{\mathcal{Q}} \bigl( \boldsymbol{\phi}_{i}^n , \psi_{i-1/2}^n \bigr) - \hat{\mathcal{Q}} \bigl( \phi_{i+ 1/2}^n, \phi_{i+1/2}^n, \psi_{i-1/2}^n \bigr) \bigr| \leq 3 \| \eta' \|_{\infty} \bigl| \mathcal{X}_{i-1/2}^n \bigr|, \end{align*}$

where

$\begin{align*} \mathcal{X}_{i-1/2}^n : = \hat{\mathcal{F}} \bigl( \boldsymbol{\phi}_{i}^n, \psi_{i-1/2}^n \bigr) - \hat{\mathcal{F}} \bigl( \phi_{i+1/2}^n, \phi_{i+1/2}^n, \psi_{i-1/2}^n \bigr). \end{align*}$

The discussion of $\smash{\mathcal{X}_{i-1/2}^n}$ is similar to that of $\smash{\mathcal{Y}_{i-1/2}^n}$ above, and appealing to Eq (5.34) we see that

$\begin{align*} \bigl| \mathcal{X}_{i-1/2}^n \bigr| \leq C \bigl| \Delta_+ \phi_{i-1/2}^n \bigr|. \end{align*}$

On the other hand, Eq (A.6) implies that

$\begin{align*} & \bigl| \check{\mathcal{Q}}\bigl( \boldsymbol{\phi}_{i}^n , \psi_{i-1/2}^n \bigr) - \check{\mathcal{Q}} \bigl( \phi_{i+ 1/2}^n, \phi_{i+1/2}^n, \psi_{i-1/2}^n \bigr) \bigr| \\ & \leq 3 \| \eta' \|_{\infty} \bigl| \check{\mathcal{F}} \bigl( \boldsymbol{\phi}_{i}^n, \psi_{i-1/2}^n \bigr) - \check{\mathcal{F}} \bigl( \phi_{i+1/2}^n, \phi_{i+1/2}^n, \psi_{i-1/2}^n \bigr) \bigr| \leq C \bigl| \Delta_+ \phi_{i-1/2}^n \bigr|. \end{align*}$

Thus

and we deduce that $\mathcal{S}_2$ can be bounded in a similar way as $\mathcal{S}_1$ . In particular,

$\begin{align*} | \mathcal{S}_2 | & \leq 3 \Biggl( \Delta z \Delta t C \sum\limits_n \operatorname*{TV} ( \boldsymbol{\phi}^n ) \Biggr) \Biggr)^{1/2} \Biggl( \Delta z \Delta t \sum\limits_{n, i} \biggl( \frac{|\Delta_+ \zeta_{i-1/2}^n|}{\Delta z} \biggr)^2 \Biggr)^{1/2}, \end{align*}$

and we conclude that also

$\begin{align*} \bigl| \langle \mathcal{S}_2 , \zeta \rangle \bigr| \leq C_{T} \Delta z^{1/2} \| \partial_z \zeta \|_{L^2 ( \mathbb{R}^2 \times \mathbb{R}^+ )}, \end{align*}$

so $\mathcal{D}$ is compact in $H_{\mathrm{loc}}^{-1} (\Pi_T)$ . Thus, the sequence $\{ \mu_2^{\Delta z} \}_{\Delta z > 0}$ , and therefore also the sequence $\{ \mu^{\Delta z} \}_{\Delta z > 0}$ belong to a compact subset of $H_{\mathrm{loc}}^{-1} (\Pi_T)$ .

Proof of Theorem 5.1. We only need to verify that $\psi$ is a weak solution of Eq (1.8b), that is, that Eq (1.10) holds. To this end, we choose a test function $\zeta \in C_0^{\infty} (\Pi_T)$ , recall the definition Eq (A.2) of cell averages $\smash{\zeta_{i-1/2}^n}$ , multiply the $\psi$ -scheme Eq (5.3) by $\Delta z \zeta_{i-1/2}^n$ , sum over $i$ and $n$ , and apply a summation by parts to obtain an identity $\mathcal{J}_0 + \mathcal{J}_1 + \mathcal{J}_2 = 0$ , where

$\begin{align*} & \mathcal{J}_0 : = \Delta z \sum\limits_{i \in \mathbb{Z}} \psi_{i-1/2}^0 \zeta_{i-1/2}^0, \quad \mathcal{J}_1 : = \Delta z \sum\limits_{i \in \mathbb{Z}} \sum\limits_{n = 1}^{N_T} \psi_{i-1/2}^n \bigl( \zeta_{i-1/2}^n- \zeta_{i-1/2}^{n-1} \bigr) , \\ & \mathcal{J}_2 : = \Delta z \Delta t \sum\limits_{n = 0}^{N_T-1} \sum\limits_{i \in \mathbb{Z}} \mathcal{F} ( \boldsymbol{\phi}_i^n, \boldsymbol{\psi}_i^n) {\frac{{ {{\Delta_+ \zeta_{i-1/2}^n}}}}{{ {{\Delta z}}}}}. \end{align*}$

By exactly following the estimates of terms $I_0$ and $I_1$ in the proof of [18,Lemma 3.5] and appealing to the bounded convergence theorem we may prove that

$\begin{align} \lim\limits_{\Delta z \to 0} \mathcal{J}_0 = \int_{\mathbb{R}} \psi_0( z) \zeta (z, 0) \, \mathrm{d} z, \quad \lim\limits_{\Delta z \to 0} \mathcal{J}_1 = \iint_{\Pi_T} \psi \partial_t \zeta \, \mathrm{d} z \, \mathrm{d} t. \end{align}$

(A.7)

The treatment of $\mathcal{J}_2$ differs from that of the term $I_2$ in [18,Lemma 3.5] since here the numerical flux depends on four arguments (not three, as in ^[18]). We here get

$\begin{align*} \mathcal{J}_2 & = \iint_{\Pi_T} \tilde{F} ( \phi^{\Delta z}, \psi^{\Delta z} ) \partial_z \zeta \, \mathrm{d} z \, \mathrm{d} t + \mathcal{J}_{2, 1} + \mathcal{J}_{2, 2} + \mathcal{J}_{2, 3}, \end{align*}$

where we define

$\begin{align*} \mathcal{J}_{2, 1} & : = - \sum\limits_{\mathcal{I}_1} \tilde{F} \bigl( \phi_{i-1/2}^n, \psi_{i-1/2}^n \bigr) \iint_{I_{i-1/2}^n} \int_0^{\Delta z} \frac{\partial_z \zeta (z, t) - \partial_z \zeta ( z+ \xi, t)}{\Delta z} \, \mathrm{d} \xi \, \mathrm{d} z \, \mathrm{d} t, \\ \mathcal{J}_{2, 2} & : = - \Delta z \Delta t \sum\limits_{\mathcal{I}_1} \bigl( \tilde{F} \bigl( \phi_{i-1/2}^n, \psi_{i-1/2}^n \bigr) - \mathcal{F} \bigl( \boldsymbol{\phi}_i^n, \psi_{i-1/2}^n, \psi_{i-1/2}^n \bigr) \bigr) \frac{\Delta_+ \zeta_{i-1/2}^n}{\Delta z}, \\ \mathcal{J}_{2, 3} & : = - \Delta z \Delta t \sum\limits_{\mathcal{I}_1} \bigl( \mathcal{F} \bigl( \boldsymbol{\phi}_i^n, \psi_{i-1/2}^n, \psi_{i-1/2}^n \bigr) - \mathcal{F} \bigl( \boldsymbol{\phi}_i^n, \boldsymbol{\psi}_{i}^n \bigr)\bigr) \frac{\Delta_+ \zeta_{i-1/2}^n}{\Delta z} \\ & = \Delta z \Delta t \sum\limits_{\mathcal{I}_1} \Delta_+^{(3)} \check{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n, \psi_{i-1/2}^n \bigr) \frac{\Delta_+ \zeta_{i-1/2}^n}{\Delta z}. \end{align*}$

The term $\mathcal{J}_{2, 1}$ can be estimated by choosing a constant $M$ such that $\zeta (z, t) = 0$ for $|z| > M$ and noting that

$\begin{align} |\mathcal{J}_{2, 1}| \leq \Delta z \| \partial_z^2 \zeta \|_{L^{\infty}} \| \tilde{F} (\phi^{\Delta z}, \psi^{\Delta z} ) \|_{L^1 ([-M, M] \times [0, T])} { \to 0\; {\rm{as}} \;\Delta z \to 0 .} \end{align}$

(A.8)

Furthermore, in light of Eq (5.6) the difference arising in $\mathcal{J}_{2, 2}$ can be written as

$\begin{array}{l} \tilde{F} \bigl( \phi_{i-1/2}^n, \psi_{i-1/2}^n \bigr) - \mathcal{F} \bigl( \boldsymbol{\phi}_i^n, \psi_{i-1/2}^n, \psi_{i-1/2}^n \bigr) \\ = \mathcal{F} \bigl( \phi_{i-1/2}^n, \phi_{i-1/2}^n, \psi_{i-1/2}^n, \psi_{i-1/2}^n \bigr) - \mathcal{F} \bigl( \phi_{i-1/2}^n, \phi_{i+1/2}^n, \psi_{i-1/2}^n, \psi_{i-1/2}^n \bigr) \\ = \hat{\mathcal{F}} \bigl( \phi_{i-1/2}^n, \phi_{i-1/2}^n, \psi_{i-1/2}^n \bigr) - \hat{\mathcal{F}} \bigl( \phi_{i-1/2}^n, \phi_{i+1/2}^n, \psi_{i-1/2}^n \bigr) \\ \quad + \check{\mathcal{F}} \bigl( \phi_{i-1/2}^n, \phi_{i-1/2}^n, \psi_{i-1/2}^n \bigr) - \check{\mathcal{F}} \bigl( \phi_{i-1/2}^n, \phi_{i+1/2}^n, \psi_{i-1/2}^n \bigr). \end{array}$

Utilizing the estimate Eq (5.34) with $\phi_{i-3/2}^n = \phi_{i-1/2}^n$ yields that there exists a constant $C_{12}$ such that

$\begin{align*} \bigl| \tilde{F} \bigl( \phi_{i-1/2}^n, \psi_{i-1/2}^n \bigr) - \mathcal{F} \bigl( \boldsymbol{\phi}_i^n, \psi_{i-1/2}^n, \psi_{i-1/2}^n \bigr) \bigr| \leq C_{12} \bigl| \Delta_+ \phi_{i-1/2}^n \bigr|, \end{align*}$

hence

$\begin{align} \bigl| \mathcal{J}_{2, 2} \bigr| \leq C_{12} \Delta z \Delta t \sum\limits_{n = 0}^{N_T -1} \operatorname*{TV} ( \boldsymbol{\phi}^n ) \frac{\Delta_+ \zeta_{i-1/2}^n}{\Delta z} \leq C \Delta z \| \partial_z \zeta \|_{L^{\infty}} \quad { \to 0 \;{\rm{as}} \;\Delta z \to 0 .} \end{align}$

(A.9)

To estimate $\mathcal{J}_{2, 3}$ , we utilize Eq (5.16). Then

$\begin{align} \begin{split} | \mathcal{J}_{2, 3} | & \leq \left( \Delta z \Delta t \sum\limits_{\mathcal{I}_1} \bigl( \Delta_+^{(3)} \check{\mathcal{F}} \bigl( \boldsymbol{\phi}_i^n, \psi_{i-1/2}^n \bigr) \bigr)^2 \right)^{1/2} \left( \Delta z \Delta t \sum\limits_{\mathcal{I}_1} \biggl( \frac{\Delta_+ \zeta_{i-1/2}^n}{\Delta z} \biggr)^2 \right)^{1/2} \\ & \leq C_T^{1/2} \Delta z^{1/2} \| \partial_z \zeta \|_{L^2 ( \Pi_T)} \quad { \to 0 \;{\rm{as}} \;\Delta z \to 0 .} \end{split} \end{align}$

(A.10)

From Eq (A.8), Eq (A.9) and Eq (A.10) and appealing to the strong convergence of $\phi^{\Delta z}$ and $\psi^{\Delta z}$ we deduce that

$\begin{align} \lim\limits_{\Delta z \to 0} \mathcal{J}_2 = \iint_{\Pi_T} \tilde{F} ( \phi, \psi) \partial_z \zeta \, \mathrm{d} z \, \mathrm{d} t. \end{align}$

(A.11)

The limits Eq (A.7) and Eq (A.11) imply that the limit $\psi$ is a weak solution.

References

[1]	A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, et al., Attention is All you Need, in Proceedings of the 31st International Conference on Neural Information Processing Systems, 2017. https://doi.org/10.48550/arXiv.2206.09457
[2]	Q. Wang, B. Li, T. Xiao, J. Zhu, C. Li, D. F. Wong, et al., Learning deep transformer models for machine translation, preprint, arXiv: 1906.01787.
[3]	S. A. Chowdhury, A. Abdelali, K. Darwish, J. Soon-Gyo, J. Salminen, B. J. Jansen, Improving arabic text categorization using transformer training diversification, in Proceedings of the Fifth Arabic Natural Language Processing Workshop (COLING-WANLP), (2020), 226–236. https://aclanthology.org/2020.wanlp-1.21
[4]	X. Ma, P. Zhang, S. Zhang, N. Duan, Y. Hou, M. Zhou, et al., A tensorized transformer for language modeling, preprint, arXiv: 1906.09777.
[5]	J. Devlin, M. W. Chang, K. Lee, K. Toutanova, BERT: Pre-training of deep bidirectional transformers for language understanding, preprint, arXiv: 1810.04805.
[6]	Y. Liu, M. Ott, N. Goyal, J. Du, M. Joshi, D. Chen, et al., RoBERTa: A robustly optimized BERT pretraining approach, preprint, arXiv: 1907.11692.
[7]	H. Xu, B. Liu, L. Shu, P. S. Yu, BERT post-training for review reading comprehension and aspect-based sentiment analysis, preprint, arXiv: 1904.02232.
[8]	P. Shi, J. Lin, Simple BERT models for relation extraction and semantic role labeling, preprint, arXiv: 1904.05255.
[9]	V. Sanh, L. Debut, J. Chaumond, T. Wolf, DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter, preprint, arXiv: 1910.01108.
[10]	Y. Cheng, D. Wang, P. Zhou, T. Zhang, Model compression and acceleration for deep neural networks: The principles, progress, and challenges, IEEE Signal Process. Mag., 35 (2018), 126–136. https://doi.org/10.1109/MSP.2017.2765695 doi: 10.1109/MSP.2017.2765695
[11]	S. Cheng, D. Lucor, J. P. Argaud, Observation data compression for variational assimilation of dynamical systems, J. Comput. Sci., 53 (2021), 101405. https://doi.org/10.1016/j.jocs.2021.101405 doi: 10.1016/j.jocs.2021.101405
[12]	S. Liu, Y. Lin, Z. Zhou, K. Nan, H. Liu, J. Du, On-demand deep model compression for mobile devices: A usage-driven model selection framework, in Proceedings of the 16th Annual International Conference on Mobile Systems, Applications, and Services, (2018), 389–400. https://doi.org/10.1145/3210240.3210337
[13]	S. Liu, J. Du, K. Nan, Z. Zhou, H. Liu, Z. Wang, et al., AdaDeep: A usage-driven, automated deep model compression framework for enabling ubiquitous intelligent mobiles, IEEE Trans. Mob. Comput., 20 (2021), 3282–3297. https://doi.org/10.1109/TMC.2020.2999956 doi: 10.1109/TMC.2020.2999956
[14]	V. L. Tran, S. E. Kim, Efficiency of three advanced data-driven models for predicting axial compression capacity of CFDST columns, Thin-Walled Struct., 152 (2020), 106744. https://doi.org/10.1016/j.tws.2020.106744 doi: 10.1016/j.tws.2020.106744
[15]	Z. X. Hu, Y. Wang, M. F. Ge, J. Liu, Data-driven fault diagnosis method based on compressed sensing and improved multiscale network, IEEE Trans. Ind. Electron., 67 (2020), 3216–3225. https://doi.org/10.1109/TIE.2019.2912763 doi: 10.1109/TIE.2019.2912763
[16]	S. Cheng, I. C. Prentice, Y. Huang, Y. Jin, Y. K. Guo, R. Arcucci, Data-driven surrogate model with latent data assimilation: Application to wildfire forecasting, J. Comput. Phys., 464 (2022). https://doi.org/10.1016/j.jcp.2022.111302
[17]	S. Yang, Z. Zhang, C. Zhao, X. Song, S. Guo, H. Li, CNNPC: End-edge-cloud collaborative CNN inference with joint model partition and compression, IEEE Trans. Parallel Distrib. Syst., (2022), 1–1. https://doi.org/10.1109/TPDS.2022.3177782 doi: 10.1109/TPDS.2022.3177782
[18]	H. He, S. Jin, C. K. Wen, F. Gao, G. Y. Li, Z. Xu, Model-driven deep learning for physical layer communications, IEEE Wireless Commun., 26 (2019), 77–83. https://doi.org/10.1109/MWC.2019.1800447 doi: 10.1109/MWC.2019.1800447
[19]	Z. Liu, M. del Rosario, Z. Ding, A markovian model-driven deep learning framework for massive MIMO CSI feedback, IEEE Trans. Wireless Commun., 21 (2022), 1214–1228. https://doi.org/10.1109/TWC.2021.3103120 doi: 10.1109/TWC.2021.3103120
[20]	W. Wang, F. Wei, L. Dong, H. Bao, N. Yang, M. Zhou, MiniLM: Deep self-attention distillation for task-agnostic compression of pre-trained transformers, preprint, arXiv: 2002.10957.
[21]	X. Jiao, Y. Yin, L. Shang, X. Jiang, X. Chen, L. Li, et al., TinyBERT: Distilling BERT for natural language understanding, preprint, arXiv: 1909.10351.
[22]	S. Sun, Y. Cheng, Z. Gan, J. Liu, Patient knowledge distillation for BERT model compression, preprint, arXiv: 1908.09355.
[23]	H. Touvron, M. Cord, M. Douze, F. Massa, A. Sablayrolles, H. Jegou, Training data-efficient image transformers & distillation through attention, in Proceedings of the 38th International Conference on Machine Learning (ICML), (2021), 10347–10357. https://doi.org/10.48550/arXiv.2012.12877
[24]	P. Michel, O. Levy, G. Neubig, Are sixteen heads really better than one?, Adv. Neural Inf. Process. Syst., preprint, arXiv: 1905.10650.
[25]	M. A. Gordon, K. Duh, N. Andrews, Compressing BERT: Studying the effects of weight pruning on transfer learning, preprint, arXiv: 2002.08307.
[26]	T. Chen, Y. Cheng, Z. Gan, L. Yuan, L. Zhang, Z. Wang, Chasing sparsity in vision transformers: An end-to-end exploration, Adv. Neural Inf. Process. Syst., (2021), 19974–19988. https://doi.org/10.48550/arXiv.2106.04533 doi: 10.48550/arXiv.2106.04533
[27]	T. Chen, J. Frankle, S. Chang, S. Liu, Y. Zhang, Z. Wang, et al., The lottery ticket hypothesis for pre-trained BERT networks, Adv. Neural Inf. Process. Syst., (2020), 15834–15846. https://doi.org/10.48550/arXiv.2007.12223 doi: 10.48550/arXiv.2007.12223
[28]	S. Shen, Z. Dong, J. Ye, L. Ma, Z. Yao, A. Gholami, et al., Q-BERT: Hessian based ultra low precision quantization of BERT, preprint, arXiv: 1909.05840.
[29]	Z. Liu, Y. Wang, K. Han, S. Ma, W. Gao, Post-training quantization for vision transformer, preprint, arXiv: 2106.14156.
[30]	H. Bai, W. Zhang, L. Hou, L. Shang, J. Jin, X. Jiang, et al., BinaryBERT: Pushing the limit of BERT quantization, preprint, arXiv: 2012.15701.
[31]	O. Zafrir, G. Boudoukh, P. Izsak, M. Wasserblat, Q8BERT: Quantized 8Bit BERT, in the 5th Workshop on Energy Efficient Machine Learning and Cognitive Computing-NeurIPS 2019, (2019), 36–39. https://doi.org/10.1109/EMC2-NIPS53020.2019.00016
[32]	Z. Wu, Z. Liu, J. Lin, Y. Lin, S. Han, Lite transformer with long-short range attention, preprint, arXiv: 2004.11886.
[33]	L. Hou, Z. Huang, L. Shang, X. Jiang, X. Chen, Q. Liu, DynaBERT: Dynamic BERT with adaptive width and depth, preprint, arXiv: 2004.04037.
[34]	M. Chen, H. Peng, J. Fu, H. Ling, AutoFormer: Searching transformers for visual recognition, in 2021 IEEE/CVF International Conference on Computer Vision (ICCV), (2021), 12250–12260. https://doi.org/10.1109/ICCV48922.2021.01205
[35]	P. Ganesh, Y. Chen, X. Lou, M. A. Khan, Y. Yang, H. Sajjad, et al., Compressing large-scale transformer-based models: A case study on BERT, Trans. Assoc. Comput. Linguist., 9 (2021), 1061–1080. https://doi.org/10.1162/tacl_a_00413 doi: 10.1162/tacl_a_00413
[36]	S. Hochreiter, J. Schmidhuber, Long short-term memory, Neural Comput., 9 (1997), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735 doi: 10.1162/neco.1997.9.8.1735
[37]	J. Chung, C. Gulcehre, K. Cho, Y. Bengio, Empirical evaluation of gated recurrent neural networks on sequence modeling, preprint, arXiv: 1412.3555.
[38]	D. Bahdanau, K. Cho, Y. Bengio, Neural machine translation by jointly learning to align and translate, preprint, arXiv: 1409.0473.
[39]	B. Li, S. Pandey, H. Fang, Y. Lyv, J. Li, J. Chen, et al., FTRANS: energy-efficient acceleration of transformers using FPGA, in Proceedings of the ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED), (2020), 175–180. https://doi.org/10.1145/3370748.3406567
[40]	T. J. Ham, S. J. Jung, S. Kim, Y. H. Oh, Y. Park, Y. Song, et al., A.3: Accelerating attention mechanisms in neural networks with approximation, in 2020 IEEE International Symposium on High Performance Computer Architecture (HPCA), (2020), 328–341. https://doi.org/10.1109/HPCA47549.2020.00035
[41]	T. J. Ham, Y. Lee, S. H. Seo, S. Kim, H. Choi, S. J. Jung, et al., ELSA: Hardware-software co-design for efficient, lightweight self-attention mechanism in neural networks, in 2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture (ISCA), (2021), 692–705. https://doi.org/10.1109/ISCA52012.2021.00060
[42]	X. Zhang, Y. Wu, P. Zhou, X. Tang, J. Hu, Algorithm-hardware co-design of attention mechanism on FPGA devices, ACM Trans. Embed. Comput. Syst., 20 (2021), 1–24. https://doi.org/10.1145/3477002 doi: 10.1145/3477002
[43]	S. Lu, M. Wang, S. Liang, J. Lin, Z. Wang, Hardware accelerator for multi-head attention and position-wise feed-forward in the transformer, in IEEE International SOC Conference, (2020), 84–89. https://doi.org/10.1109/ISCA52012.2021.00060
[44]	A. Parikh, O. Tä ckströ m, D. Das, J. Uszkoreit, A decomposable attention model for natural language inference, in Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, (2016), 2249–2255. https://doi.org/10.48550/arXiv.1606.01933
[45]	Z. Lin, M. Feng, C. N. dos Santos, M. Yu, B. Xiang, B. Zhou, et al., A structured self-attentive sentence embedding, preprint, arXiv: 1703.03130
[46]	M. S. Charikar, Similarity estimation techniques from rounding algorithms, in Proceedings of the Thiry-Fourth Annual ACM Symposium on Theory of Computing, (2002), 380–388. https://doi.org/10.1145/509907.509965
[47]	X. Zhang, F. X. Yu, R. Guo, S. Kumar, S. Wang, S. F. Chang, Fast orthogonal projection based on kronecker product, in 2015 IEEE International Conference on Computer Vision (ICCV), (2015), 2929–2937. https://doi.org/10.1109/ICCV.2015.335
[48]	Y. Gong, S. Kumar, H. A. Rowley, S. Lazebnik, Learning binary codes for high-dimensional data using bilinear projections, in 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (2013), 484–491. https://doi.org/10.1109/CVPR.2013.69
[49]	M. Wang, S. Lu, D. Zhu, J. Lin, Z. Wang, A high-speed and low-complexity architecture for softmax function in deep learning, in 2018 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS), (2018), 223–226. https://doi.org/10.1109/APCCAS.2018.8605654
[50]	R. Hu, B. Tian, S. Yin, S. Wei, Efficient hardware architecture of softmax layer in deep neural network, in 2018 IEEE 23rd International Conference on Digital Signal Processing (DSP), (2018), 1–5. https://doi.org/10.1109/ICDSP.2018.8631588
[51]	L. Deng, G. Li, S. Han, L. Shi, Y. Xie, Model compression and hardware acceleration for neural networks: A comprehensive survey, Proc. IEEE, 108 (2020), 485–532. https://doi.org/10.1109/JPROC.2020.2976475 doi: 10.1109/JPROC.2020.2976475
[52]	C. Ding, S. Liao, Y. Wang, Z. Li, N. Liu, Y. Zhuo, et al., C ir CNN: Accelerating and compressing deep neural networks using block-circulant weight matrices, in Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), (2017), 395–408. https://doi.org/10.1145/3123939.3124552
[53]	S. Wang, Z. Li, C. Ding, B. Yuan, Q. Qiu, Y. Wang, et al., C-LSTM: Enabling efficient LSTM using structured compression techniques on FPGAs, in Proceedings of the 2018 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA), (2018), 11–20. https://doi.org/10.1145/3174243.3174253
[54]	L. Zhao, S. Liao, Y. Wang, Z. Li, J. Tang, B. Yuan, Theoretical properties for neural networks with weight matrices of low displacement rank, in Proceedings of the 34th International Conference on Machine Learning (ICML), (2017), 4082–4090. https://doi.org/10.48550/arXiv.1703.00144
[55]	V. Y. Pan, Structured matrices and displacement operators, in Structured Matrices and Polynomials: Unified Superfast Algorithms, Springer Science & Business Media, (2001), 117–153. https://doi.org/10.1007/978-1-4612-0129-8
[56]	J. O. Smith, Mathematics of the discrete fourier transform (DFT): with audio applications, in Mathematics of the Discrete Fourier Transform (DFT): With Audio Applications, Julius Smith, (2007), 115–164. https://ccrma.stanford.edu/~jos/st/
[57]	Z. Liu, G. Li, J. Cheng, Hardware acceleration of fully quantized BERT for efficient natural language processing, in 2021 Design, Automation & Test in Europe Conference & Exhibition (DATE), (2021), 513–516. https://doi.org/10.23919/DATE51398.2021.9474043
[58]	M. Sun, H. Ma, G. Kang, Y. Jiang, T. Chen, X. Ma, et al., VAQF: Fully automatic software-hardware co-design framework for low-bit vision transformer, preprint, arXiv: 2201.06618.
[59]	Z. Liu, Z. Shen, M. Savvides, K. T. Cheng, ReActNet: Towards precise binary neural network with generalized activation functions, in Computer Vision–ECCV 2020 (ECCV), (eds. Vedaldi. A., Bischof. H., Brox. T., Frahm. J.-M.), Cham, Springer International Publishing, (2020), 143–159. https://doi.org/10.1007/978-3-030-58568-6_9
[60]	M. Rastegari, V. Ordonez, J. Redmon, A. Farhadi, XNOR-Net: ImageNet classification using binary convolutional neural networks, in Computer Vision–ECCV 2016 (ECCV), (eds. Leibe. B., Matas. J., Sebe. N., Welling. M.), Cham, Springer International Publishing, (2016), 525–542. https://doi.org/10.1007/978-3-319-46493-0_32
[61]	S. Han, H. Mao, W. J. Dally, Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding, preprint, arXiv: 1510.00149.
[62]	W. Wen, C. Wu, Y. Wang, Y. Chen, H. Li, Learning structured sparsity in deep neural networks, in Advances in Neural Information Processing Systems (NeurIPS), Curran Associates, (2016). https://doi.org/10.48550/arXiv.1608.03665
[63]	X. Ma, F. M. Guo, W. Niu, X. Lin, J. Tang, K. Ma, et al., PCONV: The missing but desirable sparsity in DNN weight pruning for real-time execution on mobile devices, in Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), (2020), 5117–5124. https://doi.org/10.1609/aaai.v34i04.5954
[64]	B. Li, Z. Kong, T. Zhang, J. Li, Z. Li, H. Liu, et al., Efficient transformer-based large scale language representations using hardware-friendly block structured pruning, preprint, arXiv: 2009.08065.
[65]	S. Cao, C. Zhang, Z. Yao, W. Xiao, L. Nie, D. Zhan, et al., Efficient and effective sparse LSTM on FPGA with bank-balanced sparsity, in Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA), (2019), 63–72. https://doi.org/10.1145/3289602.3293898
[66]	H. Peng, S. Huang, T. Geng, A. Li, W. Jiang, H. Liu, et al., Accelerating transformer-based deep learning models on FPGAs using column balanced block pruning, in 2021 22nd International Symposium on Quality Electronic Design (ISQED), (2021), 142–148. https://doi.org/10.1109/ISQED51717.2021.9424344
[67]	C. Ding, A. Ren, G. Yuan, X. Ma, J. Li, N. Liu, et al., Structured weight matrices-based hardware accelerators in deep neural networks: FPGAs and ASICs, in Proceedings of the 2018 on Great Lakes Symposium on VLSI (GLSVLSI), Chicago, IL, USA, Association for Computing Machinery, (2018), 353–358. https://doi.org/10.1145/3194554.3194625
[68]	S. Narang, E. Undersander, G. Diamos, Block-sparse recurrent neural networks, preprint, arXiv: 1711.02782.
[69]	P. Qi, E. H. M. Sha, Q. Zhuge, H. Peng, S. Huang, Z. Kong, et al., Accelerating framework of transformer by hardware design and model compression co-optimization, in 2021 IEEE/ACM International Conference On Computer Aided Design (ICCAD), (2021), 1–9. https://doi.org/10.1109/ICCAD51958.2021.9643586
[70]	P. Qi, Y. Song, H. Peng, S. Huang, Q. Zhuge, E. H. M. Sha, Accommodating transformer onto FPGA: Coupling the balanced model compression and FPGA-implementation optimization, in Proceedings of the 2021 on Great Lakes Symposium on VLSI (GLSVLSI), Virtual Event, USA, Association for Computing Machinery, (2021), 163–168. https://doi.org/10.1145/3453688.3461739
[71]	D. So, Q. Le, C. Liang, The evolved transformer, in Proceedings of the 36th International Conference on Machine Learning (ICML), PMLR, (2019), 5877–5886. https://doi.org/10.48550/arXiv.1901.11117
[72]	H. Wang, Efficient algorithms and hardware for natural language processing, Graduate Theses, Retrieved from the Massachusetts Institute of Technology, 2020. https://hdl.handle.net/1721.1/127440.
[73]	H. Sharma, J. Park, N. Suda, L. Lai, B. Chau, V. Chandra, et al., Bit fusion: Bit-Level dynamically composable architecture for accelerating deep neural network, in 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture (ISCA), (2018), 764–775. https://doi.org/10.1109/ISCA.2018.00069
[74]	R. Barrett, M. Berry, T. F. Chan, J. Demmel, J. Donato, J. Dongarra, et al., Templates for the solution of linear systems: Building blocks for iterative methods, in Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, Society for Industrial and Applied Mathematics, (1994), 39–55. https://doi.org/10.1137/1.9781611971538
[75]	W. Liu, B. Vinter, CSR5: An efficient storage format for cross-platform sparse matrix-vector multiplication, in Proceedings of the 29th ACM on International Conference on Supercomputing (ICS), Newport Beach, California, USA, Association for Computing Machinery, (2015), 339–350. https://doi.org/10.1145/2751205.2751209
[76]	R. Kannan, Efficient sparse matrix multiple-vector multiplication using a bitmapped format, in 20th Annual International Conference on High Performance Computing (HiPC), (2013), 286–294. https://doi.org/10.1109/HiPC.2013.6799135
[77]	W. Jiang, X. Zhang, E. H. M. Sha, L. Yang, Q. Zhuge, Y. Shi, et al., Accuracy vs. efficiency: achieving both through FPGA-implementation aware neural architecture search, in Proceedings of the 56th Annual Design Automation Conference 2019 (DAC), Las Vegas NV USA, ACM, (2019), 1–6. https://doi.org/10.1145/3316781.3317757
[78]	W. Jiang, E. H. M. Sha, X. Zhang, L. Yang, Q. Zhuge, Y. Shi, et al., Achieving super-linear speedup across multi-FPGA for real-time DNN inference, preprint, arXiv: 1907.08985.
[79]	W. Jiang, X. Zhang, E. H. M. Sha, Q. Zhuge, L. Yang, Y. Shi, et al., XFER: A novel design to achieve super-linear performance on multiple FPGAs for real-time AI, in Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA), Seaside, CA, USA, Association for Computing Machinery, (2019), 305. https://doi.org/10.1145/3289602.3293988

This article has been cited by:

1.	Raimund Bürger, Stefan Diehl, M Carmen Martí, Yolanda Vásquez, A degenerating convection–diffusion system modelling froth flotation with drainage, 2022, 87, 0272-4960, 1151, 10.1093/imamat/hxac033
2.	Li Feng, Yunjuan Jin, Yinzheng Sun, A class of piecewise constant Radon measure solutions to Riemann problems of compressible Euler equations with discontinuous fluxes: pressureless flow versus Chaplygin gas, 2024, 75, 0044-2275, 10.1007/s00033-024-02353-1
3.	Stefan Diehl, Jaime Manríquez, Catherine J. Paul, Tage Rosenqvist, A convection-diffusion-reaction system with discontinuous flux modelling biofilm growth in slow sand filters, 2025, 137, 0307904X, 115675, 10.1016/j.apm.2024.115675
4.	Raimund Bürger, Julio Careaga, Stefan Diehl, Romel Pineda, Numerical schemes for a moving-boundary convection-diffusion-reaction model of sequencing batch reactors, 2023, 57, 2822-7840, 2931, 10.1051/m2an/2023068

Reader Comments

Your name:*

Email:*
© 2022 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)