
This paper presents a nonlinear robust H-infinity control strategy for improving trajectory following performance of autonomous ground electric vehicles (AGEV) with active front steering system. Since vehicle trajectory dynamics inherently influenced by various driving maneuvers and road conditions, the main objective is to deal with the trajectory following control challenges of parametric uncertainties, system nonlinearities, and external disturbance. The AGEV system dynamics and its uncertain vehicle trajectory following system are first modeled and constructed, in which parameter uncertainties related to the physical limits of tire are considered and handled, then the control-oriented vehicle trajectory following augmented system with dynamic error is developed. The resulting nonlinear robust H-infinity state-feedback controller (NHC) of vehicle trajectory-following system is finally designed by H-infinity performance index and nonlinear compensation under AGEV system requirements, and solved utilizing a set of linear matrix inequalities derived from quadratic H-infinity performance and Lyapunov stability. Simulations for double lane change and serpentine scenes are carried out to verify the effectiveness of the proposed controller with a high-fidelity, CarSim®, full-vehicle model. It is found from the results that the proposed NHC provides improved vehicle trajectory following performance compared with the linear quadratic regulator (LQR) controller and robust H-infinity state-feedback controller (RHC).
Citation: Xianjian Jin, Qikang Wang, Zeyuan Yan, Hang Yang. Nonlinear robust control of trajectory-following for autonomous ground electric vehicles with active front steering system[J]. AIMS Mathematics, 2023, 8(5): 11151-11179. doi: 10.3934/math.2023565
[1] | Saulo Orizaga, Ogochukwu Ifeacho, Sampson Owusu . On an efficient numerical procedure for the Functionalized Cahn-Hilliard equation. AIMS Mathematics, 2024, 9(8): 20773-20792. doi: 10.3934/math.20241010 |
[2] | Martin Stoll, Hamdullah Yücel . Symmetric interior penalty Galerkin method for fractional-in-space phase-field equations. AIMS Mathematics, 2018, 3(1): 66-95. doi: 10.3934/Math.2018.1.66 |
[3] | Naveed Iqbal, Mohammad Alshammari, Wajaree Weera . Numerical analysis of fractional-order nonlinear Gardner and Cahn-Hilliard equations. AIMS Mathematics, 2023, 8(3): 5574-5587. doi: 10.3934/math.2023281 |
[4] | Mingliang Liao, Danxia Wang, Chenhui Zhang, Hongen Jia . The error analysis for the Cahn-Hilliard phase field model of two-phase incompressible flows with variable density. AIMS Mathematics, 2023, 8(12): 31158-31185. doi: 10.3934/math.20231595 |
[5] | Chaeyoung Lee, Sangkwon Kim, Soobin Kwak, Youngjin Hwang, Seokjun Ham, Seungyoon Kang, Junseok Kim . Semi-automatic fingerprint image restoration algorithm using a partial differential equation. AIMS Mathematics, 2023, 8(11): 27528-27541. doi: 10.3934/math.20231408 |
[6] | Joseph L. Shomberg . Well-posedness and global attractors for a non-isothermal viscous relaxationof nonlocal Cahn-Hilliard equations. AIMS Mathematics, 2016, 1(2): 102-136. doi: 10.3934/Math.2016.2.102 |
[7] | Haifeng Zhang, Danxia Wang, Zhili Wang, Hongen Jia . A decoupled finite element method for a modified Cahn-Hilliard-Hele-Shaw system. AIMS Mathematics, 2021, 6(8): 8681-8704. doi: 10.3934/math.2021505 |
[8] | Alain Miranville . The Cahn–Hilliard equation and some of its variants. AIMS Mathematics, 2017, 2(3): 479-544. doi: 10.3934/Math.2017.2.479 |
[9] | Pierluigi Colli, Gianni Gilardi, Jürgen Sprekels . Distributed optimal control of a nonstandard nonlocal phase field system. AIMS Mathematics, 2016, 1(3): 225-260. doi: 10.3934/Math.2016.3.225 |
[10] | Hyun Geun Lee . A mass conservative and energy stable scheme for the conservative Allen–Cahn type Ohta–Kawasaki model for diblock copolymers. AIMS Mathematics, 2025, 10(3): 6719-6731. doi: 10.3934/math.2025307 |
This paper presents a nonlinear robust H-infinity control strategy for improving trajectory following performance of autonomous ground electric vehicles (AGEV) with active front steering system. Since vehicle trajectory dynamics inherently influenced by various driving maneuvers and road conditions, the main objective is to deal with the trajectory following control challenges of parametric uncertainties, system nonlinearities, and external disturbance. The AGEV system dynamics and its uncertain vehicle trajectory following system are first modeled and constructed, in which parameter uncertainties related to the physical limits of tire are considered and handled, then the control-oriented vehicle trajectory following augmented system with dynamic error is developed. The resulting nonlinear robust H-infinity state-feedback controller (NHC) of vehicle trajectory-following system is finally designed by H-infinity performance index and nonlinear compensation under AGEV system requirements, and solved utilizing a set of linear matrix inequalities derived from quadratic H-infinity performance and Lyapunov stability. Simulations for double lane change and serpentine scenes are carried out to verify the effectiveness of the proposed controller with a high-fidelity, CarSim®, full-vehicle model. It is found from the results that the proposed NHC provides improved vehicle trajectory following performance compared with the linear quadratic regulator (LQR) controller and robust H-infinity state-feedback controller (RHC).
Phase field models are a very important class of nonlinear partial differential equations that are used to model a number of physical processes ranging from bio-medical applications to metallurgy. That was the original application considered when the Cahn-Hilliard (CH) equation was first introduced in 1958 [1]. The Cahn-Hilliard equation is a fourth-order nonlinear partial differential equation (PDE) for which explicit methods are not practical since they would require very small steps for numerical stability (h≤Odx4), where h is the time step and dx is the grid spacing in one dimension. On the other hand, fully implicit methods would ensure stability but can be computationally expensive for problems in more than one dimension. For this reason, a number of research works have been devoted to the efficient implementation and computation of the CH equation via semi-implicit methods. David Eyre proposed a very elegant and simple approach to compute the CH equation in what is known as the convexity-splitting (CS) approach [2]. The main idea of the method was to re-express the energy in terms of convex and concave parts, leading to energy-stable solutions for the problem. Since then, a number of extensions based on the CS approach have been implemented to solve the CH with improved order of accuracy [3,4,5,6].
Another formulation that was proposed to solve the CH equation was presented by Andrea Bertozzi [7]. The main idea is to re-express the CH equation with a bi-harmonic term that is to be computed at the implicit level. This, in turn provides another efficient computational approach termed the bi-harmonic-modified method (BHM). The BHM approach can also be understood as a splitting method with an stabilizer parameter. Another more recent formulation for the CH equation is found in the scalar auxiliary variable (SAV) approach, which requires a reformulation of the energy and the use of stabilizer parameters to produce energy-stable results [8,9,10].
The CH equation is an important model for various physical processes in science and engineering since it can be easily extended to the diblock copolymer (BCP) equation [11]. The BCP equation has been used to understand self-assembly properties of block copolymers and their relevance to higher-quality material production. Another extension of the CH equation is the phase-field crystal PFC equation [12], which is used to study the crystal formation at the atomic level in a solidification process. Several authors have worked in the efficient computation of the BCP and PFC equations [6,11,13,14,15,16]. In addition, other important applications of the CH equation are image inpainting [17] and bio-medical applications, such as, tumor growth and drug-delivery [18,19].
The purpose of this paper is to provide a numerical approach for materials science models based on the BHM formulation that is efficient, accurate, and easy to implement for solving phase-field models. In terms of ease of implementation, we believe the CS and BHM method are the best candidates so that our methodology can be used among a broad range of researchers from different disciplines. Also, we aim to propose a simple approach to speed up computations of the proposed schemes via a graphics processing unit (GPU) implementation.
In this paper, we first compare the bi-harmonic-modified (BHM) approach against the more well-known CS approach in terms of accuracy and CPU computation time. We then construct our schemes using the BHM method and couple it, for increased accuracy, with an implicit-explict (IMEX) time stepping formulation. In Section 2, we introduce the CH equation. In Section 3, we introduce the CS, BHM, and IMEX approaches for solving the CH equation. In Section 4, we perform numerical experiments in 2D to test the performance of the algorithms and to display numerical solutions over time. In Section 5, we show how the algorithms can be adapted to 3D simulations and test their performance. Finally, we also show how to speed up 3D simulations by performing graphics processing unit (GPU) computations by using simple MATLAB commands. The main goal of this computational paper is to contribute to the acceleration of methods for related Poisson problems and CH equations in 2D and 3D.
The Cahn-Hilliard energy is given by [1]
E(u)=∫Ωϵ22|∇u|2+W(u)dx, | (2.1) |
where u(x,t) is the phase-field variable, which represents a volume fraction of one component, and W(u) is a double-well potential given by
W(u)=14(u2−1)2, | (2.2) |
which gives that W′(u)=u3−u. W(u) has two minima at u=1 and u=−1, which are associated with the pure state of the materials in the binary mixture; ϵ defines the transition layer thickness separating the two materials. Ω will be considered as a box, Ω=[0,L]d, in 2D or 3D (d=2,3).
The CH equation is the H−1 gradient flow of the energy E(u) that is given in terms of the chemical potential μ=−ϵ2Δu+W′(u). The CH equation reads
∂u∂t=−ϵ2Δ2u+Δ(u3−u). | (2.3) |
The solution u for the CH equation, at the continuous level, will evolve into configurations such that the energy is always decreasing, that is
ddtE(u)=−∫Ω|∇μ|2≤0. |
For this reason, it is very important to design numerical methods that retain this property.
An elegant and very efficient way to generate numerical solutions for the CH equation was proposed by Eyre in 1998 [2]. The method consists of splitting the CH energy into convex and concave parts. The energy can be split in the following way
E(u)=∫Ωϵ22|∇u|2+W(u)dx=∫Ωϵ22|∇u|2+au22+(u44−(1+a)u22)dx, | (3.1) |
which gives the form of the energy E(u)=E+(u)+E−(u) in terms of a convex and concave part, provided that the splitting parameter a>2. The convex part is to be computed implicitly while the concave part is to be computed explicitly. The convexity-splitting applied to the CH equation gives
∂u∂t=(−ϵ2Δ2+aΔ)u+Δ[u3−(1+a)u]. | (3.2) |
We denote time-discretized approximation of the solution as u(xi,tn)≈Un,i where the discrete times will be expressed with respect to the local timestep, h, by tn+1=tn+h. Using the backward Euler difference for the time derivative, it gives
Un+1−Unh=(−ϵ2Δ2+aΔ)Un+1+Δ[U3n−(1+a)Un], | (3.3) |
which is called the convexity-splitting CS scheme. For clarity of presentation, we show the form of the CS scheme in Fourier pseudo-spectral notation [20]. The approximation for U on [0,2π] takes the form (2D case)
U≈N∑kx=1N∑ky=1ˆU(kx,ky,t)exp[2πi(ω(kx)x+ω(ky)y)], |
where ω(kx), ω(ky) are the wave-numbers, k2=ω(kx)2+ω(ky)2, and ˆU is computed using the discrete Fourier transform. Applying the Fourier transform to (3.3)
ˆUn+1−ˆUnh=(−ϵ2k4−ak2)ˆUn+1−k2[^U3n−(1+a)ˆUn], | (3.4) |
Using ^Δ2u=k4ˆu, ^Δu=−k2ˆu, and solving for ˆUn+1, one gets
ˆUn+1=ˆUn−hk2[^U3n−(1+a)ˆUn]1+h(ϵ2k4+ak2), | (3.5) |
where the inverse Fourier transform is applied to the above scheme to obtain Un+1. For more details in the CS method and for an energy decreasing proof, the readers are referred to [6,21].
Another important splitting leading to energy-decreasing property was proposed by Bertozzi for CH equations with variable mobility [7]. We start with the CH equation written as
∂u∂t=∇⋅(M(u)∇[−ϵ2Δu+W′(u)]), | (3.6) |
where M(u) is the variable mobility. We then distribute the divergence operator
∂u∂t=∇⋅(M(u)∇[−ϵ2Δu])+∇⋅(M(u)∇W′(u)), | (3.7) |
and introduce a splitting parameter M1 to re-express the first mobility function that appears in the above equation with M(u)=M(u)−M1+M1, which gives
∂u∂t=−M1ϵ2Δ2u+ϵ2∇⋅[(M(u)−M1)∇Δu]+∇⋅[M(u)∇W′(u)], | (3.8) |
which gives the original BHM approach applied to the CH equation with variable mobility. However, if we consider the case where M(u) is a constant, then the splitting reduces to the following
∂u∂t=−M1ϵ2Δ2u−ϵ2(M(u)−M1)Δ2u+M(u)ΔW′(u), | (3.9) |
where the bi-harmonic term is to be computed implicitly. We then approximate the time derivative to arrive at the BHM scheme, which reads
Un+1−Unh=−M1ϵ2Δ2Un+1−ϵ2(M(u)−M1)Δ2Un+M(u)Δ(U3n−Un), | (3.10) |
where we will always assume, for the rest of the paper, that M(u)=1. We note that (3.10) is a scheme that, to the best of our knowledge, has not been applied to the CH equation with constant mobility, since this case of CH equation has traditionally been computed using the celebrated CS method. We also remark that the BHM method was originally formulated for the case of variable mobility [7]. The Fourier pseudo-spectral notation for the BHM scheme in (3.10) then reads
^Un+1=^Un−hϵ2(M(u)−M1)k2^Un−M(u)k2[^U3n−^Un]1+hM1ϵ2k4. | (3.11) |
For more details on the BHM method and related applications of phase-field models to fluid dynamics, the readers are referred to the work of Bertozzi [7].
For a stability criteria of the BHM method given by (3.11), we will use the approach presented in [22]. Equation (3.9) is of the form ut=G(u), where G(u)=Ψ(u)+Φ(u). Here, Ψ(u)=−M1ϵ2Δ2u and Φ(u)=−ϵ2(M(u)−M1)Δ2u+M(u)ΔW′(u). Using the expression ^Φ(Un)=^G(Un)−^Ψ(Un) in (3.11) gives the following
^Un+1=^Un+h^G(Un)1+hM1ϵ2k4. | (3.12) |
To analyze the conditions for linear stability, we consider the highest order terms in G(u) to arrive at the approximation ^G(Un+en)≈^G(Un)−Coϵ2k4^en, which is then used in (3.12) to obtain the amplification factor associated with the growth of the errors in Un. The amplification factor is given by
σ=1−hCoϵ2k41+hM1ϵ2k4. | (3.13) |
The conditions to guarantee stability of the THM scheme, require that |σ|<1, so one gets
−1<1−hCoϵ2k41+hM1ϵ2k4<1,−2<−hCoϵ2k41+hM1ϵ2k4<0. |
Given the quantities considered in this formulation, the less than zero inequality is satisfied, so one works with
−2(1+hM1ϵ2k4)<−hCoϵ2k4, |
which can be further reduced by re-arranging the terms to arrive at the following inequality
ϵ2hk4(Co−2M1)<2. |
In order to guarantee stability (|σ|<1), we satisfy the above inequality with the requirement that (Co−2M1)<0, which gives M1>Co/2. In the context of a constant mobility case, we can set Co=M, which gives a minimum value for the splitting parameter in the BHM method (3.11). We note that in the analysis performed in [7] a criteria of M1>M was obtained for the original BHM approach. Following the approach in [22], we are able to get an improved criteria for this splitting parameter threshold. We will use M1>M/2 for all the simulations presented in this paper. Similarly, for the CS method, we are able to obtain a stability criteria for the CS parameter a>2 to guarantee energy decreasing property for the scheme. With regards to further details on the CS method and the corresponding energy decreasing property of the CS method, an elegant proof can be found in the work presented by Shen [21]. In addition, a proof based on a functional formulation can be found in [6]. Furthermore, for the two mentioned splitting approaches, it has been shown computationally that reducing the values of the splitting parameters tends to improve accuracy at the expense of introducing numerical instability, while increasing the splitting parameter increases error and provides numerical stability [6,23].
IMEX methods can be understood as time-stepping techniques that are useful after a splitting has taken place in a given model equation. IMEX schemes are known to preserve the energy decreasing property of original formulations (CS and BHM splittings) while increasing the temporal accuracy [23,24]. Our schemes given by (3.3) and (3.10) can be represented in the following form
ut=Ψ(Un+1)+Φ(Un), | (3.14) |
which are then suitable for an implicit-explicit (IMEX) Runge-Kutta (RK) time-stepping technique [25,26]. The main idea of the IMEX formulation is to treat Ψ implicitly and the stiff term Φ explicitly. The IMEX schemes read
U(1)=Un+h(αΨ(U(1))+αΦ(Un)), | (3.15) |
Un+1=Un+h(αΨ(U(n+1)+βΨ(U(1))+γΦ(U(1))+ωΦ(U(n))), | (3.16) |
where α=(2−√2)/2, β=√2/2, γ=1/(2−√2) and ω=(1−√2)/(2−√2). This scheme is a stage 2 scheme and is of second-order accuracy. Several authors have implemented IMEX methods for solving phase-field models in the past. Song [5] proposed an IMEX time-stepping approach for the CH equation. Ceniceros [27] proposed an IMEX time-stepping approach for a CH equation with variable mobility. The IMEX schemes for the BHM approach are different than all the mentioned ones since those authors did not rely on the BHM approach as a basis for their splitting.
The CS method was originally developed for the CH equation with constant mobility, M(u)=1, and the method, which is an order-one scheme, has seen a number of improvements in terms of accuracy. For example, the authors in [6] proposed several schemes that improved the accuracy of the CS method with the use of several extrapolation techniques, iterations, and different time stepping techniques based on backward differentiation formulas (BDF) [28]. In this section, we will perform a direct comparison between the basic form of the CS and BHM method in terms of accuracy. All numerical experiments will be performed on a box subject to periodic boundary conditions, Ω=[0,2π]d, d=2,3 for 2D or 3D space dimensions.
We compare the performance of the CS and BHM scheme by solving the CH equation (2.3) with d=2 on Ω=[0,2π]2 subject to periodic boundary conditions using the initial condition
u(x,y,0)=0.1+0.01rand(x,y), | (4.1) |
which is a suitable benchmark problem for capturing the phase separation of two materials. The random initial condition was evolved for one second to exhibit a well-defined separation process and this solution state was used as the actual initial condition. The simulation using the BHM scheme (3.10) with ϵ=0.05,M=1,M1=5, and tf=1.0 is presented in Figure 1. The solution of the pure state u≈1 is represented in yellow and u≈−1 is represented in blue.
An accurate reference solution Uref(x,y,t) was constructed by using the BHM scheme with a small timestep size of h=10−6 and a final configuration of tf=1.0. We then compute the numerical solution using CS and BHM scheme with difference choices of h and compare against the reference solution at tf=1.0. Computing the error with the formula:
Error(h)=∫Ω|U(x,y,tf;h)−Uref(x,y,tf)|dx. | (4.2) |
The computed errors are presented in Figure 2. The error plot demonstrates that both schemes are able to produce errors that follow the expected order-one convergence, and the errors produced by the BHM scheme are found to be smaller than those obtained by the CS scheme. In summary, we found that the BHM is not only reliable to solve the CH with constant mobility but is also an improvement over the CS method. The BHM-IMEX scheme is used to showcase the ability for the BHM scheme to achieve smaller errors while reaching second-order accuracy (see blue error plot in Figure 2).
We also include another benchmark problem for the CH equation (2.3) in 2D by simulating the coalescence of two drops in Ω=[0,2π]2. The initial condition is given by
u(x,y,t=0)={1,(L/9)2<(x−L/2.8)2+(y−L/2)2,1,(L/9)2<(x−b/1.7)2+(y−L/2)2,−1,otherwise, | (4.3) |
where L=2π. The simulation results using the BHM approach are presented in Figure 3. The corresponding energy evolution is presented in Figure 4. The merging of the drops starts around t=1. Then, by the action of mass combination, the drops exhibit an elliptical shape around t=10, which is then followed by formation of a circular drop around time t=20. We test for the computation time for the CS and BHM methods to solve the drop collision problem with a final time of tf=50. Results are presented in Table 1. The results indicate that the BHM method can be slightly more computationally expensive than the CS method, but this comes with benefits of added accuracy. The IMEX formulation is not much more expensive than BHM for the case of N=1024, which allows for errors to reach second-order accuracy. We also note that the methods presented in this paper, namely CS, BHM, and BHM-IMEX (coupled with a backward Euler approximation), are semi-implicit methods, which are very efficient for 2D problems and can run on a modest laptop [6,23].
N | CS | BHM | IMEX |
256 | 13.76 | 13.62 | 18.51 |
512 | 18.18 | 23.94 | 36.45 |
1024 | 59.17 | 120.74 | 149.13 |
Now that the schemes have been shown to produce accurate results in 2D, we demonstrate the capability of the schemes for working in full 3D space variables. In addition, this section also aims to present the efficient computation and simulation of our schemes via GPU implementation.
The first experiment we present for timing of computations is the Poisson problem in 2D subject to periodic boundary conditions. This numerical experiment is relevant since the Poisson problem is often encountered as one solves more complicated materials science models, very useful as a benchmark problem [13].
Δu=f(x,y). | (5.1) |
For the above problem, we follow the method of manufactured solutions [29,30,31]. The problem is solved on Ω=[0,2π]2. The Fourier spectral scheme [20] for Eq (5.1) is then formulated as
^Δu=ˆf. | (5.2) |
Using ^Δu=−k2ˆu, where k represents the vector containing the wave numbers for the x and y direction, the following equation is given
k2ˆu=−ˆf. | (5.3) |
The equation given by (5.3) can be solved as ˆu=−ˆf/k2; then, use the inverse fast Fourier transform (IFFT) to obtain the solution u for (5.1). We note, however, that the zero Fourier mode will generate an issue during the inversion process. It is possible to simply consider the minimization of the quantity (Δu−f) since f is provided and u was manufactured; we can approximate Δu≈IFFT(−k2ˆu). Another option that would allow to complete the inversion step in the scheme is to solve a modified or perturbed Poisson problem
Δu+ϵu=f(x,y), | (5.4) |
where ϵ=0.1, the exact solution is u=cos(2x)sin(2y) and f(x,y)=−7.9cos(2x)sin(2y). The corresponding formula for ˆu is given by
ˆu=ˆf−k2+ϵ, | (5.5) |
which is now able to be fully inverted via IFFT. The surface plots for u and f are presented in Figure 5 and the computation times for various values of N are given in Table 2.
N | CPU |
128 | 0.0019 |
256 | 0.0032 |
1024 | 0.026 |
2048 | 0.11 |
4096 | 0.43 |
8192 | 1.91 |
The computations for the 2D Poisson problem are very efficient and our CPU is able to handle high resolution grids, N=8192, in just under 2 seconds. This is a considerable task since for this value of N, our 2D array has N2 entries. The efficiency in computation is attributed to the fast Fourier transform (FFT), which is highly optimized in MATLAB. Regarding the spectral error for our test problem, as we varied N for larger sizes, decreasing h, the errors remained under a 10−15 threshold. A representative error plot is given in Figure 6 with N=256.
We now extend the formulation for the Poisson problem in full 3D space variables. We solve the same problem given by Eq (5.1) on Ω=[0,2π]3 with the corresponding exact solution u=cos(2x)sin(2y)cos(2z) and f=−11.9cos(2x)sin(2y)cos(2z) as for simplicity, we have adopted similar form for u as we move from the 2D to 3D problem. Corresponding 3D plots for u and f are given in Figure 7. We solve the Poisson equation in 3D by using the spectral scheme given by Eq (5.5). To declare variables on the GPU, the simple MATLAB command can be used
![]() |
which then loads up the solution u and forcing term f into the GPU so that computations can be performed using the multiple cores provided by GPU architecture. For more details of the actual code used, please see the Appendix. Computation time for both CPU and GPU are presented in Table 3.
N | CPU | 2060 Max-Q | A6000 |
32 | 0.0018 | 0.0016 | 0.0007 |
64 | 0.0051 | 0.0021 | 0.0017 |
128 | 0.037 | 0.0024 | 0.0026 |
256 | 0.35 | 0.0069 | 0.0058 |
512 | 2.79 | 2.47 | 0.089 |
From Table 3, we learn that it is possible to speed up computations by a factor of about 2X using the GPU. We also note that for a single execution, CPU and GPU computations could be comparable; in fact, the CPU can be very competitive to the GPU in terms of computation time for the case of N=512. This is attributed to the fact that the GPU memory for the 2060 at N=512 is reaching its maximum capacity of about 8 GB and the CPU has 64 GB of on-board memory. The slowing down in computation at N=512 is not observed when using a more capable GPU with higher memory, as shown in the third column of Table 3. These computation times are for the A6000 (a higher-end GPU with 48 GB of memory). GPU computations allow to speed up computations by several factors, as shown in the recent work from Liu et al. related to CH problems [32]. Other GPU accelerations were reported in the work of Lam et al. for the thin-film TF equation from lubrication theory [33]. In the next section, we will show how this is possible by using the BHM method along with simple commands that will allow GPU computations.
We consider the CH equation in full 3D space variables using the BHM scheme (3.10) subject to periodic boundary conditions. We consider the CH equation (2.3) with an initial condition representing a mixed state of materials with u≈0.5 given by
u(x,y,z,0)=0.5+0.01rand(x,y,z), | (5.6) |
with ϵ=0.05,M1=5 on Ω=[0,2π]3 using N=256. The simulation dynamics with a final computation time of tf=100 are presented in Figure 8. The simulation illustrates the phase-separation process of a binary mixture in full 3D space dimensions with uneven volume fraction of materials. The circular phase, which is traditionally expected in 2D simulations, has translated into spheres in full 3D computations.
In addition to showcasing the schemes in 3D, we are also interested in speeding up the computations to counter act the more expensive computations associated with the larger arrays that result from the additional dimension. It is also relevant to explore ways that are efficient, yet easy to implement, for speeding up the computations for the case of large-scale dynamics or long-term behavior of solutions [33,34,35,36]. We concentrate on the computation presented in Figure 8 for tf=100. This computation, with the given time step and 10,000 iterations, presents a good opportunity to benchmark the computational performance of the GPU implementation. Our reference point will be the computation time required by our CPU alone, which is an Intel-Core i9 with Windows 10 and 64 GB of ram. We will then use four Nvidia GPUs, two of which are consumer grade: the 2060 MaxQ and the 2070S, both with 8 GB of memory. The other two GPUs, the A6000 with 48 GB of memory and the A100 with 80 GB of memory, are enterprise. We solve the CH using the BHM method with a final time of tf=100 using the CPU and the four GPUs and summarize our results in Table 4. The results show that it is possible to speed up 3D phase-field model computations by a factor of 8x to 15x if we simply consider the modest GPUs 2060 and 2070. We note that these speed ups are already substantial and that these GPUs are entry level with very easy access from a commercial point of view. On the other hand, if one has access to the more advanced enterprise GPUs like the A6000 and A100, then speeds ups ranging from 25x to 80x are certainly possible. We note that the last row in Table 4 contains approximated values obtained via extrapolation for CPU, 2060 and 2070S GPUs based on N=256. For those three devices, the computations took longer than one day due to hardware reaching maximum capacity (N=512).
N | CPU | 2060 Max-Q | 2070S | A6000 | A100 |
64 | 6.27 | 0.81 | 0.63 | 0.45 | 0.17 |
128 | 56.67 | 6.68 | 4.45 | 2.26 | 0.72 |
256 | 444.42 | 52.84 | 34.74 | 17.03 | 5.33 |
512 | 3517.21 | 417.82 | 277.85 | 123.39 | 41.86 |
We also include a simulation in which we solve the CH equation and consider even volume fractions with the use of the following initial condition
u(x,y,z,0)=0.1+0.01rand(x,y,z). | (5.7) |
Using the same parameters as the previous 3D problem, we compute the numerical solution using the BHM method (3.10), and the results are presented in Figure 9.
It can be seen that the expected dynamics for a binary mixture with near equal volume fractions, u≈0.1, will undergo full separation without sphere formation. But, before reaching a steady-state, a meta-stable structure that resembles stripes in higher dimensions [11,37] at around t=100 is obtained. The BHM method retains the energy-decreasing property during the entire simulation run, as shown in Figure 10.
The dynamics of the solution to the CH equation should reach a full separation for long enough simulation runs, since it is modeling the separation of two materials [1]. In order to fully understand the evolution of the CH equation in 3D and to get the correct representation of the plots presented in Figure 9, we have added slices to those results and extended the simulations to better illustrate the separation process. Figure 11 represents the solution for t=10,100,200,300,400,500. Full separation of materials is observed at t=500, where slices have been placed at x=0, y=0 and z=2π. We note that for other phase-field models, full separation of materials may be prevented by material architecture and other modeling conditions [11,12,13].
For the final simulation example, we extend the drop coalescence in 3D space dimensions and test the performance of the proposed second-order method BHM-IMEX on Ω=[0,2π]3. Initial condition is given by
u(x,y,t=0)={1,(L/9)2<(x−L/2.8)2+(y−L/2)2+(z−L/2)2,1,(L/9)2<(x−b/1.7)2+(y−L/2)2+(z−L/2)2,−1,otherwise, | (5.8) |
where L=2π. The simulation results are presented in Figure 12 with snapshots taken at t=0,2,5,10. An energy-stable solution is obtained while capturing the correct dynamics in full 3D space dimensions. Similar benchmark problems in 3D can be found in the work of Liu et al. [32].
We tested several strategies for accessing the GPU in the most efficient way. One option to perform a GPU computation and generate a plot was to save the end result of the GPU computation as a data file, then follow up with the CPU loading up that file, and generating a plot. The basic idea is summarized with the following sample code
![]() |
To further explore the main benefits of using the BHM methods as our main basis for the proposed approach to accelerate computations, we also include a more complicated case of the CH equation. That is, we consider the logarithmic case of the potential as done by Dai [38]. We consider the type of potential for the CH equation that has singularities at u=±1.
Wlog(u)=θ2[(1+u)ln(1+u)+(1−u)ln(1−u)]+12(1−u2), |
where we choose θ=3/4 to attain inflection points at u=±1/2. The above potential has been proposed since the early developments of the CH equation, but this form of potential is not as popular as the polynomial type (W(u)=1/4u4−1/2u2). This potential has a local maximum on u=0 and minimum values approaching u=±1. Hence, mixtures near the spinodal region will evolve into the pure phases but, due to the singular nature of the potential, overshoots or undershoots are prevented on [−1,1]. Hence, for this type of potential, one expects for the solution u to be in-fact bounded on [−1,1]. We tested the adaptability of our schemes by solving the Cahn-Hilliard equation with a singular potential. The nonlinear contribution to the CH equation with logarithmic potential becomes
W′log(u)=φ(u)=38ln(1+u1−u)−u. |
Applying the BHM formulation to the CH equation with logarithmic potential then gives the following scheme
^Un+1=^Un−hϵ2(M(u)−M1)k2^Un−M(u)k2[^φ(Un)]1+hM1ϵ2k4. | (5.9) |
We consider the CH equation with logarithmic potential on Ω=[0,2π]3 with N=256, M=1, M1=5, and ϵ=0.05 using the initial condition from (5.7). Simulation snapshots are presented in Figure 13. The simulation undergoes phase separation in full 3D and this is done while preserving a bound on the solution u on [−1,1] due to the structure of the logarithmic potential. Solution boundedness under singular potentials is beyond the scope of this paper, but we can refer the readers to studies using a free boundary formulation for the CH equation under the case of different mobilities and different potentials [34,38].
We presented numerical approaches for phase-field models that are efficient, accurate, and easy to implement. The methods retain the energy-decreasing property and provide small errors. The BHM method is very efficient and shares the ease of implementation as the CS method. A second-order method based on the BHM was introduced using an IMEX formulation. The methods were tested with a variety of benchmark problems in 2D and 3D for short, and long-term simulations demonstrating their good performance. In addition to the classic CH equation with polynomial potential, we also demonstrated how the BHM easily applies to the case of a logarithmic potential. One of the main advantages for the BHM method is that the nonlinearities associated with a more complicated potential are easily handled at the current time level (explicitly). This is a drawback for the CS method, since separating the energy into convex and concave parts may be not possible. With our proposed work, users can choose a lower-accuracy method with the choice of a smaller h if accuracy requirements are modest. On the other hand, users can choose the higher-order schemes (with slightly higher computation cost) if accuracy demands are higher. We note that all schemes presented here are very efficient and can run in a modest laptop. The presented methods can be used as a powerful tool to study a large class of phase-field models. For speeding up the computations, we found that with modest GPU equipment, 8x and 15x speed ups are possible with the use of a few simple commands in MATLAB to access the GPU. Using higher-end equipment, it is possible to reach 25x or even 80x speed ups. We believe that the CPU and GPU implementation examples, with actual code that the users can use openly, could serve as a powerful tool for a broad range of researchers interested in efficient computational aspects of phase-field models. While the main phase-field model studied in this paper was the CH equation, a large class of phase field-models, including the BCP equation, phase field crystal (PFC) equation, the functionalized Cahn-Hilliard (FCH) equation, and the thin-film (TF) equation [11,13,39,40,41,42,43], can be approximated with the BHM approach. Many useful studies could benefit from the work presented here; in particular, large-scale computation of complex phase-field models and complicated simulations in full 3D space dimensions. Future work will include the rigorous analysis for the schemes with regards to energy-decreasing property and convergence analysis of the scheme, as well as the applicability of the methods to GPU accelerations in systems of CH equations [44] and systems of aggregation equations [45].
Saulo Orizaga: Project administration, Supervision, Formal analysis, Methodology, Writing-review & editing; Maurice Fabien: Supervision, Resources, Validation, Writing - original draft; Michael Millard: Investigation, Software, Visualization. All authors have read and agreed to the published version of the manuscript.
SO was supported by an Institutional Development Award (IDeA) from the National Institute of General Medical Sciences of the National Institutes of Health under grant number P20GM103451. MF acknowledges support for this research to the Office of the Vice Chancellor for Research and Graduate Education at the University of Wisconsin–Madison with funding from the Wisconsin Alumni Research Foundation.
All authors declare no conflicts of interest in this paper.
The following MATLAB codes are used to simulate the Poisson problem in 3D and the phase-separation process with random initial state for CH equation in 2D and 3D using the BHM scheme. After minor modifications, the codes can be easily extended for other simulations in this paper. The codes used for these experiments are available on the freely accessible GitHub repository https://github.com/sauloorizaga.
Detecting GPU
![]() |
Poisson equation in 3D
![]() |
CH 2D with GPU
![]() |
CH 3D with GPU
![]() |
Remark: The presented codes with use of GPU can run in non-GPU devices by simply commenting out the associated GPU declaration commands X = gpuArray(X), where X is the array in question. We also note that the codes have been given in the Appendix for maximum reproducibility of the results presented in this paper. The BHM approach can be easily adapted to more complicated phase-field models (BCP, PFC, and TF equations) as explained in the main body of the manuscript. For users running extensive simulations, we recommend the GPU strategy presented in Section 5.5. The authors welcome conversations or questions about accelerated computations for phase-field models.
[1] |
G. Demesure, M. Defoort, A. Bekrar, D. Trentesaux, M. Djemaï , Decentralized motion planning and scheduling of AGVs in an FMS, IEEE Trans. Ind. Informat., 14 (2018), 1744–1752. https://doi.org/10.1109/TII.2017.2749520 doi: 10.1109/TII.2017.2749520
![]() |
[2] |
S. Riazi, K. Bengtsson, B. Lennartson, Energy optimization of large-scale AGV systems, IEEE Trans. Autom. Sci. Eng., 18 (2021), 638–649. https://doi.org/10.1109/TASE.2019.2963285 doi: 10.1109/TASE.2019.2963285
![]() |
[3] |
V. Digani, L. Sabattini, C. Secchi, C. Fantuzzi, Ensemble coordination approach in multi-AGV systems applied to industrial warehouses, IEEE Trans. Autom. Sci. Eng., 12 (2015), 922–934. https://doi.org/10.1109/TASE.2015.2446614 doi: 10.1109/TASE.2015.2446614
![]() |
[4] |
T. Wang, Y. Xu, S. Ahipasaoglu, C. Courcoubetis, Ex-post max-min fairness of generalized AGV mechanisms, IEEE Trans. Autom. Control, 62 (2017), 5275–5281. https://doi.org/10.1109/TAC.2016.2632424 doi: 10.1109/TAC.2016.2632424
![]() |
[5] |
S. Hoshino, J. Ota, A. Shinozaki, H. Hashimoto, Hybrid design methodology and cost-effectiveness evaluation of AGV transportation systems, IEEE Trans. Autom. Sci. Eng., 4 (2007), 360–372. https://doi.org/10.1109/TASE.2006.887162 doi: 10.1109/TASE.2006.887162
![]() |
[6] |
S. Lu, C. Xu, R. Zhong, An active RFID tag-enabled locating approach with multipath effect elimination in AGV, IEEE Trans. Autom. Sci. Eng., 13 (2016), 1333–1342. https://doi.org/10.1109/TASE.2016.2573595 doi: 10.1109/TASE.2016.2573595
![]() |
[7] |
H. Gao, J. Zhu, X. Li, Y. Kang, J. Li, H. Su, Automatic parking control of unmanned vehicle based on switching control algorithm and backstepping, IEEE /ASME Trans. Mech., 27 (2022), 1233–1243. https://doi.org/10.1109/TMECH.2020.3037215 doi: 10.1109/TMECH.2020.3037215
![]() |
[8] |
M. Graf Plessen, D. Bernardini, H. Esen, A. Bemporad, Spatial-based predictive control and geometric corridor planning for adaptive cruise control coupled with obstacle avoidance, IEEE Trans. Control. Syst. Technol., 26 (2018), 38–50. https://doi.org/10.1109/TCST.2017.2664722 doi: 10.1109/TCST.2017.2664722
![]() |
[9] |
R. Utriainen, M. Pöllänen, H. Liimatainen, The safety potential of lane keeping assistance and possible actions to improve the potential, IEEE Trans. Intell. Veh., 5 (2020), 556–564. https://doi.org/10.1109/TIV.2020.2991962 doi: 10.1109/TIV.2020.2991962
![]() |
[10] |
W. Li, Q. Li, S. Li, R. Li, Y. Ren, W. Wang, Indirect shared control through non-zero sum differential game for cooperative automated driving, IEEE Trans. Intell. Transp. Syst., 23 (2022), 15980–15992. https://doi.org/10.1109/TITS.2022.3146895 doi: 10.1109/TITS.2022.3146895
![]() |
[11] |
X. Jin, J. Wang, X. He, Z. Yan, L. Xu, C. Wei, et al., Improving vibration performance of electric vehicles based on in-wheel motor-active suspension system via robust finite frequency control, IEEE Trans. Intell. Transp. Syst., 24 (2023), 1631–1643. https://doi.org/10.1109/TITS.2022.3224609 doi: 10.1109/TITS.2022.3224609
![]() |
[12] |
X. Jin, J. Wang, Z. Yan, L. Xu, G. Yin, N. Chen, Robust vibration control for active suspension system of in-wheel-motor-driven electric vehicle via μ-synthesis methodology, ASME J. Dyn. Sys. Meas. Control, 144 (2022), 051007. https://doi.org/10.1115/1.4053661 doi: 10.1115/1.4053661
![]() |
[13] |
X. Jin, Q. Wang, Z. Yan, H. Yang, J. Wang, G. Yin, A learning-based evaluation for lane departure warning system considering driving characteristics, Proc. Inst. Mech. Eng. D-J. Aut., 2022. https://doi.org/10.1177/09544070221140973 doi: 10.1177/09544070221140973
![]() |
[14] |
K. Nam, H. Fujimoto, Y. Hori, Advanced motion control of electric vehicles based on robust lateral tire force control via active front steering, IEEE/ASME Trans. Mech., 19 (2014), 289–299. https://doi.org/10.1109/TMECH.2012.2233210 doi: 10.1109/TMECH.2012.2233210
![]() |
[15] |
H. Wang, Y. Tian, H. Xu, Neural adaptive command filtered control for cooperative path following of multiple underactuated autonomous underwater vehicles along one path, IEEE Trans. Syst, Man, Cybern. Syst., 52 (2022), 2966–2978. https://doi.org/10.1109/TSMC.2021.3062077 doi: 10.1109/TSMC.2021.3062077
![]() |
[16] |
Y. Wang, B. Nguyen, H. Fujimoto, Y. Hori, Multirate estimation and control of body slip angle for electric vehicles based on onboard vision system, IEEE Trans. Ind. Electron., 61 (2014), 1133–1143. https://doi.org//10.1109/TIE.2013.2271596 doi: 10.1109/TIE.2013.2271596
![]() |
[17] |
G. Wang, Y. Liu, S. Li, Y. Tian, N. Zhang, G. Cui, New integrated vehicle stability control of active front steering and electronic stability control considering tire force reserve capability, IEEE Trans. Veh. Technol., 70 (2021), 2181–2195. https://doi.org//10.1109/TVT.2021.3056560 doi: 10.1109/TVT.2021.3056560
![]() |
[18] |
J. Cho, K. Huh, Active front steering for driver's steering comfort and vehicle driving stability, Int. J. Automt. Technol., 20 (2019), 589–596. https://doi.org/10.1007/s12239-019-0056-1 doi: 10.1007/s12239-019-0056-1
![]() |
[19] |
P. Falcone, F. Borrelli, J. Asgari, H. Tseng, D. Hrovat, Predictive active steering control for autonomous vehicle systems. IEEE Trans. Control Syst. Technol., 15 (2007), 566–580. https://doi.org/10.1109/TCST.2007.894653 doi: 10.1109/TCST.2007.894653
![]() |
[20] |
T. Faulwasser, R. Findeisen, Nonlinear model predictive control for constrained output path following, IEEE Trans. Autom. Control, 61 (2016), 1026–1039. https://doi.org/10.1109/TAC.2015.2466911 doi: 10.1109/TAC.2015.2466911
![]() |
[21] |
P. Liljeback, I. Haugstuen, K. Pettersen, Path following control of planar snake robots using a cascaded approach, IEEE Trans. Control Syst. Technol., 20 (2012), 111–126. https://doi.org/10.1109/TCST.2011.2107516 doi: 10.1109/TCST.2011.2107516
![]() |
[22] |
A. Hladio, C. Nielsen, D. Wang, Path following for a class of mechanical systems, IEEE Trans. Control Syst. Technol., 21 (2013), 2380–2390. https://doi.org/10.1109/TCST.2012.2223470 doi: 10.1109/TCST.2012.2223470
![]() |
[23] |
S. Mobayen, Robust tracking controller for multivariable delayed systems with input saturation via composite nonlinear feedback, Nonlinear. Dyn., 76 (2014), 827–838. https://doi.org/10.1007/s11071-013-1172-5 doi: 10.1007/s11071-013-1172-5
![]() |
[24] |
V. Ghaffari, Optimal tuning of composite nonlinear feedback control in time-delay nonlinear systems, J. Franklin Inst., 357 (2020), 1331–1356. https://doi.org/10.1016/j.jfranklin.2019.12.024 doi: 10.1016/j.jfranklin.2019.12.024
![]() |
[25] |
S. Yu, X. Li, H. Chen, F. Allgöwer, Nonlinear model predictive control for path following problems, Int. J. Robust Nonlinear Control, 25 (2015), 1168–1182. https://doi.org/10.1002/rnc.3133 doi: 10.1002/rnc.3133
![]() |
[26] |
J. Chen, Z. Shuai, H. Zhang, W. Zhao, Path following control of autonomous four-wheel-independent-drive electric vehicles via second-order sliding mode and nonlinear disturbance observer techniques, IEEE Trans. Ind. Electron., 68 (2021), 2460–2469. https://doi.org/10.1109/TIE.2020.2973879 doi: 10.1109/TIE.2020.2973879
![]() |
[27] |
Z. Liu, X. Chen, Adaptive sliding mode security control for stochastic Markov jump cyber-physical nonlinear systems subject to actuator failures and randomly occurring injection attacks, IEEE Trans. Ind. Informat., 2022. https://doi.org/10.1109/TII.2022.3181274 doi: 10.1109/TII.2022.3181274
![]() |
[28] |
X. Zhao, Z. Liu, B. Jiang, C. Gao, Switched controller design for robotic manipulator via neural network-based sliding mode approach, IEEE Trans. Circuits Syst. II: Exp. Briefs, 70 (2022), 561–565. https://doi.org/10.1109/TCSII.2022.3169475 doi: 10.1109/TCSII.2022.3169475
![]() |
[29] |
B. Xu, F. Sun, Y. Pan, B. Chen, Disturbance observer based composite learning fuzzy control of nonlinear systems with unknown dead zone, IEEE Trans. Syst. Man. Cybern. Syst., 47 (2017), 1854–1862. https://doi.org/10.1109/TSMC.2016.2562502 doi: 10.1109/TSMC.2016.2562502
![]() |
[30] |
H. A. Hagras, A hierarchical type-2 fuzzy logic control architecture for autonomous mobile robots, IEEE Trans. Fuzzy. Syst., 12 (2004), 524–539. https://doi.org/10.1109/TFUZZ.2004.832538 doi: 10.1109/TFUZZ.2004.832538
![]() |
[31] |
J. Cervantes, W. Yu, S. Salazar, I. Chairez, Takagi-Sugeno dynamic neuro-fuzzy controller of uncertain nonlinear systems, IEEE Trans. Fuzzy. Syst., 25 (2017), 1601–1615. https://doi.org/10.1109/TFUZZ.2016.2612697 doi: 10.1109/TFUZZ.2016.2612697
![]() |
[32] |
Y. Wu, L. Wang, J. Zhang, F. Li, Path following control of autonomous ground vehicle based on nonsingular terminal sliding mode and active disturbance rejection control, IEEE Trans. Veh. Technol., 68 (2019), 6379–6390. https://doi.org/10.1109/TVT.2019.2916982 doi: 10.1109/TVT.2019.2916982
![]() |
[33] | M. Islam, Y. He, An optimal preview controller for active trailer steering systems of articulated heavy vehicles, SAE Tech. Pap., 2011, 1–14. https://doi.org/10.4271/2011-01-0983 |
[34] |
T. Ding, Y. Zhang, G. Ma, Z. Cao, X. Zhao, B. Tao, Trajectory tracking of redundantly actuated mobile robot by MPC velocity control under steering strategy constraint, Mechatronics, 84 (2022), 102779. https://doi.org/10.1016/j.mechatronics.2022.102779 doi: 10.1016/j.mechatronics.2022.102779
![]() |
[35] |
H. Moradi, G. Vossoughi, M. R. Movahhedy, H. Salarieh, Suppression of nonlinear regenerative chatter in milling process via robust optimal control, J. Process Control, 23 (2013), 631–648. https://doi.org/10.1016/j.jprocont.2013.02.006 doi: 10.1016/j.jprocont.2013.02.006
![]() |
[36] |
Y. Fu, B. Li, J. Fu, Multi-model adaptive switching control of a nonlinear system and its applications in a smelting process of fused magnesia, J. Process Control, 115 (2022), 67–76. https://doi.org/10.1016/j.jprocont.2022.04.009 doi: 10.1016/j.jprocont.2022.04.009
![]() |
[37] |
S. Fahmy, S. Banks, Robust H∞ control of uncertain nonlinear dynamical systems via linear time-varying approximations, Nonlinear Anal., 63 (2005), 2315–2327. https://doi.org/10.1016/j.na.2005.03.030 doi: 10.1016/j.na.2005.03.030
![]() |
[38] |
G. Ju, Y. Wu, W. Sun, Adaptive output feedback asymptotic stabilization of nonholonomic systems with uncertainties, Nonlinear Anal., 71 (2009), 5106–5117. https://doi.org/10.1016/j.na.2009.03.088 doi: 10.1016/j.na.2009.03.088
![]() |
[39] |
E. Jafari, T. Binazadeh, Observer-based improved composite nonlinear feedback control for output tracking of time-varying references in descriptor systems with actuator saturation, ISA Trans., 91 (2019), 1–10. https://doi.org/10.1016/j.isatra.2019.01.035 doi: 10.1016/j.isatra.2019.01.035
![]() |
[40] |
S. Mobayen, An LMI-based robust tracker for uncertain linear systems with multiple time-varying delays using optimal composite nonlinear feedback technique, Nonlinear Dyn., 80 (2015), 917–927. https://doi.org/10.1007/s11071-015-1916-5 doi: 10.1007/s11071-015-1916-5
![]() |
[41] |
V. Ghaffari, S. Mobayen, D. Ud, T. Rojsiraphisal, M. T. Vu, Robust tracking composite nonlinear feedback controller design for time-delay uncertain systems in the presence of input saturation, ISA Trans., 129 (2022), 88–99. https://doi.org/10.1016/j.isatra.2022.02.029 doi: 10.1016/j.isatra.2022.02.029
![]() |
[42] |
X. Jin, Z. Yan, G. Yin, S. Li, C. Wei, An adaptive motion planning technique for on-road autonomous driving, IEEE Access, 9(2020), 2655–2664. https://doi.org/10.1109/ACCESS.2020.3047385 doi: 10.1109/ACCESS.2020.3047385
![]() |
[43] |
B. Paden, M. Čáp, S. Yong, D. Yershov, E. Frazzoli, A survey of motion planning and control techniques for self-driving urban vehicles, IEEE Trans. Intell. Veh., 1 (2016), 33–55. https://doi.org/10.1109/TIV.2016.2578706 doi: 10.1109/TIV.2016.2578706
![]() |
N | CS | BHM | IMEX |
256 | 13.76 | 13.62 | 18.51 |
512 | 18.18 | 23.94 | 36.45 |
1024 | 59.17 | 120.74 | 149.13 |
N | CPU |
128 | 0.0019 |
256 | 0.0032 |
1024 | 0.026 |
2048 | 0.11 |
4096 | 0.43 |
8192 | 1.91 |
N | CPU | 2060 Max-Q | A6000 |
32 | 0.0018 | 0.0016 | 0.0007 |
64 | 0.0051 | 0.0021 | 0.0017 |
128 | 0.037 | 0.0024 | 0.0026 |
256 | 0.35 | 0.0069 | 0.0058 |
512 | 2.79 | 2.47 | 0.089 |
N | CPU | 2060 Max-Q | 2070S | A6000 | A100 |
64 | 6.27 | 0.81 | 0.63 | 0.45 | 0.17 |
128 | 56.67 | 6.68 | 4.45 | 2.26 | 0.72 |
256 | 444.42 | 52.84 | 34.74 | 17.03 | 5.33 |
512 | 3517.21 | 417.82 | 277.85 | 123.39 | 41.86 |
N | CS | BHM | IMEX |
256 | 13.76 | 13.62 | 18.51 |
512 | 18.18 | 23.94 | 36.45 |
1024 | 59.17 | 120.74 | 149.13 |
N | CPU |
128 | 0.0019 |
256 | 0.0032 |
1024 | 0.026 |
2048 | 0.11 |
4096 | 0.43 |
8192 | 1.91 |
N | CPU | 2060 Max-Q | A6000 |
32 | 0.0018 | 0.0016 | 0.0007 |
64 | 0.0051 | 0.0021 | 0.0017 |
128 | 0.037 | 0.0024 | 0.0026 |
256 | 0.35 | 0.0069 | 0.0058 |
512 | 2.79 | 2.47 | 0.089 |
N | CPU | 2060 Max-Q | 2070S | A6000 | A100 |
64 | 6.27 | 0.81 | 0.63 | 0.45 | 0.17 |
128 | 56.67 | 6.68 | 4.45 | 2.26 | 0.72 |
256 | 444.42 | 52.84 | 34.74 | 17.03 | 5.33 |
512 | 3517.21 | 417.82 | 277.85 | 123.39 | 41.86 |