The numerical solution of spatiotemporal partial differential equations (PDEs) using the deep learning method has attracted considerable attention in quantum mechanics, fluid mechanics, and many other natural sciences. In this paper, we propose an interactive temporal physics-informed neural network architecture based on ConvLSTM for solving spatiotemporal PDEs, in which the information feedback mechanism in learning is introduced between the current input and the previous state of network. Numerical experiments on four kinds of classical spatiotemporal PDEs tasks show that the extended models have superiority in accuracy, long-range learning ability, and robustness. Our key takeaway is that the proposed network architecture is capable of learning information correlation of the PDEs model with spatiotemporal data through the input state interaction process. Furthermore, our method also has a natural advantage in carrying out physical information and boundary conditions, which could improve interpretability and reduce the bias of numerical solutions.
Citation: Ruohan Cao, Jin Su, Jinqian Feng, Qin Guo. PhyICNet: Physics-informed interactive learning convolutional recurrent network for spatiotemporal dynamics[J]. Electronic Research Archive, 2024, 32(12): 6641-6659. doi: 10.3934/era.2024310
Related Papers:
[1]
Youtian Hao, Guohua Yan, Renjun Ma, M. Tariqul Hasan .
Linking dynamic patterns of COVID-19 spreads in Italy with regional characteristics: a two level longitudinal modelling approach. Mathematical Biosciences and Engineering, 2021, 18(3): 2579-2598.
doi: 10.3934/mbe.2021131
[2]
Weike Zhou, Aili Wang, Fan Xia, Yanni Xiao, Sanyi Tang .
Effects of media reporting on mitigating spread of COVID-19 in the early phase of the outbreak. Mathematical Biosciences and Engineering, 2020, 17(3): 2693-2707.
doi: 10.3934/mbe.2020147
[3]
Marco Roccetti .
Excess mortality and COVID-19 deaths in Italy: A peak comparison study. Mathematical Biosciences and Engineering, 2023, 20(4): 7042-7055.
doi: 10.3934/mbe.2023304
[4]
Marco Roccetti .
Drawing a parallel between the trend of confirmed COVID-19 deaths in the winters of 2022/2023 and 2023/2024 in Italy, with a prediction. Mathematical Biosciences and Engineering, 2024, 21(3): 3742-3754.
doi: 10.3934/mbe.2024165
[5]
Tianfang Hou, Guijie Lan, Sanling Yuan, Tonghua Zhang .
Threshold dynamics of a stochastic SIHR epidemic model of COVID-19 with general population-size dependent contact rate. Mathematical Biosciences and Engineering, 2022, 19(4): 4217-4236.
doi: 10.3934/mbe.2022195
[6]
S. H. Sathish Indika, Norou Diawara, Hueiwang Anna Jeng, Bridget D. Giles, Dilini S. K. Gamage .
Modeling the spread of COVID-19 in spatio-temporal context. Mathematical Biosciences and Engineering, 2023, 20(6): 10552-10569.
doi: 10.3934/mbe.2023466
[7]
Aili Wang, Xueying Zhang, Rong Yan, Duo Bai, Jingmin He .
Evaluating the impact of multiple factors on the control of COVID-19 epidemic: A modelling analysis using India as a case study. Mathematical Biosciences and Engineering, 2023, 20(4): 6237-6272.
doi: 10.3934/mbe.2023269
[8]
Sarah R. Al-Dawsari, Khalaf S. Sultan .
Modeling of daily confirmed Saudi COVID-19 cases using inverted exponential regression. Mathematical Biosciences and Engineering, 2021, 18(3): 2303-2330.
doi: 10.3934/mbe.2021117
[9]
Peng An, Xiumei Li, Ping Qin, YingJian Ye, Junyan Zhang, Hongyan Guo, Peng Duan, Zhibing He, Ping Song, Mingqun Li, Jinsong Wang, Yan Hu, Guoyan Feng, Yong Lin .
Predicting model of mild and severe types of COVID-19 patients using Thymus CT radiomics model: A preliminary study. Mathematical Biosciences and Engineering, 2023, 20(4): 6612-6629.
doi: 10.3934/mbe.2023284
[10]
Yuto Omae, Yohei Kakimoto, Makoto Sasaki, Jun Toyotani, Kazuyuki Hara, Yasuhiro Gon, Hirotaka Takahashi .
SIRVVD model-based verification of the effect of first and second doses of COVID-19/SARS-CoV-2 vaccination in Japan. Mathematical Biosciences and Engineering, 2022, 19(1): 1026-1040.
doi: 10.3934/mbe.2022047
Abstract
The numerical solution of spatiotemporal partial differential equations (PDEs) using the deep learning method has attracted considerable attention in quantum mechanics, fluid mechanics, and many other natural sciences. In this paper, we propose an interactive temporal physics-informed neural network architecture based on ConvLSTM for solving spatiotemporal PDEs, in which the information feedback mechanism in learning is introduced between the current input and the previous state of network. Numerical experiments on four kinds of classical spatiotemporal PDEs tasks show that the extended models have superiority in accuracy, long-range learning ability, and robustness. Our key takeaway is that the proposed network architecture is capable of learning information correlation of the PDEs model with spatiotemporal data through the input state interaction process. Furthermore, our method also has a natural advantage in carrying out physical information and boundary conditions, which could improve interpretability and reduce the bias of numerical solutions.
1.
Introduction
Since the emergence of the COVID-19 pandemic around December 2019, the outbreak has snowballed globally [1,2], and there is no clear sign that the new confirmed cases and deaths are coming to an end. Though vaccines are rolling out to deter the spread of this pandemic, the mutations of the viruses are already under way [3,4,5,6]. Despite the fact and research that the origin of the pandemic is still in debate [7], many researchers are conducting their study from different aspects and perspectives. They could be categorised mainly into three levels: SARS-CoV-2 genetic level [8], COVID-19 individual country level [9,10,11] and continental levels [12,13]. In this study, we focus on the latter two levels. Regarding these two levels, there are many methods and techniques on these issues. For example, linear and non-linear growth models, together with 2-week-kernel-window regression, are exploited in modelling the exponential growth rate of COVID-19 confirmed cases [14] - which are also generalised to non-linear modelling of COVID-19 pandemic [15,16]. Some research works focus on the prediction of COVID-19 spread by estimating the lead-lag effects between different countries via time warping technique [17], while some utilise clustering analyses to group countries via epidemiological data of active cases, active cases per population, etc.[18]. In addition, there are other researches focusing on tackling the relationship between economic variables and COVID-19 related variables [19,20] - though both the results show there are no relation between economic freedom and COVID-19 deaths and no relation between the performance of equality markets and the COVID-19 cases and deaths.
In this study, we aim to extract the features of daily biweekly growth rates of cases and deaths on national and continental levels. We devise the orthonormal bases based on Fourier analysis [21,22], in particular Fourier coefficients for the potential features. For the national levels, we import the global time series data and sample 117 countries for 109 days [23,24]. Then we calculate the Euclidean distance matrices for the inner products between countries and between days. Based on the distance matrices, we then calculate their variabilities to delve into the distribution of the data. For the continental level, we also import the biweekly changes of cases and deaths for 5 continents as well as the world data with time series data for 447 days. Then we calculate their inner products with respect to the temporal frequencies and find the similarities of extracted features between continents.
For the national levels, the biweekly data bear higher temporal features than spatial features, i..e., as time goes by, the pandemic evolves more in the time dimension than the space (or country-wise) dimension. Moreover, there exists a strong concurrency between features for biweekly changes of cases and deaths, though there is no clear or stable trend for the extracted features. However, in the continental level, one observes that there is a stable trend of features regarding biweekly change. In addition, the extracted features between continents are similar to one another, except Asia whose features bear no clear similarities with other continents.
Our approach is based on orthonormal bases, which serve as the potential features for the biweekly change of cases and deaths. This method is straightforward and easy to comprehend. The limitations of this approach are the extracted features are based on the hidden frequencies of the dynamical structure, which is hard to assign a interpretable meaning for the frequencies, and the data fetched are not complete, due to the missing data in the database. However the results provided in this study could help one map out the evolutionary features of COVID-19.
2.
Method and implementation
Let δ:N→{0,1} be a function such that δ(n)=0 (or δn=0), if n∈2N and δ(n)=1, if n∈2N+1. Given a set of point data D={→v}⊆RN, we would like to decompose each →v into some frequency-based vectors by Fourier analysis. The features of COVID-19 case and death growth rates are specified by the orthogonal frequency vectors BN={→fij:1≤j≤N}Ni=1, which is based on Fourier analysis, in particular Fourier series [22], where
● →f1j=√1N for all 1≤j≤N;
● For any 2≤i≤N−1+δN,
→fij=√2N⋅cos[π2⋅δi−(i−δi)⋅πN⋅j;
(2.1)
● If N∈2N, then →fNj=√1N⋅cos(j⋅π) for all 1≤j≤N.
Now we have constructed an orthonormal basis FN={→f1,→f2,⋯,→fN} as features for RN. Now each →v=N∑i=1<→v,→fi>⋅→fi, where <,> is the inner product. The basis BN could also be represented by a matrix
and the representation of a data column vector →v={(-3,14,5,8,-12)} with respect to B5 is calculated by F5→v=[<→v,→fi>]5i=1 or a column vector or 5-by-1 matrix (5.367,-16.334,-3.271,-6.434,-9.503).
2.1. Data description and handling
There are two main parts of data collection and handling - one for individual countries (or national level) and the other for individual continents (or continental level). In both levels, we fetch the daily biweekly growth rates of confirmed COVID-19 cases and deaths from Our World in Data [23,24]. Then we use R programming 4.1.0 to handle the data and implement the procedures.
Sampled targets: national. After filtering out non-essential data and missing data, the effective sampled data are 117 countries with effective sampled 109 days as shown in Results. The days range from December 2020 to June 2021. Though the sampled days are not subsequent ones (due to the missing data), the biweekly information could still cover such loss. In the latter temporal and spatial analyses, we will conduct our study based on these data.
Sampled targets: continental. As for the continental data, we collect data regarding the world, Africa, Asia, Europe, North and South America. The sampled days range from March 22nd, 2020 to June 11th, 2021. In total, there are 449 days (this is different from the national level). In the latter temporal analysis (there is no spatial analysis in the continental level, due to the limited sampling size), we will conduct our study based on these data.
Notations: national. For further processing, let us utilise some notations to facilitate the introduction. Let the sampled countries be indexed by i=1,...,117. Let the sampled days be indexed by t=1,...,109. Days range from December 3rd 2020 to May 31st 2021. Let ci(t) and di(t) be the daily biweekly growth rates of confirmed cases and deaths in country i on day t, respectively, i.e.,
ci(t):=casei,t+13−casei,tcasei,t;
(2.3)
di(t):=deathi,t+13−deathi,tdeathi,t,
(2.4)
where casei,t and deathi,t denote the total confirmed cases and deaths for country i at day t, respectively. We form temporal and spatial vectors by
the vector ci and di give every count in time for a given country, and the vector v(t) and w(t) give every countries' count for a given time.
Notations: continental. For further processing, let us utilise some notations to facilitate the introduction. Let the sampled continents be indexed by j=1,...,6. Let the 447 sampled days range from March 22nd 2020 to June 11th 2021. We form temporal vectors for confirmed cases and deaths by
For any m-by-n matrix A, we use min(A) to denote the value min{aij:1≤i≤m;1≤j≤n}. Similarly, we define max(A) by the same manner. If →v is a vector, we define min(→v) and max(→v) in the same manner. The implementation goes as follows:
(1) Extract and trim and source data.
Extraction: national. Extract the daily biweekly growth rates of COVID-19 cases and deaths from the database and trim the data. The trimmed data consist of 109 time series data for 117 countries as shown in Table 1, which consists of two 117-by-109 matrices:
Row i in the matrices are regarded as temporal vectors ci and di respectively, and Column t in the matrices are regarded as spatial vectors v(t) and w(t) respectively.
Extraction: continental. As for the continental data, they are collected by two 6-by-447 matrices:
Biweekly_cont_cases=[xj(τ)]τ=1:447j=1:6;
Biweekly_cont_deaths=[yj(t)]τ=1:447j=1:6.
(2) Specify the frequencies (features) for the imported data.
Basis: national. In order to decompose ci and di into some fundamental features, we specify F109 as the corresponding features, whereas to decompose v(t) and w(t), we specify F117 as the corresponding features. The results are presented in Table 2.
Table 2.
Orthonormal temporal frequencies for 109 days (upper block or F109) and orthonormal spatial frequencies for 117 countries (lower block or F117).
Average variability: continental. For each continent j, the temporal variabilities for confirmed cases and deaths are computed by
var_cont_case_time[j]=6∑k=1dE(F447xj,F447xk)447;
var_cont_death_time[j]=6∑k=1dE(F447yj,F447yk)447.
(6) Unify the national temporal and spatial variabilities of cases and deaths. For each country i, the unified temporal and spatial variabilities for cases and deaths are defined by
where σij and βij denotes the value in the (i,j) cells of IP_cont_cases_time and IP_cont_deaths_time, respectively. The results are visualised by figures in Results.
3.
Results
There are two main parts of results shown in this section: national results and continental results.
National results. Based on the method mentioned in section 2, we identify the temporal orthonormal frequencies and spatial ones as shown in Table 2.
The computed inner products at country levels, served as the values for extracted features, for daily biweekly growth rates of cases and deaths with respect to temporal frequencies are shown in Figure 1. Similarly, the computed inner products at a country level for daily biweekly growth rates of cases and deaths with respect to spatial frequencies are shown in Figure 2. Meanwhile, their scaled variabilities are plotted in Figure 3.
Figure 1.
Inner products between growth rates of cases (in solid line) over 109 temporal frequencies; and inner products between growth rates of deaths (in dotted line) over 109 temporal frequencies for some demonstrative countries: Afghanistan, Albania, Algeria, Uruguay, Zambia, and Zimbabwe.
Figure 2.
Inner products between growth rates of cases (in solid line) over 117 spatial frequencies; and inner products between growth rates of deaths (in dotted line) over 117 spatial frequencies for some demonstrative dates: 2020/12/3, 2020/12/4, 2020/12/5, 2021/5/29, 2021/5/30, and 2021/5/31.
Continental results. According to the obtained data, we study and compare continental features of daily biweekly growth rates of confirmed cases and deaths of Africa, Asia, Europe, North America, South America and World. Unlike the missing data in analysing individual countries, the continental data are complete. We take the samples from March 22nd, 2020 to June 11th, 2021. In total, there are 447 days for the analysis. The cosine values which compute the similarities between representations for continents are shown in Table 3. The results of the unified inner products with respect to confirmed cases and deaths are plotted in Figures 4 and 5, respectively.
Table 3.
Cosine values (similarities) between World, Africa, Asia, Europe, North America (No. Am.), and South America (So. Am.).
Figure 5.
Unified inner product, or UIP, for world, Africa, Asia, Europe, North and South America with respect to daily biweekly growth rates of deaths.
Other auxiliary results that support the plotting of the graphs are also appended in Appendix. The names of the sampled 117 countries are provided in Tables A1 and A2. The dates of the sampled days are provided in Figure A1. The tabulated results for inner product of temporal and spatial frequencies on a national level are provided in Table A3. The tabulated results for inner product of temporal frequencies on a continental level are provided in Table A4. The Euclidean distance matrices for temporal and spatial representations with respect to confirmed cases and deaths are tabulated in Table A5 and their average variabilities are tabulated in Table A6.
Summaries of results. Based on the previous tables and figures, we have the following results.
(1) From Figures 1 and 2, one observes that the temporal features are much more distinct that the spatial features, i.e., if one fixes one day and extracts the features from the spatial frequencies, he obtains less distinct features when comparing with fixing one country and extracting the features from the temporal frequencies. This indicates that SARS-CoV-2 evolves and mutates mainly according to time than space.
(2) For individual countries, the features for the biweekly changes of cases are almost concurrent with those of deaths. This indicates biweekly changes of cases and deaths share the similar features. In some sense, the change of deaths is still in tune with the change of confirmed cases, i.e., there is no substantial change between their relationship.
(3) For individual countries, the extracted features go up and down intermittently and there is no obvious trend. This indicates the virus is still very versatile and hard to capture its fixed features in a country-level.
(4) From Figure 3, one observes that there is a clear similarities, in terms of variabilities, for both daily biweekly growth rates of cases and deaths under temporal frequencies. Moreover, the distribution of overall data is not condensed, where middle, labelled countries are scattering around the whole data. This indicates the diversity of daily biweekly growth rates of cases and deaths across countries is still very high.
(5) From Figure 3, the daily biweekly growth rates of deaths with respect to the spatial frequencies are fairly concentrated. This indicates the extracted features regarding deaths are stable, i.e., there are clearer and stabler spatial features for daily biweekly growth rates of deaths.
(6) Comparing the individual graphs in Figures 4 and 5, they bear pretty much the same shape, but in different scale - with death being higher feature oriented (this is also witnessed in a country-level as claimed in the first result above). This indicates there is a very clear trend of features regarding daily biweekly growth rates in a continental level (this is a stark contrast to the third claimed result above).
(7) From Figures 4 and 5, the higher values of inner products lie in both endpoints for biweekly change of cases and deaths, i.e., low temporal frequencies and high temporal frequencies for all the continents, except the biweekly change of deaths in Asia. This indicates the evolutionary patterns in Asia are very distinct from other continents.
(8) From Table 3, the extracted features are all very similar to each continents, except Asia. This echoes the above result.
4.
Conclusions and future work
In this study, we identify the features of daily biweekly growth rates of COVID-19 confirmed cases and deaths via orthonormal bases (features) which derive from Fourier analysis. Then we analyse the inner products which represent the levels of chosen features. The variabilities for each country show the levels of deaths under spatial frequencies are much more concentrated than others. The generated results are summarised in Results 3. There are some limitations in this study and future improvements to be done:
● The associated meanings of the orthonormal features from Fourier analysis are not yet fully explored;
● We use the Euclidean metric to measure the distances between features, which is then used to calculate the variabilities. Indeed Euclidean metric is noted for its geographical properties, but may not be the most suitable in the context of frequencies. One could further introduce other metrics and apply machine learning techniques to find out the optimal ones.
● In this study, we choose the daily biweekly growth rates of confirmed cases and deaths as our research sources. This is a one-sided story. To obtain a fuller picture of the dynamical features, one could add other variables for comparison.
Acknowledgements
This work is supported by the Humanities and Social Science Research Planning Fund Project under the Ministry of Education of China (No. 20XJAGAT001).
Conflict of interest
No potential conflict of interest was reported by the authors.
Table A3.
Inner products w.r.t. temporal (case: upper top and death: upper bottom blocks) and spatial (case: lower top and death: lower bottom blocks) frequencies at a national level.
Table A4.
Temporal inner product for continents (World, Africa, Asia, Europe, North and South America) w.r.t. daily biweekly growth rates of cases (upper block) and deaths (lower block) from March 22nd, 2020 to June 11th, 2021 (447 days).
Table A5.
Distance matrices for daily biweekly growth rates of cases (uppermost block) and deaths (2nd block) w.r.t. temporal frequencies and the ones of cases (3rd block) and deaths (bottommost block) w.r.t. spatial frequencies.
Table A6.
Variability for 117 countries with respect to daily biweekly growth rates of cases and death under temporal frequencies and variability for 109 days with respect to daily biweekly growth rates of cases and death under spatial frequencies.
T. J. R. Hughes, The finite element method. Linear static and dynamic finite element analysis, Comput. Methods Appl. Mech. Eng., 65 (1987), 191–193. https://doi.org/10.1016/0045-7825(87)90013-2 doi: 10.1016/0045-7825(87)90013-2
[2]
M. W. M. G. Dissanayake, N. Phan-Thien, Neural-network-based approximations for solving partial differential equations, Commun. Numer. Methods Eng., 10 (1994), 195–201. https://doi.org/10.1002/cnm.1640100303 doi: 10.1002/cnm.1640100303
[3]
I. E. Lagaris, A. Likas, D. I. Fotiadis, Artificial neural networks for solving ordinary and partial differential equations, IEEE Trans. Neural Networks, 9 (1998), 987–1000. https://doi.org/10.1109/72.712178 doi: 10.1109/72.712178
[4]
T. J. R. Hughes, J. A. Cottrell, Y. Bazilevs, Isogeometric analysis: CAD, finite elements, NURBS, exact geometry and mesh refinement, Comput. Methods Appl. Mech. Eng., 194 (2005), 4135–4195. https://doi.org/10.1016/j.cma.2004.10.008 doi: 10.1016/j.cma.2004.10.008
[5]
J. Han, A. Jentzen, W. E, Solving high-dimensional partial differential equations using deep learning, PNAS, 115 (2018), 8505–8510. https://doi.org/10.1073/pnas.1718942115 doi: 10.1073/pnas.1718942115
[6]
M. Raissi, P. Perdikaris, G. E. Karniadakis, Numerical gaussian processes for time-dependent and nonlinear partial differential equations, SIAM J. Sci. Comput., 40 (2018), A172–A198. https://doi.org/10.1137/17M1120762 doi: 10.1137/17M1120762
[7]
M. Raissi, P. Perdikaris, G. E. Karniadakis, Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations, J. Comput. Phys., 378 (2019), 686–707. https://doi.org/10.1016/j.jcp.2018.10.045 doi: 10.1016/j.jcp.2018.10.045
[8]
C. Song, T. Alkhalifah, U. B. Waheed, Solving the frequency-domain acoustic VTI wave equation using physics-informed neural networks, Geophys. J. Int., 225 (2020), 846–859. https://doi.org/10.1093/gji/ggab010 doi: 10.1093/gji/ggab010
[9]
F. Sahli Costabal, Y. Yang, P. Perdikaris, D. E. Hurtado, E. Kuhl, Physics-informed neural networks for cardiac activation mapping, Front. Phys., 8 (2020), 42. https://doi.org/10.3389/fphy.2020.00042 doi: 10.3389/fphy.2020.00042
[10]
J. D. Willard, X. Jia, S. Xu, M. S. Steinbach, V. Kumar, Integrating physics-based modeling with machine learning: A Survey, preprint, arXiv: 2003.04919, 2020. https://doi.org/10.48550/arXiv.2003.04919
[11]
G. E. Karniadakis, I. G. Kevrekidis, L. Lu, P. Perdikaris, S. Wang, L. Yang, Physics-informed machine learning, Nat. Rev. Phys., 3 (2021), 422–440. https://doi.org/10.1038/s42254-021-00314-5 doi: 10.1038/s42254-021-00314-5
[12]
M. Rashtbehesht, C. Huber, K. Shukla, G. Karniadakis, Physics-informed deep learning for wave propagation and full waveform inversions, J. Geophys. Res.: Solid Earth, 2021 (2021). https://doi.org/10.1029/2021JB023120 doi: 10.1029/2021JB023120
[13]
S. Liu, B. B. Kappes, B. Amin-ahmadi, O. Benafan, X. Zhang, A. P. Stebner, Physics-informed machine learning for composition - process - property design: Shape memory alloy demonstration, Appl. Mater. Today, 22 (2021), 100898. https://doi.org/10.1016/j.apmt.2020.100898 doi: 10.1016/j.apmt.2020.100898
[14]
M. Raissi, G. E. Karniadakis, Hidden physics models: Machine learning of nonlinear partial differential equations, J. Comput. Phys., 357 (2018), 125–141. https://doi.org/10.1016/j.jcp.2017.11.039 doi: 10.1016/j.jcp.2017.11.039
[15]
E. Samaniego, C. Anitescu, S. Goswami, V. M. Nguyen-Thanh, H. Guo, K. Hamdia, et al., An energy approach to the solution of partial differential equations in computational mechanics via machine learning: Concepts, implementation and applications, Comput. Methods Appl. Mech. Eng., 362 (2020), 112790. https://doi.org/10.1016/j.cma.2019.112790 doi: 10.1016/j.cma.2019.112790
[16]
C. Rao, H. Sun, Y. Liu, Physics-informed deep learning for computational elastodynamics without labeled data, J. Eng. Mech., 147 (2021), 04021043. https://doi.org/10.1061/(ASCE)EM.1943-7889.0001947 doi: 10.1061/(ASCE)EM.1943-7889.0001947
[17]
B. Wu, O. Hennigh, J. Kautz, S. Choudhry, W. Byeon, Physics informed RNN-DCT networks for time-dependent partial differential equations, preprint, arXiv: 2202.12358, 2022. https://doi.org/10.48550/arXiv.2202.12358
[18]
A. Sherstinsky, Fundamentals of recurrent neural network (RNN) and long short-term memory (LSTM) network, Physica D, 404 (2020), 132306. https://doi.org/10.1016/j.physd.2019.132306 doi: 10.1016/j.physd.2019.132306
Y. Kim, Convolutional neural networks for sentence classification, in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014. https://doi.org/10.3115/v1/D14-1181
[21]
Y. Zhu, N. Zabaras, Bayesian deep convolutional encoder-decoder networks for surrogate modeling and uncertainty quantification, J. Comput. Phys., 366 (2018), 243–266. https://doi.org/10.1016/j.jcp.2018.04.018 doi: 10.1016/j.jcp.2018.04.018
Y. Zhu, N. Zabaras, P. S. Koutsourelakis, P. Perdikaris, Physics-constrained deep learning for high-dimensional surrogate modeling and uncertainty quantification without labeled data, J. Comput. Phys., 394 (2019), 56–81. https://doi.org/10.1016/j.jcp.2019.05.024 doi: 10.1016/j.jcp.2019.05.024
[24]
L. Sun, H. Gao, S. Pan, J. Wang, Surrogate modeling for fluid flows based on physics-constrained deep learning without simulation data, Comput. Methods Appl. Mech. Eng., 351 (2019), 112732. https://doi.org/10.1016/j.cma.2019.112732 doi: 10.1016/j.cma.2019.112732
[25]
H. Gao, L. Sun, J. X. Wang, PhyGeoNet: Physics-informed geometry-adaptive convolutional neural networks for solving parameterized steady-state PDEs on irregular domain, J. Comput. Phys., 428 (2020), 110079. https://doi.org/10.1016/j.jcp.2020.110079 doi: 10.1016/j.jcp.2020.110079
[26]
P. Ren, C. Rao, Y. Liu, Z. Ma, Q. Wang, J. X. Wang, et al., PhySR: Physics-informed deep super-resolution for spatiotemporal data, J. Comput. Phys., 492 (2023), 112438. https://doi.org/10.1016/j.jcp.2023.112438 doi: 10.1016/j.jcp.2023.112438
[27]
A. D. Jagtap, G. E. Karniadakis, Extended physics-informed neural networks (XPINNs): A generalized space-time domain decomposition based deep learning framework for nonlinear partial differential equations, Commun. Comput. Phys., 28 (2020), 2002–2041. https://doi.org/10.4208/cicp.OA-2020-0164 doi: 10.4208/cicp.OA-2020-0164
[28]
P. Ren, N. Erichson, S. Subramanian, O. San, Z. Lukic, M. Mahoney, SuperBench: A super-resolution benchmark dataset for scientific machine learning, preprint, arXiv: 2306.14070, 2023. https://doi.org/10.48550/arXiv.2306.14070
[29]
L. Wang, Z. Zhou, Z. Yan, Data-driven vortex solitons and parameter discovery of 2D generalized nonlinear Schrödinger equations with a PT-symmetric optical lattice, Comput. Math. Appl., 140 (2023), 17–23. https://doi.org/10.1016/j.camwa.2023.03.015 doi: 10.1016/j.camwa.2023.03.015
[30]
M. Sadr, T. Tohme, K. Youcef-Toumi, Data-driven discovery of PDEs via the adjoint method, preprint, arXiv: 2401.17177, 2024. https://doi.org/10.48550/arXiv.2401.17177
[31]
F. J. Aguilar-Canto, C. Brito-Loeza, H. Calvo, Model discovery of compartmental models with graph-supported neural networks, Appl. Math. Comput., 464 (2024), 128392. https://doi.org/10.1016/j.amc.2023.128392 doi: 10.1016/j.amc.2023.128392
[32]
C. Rao, P. Ren, Y. Liu, H. Sun, Discovering nonlinear PDEs from scarce data with physics-encoded learning, preprint, arXiv: 2201.12354, 2022. https://doi.org/10.48550/arXiv.2201.12354
[33]
Y. Hu, T. Zhao, S. Xu, L. Lin, Z. Xu, Neural-PDE: a RNN based neural network for solving time dependent PDEs, Commun. Inf. Syst., 22 (2020), 223–245. https://doi.org/10.4310/CIS.2022.v22.n2.a3 doi: 10.4310/CIS.2022.v22.n2.a3
[34]
P. Ren, C. Rao, Y. Liu, J. X. Wang, H. Sun, PhyCRNet: Physics-informed convolutional-recurrent network for solving spatiotemporal PDEs, Comput. Methods Appl. Mech. Eng., 389 (2022), 114399. https://doi.org/10.1016/j.cma.2021.114399 doi: 10.1016/j.cma.2021.114399
[35]
L. Jiang, L. Wang, X. Chu, Y. Xiao, H. Zhang, PhyGNNet: Solving spatiotemporal PDEs with physics-informed graph neural network, in Proceedings of the 2023 2nd Asia Conference on Algorithms, Computing and Machine Learning, 2022. https://doi.org/10.1145/3590003.3590029
[36]
X. Meng, Z. Li, D. Zhang, G. E. Karniadakis, PPINN: Parareal physics-informed neural network for time-dependent PDEs, Comput. Methods Appl. Mech. Eng., 370 (2020), 113250. https://doi.org/10.1016/j.cma.2020.113250 doi: 10.1016/j.cma.2020.113250
[37]
A. Mavi, A. C. Bekar, E. Haghighat, E. Madenci, An unsupervised latent/output physics-informed convolutional-LSTM network for solving partial differential equations using peridynamic differential operator, Comput. Methods Appl. Mech. Eng., 407 (2023), 115944. https://doi.org/10.1016/j.cma.2023.115944 doi: 10.1016/j.cma.2023.115944
[38]
P. Saha, S. Dash, S. Mukhopadhyay, Physics-incorporated convolutional recurrent neural networks for source identification and forecasting of dynamical systems, Neural Networks, 144 (2021), 359–371. https://doi.org/10.1016/j.neunet.2021.08.033 doi: 10.1016/j.neunet.2021.08.033
[39]
P. R. Kakka, Sequence to sequence AE-ConvLSTM network for modelling the dynamics of PDE systems, preprint, arXiv: 2208.07315, 2022. https://doi.org/10.48550/arXiv.2208.07315
G. Larsson, M. Maire, G. Shakhnarovich, FractalNet: Ultra-deep neural networks without residuals, preprint, arXiv: 1605.07648, 2017. https://doi.org/10.48550/arXiv.1605.07648
[43]
Y. Lu, A. Zhong, Q. Li, B. Dong, Beyond finite layer neural networks: bridging deep architectures and numerical differential equations, preprint, arXiv: 1710.10121, 2020. https://doi.org/10.48550/arXiv.1710.10121
[44]
L. Ruthotto, E. Haber, Deep neural networks motivated by partial differential equations, J. Math. Imaging Vision, 61 (2019), 787–805. https://doi.org/10.1007/s10851-019-00903-1 doi: 10.1007/s10851-019-00903-1
[45]
Z. Long, Y. Lu, B. Dong, PDE-Net 2.0: Learning PDEs from data with a numeric-symbolic hybrid deep network, J. Comput. Phys., 399 (2019), 108925. https://doi.org/10.1016/j.jcp.2019.108925 doi: 10.1016/j.jcp.2019.108925
[46]
M. Raissi, Deep hidden physics models: deep learning of nonlinear partial differential equations, J. Mach. Learn. Res., 19 (2018), 932–955. https://doi.org/10.5555/3291125.3291150 doi: 10.5555/3291125.3291150
Table 2.
Orthonormal temporal frequencies for 109 days (upper block or F109) and orthonormal spatial frequencies for 117 countries (lower block or F117).
Table A3.
Inner products w.r.t. temporal (case: upper top and death: upper bottom blocks) and spatial (case: lower top and death: lower bottom blocks) frequencies at a national level.
Table A4.
Temporal inner product for continents (World, Africa, Asia, Europe, North and South America) w.r.t. daily biweekly growth rates of cases (upper block) and deaths (lower block) from March 22nd, 2020 to June 11th, 2021 (447 days).
Table A5.
Distance matrices for daily biweekly growth rates of cases (uppermost block) and deaths (2nd block) w.r.t. temporal frequencies and the ones of cases (3rd block) and deaths (bottommost block) w.r.t. spatial frequencies.
Table A6.
Variability for 117 countries with respect to daily biweekly growth rates of cases and death under temporal frequencies and variability for 109 days with respect to daily biweekly growth rates of cases and death under spatial frequencies.
Figure 1. PhyICNet architecture. a. By learning the known state variable u0 through T time steps in multiple convolutional layers, interactive learning modules, and ConvLSTM time propagator, the bounded output uT is obtained. b. PhyICNet is updated through r rounds of interaction. c. Padding implements BCs hard constraints. Dirichlet BCs (left) and Neumann BCs (right). d. Key features of the network
Figure 2. Comparison of long-term prediction performance. PhyICNet and baseline model predicted images and errors at randomly selected time T = 3, 7, 12 seconds
Figure 3. The influence of noise level on the three networks
Figure 4. Comparing the difference of the error evolution of different networks over time