Over the years, data-driven regression on univariate functions has been extensively studied. However, fast, effective, and stable algorithms for multivariate function fitting are still lacking. Recently, Kolmogorov-Arnold networks have garnered significant attention among scholars due to their superior accuracy and interpretability compared to multi-layer perceptrons. In this paper, we have demonstrated that the sigma-pi neural network, a form of Kolmogorov-Arnold networks, can efficiently fit multivariate polynomial functions, including fractional-order multivariate polynomials. Three examples were employed to illustrate the regression performance of the designed neural networks. The explainable sigma-pi neural network will lay the groundwork for further development of general tools for multivariate nonlinear function regression problems.
Citation: Xiaoxiang Guo, Zuolin Shi, Bin Li. Multivariate polynomial regression by an explainable sigma-pi neural network[J]. Big Data and Information Analytics, 2024, 8: 65-79. doi: 10.3934/bdia.2024004
Over the years, data-driven regression on univariate functions has been extensively studied. However, fast, effective, and stable algorithms for multivariate function fitting are still lacking. Recently, Kolmogorov-Arnold networks have garnered significant attention among scholars due to their superior accuracy and interpretability compared to multi-layer perceptrons. In this paper, we have demonstrated that the sigma-pi neural network, a form of Kolmogorov-Arnold networks, can efficiently fit multivariate polynomial functions, including fractional-order multivariate polynomials. Three examples were employed to illustrate the regression performance of the designed neural networks. The explainable sigma-pi neural network will lay the groundwork for further development of general tools for multivariate nonlinear function regression problems.
| [1] |
Clauset A, Larremore D, Sinatra R, (2017) Data-driven predictions in the science of science. Science 355: 477–480. https://doi.org/10.1126/science.aal4217 doi: 10.1126/science.aal4217
|
| [2] |
Subrahmanian VS, Kumar S, (2017) Predicting human behavior: The next frontiers. Science 355: 489. https://doi.org/10.1126/science.aam7032 doi: 10.1126/science.aam7032
|
| [3] |
Wang Z, Sun ZH, Yin H, Liu XH, Wang JL, Zhao HT, et al. (2022) Data-driven materials innovation and applications. Adv Mater 34: 2104113. https://doi.org/10.1002/adma.202104113 doi: 10.1002/adma.202104113
|
| [4] |
Chen X, Yan CC, Zhang XT, Zhang X, Dai F, Yin J, et al. (2016) Drug-target interaction prediction: Databases, web servers and computational models. Briefings Bioinf 17: 696–712. https://doi.org/10.1093/bib/bbv066 doi: 10.1093/bib/bbv066
|
| [5] |
Zhao Y, Yin J, Zhang L, Zhang Y, Chen X, (2024) Drug-drug interaction prediction: Databases, web servers and computational models. Briefings Bioinf 25: bbad445. https://doi.org/10.1093/bib/bbad445 doi: 10.1093/bib/bbad445
|
| [6] |
Guo XX, Sun YT, Ren JL, (2020) Low dimensional mid-term chaotic time series prediction by delay parameterized method. Inf Sci 516: 1–19. https://doi.org/10.1016/j.ins.2019.12.021 doi: 10.1016/j.ins.2019.12.021
|
| [7] |
Li B, Guo XX, Fang HY, Ren JL, Yang KJ, Wang F, et al. (2020) Prediction equation for maximum stress of concrete drainage pipelines subjected to various damages and complex service conditions. Constr Build Mater 264: 120238. https://doi.org/10.1016/j.conbuildmat.2020.120238 doi: 10.1016/j.conbuildmat.2020.120238
|
| [8] |
Guo XX, Xiong NN, Wang HY, Ren JL, (2022) Design and analysis of a prediction system about influenza-like illness from the latent temporal and spatial information. IEEE Trans Syst Man Cybern Syst 52: 66–77. https://doi.org/10.1109/TSMC.2020.3048946 doi: 10.1109/TSMC.2020.3048946
|
| [9] |
Li B, Fang HY, Yang KJ, Zhang XJ, Du XM, Wang NN, et al. (2022) Impact of erosion voids and internal corrosion on concrete pipes under traffic loads. Tunnelling Underground Space Technol 130: 104761. https://doi.org/10.1016/j.tust.2022.104761 doi: 10.1016/j.tust.2022.104761
|
| [10] | Rudin W, (1976) Principles of Mathematical Analysis, McGraw-Hill Companies. |
| [11] |
Guo XX, Han WM, Ren JL, (2023) Design of a prediction system based on the dynamical feed-forward neural network. Sci China Inf Sci 66: 112102. https://doi.org/10.1007/s11432-020-3402-9 doi: 10.1007/s11432-020-3402-9
|
| [12] |
Yu LP, Guo XX, Wang G, Sun BA, Han DX, Chen C, et al. (2022) Extracting governing system for the plastic deformation of metallic glasses using machine learning. Sci China Phys Mech Astron 65: 264611. https://doi.org/10.1007/s11433-021-1840-9 doi: 10.1007/s11433-021-1840-9
|
| [13] | Liu Z, Wang Y, Vaidya S, Ruehle F, Halverson J, Soljacic M, et al. (2024) KAN: Kolmogorov-Arnold networks. preprint, arXiv: 2404.19756. https://doi.org/10.48550/arXiv.2404.19756 |
| [14] |
Gurney KN, (1992) Training nets of hardware realizable sigma-pi units. Neural Networks 5: 289–303. https://doi.org/10.1016/S0893-6080(05)80027-9 doi: 10.1016/S0893-6080(05)80027-9
|
| [15] | Penny WD, Stonham TJ, (1995) Generalization in multi-layer networks of sigma-pi units IEEE Trans Neural Networks 6: 506–508. https://doi.org/10.1109/72.363490 |
| [16] | Lundberg SM, Lee SI, (2017) A unified approach to interpreting model predictions. Adv Neural Inf Process Syst 30: 4765–4774. |
| [17] |
Wu LL, Wei GY, Wang G, Wang HY, Ren JL, (2022) Creating win-wins from strength-ductility trade-off in multi-principal element alloys by machine learning. Mater Today Commun 32: 104010. https://doi.org/10.1016/j.mtcomm.2022.104010 doi: 10.1016/j.mtcomm.2022.104010
|
| [18] |
Xiao L, Wang G, Long WM, Liaw PK, Ren JL, (2024) Fatigue life prediction of the FCC-based multi-principal element alloys via domain knowledge-based machine learning. Eng Fract Mech 296: 109860. https://doi.org/10.1016/j.engfracmech.2024.109860 doi: 10.1016/j.engfracmech.2024.109860
|