×

On sparse regression, \(L_p\)-regularization, and automated model discovery. (English) Zbl 07901039

Summary: Sparse regression and feature extraction are the cornerstones of knowledge discovery from massive data. Their goal is to discover interpretable and predictive models that provide simple relationships among scientific variables. While the statistical tools for model discovery are well established in the context of linear regression, their generalization to nonlinear regression in material modeling is highly problem-specific and insufficiently understood. Here we explore the potential of neural networks for automatic model discovery and induce sparsity by a hybrid approach that combines two strategies: regularization and physical constraints. We integrate the concept of \(L_p\) regularization for subset selection with constitutive neural networks that leverage our domain knowledge in kinematics and thermodynamics. We train our networks with both, synthetic and real data, and perform several thousand discovery runs to infer common guidelines and trends: \(L_2\) regularization or ridge regression is unsuitable for model discovery; \(L_1\) regularization or lasso promotes sparsity, but induces strong bias that may aggressively change the results; only \(L_0\) regularization allows us to transparently fine-tune the trade-off between interpretability and predictability, simplicity and accuracy, and bias and variance. With these insights, we demonstrate that \(L_p\) regularized constitutive neural networks can simultaneously discover both, interpretable models and physically meaningful parameters. We anticipate that our findings will generalize to alternative discovery techniques such as sparse and symbolic regression, and to other domains such as biology, chemistry, or medicine. Our ability to automatically discover material models from data could have tremendous applications in generative material design and open new opportunities to manipulate matter, alter properties of existing materials, and discover new materials with user-defined properties.
© 2024 The Authors. International Journal for Numerical Methods in Engineering published by John Wiley & Sons Ltd.

MSC:

74S99 Numerical and other methods in solid mechanics
74A20 Theory of constitutive functions in solid mechanics
74B20 Nonlinear elasticity
68T07 Artificial neural networks and deep learning

References:

[1] AlberM, Buganza TepoleA, CannonW, et al. Integrating machine learning and multiscale modeling: perspectives, challenges, and opportunities in the biological, biomedical, and behavioral sciences. NPJ Digit Med. 2019;2:115.
[2] BruntonSL, KutzJN. Data‐Driven Science and Engineering: Machine Learning, Dynamical Systems, and Control. 1st ed.Cambridge University Press; 2019. · Zbl 1407.68002
[3] As’adF, AveryP, FarhatC. A mechanics‐informed artificial neural network approach in data‐driven constitutive modeling. Int J Numer Methods Eng. 2022;123:2738‐2759. · Zbl 1528.74114
[4] FuhgJN, BouklasN, JonesRE. Learning hyperelastic anisotropy from data via a tensor basis neural network. J Mech Phys Solids. 2022;168:105022.
[5] GhaderiA, MorovatiV, DargazanyR. A physics‐informed assembly for feed‐forward neural network engines to predict inelasticity in cross‐linked polymers. Polymers. 2020;12:2628.
[6] HolthusenH, LammL, BrepolsT, ReeseS, KuhlE. Theory and implementation of inelastic constitutive artificial neural networks; 2023. doi:10.48550/arXiv.2311.06380
[7] HolzapfelGA, LinkaK, SherifovaS, CyronC. Predictive constitutive modelling of arteries by deep learning. J R Soc Interface. 2021;18:20210411.
[8] HuangDZ, XuK, FarhatC, DarveE. Learning constitutive relations from indirect observations using deep neural networks. J Comput Phys. 2020;416:109491. · Zbl 1437.65192
[9] KleinDK, FernandezM, MartinRJ, NeffP, WeegerO. Polyconvex anisotropic hyperelasticity with neural networks. J Mech Phys Solids. 2022;159:105703.
[10] MasiF, StefanouI, VannucciP, Maffi‐BerthierV. Thermodynamics‐based artificial neural networks for constitutive modeling. J Mech Phys Solids. 2021;147:04277.
[11] PeirlinckM, LinkaK, HurtadoJA, KuhlE. On automated model discovery and a universal material subroutine for hyperelastic materials. Comput Methods Appl Mech Eng. 2024;418:116534. · Zbl 1539.74067
[12] TacV, Sahli CostabalF, BuganzaTA. Data‐driven tissue mechanics with polyconvex neural ordinary differential equations. Comput Methods Appl Mech Eng. 2022;398:115248. · Zbl 1507.74236
[13] TacV, LinkaK, Sahli CostabalF, KuhlE, BuganzaTA. Benchmarking physics‐informed frameworks for data‐driven hyperelasticity. Comput Mech. 2023;73:49‐65. · Zbl 1542.74081
[14] WangLM, LinkaK, KuhlE. Automated model discovery for muscle using constitutive recurrent neural networks. J Mech Behav Biomed Mater. 2023;145:106021.
[15] LinkaK, HillgartnerM, AbdolaziziKP, AydinRC, ItskovM, CyronCJ. Constitutive artificial neural networks: a fast and general approach to predictive data‐driven constitutive modeling by deep learning. J Comput Phys. 2021;429:110010. · Zbl 07500745
[16] GhaboussiJ, GarrettJH, WuX. Knowledge‐based modeling of material behavior with neural networks. J Eng Mech. 1991;117:132‐153.
[17] LinkaK, KuhlE. A new family of constitutive artificial neural networks towards automated model discovery. Comput Methods Appl Mech Eng. 2023;403:115731. · Zbl 1536.74023
[18] BongardJ, LipsonH. Automated reverse engineering of nonlinear dynamical systems. Proc Natl Acad Sci USA. 2007;104:9943‐9948. · Zbl 1155.37044
[19] SchmidtM, LipsonH. Distilling free‐form natural laws from experimental data. Science. 2009;324:81‐85.
[20] BruntonSL, ProctorJP, KutzJN. Discovering governing equations from data by sparse identification of nonlinear dynamical systems. Proc Natl Acad Sci USA. 2016;113:3932‐3937. · Zbl 1355.94013
[21] PengGCY, AlberM, Buganza TepoleA, et al. Multiscale modeling meets machine learning: what can we learn?Arch Comput Methods Eng. 2021;28:1017‐1037.
[22] PeirlinckM, LinkaK, HurtadoJA, HolzapfelGA, KuhlE. Democratizing biomedical simulation through automated model discovery and a universal material subroutine. bioRxiv, 2023. doi:10.1101/2023.12.06.570487
[23] SimoJC, TaylorRL. Consistent tangent operators for rate‐independent elastoplasticity. Comput Methods Appl Mech Eng. 1985;48:101‐118. · Zbl 0535.73025
[24] ZienkiewiczOC, TaylorRL. The Finite Element Method. 4th ed.McGraw Hill College; 1987.
[25] FlaschelM, KumarS, De LorenzisL. Unsupervised discovery of interpretable hyperelastic constitutive laws. Comput Methods Appl Mech Eng. 2021;381:113852. · Zbl 1506.74051
[26] AbdusalamovR, HillgartnerM, ItskovM. Automatic generation of interpretable hyperelastic models by symbolic regression. Int J Numer Methods Eng. 2023;124:2093‐2104. · Zbl 1532.74014
[27] WangZ, EstradaJB, ArrudaEM, GarikipatiK. Inference of deformation mechanisms and constitutive response of soft material surrogates of biological tissue by full‐field characterization and data‐driven variational system identification. J Mech Phys Solids. 2021;153:104474.
[28] FlaschelM, KumarS, De LorenzisL. Automated discovery of generalized standard material models with EUCLID. Comput Methods Appl Mech Eng. 2023;405:115867. · Zbl 1539.74136
[29] LinkaK, St PierreSR, KuhlE. Automated model discovery for human brain using constitutive artificial neural networks. Acta Biomater. 2023;160:134‐151.
[30] FrankIE, FriedmanJH. A statistical view of some chemometrics regression tools. Dent Tech. 1993;35:109‐135. · Zbl 0775.62288
[31] HoerlAE, KennardRW. Ridge regression: biased estimation for nonorthogonal problems. Dent Tech. 1970;12:55‐67. · Zbl 0202.17205
[32] SantosaF, SymesWW. Linear inversion of band‐limited reflection seismograms. SIAM J Sci Stat Comput. 1986;7:1307‐1330. · Zbl 0602.73113
[33] TibshiraniR. Regression shrinkage and selection via the lasso. J R Stat Soc B Methodol. 1996;58:267‐288. · Zbl 0850.62538
[34] HastieT, TibshiraniR, FriedmanJ. The Elements of Statistical Learning. 2nd ed.Springer; 2009. · Zbl 1273.62005
[35] JamesG, WittenD, HastieT, TibshiraniR. An Introduction to Statistical Learning. 2nd ed.Springer; 2013. · Zbl 1281.62147
[36] KozaJR. Genetic Programming: on the Programming of Computers by Means of Natural Selection. 1st ed.The MIT Press; 1992. · Zbl 0850.68161
[37] SeberGAF, LeeAJ. Linear Regression Analysis. 1st ed.John Wiley & Sons; 1977. · Zbl 0354.62055
[38] AntmanSS. Nonlinear Problems of Elasticity. 2nd ed.Springer; 2005. · Zbl 1098.74001
[39] FuhgJN, BouklasN. On physics‐informed data‐driven isotropic and anisotropic constitutive models through probabilistic machine learning and space‐filling sampling. Comput Methods Appl Mech Eng. 2022;394:114915. · Zbl 1507.74551
[40] HartmannS. Parameter estimation of hyperelastic relations of generalized polynomial‐type with constraint conditions. Int J Solids Struct. 2001;38:7999‐8018. · Zbl 1032.74014
[41] LindenL, KleinDK, KalinkaKA, BrummundJ, WeegerO, KästnerM. Neural networks meet hyperelasticity: a guide to enforcing physics. J Mech Phys Solids. 2023;179:105363.
[42] TacV, RauschMK, Sahli CostabalF, BuganzaTA. Data‐driven anisotropic finite viscoelasticity using neural ordinary differential equations. Comput Methods Appl Mech Eng. 2023;411:116046. · Zbl 1539.74005
[43] TruesdellC, NollW. Non‐linear field theories of mechanics. In: FlüggeS (ed.), ed. Encyclopedia of Physics. Vol III/3. Spinger; 1965. · Zbl 0137.19501
[44] SeberGAF, WildCJ. Nonlinear Regression. 1st ed.John Wiley & Sons; 1989. · Zbl 0721.62062
[45] BatesDM, WattsDG. Nonlinear Regression Analysis and its Applications. 1st ed.John Wiley & Sons; 1988. · Zbl 0728.62062
[46] ChampionK, LuschB, KutzN, BruntonSL. Data‐driven discovery of coordinates and governing equations. Proc Natl Acad Sci USA. 2019;116:22445‐22451. · Zbl 1433.68396
[47] ZhangL, SchaefferH. On the convergence of the SINDy algorithm. Multiscale Model Simul. 2019;17:948‐972. · Zbl 1437.37108
[48] HornikK, StinchcombeM, WhiteH. Multilayer feedforward networks are universal approximators. Neural Netw. 1989;2:359‐366. · Zbl 1383.92015
[49] GoodfellowI, BengioY, CourvilleA. Deep Learning. 1st ed.The MIT Press; 2016. · Zbl 1373.68009
[50] St PierreSR, LinkaK, KuhlE. Principal‐stretch‐based constitutive neural networks autonomously discover a subclass of Ogden models for human brain tissue. Brain Multiphys. 2023;4:100066.
[51] HanS, PoolJ, TranJ, DallyWJ. Learning both weights and connections for efficient neural networks. Proceedings of the 28th International Conference on Neural Information Processing Systems. Vol 1. MIT Press; 2015:1135‐1143.
[52] LeCunY, DenkerJS, SollaSA. Optimal brain damage. Advances in Neural Information Processing Systems 2. Morgan‐Kaufmann; 1989.
[53] FuWJ. The bridge versus the lasso. J Comput Graph Stat. 1998;7:397‐416.
[54] ZouH, HastieT. Regularization and variable selection via the elastic net. J R Stat Soc Series B Stat Methodol. 2005;67:301‐320. · Zbl 1069.62054
[55] FriedmanJH. Sparse regression in and classification. Int J Forecast. 2012;28:722‐738.
[56] HolzapfelGA. Nonlinear Solid Mechanics: A Continuum Approach to Engineering. John Wiley & Sons; 2000. · Zbl 0980.74001
[57] OgdenRW. Large deformation isotropic elasticity—on the correlation of theory and experiment for incompressible rubberlike solids. Proc R Soc Lond A Math Phys Sci. 1972;326:565‐584. · Zbl 0257.73034
[58] BlatzPJ, KoWL. Application of finite elastic theory to the deformation of rubbery materials. Trans Soc Rheol. 1962;6:223‐251.
[59] MooneyM. A theory of large elastic deformations. J Appl Phys. 1940;11:582‐590. · JFM 66.1021.04
[60] RivlinRS. Large elastic deformations of isotropic materials. IV. Further developments of the general theory. Philos Trans R Soc Lond A. 1948;241:379‐397. · Zbl 0031.42602
[61] TreloarLRG. Stresses and birefringence in rubber subjected to general homogeneous strain. Proc Phys Soc. 1948;60:135‐144.
[62] BuddayS, SommerG, BirklC, et al. Mechanical characterization of human brain tissue. Acta Biomater. 2017;48:319‐340.
[63] BuddayS, OvaertTC, HolzapfelGA, SteinmannP, KuhlE. Fifty shades of brain: a review on the material testing and modeling of brain tissue. Arch Comput Methods Eng. 2020;27:1187‐1230.
[64] LinkaK, Buganza TepoleA, HolzapfelGA, KuhlE. Automated model discovery for skin: discovering the best model, data, and experiment. Comput Methods Appl Mech Eng. 2023;410:116007. · Zbl 1539.74004
[65] St PierreSR, RajasekharanD, DarwinEC, LinkaK, LevenstonME, KuhlE. Discovering the mechanics of artificial and real meat. Comput Methods Appl Mech Eng. 2023;415:116236. · Zbl 1539.74246
[66] ValanisK, LandelRF. The strain‐energy function of a hyperelastic material in terms of the extension ratios. J Appl Phys. 1967;38:2997‐3002.
[67] OgdenRW, SaccomandiG, SguraI. Fitting hyperelastic models to experimental data. Comput Mech. 2004;34:484‐502. · Zbl 1109.74320
[68] KingmaDP, BaJ. Adam: a method for stochastic optimization; 2014. doi:10.48550/arXiv.1412.6980
[69] DemirayH. A note on the elasticity of soft biological tissues. J Biomech. 1972;5:309‐311.
[70] HolzapfelGA, GasserTC, OgdenRW. A new constitutive framework for arterial wall mechanics and comparative study of material models. J Elast. 2000;61:1‐48. · Zbl 1023.74033
[71] GentA. A new constitutive relation for rubber. Rubber Chem Technol. 1996;69:59‐61.
[72] NikolovDP, SrivastavaS, AbeidBA, et al. Ogden material calibration via magnetic resonance cartography, parameter sensitivity and variational system identification. Philos Trans A Math Phys Eng Sci. 2022;380:20210324.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.