On sparse regression, Lp-regularization, and automated model discovery
Jeremy A. McCulloch, Skyler R. St. Pierre, Kevin Linka, Ellen Kuhl
TL;DR
This work tackles automatic discovery of interpretable, data-driven constitutive relations for nonlinear material behavior using a hybrid approach that combines $L_p$ regularization with physics-informed constitutive neural networks. It introduces two architectures—an invariant-based network and a principal-stretch based network—that enforce thermodynamic consistency and objectivity, and it systematically examines how $L_p$ regularization ($p$ and $\alpha$) influences sparsity, bias, and robustness on both synthetic and brain-tissue data, with normalization proving crucial for stable discovery. The key finding is that $L_2$ regularization is inadequate for discovery, $L_1$ induces sparsity but can bias results, and $L_0$ regularization provides transparent control over the trade-off between interpretability and predictive accuracy, enabling best-in-class term discovery—though nonlinear cases can exhibit multiple local minima, especially in the invariant-based network. The results suggest broad applicability to other discovery methods (e.g., sparse/symbolic regression) and domains, with potential implications for generative material design and automated discovery of materials with user-defined properties.
Abstract
Sparse regression and feature extraction are the cornerstones of knowledge discovery from massive data. Their goal is to discover interpretable and predictive models that provide simple relationships among scientific variables. While the statistical tools for model discovery are well established in the context of linear regression, their generalization to nonlinear regression in material modeling is highly problem-specific and insufficiently understood. Here we explore the potential of neural networks for automatic model discovery and induce sparsity by a hybrid approach that combines two strategies: regularization and physical constraints. We integrate the concept of Lp regularization for subset selection with constitutive neural networks that leverage our domain knowledge in kinematics and thermodynamics. We train our networks with both, synthetic and real data, and perform several thousand discovery runs to infer common guidelines and trends: L2 regularization or ridge regression is unsuitable for model discovery; L1 regularization or lasso promotes sparsity, but induces strong bias; only L0 regularization allows us to transparently fine-tune the trade-off between interpretability and predictability, simplicity and accuracy, and bias and variance. With these insights, we demonstrate that Lp regularized constitutive neural networks can simultaneously discover both, interpretable models and physically meaningful parameters. We anticipate that our findings will generalize to alternative discovery techniques such as sparse and symbolic regression, and to other domains such as biology, chemistry, or medicine. Our ability to automatically discover material models from data could have tremendous applications in generative material design and open new opportunities to manipulate matter, alter properties of existing materials, and discover new materials with user-defined properties.
