The search for the gust-wing interaction \emph{textbook}
Paolo Olivucci, David E. Rival
TL;DR
The paper tackles unsteady gust–wing interactions, a high‑dimensional and diverse physical problem, by proposing a textbook approach to compress a large experimental database into a small, representative subset for predictive modeling. It builds a purpose‑built random gust generator, collects a dataset of $1031$ gust events, and formulates the textbook selection problem as an unsupervised, submodular optimization using a facility‑location objective; the selected textbook is then used to train an MLP load predictor. Key findings show that a textbook of as few as $m_{txt}=10$ elements can achieve nearly the large‑sample accuracy while reducing data requirements by ~98%, and that textbooks generalize across typical and extreme cases with improved interpretability. The work demonstrates substantial data‑efficiency gains and provides a framework for extracting essential physics from large experimental datasets, with potential applications to other unsteady flow problems and to physics‑aware data summarization strategies.
Abstract
We address whether complex physical relations can be investigated through the synergy of automated high-volume experiments and the reduction of large datasets to a concise, representative subset of canonical examples -- a \emph{textbook}. To this end, we consider the unsteady aerodynamics of wing-gust interactions, which is characterized by its rich, high-dimensional physics. We take advantage of a purpose-built gust generator to systematically produce over 1,000 distinct random gust events and to measure the unsteady loads induced on a delta wing. We then employ a data summarization procedure to identify representative subsets of increasing size from the large-scale database, which then serve as training data for a machine-learning model of the aerodynamic loads from sparse pressure measurements. An appropriately selected \emph{textbook} of a few events can achieve predictive accuracy comparable to random training sets up to two orders of magnitude larger, capturing the intrinsic diversity of the full-scale data and enhancing modeling efficiency and interpretability. Our methodology evidences the potential of distilling the essential information contained in large amounts of experimental observations.
