Function Class Learning with Genetic Programming: Towards Explainable Meta Learning for Tumor Growth Functionals
E. M. C. Sijben, J. C. Jansen, P. A. N. Bosman, T. Alderliesten
TL;DR
This work introduces FC-GOMEA, a GP-based framework that learns a set of function classes representing overarching growth patterns across multiple tumor data sets, with local per-dataset constants $c^{\mathrm{FC}}$ to tailor predictions. By coupling GP-GOMEA with RV-GOMEA and employing a multi-modal, multi-objective search (MM-GP-GOMEA), FC-GOMEA discovers diverse, interpretable function classes that explain tumor growth while enabling dataset-specific refinements. The method is validated on both synthetic data that reflect known logistic and Gompertz growth forms and a real-world paraganglioma dataset, demonstrating the ability to recover meaningful classes and to provide per-tumor predictions via learned constants. Although computationally intensive, the approach yields explainable meta-learning outputs that can be used to assess growth patterns and uncertainty across related patient data, potentially informing treatment timing decisions.
Abstract
Paragangliomas are rare, primarily slow-growing tumors for which the underlying growth pattern is unknown. Therefore, determining the best care for a patient is hard. Currently, if no significant tumor growth is observed, treatment is often delayed, as treatment itself is not without risk. However, by doing so, the risk of (irreversible) adverse effects due to tumor growth may increase. Being able to predict the growth accurately could assist in determining whether a patient will need treatment during their lifetime and, if so, the timing of this treatment. The aim of this work is to learn the general underlying growth pattern of paragangliomas from multiple tumor growth data sets, in which each data set contains a tumor's volume over time. To do so, we propose a novel approach based on genetic programming to learn a function class, i.e., a parameterized function that can be fit anew for each tumor. We do so in a unique, multi-modal, multi-objective fashion to find multiple potentially interesting function classes in a single run. We evaluate our approach on a synthetic and a real-world data set. By analyzing the resulting function classes, we can effectively explain the general patterns in the data.
