Unmasking unlearnable models: a classification challenge for biomedical images without visible cues
Shivam Kumar, Samrat Chatterjee
TL;DR
This paper tackles the challenge of predicting MGMT methylation status from MRI images that lack visually discernible cues, a problem with high clinical relevance due to treatment decisions in glioma. Through a process-driven evaluation, the authors benchmark multiple CNN architectures, explore transfer learning, analyze gradient flow and learning curves, and apply radiomics-based feature selection to probe the roots of limited learnability. They find that current models are largely unlearnable for this task, with transfer learning offering limited gains and complexity unable to overcome fundamental limitations. Radiomic features provide some informative signal, but are not sufficient alone, prompting the authors to advocate domain-informed feature maps and indigenous architectural strategies to address non-visible cues and improve real-world applicability.
Abstract
Predicting traits from images lacking visual cues is challenging, as algorithms are designed to capture visually correlated ground truth. This problem is critical in biomedical sciences, and their solution can improve the efficacy of non-invasive methods. For example, a recent challenge of predicting MGMT methylation status from MRI images is critical for treatment decisions of glioma patients. Using less robust models poses a significant risk in these critical scenarios and underscores the urgency of addressing this issue. Despite numerous efforts, contemporary models exhibit suboptimal performance, and underlying reasons for this limitation remain elusive. In this study, we demystify the complexity of MGMT status prediction through a comprehensive exploration by performing benchmarks of existing models adjoining transfer learning. Their architectures were further dissected by observing gradient flow across layers. Additionally, a feature selection strategy was applied to improve model interpretability. Our finding highlighted that current models are unlearnable and may require new architectures to explore applications in the real world. We believe our study will draw immediate attention and catalyse advancements in predictive modelling with non-visible cues.
