STORM: Strategic Orchestration of Modalities for Rare Event Classification
Payal Kamboj, Ayan Banerjee, Sandeep K. S. Gupta
TL;DR
This paper tackles the challenge of multimodal learning for rare-event classification in biomedicine by proposing STORM, an entropy-based modality selection framework that combines expert-guided narrowing with data-driven assessment. It quantifies information content via class-wise entropy and employs an entropy-imbalance metric and information gain to rank and cascade modalities, addressing NP-hard subset selection through a Hunt-style procedure. The method is demonstrated on seizure onset zone detection from resting-state fMRI, revealing that expert-derived spatial features (D1) can be crucial while temporal features (D2) may contribute less to AI performance, and that the basic modality is essential. The work advances data-efficient, clinically applicable AI by enabling principled modality orchestration and could generalize to other rare-event biomedical tasks requiring careful modality budgeting.
Abstract
In domains such as biomedical, expert insights are crucial for selecting the most informative modalities for artificial intelligence (AI) methodologies. However, using all available modalities poses challenges, particularly in determining the impact of each modality on performance and optimizing their combinations for accurate classification. Traditional approaches resort to manual trial and error methods, lacking systematic frameworks for discerning the most relevant modalities. Moreover, although multi-modal learning enables the integration of information from diverse sources, utilizing all available modalities is often impractical and unnecessary. To address this, we introduce an entropy-based algorithm STORM to solve the modality selection problem for rare event. This algorithm systematically evaluates the information content of individual modalities and their combinations, identifying the most discriminative features essential for rare class classification tasks. Through seizure onset zone detection case study, we demonstrate the efficacy of our algorithm in enhancing classification performance. By selecting useful subset of modalities, our approach paves the way for more efficient AI-driven biomedical analyses, thereby advancing disease diagnosis in clinical settings.
