Multi-modal Imaging Genomics Transformer: Attentive Integration of Imaging with Genomic Biomarkers for Schizophrenia Classification
Nagur Shareef Shaik, Teja Krishna Cherukuri, Vince D. Calhoun, Dong Hye Ye
TL;DR
Schizophrenia diagnosis benefits from integrating structural MRI, functional connectome, and genomic biomarkers, but effective multi‑modal fusion remains challenging. The authors propose MIGTrans, a three‑way attentive transformer that uses Genomic and Connectome Encoders, a Structural MRI encoder with Spatial Sequence Attention, and a Fusion Transformer to perform cross‑modal fusion through stepwise attention. The model achieves an accuracy of $86.05\%$ (±$0.02$) on a FBIRN‑derived dataset and provides interpretable biomarkers by highlighting significant SNPs and pivotal brain regions and connections. This work advances diagnostic performance and offers biological insights that could inform personalized interventions for schizophrenia.
Abstract
Schizophrenia (SZ) is a severe brain disorder marked by diverse cognitive impairments, abnormalities in brain structure, function, and genetic factors. Its complex symptoms and overlap with other psychiatric conditions challenge traditional diagnostic methods, necessitating advanced systems to improve precision. Existing research studies have mostly focused on imaging data, such as structural and functional MRI, for SZ diagnosis. There has been less focus on the integration of genomic features despite their potential in identifying heritable SZ traits. In this study, we introduce a Multi-modal Imaging Genomics Transformer (MIGTrans), that attentively integrates genomics with structural and functional imaging data to capture SZ-related neuroanatomical and connectome abnormalities. MIGTrans demonstrated improved SZ classification performance with an accuracy of 86.05% (+/- 0.02), offering clear interpretations and identifying significant genomic locations and brain morphological/connectivity patterns associated with SZ.
