Genetic Information Analysis of Age-Related Macular Degeneration Fellow Eye Using Multi-Modal Selective ViT
Yoichi Furukawa, Satoshi Kamiya, Yoichi Sakurada, Kenji Kashiwagi, Kazuhiro Hotta
TL;DR
This work targets non-invasive genetic risk assessment for AMD by predicting the presence of risk alleles in $ARMS2$ and $CFH$ using a multi-modal framework that fuses fundus, OCT, and medical records. The proposed MSViT architecture (MME, ST, and Enhanced head) with TSIA for image-augmentation and a Record Revive Algorithm (RRA) for reconstructing tabular data achieves high accuracy (>80%) in predicting risk allele count $=2$. Key contributions include a multi-modal embedding strategy, selective attention to informative tokens, and reconstruction-driven training, which collectively improve classification performance and provide interpretable token visualizations. The approach addresses irregular modality counts and missing data, offering a practical pathway for integrating imaging and clinical data in AMD risk prediction.
Abstract
In recent years, there has been significant development in the analysis of medical data using machine learning. It is believed that the onset of Age-related Macular Degeneration (AMD) is associated with genetic polymorphisms. However, genetic analysis is costly, and artificial intelligence may offer assistance. This paper presents a method that predict the presence of multiple susceptibility genes for AMD using fundus and Optical Coherence Tomography (OCT) images, as well as medical records. Experimental results demonstrate that integrating information from multiple modalities can effectively predict the presence of susceptibility genes with over 80$\%$ accuracy.
