Health system learning achieves generalist neuroimaging models

Akhil Kondepudi; Akshay Rao; Chenhui Zhao; Yiwei Lyu; Samir Harake; Soumyanil Banerjee; Rushikesh Joshi; Anna-Katharina Meissner; Renly Hou; Cheng Jiang; Asadur Chowdury; Ashok Srinivasan; Brian Athey; Vikas Gulani; Aditya Pandey; Honglak Lee; Todd Hollon

Health system learning achieves generalist neuroimaging models

Akhil Kondepudi, Akshay Rao, Chenhui Zhao, Yiwei Lyu, Samir Harake, Soumyanil Banerjee, Rushikesh Joshi, Anna-Katharina Meissner, Renly Hou, Cheng Jiang, Asadur Chowdury, Ashok Srinivasan, Brian Athey, Vikas Gulani, Aditya Pandey, Honglak Lee, Todd Hollon

TL;DR

The introduction of NeuroVFM, a visual foundation model trained on 5.24 million clinical MRI and CT volumes using a scalable volumetric joint-embedding predictive architecture, establishes health system learning as a paradigm for building generalist medical AI and provides a scalable framework for clinical foundation models.

Abstract

Frontier artificial intelligence (AI) models, such as OpenAI's GPT-5 and Meta's DINOv3, have advanced rapidly through training on internet-scale public data, yet such systems lack access to private clinical data. Neuroimaging, in particular, is underrepresented in the public domain due to identifiable facial features within MRI and CT scans, fundamentally restricting model performance in clinical medicine. Here, we show that frontier models underperform on neuroimaging tasks and that learning directly from uncurated data generated during routine clinical care at health systems, a paradigm we call health system learning, yields high-performance, generalist neuroimaging models. We introduce NeuroVFM, a visual foundation model trained on 5.24 million clinical MRI and CT volumes using a scalable volumetric joint-embedding predictive architecture. NeuroVFM learns comprehensive representations of brain anatomy and pathology, achieving state-of-the-art performance across multiple clinical tasks, including radiologic diagnosis and report generation. The model exhibits emergent neuroanatomic understanding and interpretable visual grounding of diagnostic findings. When paired with open-source language models through lightweight visual instruction tuning, NeuroVFM generates radiology reports that surpass frontier models in accuracy, clinical triage, and expert preference. Through clinically grounded visual understanding, NeuroVFM reduces hallucinated findings and critical errors, offering safer clinical decision support. These results establish health system learning as a paradigm for building generalist medical AI and provide a scalable framework for clinical foundation models.

Health system learning achieves generalist neuroimaging models

TL;DR

Abstract

Health system learning achieves generalist neuroimaging models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (38)