Connectomics Informed by Large Language Models

Elinor Thompson; Tiantian He; Anna Schroder; Ahmed Abdulaal; Alec Sargood; Sonja Soskic; Henry F. J. Tregidgo; Daniel C. Alexander

Connectomics Informed by Large Language Models

Elinor Thompson, Tiantian He, Anna Schroder, Ahmed Abdulaal, Alec Sargood, Sonja Soskic, Henry F. J. Tregidgo, Daniel C. Alexander

TL;DR

The paper examines using large language models to generate anatomical priors for connectomics, addressing tractography’s false positives and negatives. It develops a pipeline that combines prompting strategies with retrieval-augmented generation to ground LLM outputs in parcellation context and neuroscience literature, and integrates these priors into tractography filtering. The study demonstrates near 90% edge-classification accuracy and shows that LLM-derived priors can improve a network diffusion model of pathology spread, with RAG providing verifiable citations. Limitations include potential LLM hallucinations and evaluation biases, but the framework offers a scalable, knowledge-grounded means to enhance connectome construction and interpretation.

Abstract

Tractography is a unique method for mapping white matter connections in the brain, but tractography algorithms suffer from an inherent trade-off between sensitivity and specificity that limits accuracy. Incorporating prior knowledge of white matter anatomy is an effective strategy for improving accuracy and has been successful for reducing false positives and false negatives in bundle-mapping protocols. However, it is challenging to scale this approach for connectomics due to the difficulty in synthesising information relating to many thousands of possible connections. In this work, we develop and evaluate a pipeline using large language models (LLMs) to generate quantitative priors for connectomics, based on their knowledge of neuroanatomy. We benchmark our approach against an evaluation set derived from a gold-standard tractography atlas, identifying prompting techniques to elicit accurate connectivity information from the LLMs. We further identify strategies for incorporating external knowledge sources into the pipeline, which can provide grounding for the LLM and improve accuracy. Finally, we demonstrate how the LLM-derived priors can augment existing tractography filtering approaches by identifying true-positive connections to retain during the filtering process. We show that these additional connections can improve the accuracy of a connectome-based model of pathology spread, which provides supporting evidence that the connections preserved by the LLM are valid.

Connectomics Informed by Large Language Models

TL;DR

Abstract

Connectomics Informed by Large Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)