Table of Contents
Fetching ...

A New Perspective on ADHD Research: Knowledge Graph Construction with LLMs and Network Based Insights

Hakan T. Otal, Stephen V. Faraone, M. Abdullah Canbaz

TL;DR

This work tackles ADHD's heterogeneous etiology by building a multimodal knowledge graph that fuses literature, clinical data, and expert-curated content via LLM-driven semantic extraction. The authors construct and refine a semantically rich graph using contextual proximity and redundancy reduction (via embeddings and DBSCAN), yielding a network of $N=2347$ nodes and $E=8655$ edges. Network analysis reveals a sparse but modular structure with key core concepts up to $k=16$, identifying central health conditions and cognitive outcomes, and distinct communities around comorbidities and neuroimaging/genetics. The resulting Graph-RAG framework supports accurate, context-aware information retrieval for ADHD research and clinical practice, while highlighting areas for data expansion and methodological refinement to enhance interpretability and decision support.

Abstract

Attention-Deficit/Hyperactivity Disorder (ADHD) is a challenging disorder to study due to its complex symptomatology and diverse contributing factors. To explore how we can gain deeper insights on this topic, we performed a network analysis on a comprehensive knowledge graph (KG) of ADHD, constructed by integrating scientific literature and clinical data with the help of cutting-edge large language models. The analysis, including k-core techniques, identified critical nodes and relationships that are central to understanding the disorder. Building on these findings, we curated a knowledge graph that is usable in a context-aware chatbot (Graph-RAG) with Large Language Models (LLMs), enabling accurate and informed interactions. Our knowledge graph not only advances the understanding of ADHD but also provides a powerful tool for research and clinical applications.

A New Perspective on ADHD Research: Knowledge Graph Construction with LLMs and Network Based Insights

TL;DR

This work tackles ADHD's heterogeneous etiology by building a multimodal knowledge graph that fuses literature, clinical data, and expert-curated content via LLM-driven semantic extraction. The authors construct and refine a semantically rich graph using contextual proximity and redundancy reduction (via embeddings and DBSCAN), yielding a network of nodes and edges. Network analysis reveals a sparse but modular structure with key core concepts up to , identifying central health conditions and cognitive outcomes, and distinct communities around comorbidities and neuroimaging/genetics. The resulting Graph-RAG framework supports accurate, context-aware information retrieval for ADHD research and clinical practice, while highlighting areas for data expansion and methodological refinement to enhance interpretability and decision support.

Abstract

Attention-Deficit/Hyperactivity Disorder (ADHD) is a challenging disorder to study due to its complex symptomatology and diverse contributing factors. To explore how we can gain deeper insights on this topic, we performed a network analysis on a comprehensive knowledge graph (KG) of ADHD, constructed by integrating scientific literature and clinical data with the help of cutting-edge large language models. The analysis, including k-core techniques, identified critical nodes and relationships that are central to understanding the disorder. Building on these findings, we curated a knowledge graph that is usable in a context-aware chatbot (Graph-RAG) with Large Language Models (LLMs), enabling accurate and informed interactions. Our knowledge graph not only advances the understanding of ADHD but also provides a powerful tool for research and clinical applications.
Paper Structure (15 sections, 2 figures, 2 tables)

This paper contains 15 sections, 2 figures, 2 tables.

Figures (2)

  • Figure 1: ADHD Knowledge Graph Visualization (interactive version is at https://github.com/AI-in-Complex-Systems-Lab/ADHD-KnowledgeGraph)
  • Figure 2: k-core Network (K=16) on the left, and Nodes in the Maximum k-core (16 Cores) on the right.