A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation
Yangxinyu Xie, Bowen Jiang, Tanwi Mallick, Joshua David Bergerson, John K. Hutchison, Duane R. Verner, Jordan Branham, M. Ross Alexander, Robert B. Ross, Yan Feng, Leslie-Anne Levy, Weijie Su, Camillo J. Taylor
TL;DR
This work tackles the challenge of context-specific, location-aware wildfire risk decision-support by introducing WildfireGPT, a retrieval-augmented, multi-agent LLM system. The architecture combines a user_profile_agent, planning_agent, and analyst_agent under a task_orchestrator to personalize analysis and recommendations through data fusion of hazard projections, observations, demographics, and literature, with interactive geospatial visualizations. A three-stage evaluation framework—data/literature retrieval comparison, personalization ablation, and domain-expert plus LLM-as-a-judge assessments—demonstrates that WildfireGPT outperforms baseline tools in data provision, location specificity, and data accuracy, while delivering high contextual relevance. The study also explores LLM-based automated evaluation (LLM-as-a-Judge) and scalable evaluation considerations, offering insights into deployment-time quality assurance and future directions for adaptive, domain-focused AI-assisted hazard decision-making. Overall, the system represents a pragmatic, scalable approach to integrating localized data and expert knowledge to support resilient adaptation to natural hazards across diverse stakeholder groups.
Abstract
Large language models (LLMs) are a transformational capability at the frontier of artificial intelligence and machine learning that can support decision-makers in addressing pressing societal challenges such as extreme natural hazard events. As generalized models, LLMs often struggle to provide context-specific information, particularly in areas requiring specialized knowledge. In this work we propose a retrieval-augmented generation (RAG)-based multi-agent LLM system to support analysis and decision-making in the context of natural hazards and extreme weather events. As a proof of concept, we present WildfireGPT, a specialized system focused on wildfire hazards. The architecture employs a user-centered, multi-agent design to deliver tailored risk insights across diverse stakeholder groups. By integrating natural hazard and extreme weather projection data, observational datasets, and scientific literature through an RAG framework, the system ensures both the accuracy and contextual relevance of the information it provides. Evaluation across ten expert-led case studies demonstrates that WildfireGPT significantly outperforms existing LLM-based solutions for decision support.
