OceanAI: A Conversational Platform for Accurate, Transparent, Near-Real-Time Oceanographic Insights
Bowen Chen, Jayesh Gajbhar, Gregory Dusek, Rob Redmon, Patrick Hogan, Paul Liu, DelWayne Bohnenstiehl, Dongkuan Xu, Ruoying He
TL;DR
OceanAI addresses hallucination and data-access barriers in ocean science by grounding conversational queries in real-time NOAA datasets through a function-calling, multi-agent architecture. It couples a coordinating LLM with modular data-retrieval, computation, and visualization components that access CO-OPS, CORA, and CRW datasets, producing verifiable outputs with provenance. Empirical comparisons show OceanAI outperforms baseline AI chat interfaces on structured NOAA data tasks, due to robust grounding and transparent metadata. This framework enables near-real-time, reproducible ocean insights with potential for operational decision support in hazards, ecology, and water-quality monitoring.
Abstract
Artificial intelligence is transforming the sciences, yet general conversational AI systems often generate unverified "hallucinations" undermining scientific rigor. We present OceanAI, a conversational platform that integrates the natural-language fluency of open-source large language models (LLMs) with real-time, parameterized access to authoritative oceanographic data streams hosted by the National Oceanic and Atmospheric Administration (NOAA). Each query such as "What was Boston Harbor's highest water level in 2024?" triggers real-time API calls that identify, parse, and synthesize relevant datasets into reproducible natural-language responses and data visualizations. In a blind comparison with three widely used AI chat-interface products, only OceanAI produced NOAA-sourced values with original data references; others either declined to answer or provided unsupported results. Designed for extensibility, OceanAI connects to multiple NOAA data products and variables, supporting applications in marine hazard forecasting, ecosystem assessment, and water-quality monitoring. By grounding outputs and verifiable observations, OceanAI advances transparency, reproducibility, and trust, offering a scalable framework for AI-enabled decision support within the oceans. A public demonstration is available at https://oceanai.ai4ocean.xyz.
