A Scalable System for Visual Analysis of Ocean Data
Toshit Jain, Upkar Singh, Varun Singh, Vijay Kumar Boda, Ingrid Hotz, Sathish S. Vadhiyar, P. N. Vinayachandran, Vijay Natarajan
TL;DR
This paper introduces pyParaOcean, a scalable, interactive visualization system for 3D time-varying ocean data built as ParaView plugins to leverage parallelism and To accelerate exploratory analysis of complex features such as eddies and surface fronts. It integrates a Cinema database generator to overcome I/O bottlenecks and provides ocean-specific filters for seed-based fieldlines, isovolumes, depth profiles, front tracking, and eddy detection, all within a client-server ParaView architecture. Comprehensive scaling studies on ROMS and GLORYS datasets demonstrate near-linear performance and the effectiveness of depth-based partitioning and data redistribution, with practical gains in I/O via Cinema. A detailed Bay of Bengal case study showcases the system’s ability to analyze monsoon-driven currents, salinity transport, and submesoscale filaments, highlighting its potential for rapid, in-depth oceanographic analysis. Overall, pyParaOcean offers a flexible, extensible framework that can adapt to larger datasets and other geoscience domains, enabling researchers to efficiently visualize and track dynamic ocean processes.
Abstract
Oceanographers rely on visual analysis to interpret model simulations, identify events and phenomena, and track dynamic ocean processes. The ever increasing resolution and complexity of ocean data due to its dynamic nature and multivariate relationships demands a scalable and adaptable visualization tool for interactive exploration. We introduce pyParaOcean, a scalable and interactive visualization system designed specifically for ocean data analysis. pyParaOcean offers specialized modules for common oceanographic analysis tasks, including eddy identification and salinity movement tracking. These modules seamlessly integrate with ParaView as filters, ensuring a user-friendly and easy-to-use system while leveraging the parallelization capabilities of ParaView and a plethora of inbuilt general-purpose visualization functionalities. The creation of an auxiliary dataset stored as a Cinema database helps address I/O and network bandwidth bottlenecks while supporting the generation of quick overview visualizations. We present a case study on the Bay of Bengal (BoB) to demonstrate the utility of the system and scaling studies to evaluate the efficiency of the system.
