ARIM-mdx Data System: Towards a Nationwide Data Platform for Materials Science
Masatoshi Hanai, Ryo Ishikawa, Mitsuaki Kawamura, Masato Ohnishi, Norio Takenaka, Kou Nakamura, Daiju Matsumura, Seiji Fujikawa, Hiroki Sakamoto, Yukinori Ochiai, Tetsuo Okane, Shin-Ichiro Kuroki, Atsuo Yamada, Toyotaro Suzumura, Junichiro Shiomi, Kenjiro Taura, Yoshio Mita, Naoya Shibata, Yuichi Ikuhara
TL;DR
The paper addresses the lack of scalable nationwide data platforms for materials science that integrate experimental and computational data. It presents ARIM-mdx, a hybrid architecture combining petascale storage with cloud compute, direct IoT-based data transfer from standalone facilities, and high-speed national networks (SINET6) to support interactive analysis and HPC workflows. Key contributions include a complete system design, containerized compute via Jupyter and MateriApps LIVE!, cloud–storage integration, IoT data-transfer, and an evaluation demonstrating low latency and high throughput across a year of operation with hundreds of users. The work demonstrates the feasibility and potential impact of nationwide, cross-institutional data infrastructure for accelerating materials research.
Abstract
In modern materials science, effective and high-volume data management across leading-edge experimental facilities and world-class supercomputers is indispensable for cutting-edge research. However, existing integrated systems that handle data from these resources have primarily focused just on smaller-scale cross-institutional or single-domain operations. As a result, they often lack the scalability, efficiency, agility, and interdisciplinarity, needed for handling substantial volumes of data from various researchers. In this paper, we introduce ARIM-mdx data system, aiming at a nationwide data platform for materials science in Japan. Currently in its trial phase, the platform has been involving 11 universities and institutes all over Japan, and it is utilized by over 800 researchers from around 140 organizations in academia and industry, being intended to gradually expand its reach. The ARIM-mdx data system, as a pioneering nationwide data platform, has the potential to contribute to the creation of new research communities and accelerate innovations.
