SuperCode: Sustainability PER AI-driven CO-DEsign
P. Chris Broekema, Rob V. van Nieuwpoort
TL;DR
The paper tackles the environmental impact of data-intensive science by proposing SuperCode, an AI-driven co-design framework that uses LLMs to generate energy-efficient code for emerging hardware. It defines sustainability as a KPI and introduces $M_S = \frac{TVO}{TCO}$ to balance scientific value against environmental cost. Two radio astronomy use-cases (terrestrial LOFAR/SKA and space-based OLFAR/ALO/DSL) will be used to validate the approach, with energy efficiency as a primary objective. The methodology combines AI-assisted translation and porting across architectures, LoRA fine-tuning, retrieval-augmented generation, and reinforcement learning driven by measured energy, plus human-in-the-loop to ensure correctness. If successful, the project could accelerate sustainable data-intensive science and be applicable to climate research, remote sensing, and earth observation, with open dissemination.
Abstract
Currently, data-intensive scientific applications require vast amounts of compute resources to deliver world-leading science. The climate emergency has made it clear that unlimited use of resources (e.g., energy) for scientific discovery is no longer acceptable. Future computing hardware promises to be much more energy efficient, but without better optimized software this cannot reach its full potential. In this vision paper, we propose a generic AI-driven co-design methodology, using specialized Large Language Models (like ChatGPT), to effectively generate efficient code for emerging computing hardware. We describe how we will validate our methodology with two radio astronomy applications, with sustainability as the key performance indicator. This paper is a modified version of our accepted SuperCode project proposal. We present it here in this form to introduce the vision behind this project and to disseminate the work in the spirit of Open Science and transparency. An additional aim is to collect feedback, invite potential collaboration partners and use-cases to join the project.
