Towards Sustainable Web Agents: A Plea for Transparency and Dedicated Metrics for Energy Consumption
Lars Krupp, Daniel Geißler, Paul Lukowicz, Jakob Karolus
TL;DR
This paper addresses the sustainability of web agents that autonomously browse the web and evaluates energy cost and CO2 emissions by comparing MindAct and LASER on Mind2Web. It demonstrates that design philosophy strongly influences energy consumption, with MindAct using orders of magnitude less energy due to preprocessing with small models, while LASER relies on a heavy GPT-4 core. The authors advocate for dedicated energy metrics, token-level reporting, and modular, small-model pipelines to improve sustainability and comparability. They also highlight the need for transparency around model parameters to enable accurate environmental accounting at scale.
Abstract
Improvements in the area of large language models have shifted towards the construction of models capable of using external tools and interpreting their outputs. These so-called web agents have the ability to interact autonomously with the internet. This allows them to become powerful daily assistants handling time-consuming, repetitive tasks while supporting users in their daily activities. While web agent research is thriving, the sustainability aspect of this research direction remains largely unexplored. We provide an initial exploration of the energy and CO2 cost associated with web agents. Our results show how different philosophies in web agent creation can severely impact the associated expended energy. We highlight lacking transparency regarding the disclosure of model parameters and processes used for some web agents as a limiting factor when estimating energy consumption. As such, our work advocates a change in thinking when evaluating web agents, warranting dedicated metrics for energy consumption and sustainability.
