Efficiency is Not Enough: A Critical Perspective of Environmentally Sustainable AI

Dustin Wright; Christian Igel; Gabrielle Samuel; Raghavendra Selvan

Efficiency is Not Enough: A Critical Perspective of Environmentally Sustainable AI

Dustin Wright, Christian Igel, Gabrielle Samuel, Raghavendra Selvan

TL;DR

The paper argues that optimizing ML systems for efficiency alone is insufficient to achieve environmental sustainability due to complex interactions across compute, lifecycle decisions, and platform hardware. It introduces three discrepancies: (1) compute efficiency $\neq$ energy efficiency $\neq$ carbon efficiency, (2) efficiency effects across the model life cycle can be counterintuitive and lead to higher emissions, and (3) platform-level impacts (embodied emissions, water use, e-waste) limit the benefits of efficiency improvements. Through analysis of the EC-NAS benchmark and literature on carbon intensity, the authors demonstrate how reductions in one metric do not reliably translate to lower emissions in practice and highlight rebound effects. They propose systems thinking as a holistic framework to account for interdependencies among people, processes, and infrastructure, and to identify leverage points beyond efficiency to sustainably integrate ML as a technology with broader societal goals.

Abstract

Artificial intelligence (AI) is currently spearheaded by machine learning (ML) methods such as deep learning which have accelerated progress on many tasks thought to be out of reach of AI. These recent ML methods are often compute hungry, energy intensive, and result in significant green house gas emissions, a known driver of anthropogenic climate change. Additionally, the platforms on which ML systems run are associated with environmental impacts that go beyond the energy consumption driven carbon emissions. The primary solution lionized by both industry and the ML community to improve the environmental sustainability of ML is to increase the compute and energy efficiency with which ML systems operate. In this perspective, we argue that it is time to look beyond efficiency in order to make ML more environmentally sustainable. We present three high-level discrepancies between the many variables that influence the efficiency of ML and the environmental sustainability of ML. Firstly, we discuss how compute efficiency does not imply energy efficiency or carbon efficiency. Second, we present the unexpected effects of efficiency on operational emissions throughout the ML model life cycle. And, finally, we explore the broader environmental impacts that are not accounted by efficiency. These discrepancies show as to why efficiency alone is not enough to remedy the adverse environmental impacts of ML. Instead, we argue for systems thinking as the next step towards holistically improving the environmental sustainability of ML.

Efficiency is Not Enough: A Critical Perspective of Environmentally Sustainable AI

TL;DR

energy efficiency

carbon efficiency, (2) efficiency effects across the model life cycle can be counterintuitive and lead to higher emissions, and (3) platform-level impacts (embodied emissions, water use, e-waste) limit the benefits of efficiency improvements. Through analysis of the EC-NAS benchmark and literature on carbon intensity, the authors demonstrate how reductions in one metric do not reliably translate to lower emissions in practice and highlight rebound effects. They propose systems thinking as a holistic framework to account for interdependencies among people, processes, and infrastructure, and to identify leverage points beyond efficiency to sustainably integrate ML as a technology with broader societal goals.

Abstract

Paper Structure (6 sections, 4 figures)

This paper contains 6 sections, 4 figures.

Introduction
Discrepancy 1: Compute Efficiency $\neq$ Energy Efficiency $\neq$ Carbon Efficiency
Discrepancy 2: Efficiency Across The Model Life Cycle
Discrepancy 3: Efficiency and Platforms
Beyond Efficiency: Systems Thinking
Conclusion

Figures (4)

Figure 1: (a-c) Three plots demonstrating the discrepancy between different metrics of compute and energy, highlighting that changing one may not change another in kind. Note that each dot marker is a CNN model from the EC-NAS dataset bakhtiarifard2022energy. (d) Hourly carbon intensity in terms of gCO2eq/kWh over the time period 2019-2023 for three different regions: Denmark, London, and Edinburgh. The boxes show the median and interquartile range of carbon intensities; points outside the whiskers indicate outliers. Each region has vastly different distributions of carbon intensity, and all three are characterized by high variance with several peaks.
Figure 2: Monthly rolling average carbon intensity (thick blue line) and hourly carbon intensity (thin blue line) for Denmark between 2019 and 2023. To compare with estimates, we show the actual yearly average and yearly average carbon intensities for the same time period in Denmark. Subplots demonstrate how the selection of start time and which carbon intensity measure to use can result in vastly different observed operational emissions for the 423K models in the EC-NAS benchmark (blue points are using real-time carbon intensity, black points are using ElectricityMaps average intensity). Emissions are calculated by averaging the total energy consumption of each model over the selected time period, multiplying the energy consumption by the instantaneous carbon intensity in 5 minute intervals.
Figure 3: The Deep Learning model life cycle. The model development stage consists of data curation, model selection, and model training, while the deployment stage consists of the use of a model for inference in downstream applications and potentially retraining a model on new data.
Figure 4: A sample of different ways to improve efficiency and whether or not it is targeted to development or deployment in a typical scenario (check means yes, dash means no, check within parentheses means it depends on the situation). Examples include data parsimony DBLP:journals/csur/WangYKN20DBLP:journals/corr/abs-2301-07014, model selection DBLP:journals/csur/RenXCHLCW21DBLP:conf/ijcai/BenmezianeMONWW21, model compression DBLP:journals/pieee/DengLHSX20DBLP:conf/icml/GuptaAGN15DBLP:journals/corr/abs-2103-13630DBLP:journals/ijcv/GouYMT21DBLP:journals/ijon/LiangGWSZ21wangexploringDBLP:journals/corr/abs-2210-06640, and hardware configuration DBLP:conf/isca/JouppiYPPABBBBB17DBLP:conf/cvpr/YangCS17DBLP:conf/sc/AllenG16yarally2023uncoveringDBLP:conf/bdcloud/LiCBZ16 for energy efficiency, and job scheduling DBLP:conf/fat/DodgePCOSSLSDB22 for carbon efficiency.

Efficiency is Not Enough: A Critical Perspective of Environmentally Sustainable AI

TL;DR

Abstract

Efficiency is Not Enough: A Critical Perspective of Environmentally Sustainable AI

Authors

TL;DR

Abstract

Table of Contents

Figures (4)