Table of Contents
Fetching ...

HarmonE: A Self-Adaptive Approach to Architecting Sustainable MLOps

Hiya Bhatt, Shaunak Biswas, Srinivasan Rakhunathan, Karthik Vaidhyanathan

TL;DR

HarmonE presents a self-adaptive architectural approach to sustainable MLOps by embedding the $MAPE-K$ loop into MLS pipelines to monitor and adapt to data drift, model degradation, and energy/cost pressures. It defines a design-time Decision Map and a Knowledge Base to guide runtime strategies such as selective retraining, dynamic model switching, and versioned artifact reuse, all while evolving adaptation boundaries through a control-theoretic mechanism. The approach is validated on a Digital Twin of an ITS for traffic-flow forecasting using PeMS data, showing substantial energy savings (up to $54.5\%$) while preserving about $95\%$ of the baseline accuracy ($R^2$), and with low planning overhead. These results indicate that sustainability-aware MLOps can balance long-term environmental and economic goals with short-term predictive performance, with potential applicability across domains including intelligent mobility and beyond.

Abstract

Machine Learning Enabled Systems (MLS) are becoming integral to real-world applications, but ensuring their sustainable performance over time remains a significant challenge. These systems operate in dynamic environments and face runtime uncertainties like data drift and model degradation, which affect the sustainability of MLS across multiple dimensions: technical, economical, environmental, and social. While Machine Learning Operations (MLOps) addresses the technical dimension by streamlining the ML model lifecycle, it overlooks other dimensions. Furthermore, some traditional practices, such as frequent retraining, incur substantial energy and computational overhead, thus amplifying sustainability concerns. To address them, we introduce HarmonE, an architectural approach that enables self-adaptive capabilities in MLOps pipelines using the MAPE-K loop. HarmonE allows system architects to define explicit sustainability goals and adaptation thresholds at design time, and performs runtime monitoring of key metrics, such as prediction accuracy, energy consumption, and data distribution shifts, to trigger appropriate adaptation strategies. We validate our approach using a Digital Twin (DT) of an Intelligent Transportation System (ITS), focusing on traffic flow prediction as our primary use case. The DT employs time series ML models to simulate real-time traffic and assess various flow scenarios. Our results show that HarmonE adapts effectively to evolving conditions while maintaining accuracy and meeting sustainability goals.

HarmonE: A Self-Adaptive Approach to Architecting Sustainable MLOps

TL;DR

HarmonE presents a self-adaptive architectural approach to sustainable MLOps by embedding the loop into MLS pipelines to monitor and adapt to data drift, model degradation, and energy/cost pressures. It defines a design-time Decision Map and a Knowledge Base to guide runtime strategies such as selective retraining, dynamic model switching, and versioned artifact reuse, all while evolving adaptation boundaries through a control-theoretic mechanism. The approach is validated on a Digital Twin of an ITS for traffic-flow forecasting using PeMS data, showing substantial energy savings (up to ) while preserving about of the baseline accuracy (), and with low planning overhead. These results indicate that sustainability-aware MLOps can balance long-term environmental and economic goals with short-term predictive performance, with potential applicability across domains including intelligent mobility and beyond.

Abstract

Machine Learning Enabled Systems (MLS) are becoming integral to real-world applications, but ensuring their sustainable performance over time remains a significant challenge. These systems operate in dynamic environments and face runtime uncertainties like data drift and model degradation, which affect the sustainability of MLS across multiple dimensions: technical, economical, environmental, and social. While Machine Learning Operations (MLOps) addresses the technical dimension by streamlining the ML model lifecycle, it overlooks other dimensions. Furthermore, some traditional practices, such as frequent retraining, incur substantial energy and computational overhead, thus amplifying sustainability concerns. To address them, we introduce HarmonE, an architectural approach that enables self-adaptive capabilities in MLOps pipelines using the MAPE-K loop. HarmonE allows system architects to define explicit sustainability goals and adaptation thresholds at design time, and performs runtime monitoring of key metrics, such as prediction accuracy, energy consumption, and data distribution shifts, to trigger appropriate adaptation strategies. We validate our approach using a Digital Twin (DT) of an Intelligent Transportation System (ITS), focusing on traffic flow prediction as our primary use case. The DT employs time series ML models to simulate real-time traffic and assess various flow scenarios. Our results show that HarmonE adapts effectively to evolving conditions while maintaining accuracy and meeting sustainability goals.

Paper Structure

This paper contains 17 sections, 3 equations, 7 figures, 2 tables.

Figures (7)

  • Figure 1: Adaptation explained as a sustainability goal
  • Figure 2: A decision map for a traffic flow prediction pipeline
  • Figure 3: HarmonE Architecture
  • Figure 4: Adaptation strategies activated by $\mathcal{H}armon{E}$ during execution.
  • Figure 5: Average energy consumption (mJ) across five runs for all nine approaches.
  • ...and 2 more figures