Energy Efficient Software Hardware CoDesign for Machine Learning: From TinyML to Large Language Models

Mohammad Saleh Vahdatpour; Yanqing Zhang

Energy Efficient Software Hardware CoDesign for Machine Learning: From TinyML to Large Language Models

Mohammad Saleh Vahdatpour, Yanqing Zhang

Abstract

The rapid deployment of machine learning across platforms from milliwatt-class TinyML devices to large language models has made energy efficiency a primary constraint for sustainable AI. Across these scales, performance and energy are increasingly limited by data movement and memory-system behavior rather than by arithmetic throughput alone. This work reviews energy efficient software hardware codesign methods spanning edge inference and training to datacenter-scale LLM serving, covering accelerator architectures (e.g., ASIC/FPGA dataflows, processing-/compute-in-memory designs) and system-level techniques (e.g., partitioning, quantization, scheduling, and runtime adaptation). We distill common design levers and trade-offs, and highlight recurring gaps including limited cross-platform generalization, large and costly co-design search spaces, and inconsistent benchmarking across workloads and deployment settings. Finally, we outline a hierarchical decomposition perspective that maps optimization strategies to computational roles and supports incremental adaptation, offering practical guidance for building energy and carbon aware ML systems.

Energy Efficient Software Hardware CoDesign for Machine Learning: From TinyML to Large Language Models

Abstract

Paper Structure (19 sections, 1 figure, 1 table)

This paper contains 19 sections, 1 figure, 1 table.

INTRODUCTION
BACKGROUND
The Memory Wall and Data Movement Bottleneck
Scale-Dependent Efficiency Challenges
SOFTWARE-HARDWARE CO-DESIGN
Co-Design Methodologies
Cross-Layer Optimization Techniques
ENERGY-EFFICIENT CO-DESIGN ACROSS THE ML SPECTRUM
TinyML: Ultra-Low-Power Edge Intelligence
Mid-Scale: Edge-Cloud Split Computing
Large-Scale: Transformers and LLMs
Cross-Scale Analysis and Challenges
Cross-Scale Comparison
Identified Gaps and Challenges
A HIERARCHICAL PERSPECTIVE ON GREEN CO-DESIGN
...and 4 more sections

Figures (1)

Figure 1: Timeline of representative software–hardware co-design approaches for energy-efficient machine learning (2016–2025), categorized by architectural paradigm.

Energy Efficient Software Hardware CoDesign for Machine Learning: From TinyML to Large Language Models

Abstract

Energy Efficient Software Hardware CoDesign for Machine Learning: From TinyML to Large Language Models

Authors

Abstract

Table of Contents

Figures (1)