Systematic Weight Evaluation for Pruning Large Language Models: Enhancing Performance and Sustainability

Ashhadul Islam; Samir Brahim Belhaouari; Amine Bermak

Systematic Weight Evaluation for Pruning Large Language Models: Enhancing Performance and Sustainability

Ashhadul Islam, Samir Brahim Belhaouari, Amine Bermak

TL;DR

The paper tackles the environmental impact of training large language models by introducing a training-aware pruning method that continually evaluates the importance of individual weights across epochs. By computing a weighted importance score $Imp_i$ and maintaining a weighted-average clone of parameters, the approach guides pruning with explicit thresholds such as $ ext{Threshold} = \sigma(W_{abs}) \times \text{PruneRate}$ and $W_{abs} = |W|$. Empirical results on a scaled-down 10.7M-parameter Transformer and a 4.2B Phi-3-vision model show that moderate pruning can improve efficiency or reduce loss/MAE, while aggressive pruning drastically degrades performance. The findings advocate for sustainable AI development through training-aware sparsity, balancing compression with accuracy in both language and multimodal settings.

Abstract

The exponential growth of large language models (LLMs) like ChatGPT has revolutionized artificial intelligence, offering unprecedented capabilities in natural language processing. However, the extensive computational resources required for training these models have significant environmental implications, including high carbon emissions, energy consumption, and water usage. This research presents a novel approach to LLM pruning, focusing on the systematic evaluation of individual weight importance throughout the training process. By monitoring parameter evolution over time, we propose a method that effectively reduces model size without compromising performance. Extensive experiments with both a scaled-down LLM and a large multimodal model reveal that moderate pruning enhances efficiency and reduces loss, while excessive pruning drastically deteriorates model performance. These findings highlight the critical need for optimized AI models to ensure sustainable development, balancing technological advancement with environmental responsibility.

Systematic Weight Evaluation for Pruning Large Language Models: Enhancing Performance and Sustainability

TL;DR

Abstract

Systematic Weight Evaluation for Pruning Large Language Models: Enhancing Performance and Sustainability

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)