General Information Metrics for Improving AI Model Training Efficiency
Jianfeng Xu, Congcong Liu, Xiaoying Tan, Xiaojie Zhu, Anpeng Wu, Huan Wan, Weijun Kong, Chun Li, Hu Xu, Kun Kuang, Fei Wu
TL;DR
This work introduces General Information Metrics Evaluation (GIME), a pre-training data selection framework grounded in Objective Information Theory that uses 11 metrics to quantify training data quality. By thresholding high-sensitivity metrics before model training, GIME probabilistically yields datasets that preserve performance while dramatically reducing data size, training time, and energy use. Across CTR Prediction, Civil Case Prediction, Weather Forecasting, and a Judicial AI program, GIME demonstrates substantial efficiency gains with minimal performance degradation, and outperforms baseline full-data and random-sampling approaches, as well as an active-learning baseline. The proposed approach offers a domain-agnostic, theory-backed path toward sustainable and scalable AI development, with clear practical impact in large-scale and resource-constrained settings.
Abstract
To address the growing size of AI model training data and the lack of a universal data selection methodology-factors that significantly drive up training costs -- this paper presents the General Information Metrics Evaluation (GIME) method. GIME leverages general information metrics from Objective Information Theory (OIT), including volume, delay, scope, granularity, variety, duration, sampling rate, aggregation, coverage, distortion, and mismatch to optimize dataset selection for training purposes. Comprehensive experiments conducted across diverse domains, such as CTR Prediction, Civil Case Prediction, and Weather Forecasting, demonstrate that GIME effectively preserves model performance while substantially reducing both training time and costs. Additionally, applying GIME within the Judicial AI Program led to a remarkable 39.56% reduction in total model training expenses, underscoring its potential to support efficient and sustainable AI development.
