What Is The Performance Ceiling of My Classifier? Utilizing Category-Wise Influence Functions for Pareto Frontier Analysis

Shahriar Kabir Nahin; Wenxiao Xiao; Joshua Liu; Anshuman Chhabra; Hongfu Liu

What Is The Performance Ceiling of My Classifier? Utilizing Category-Wise Influence Functions for Pareto Frontier Analysis

Shahriar Kabir Nahin, Wenxiao Xiao, Joshua Liu, Anshuman Chhabra, Hongfu Liu

TL;DR

The paper tackles the problem of identifying a classifier's performance ceiling from a category-aware perspective, beyond overall accuracy. It introduces category-wise influence functions and an influence vector $P(z) \in \mathbb{R}^K$, enabling Pareto frontier analysis across $K$ classes. A Pareto-LP-GA framework then reweights training samples via a linear program guided by $P(z)$ to achieve Pareto improvements, with modes for Direct Improvement and Course Correction. The authors validate the approach on synthetic data and real benchmarks (CIFAR-10, STL-10, Emotion, AG_News), showing substantial per-class gains with limited degradation in other classes, thereby providing a practical data-centric tool for per-class optimization and fairer performance tradeoffs.

Abstract

Data-centric learning seeks to improve model performance from the perspective of data quality, and has been drawing increasing attention in the machine learning community. Among its key tools, influence functions provide a powerful framework to quantify the impact of individual training samples on model predictions, enabling practitioners to identify detrimental samples and retrain models on a cleaner dataset for improved performance. However, most existing work focuses on the question: "what data benefits the learning model?" In this paper, we take a step further and investigate a more fundamental question: "what is the performance ceiling of the learning model?" Unlike prior studies that primarily measure improvement through overall accuracy, we emphasize category-wise accuracy and aim for Pareto improvements, ensuring that every class benefits, rather than allowing tradeoffs where some classes improve at the expense of others. To address this challenge, we propose category-wise influence functions and introduce an influence vector that quantifies the impact of each training sample across all categories. Leveraging these influence vectors, we develop a principled criterion to determine whether a model can still be improved, and further design a linear programming-based sample reweighting framework to achieve Pareto performance improvements. Through extensive experiments on synthetic datasets, vision, and text benchmarks, we demonstrate the effectiveness of our approach in estimating and achieving a model's performance improvement across multiple categories of interest.

What Is The Performance Ceiling of My Classifier? Utilizing Category-Wise Influence Functions for Pareto Frontier Analysis

TL;DR

Abstract

What Is The Performance Ceiling of My Classifier? Utilizing Category-Wise Influence Functions for Pareto Frontier Analysis

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)