Table of Contents
Fetching ...

Top-$k$ Classification and Cardinality-Aware Prediction

Anqi Mao, Mehryar Mohri, Yutao Zhong

TL;DR

This work demonstrates that several prevalent surrogate loss functions in multi-class classification, such as comp-sum and constrained losses, are supported by $H-consistency bounds with respect to the top-$k$ loss, and introduces cardinality-aware loss functions through instance-dependent cost-sensitive learning.

Abstract

We present a detailed study of top-$k$ classification, the task of predicting the $k$ most probable classes for an input, extending beyond single-class prediction. We demonstrate that several prevalent surrogate loss functions in multi-class classification, such as comp-sum and constrained losses, are supported by $H$-consistency bounds with respect to the top-$k$ loss. These bounds guarantee consistency in relation to the hypothesis set $H$, providing stronger guarantees than Bayes-consistency due to their non-asymptotic and hypothesis-set specific nature. To address the trade-off between accuracy and cardinality $k$, we further introduce cardinality-aware loss functions through instance-dependent cost-sensitive learning. For these functions, we derive cost-sensitive comp-sum and constrained surrogate losses, establishing their $H$-consistency bounds and Bayes-consistency. Minimizing these losses leads to new cardinality-aware algorithms for top-$k$ classification. We report the results of extensive experiments on CIFAR-100, ImageNet, CIFAR-10, and SVHN datasets demonstrating the effectiveness and benefit of these algorithms.

Top-$k$ Classification and Cardinality-Aware Prediction

TL;DR

This work demonstrates that several prevalent surrogate loss functions in multi-class classification, such as comp-sum and constrained losses, are supported by k$ loss, and introduces cardinality-aware loss functions through instance-dependent cost-sensitive learning.

Abstract

We present a detailed study of top- classification, the task of predicting the most probable classes for an input, extending beyond single-class prediction. We demonstrate that several prevalent surrogate loss functions in multi-class classification, such as comp-sum and constrained losses, are supported by -consistency bounds with respect to the top- loss. These bounds guarantee consistency in relation to the hypothesis set , providing stronger guarantees than Bayes-consistency due to their non-asymptotic and hypothesis-set specific nature. To address the trade-off between accuracy and cardinality , we further introduce cardinality-aware loss functions through instance-dependent cost-sensitive learning. For these functions, we derive cost-sensitive comp-sum and constrained surrogate losses, establishing their -consistency bounds and Bayes-consistency. Minimizing these losses leads to new cardinality-aware algorithms for top- classification. We report the results of extensive experiments on CIFAR-100, ImageNet, CIFAR-10, and SVHN datasets demonstrating the effectiveness and benefit of these algorithms.
Paper Structure