Multiclass ROC

Liang Wang; Luis Carvalho

Multiclass ROC

Liang Wang, Luis Carvalho

TL;DR

The work tackles multiclass model evaluation by extending ROC/AUC via a binomial matrix factorization of pairwise TPR/FPR counts. It constructs matrices $M^{tp}$ and $M^{fp}$ for all class pairs, and learns a rank-1 representation that yields a ROC-like plot and an AUC-like score while accommodating misclassification costs and bootstrap-based CIs. The method is invariant to class skew and provides a straightforward visualization of classifier performance across confidence thresholds. Empirical results on simulations and real data show competitive performance against pairwise-averaged AUC and demonstrate useful uncertainty quantification.

Abstract

Model evaluation is of crucial importance in modern statistics application. The construction of ROC and calculation of AUC have been widely used for binary classification evaluation. Recent research generalizing the ROC/AUC analysis to multi-class classification has problems in at least one of the four areas: 1. failure to provide sensible plots 2. being sensitive to imbalanced data 3. unable to specify mis-classification cost and 4. unable to provide evaluation uncertainty quantification. Borrowing from a binomial matrix factorization model, we provide an evaluation metric summarizing the pair-wise multi-class True Positive Rate (TPR) and False Positive Rate (FPR) with one-dimensional vector representation. Visualization on the representation vector measures the relative speed of increment between TPR and FPR across all the classes pairs, which in turns provides a ROC plot for the multi-class counterpart. An integration over those factorized vector provides a binary AUC-equivalent summary on the classifier performance. Mis-clasification weights specification and bootstrapped confidence interval are also enabled to accommodate a variety of of evaluation criteria. To support our findings, we conducted extensive simulation studies and compared our method to the pair-wise averaged AUC statistics on benchmark datasets.

Multiclass ROC

TL;DR

The work tackles multiclass model evaluation by extending ROC/AUC via a binomial matrix factorization of pairwise TPR/FPR counts. It constructs matrices

and

for all class pairs, and learns a rank-1 representation that yields a ROC-like plot and an AUC-like score while accommodating misclassification costs and bootstrap-based CIs. The method is invariant to class skew and provides a straightforward visualization of classifier performance across confidence thresholds. Empirical results on simulations and real data show competitive performance against pairwise-averaged AUC and demonstrate useful uncertainty quantification.

Abstract

Paper Structure (21 sections, 42 equations, 8 figures, 1 table, 1 algorithm)

This paper contains 21 sections, 42 equations, 8 figures, 1 table, 1 algorithm.

Introduction
Hard Classification Metric
Soft Classification Metric
Overall Contribution and Plans of the Paper
Backgrounds on Pair-wise AUC
AUC and Mann Whitney U Statistics
Generalizing AUC to Multiclass via Pair-wise Specification
A Joint Binomial Factorization Model
An Example of Binary Matrix Construction
Construction of a Pair-wise Matrix
The Factorization Model
Specifying Pair-wise Misclassification Cost
Connection to sROC
Visualization and Confidence Interval
Algorithm ROC-DMF
...and 6 more sections

Figures (8)

Figure 1: Pair-wise AUC partition
Figure 2: Hisogram of simulated class labels
Figure 3: Discriminative experiment
Figure 4: $\mathbf{\alpha}$ impact on class-skewness
Figure 5: Class-skewness experiment
...and 3 more figures

Multiclass ROC

TL;DR

Abstract

Multiclass ROC

Authors

TL;DR

Abstract

Table of Contents

Figures (8)