A Priori Generalizability Estimate for a CNN
Cito Balsells, Beatrice Riviere, David Fuentes
TL;DR
This work addresses estimating CNN generalizability by formulating a per-input matrix $A[x]$ representing the CNN and computing its truncated SVD via the adjoint $A^T[x]$, enabling a low-rank CNN approximation. It introduces the Adjoint CNN $\mathcal{G}_\theta$ and two projection-based metrics, the Right Projection Ratio $\mathrm{RPR}$ and Left Projection Ratio $\mathrm{LPR}$, to assess whether inputs or labels lie in the CNN's range or nullspace. The approach is validated on MNIST for classification and BraTS for brain tumor segmentation, showing that $\mathrm{RPR}$ correlates with performance on unlabeled data and can reveal class imbalance biases. These results suggest a practical diagnostic tool for sample-level performance estimation and potential use in active learning and bias detection, grounded in a theoretical link between adjoint operators and CNN structure.
Abstract
We formulate truncated singular value decompositions of entire convolutional neural networks. We demonstrate the computed left and right singular vectors are useful in identifying which images the convolutional neural network is likely to perform poorly on. To create this diagnostic tool, we define two metrics: the Right Projection Ratio and the Left Projection Ratio. The Right (Left) Projection Ratio evaluates the fidelity of the projection of an image (label) onto the computed right (left) singular vectors. We observe that both ratios are able to identify the presence of class imbalance for an image classification problem. Additionally, the Right Projection Ratio, which only requires unlabeled data, is found to be correlated to the model's performance when applied to image segmentation. This suggests the Right Projection Ratio could be a useful metric to estimate how likely the model is to perform well on a sample.
