Improving Deep Regression with Tightness
Shihao Zhang, Yuguang Yan, Angela Yao
TL;DR
This work addresses why preserving target ordinality improves deep regression by linking it to representation tightness measured by $H(Z|Y)$. It reveals that standard regressors poorly tighten representations due to gradient directions, and introduces two strategies—Multiple Target (MT) learning and a Regression Optimal Transport Regularizer (ROT-Reg)—to globally and locally tighten the feature space. MT adds extra target dimensions to compress the regression solution space, while ROT-Reg aligns local transport plans between target and representation spaces using self-entropic optimal transport. Experiments on age estimation, depth estimation, and coordinate prediction show that MT and ROT-Reg improve performance and better preserve ordinality, with their combination delivering the strongest gains and faster convergence, while incurring minimal computational overhead. This approach provides a principled path to improved generalization in deep regression by enforcing both global and local structure in learned representations.
Abstract
For deep regression, preserving the ordinality of the targets with respect to the feature representation improves performance across various tasks. However, a theoretical explanation for the benefits of ordinality is still lacking. This work reveals that preserving ordinality reduces the conditional entropy $H(Z|Y)$ of representation $Z$ conditional on the target $Y$. However, our findings reveal that typical regression losses do little to reduce $H(Z|Y)$, even though it is vital for generalization performance. With this motivation, we introduce an optimal transport-based regularizer to preserve the similarity relationships of targets in the feature space to reduce $H(Z|Y)$. Additionally, we introduce a simple yet efficient strategy of duplicating the regressor targets, also with the aim of reducing $H(Z|Y)$. Experiments on three real-world regression tasks verify the effectiveness of our strategies to improve deep regression. Code: https://github.com/needylove/Regression_tightness.
