Derivative-based regularization for regression
Enrico Lopedoto, Maksim Shekhunov, Vitaly Aksenov, Kizito Salako, Tillman Weyde
TL;DR
This paper introduces DLoss, a derivative-based regularizer for regression that aligns a model's directional derivatives with data-derived derivatives estimated from training tuples. By combining a derivative-matching term with the standard mean squared error, and using either nearest-neighbour or random tuple selection to compute data derivatives, the method aims to capture the target function's differential structure. Empirical results on real and synthetic datasets show that DLoss, especially with nearest-neighbour tuples, improves validation MSE on average and often ranks first among considered regularizers, albeit with higher computational cost. The approach provides a data-driven regularization mechanism that can enhance generalization without altering model architecture, and it opens avenues for extension to more models and to classification tasks.
Abstract
In this work, we introduce a novel approach to regularization in multivariable regression problems. Our regularizer, called DLoss, penalises differences between the model's derivatives and derivatives of the data generating function as estimated from the training data. We call these estimated derivatives data derivatives. The goal of our method is to align the model to the data, not only in terms of target values but also in terms of the derivatives involved. To estimate data derivatives, we select (from the training data) 2-tuples of input-value pairs, using either nearest neighbour or random, selection. On synthetic and real datasets, we evaluate the effectiveness of adding DLoss, with different weights, to the standard mean squared error loss. The experimental results show that with DLoss (using nearest neighbour selection) we obtain, on average, the best rank with respect to MSE on validation data sets, compared to no regularization, L2 regularization, and Dropout.
