Designing a Robust, Bounded, and Smooth Loss Function for Improved Supervised Learning
Soumi Mahato, Lineesh M. C
TL;DR
The paper introduces RoBoS-NN, a robust, bounded, and smooth regression loss obtained by extending the RoBoSS classifier loss to continuous outputs. The loss is defined as $L_{RoBoS-NN}(u) = \lambda \{ 1 - ( a \sqrt{u^{2}+\epsilon} - a \sqrt{\epsilon} ) \exp(- ( a \sqrt{u^{2}+\epsilon} - a \sqrt{\epsilon} )) \}$ with $u = |y - \hat{y}|$, and the neural-network objective is $\min_{\theta} \frac{\lambda}{2} \|\theta\|^{2} + \frac{1}{n} \sum_{k=1}^{n} L_{RoBoS-NN}(y_k - f(x_k; \theta))$. A theoretical generalization bound based on Rademacher complexity is derived, showing the bound scales with the Lipschitz-like constant $a/e$ and network norms. Empirically, RoBoS-NN is evaluated on four real-world time-series datasets with injected outliers and outperforms standard losses (MAE, MSE, Huber, Logcosh) in robustness and accuracy, with hyperparameters tuned via HyperOpt/TPE. The results suggest RoBoS-NN is ready to be integrated into broader architectures (e.g., CNNs, RNNs) to enhance robustness in supervised learning and forecasting tasks.
Abstract
The loss function is crucial to machine learning, especially in supervised learning frameworks. It is a fundamental component that controls the behavior and general efficacy of learning algorithms. However, despite their widespread use, traditional loss functions have significant drawbacks when dealing with high-dimensional and outlier-sensitive datasets, which frequently results in reduced performance and slower convergence during training. In this work, we develop a robust, bounded, and smooth (RoBoS-NN) loss function to resolve the aforementioned hindrances. The generalization ability of the loss function has also been theoretically analyzed to rigorously justify its robustness. Moreover, we implement RoboS-NN loss in the framework of a neural network (NN) to forecast time series and present a new robust algorithm named $\mathcal{L}_{\text{RoBoS}}$-NN. To assess the potential of $\mathcal{L}_{\text{RoBoS}}$-NN, we conduct experiments on multiple real-world datasets. In addition, we infuse outliers into data sets to evaluate the performance of $\mathcal{L}_{\text{RoBoS}}$-NN in more challenging scenarios. Numerical results show that $\mathcal{L}_{\text{RoBoS}}$-NN outperforms the other benchmark models in terms of accuracy measures.
