Table of Contents
Fetching ...

RoBoSS: A Robust, Bounded, Sparse, and Smooth Loss Function for Supervised Learning

Mushir Akhtar, M. Tanveer, Mohd. Arshad

TL;DR

A novel robust, bounded, sparse, and smooth (RoBoSS) loss function for supervised learning is proposed and incorporated within the framework of support vector machine (SVM) and a new robust algorithm named <inline-formula><tex-math notation="LaTeX">$\mathcal {L}_{RoBoSS}$</tex-math><alternatives><mml:math><mml:msub><mml:mi mathvariant="script">

Abstract

In the domain of machine learning, the significance of the loss function is paramount, especially in supervised learning tasks. It serves as a fundamental pillar that profoundly influences the behavior and efficacy of supervised learning algorithms. Traditional loss functions, though widely used, often struggle to handle outlier-prone and high-dimensional data, resulting in suboptimal outcomes and slow convergence during training. In this paper, we address the aforementioned constraints by proposing a novel robust, bounded, sparse, and smooth (RoBoSS) loss function for supervised learning. Further, we incorporate the RoBoSS loss within the framework of support vector machine (SVM) and introduce a new robust algorithm named $\mathcal{L}_{RoBoSS}$-SVM. For the theoretical analysis, the classification-calibrated property and generalization ability are also presented. These investigations are crucial for gaining deeper insights into the robustness of the RoBoSS loss function in classification problems and its potential to generalize well to unseen data. To validate the potency of the proposed $\mathcal{L}_{RoBoSS}$-SVM, we assess it on $88$ benchmark datasets from KEEL and UCI repositories. Further, to rigorously evaluate its performance in challenging scenarios, we conducted an assessment using datasets intentionally infused with outliers and label noise. Additionally, to exemplify the effectiveness of $\mathcal{L}_{RoBoSS}$-SVM within the biomedical domain, we evaluated it on two medical datasets: the electroencephalogram (EEG) signal dataset and the breast cancer (BreaKHis) dataset. The numerical results substantiate the superiority of the proposed $\mathcal{L}_{RoBoSS}$-SVM model, both in terms of its remarkable generalization performance and its efficiency in training time.

RoBoSS: A Robust, Bounded, Sparse, and Smooth Loss Function for Supervised Learning

TL;DR

A novel robust, bounded, sparse, and smooth (RoBoSS) loss function for supervised learning is proposed and incorporated within the framework of support vector machine (SVM) and a new robust algorithm named <inline-formula><tex-math notation="LaTeX"></tex-math><alternatives><mml:math><mml:msub><mml:mi mathvariant="script">

Abstract

In the domain of machine learning, the significance of the loss function is paramount, especially in supervised learning tasks. It serves as a fundamental pillar that profoundly influences the behavior and efficacy of supervised learning algorithms. Traditional loss functions, though widely used, often struggle to handle outlier-prone and high-dimensional data, resulting in suboptimal outcomes and slow convergence during training. In this paper, we address the aforementioned constraints by proposing a novel robust, bounded, sparse, and smooth (RoBoSS) loss function for supervised learning. Further, we incorporate the RoBoSS loss within the framework of support vector machine (SVM) and introduce a new robust algorithm named -SVM. For the theoretical analysis, the classification-calibrated property and generalization ability are also presented. These investigations are crucial for gaining deeper insights into the robustness of the RoBoSS loss function in classification problems and its potential to generalize well to unseen data. To validate the potency of the proposed -SVM, we assess it on benchmark datasets from KEEL and UCI repositories. Further, to rigorously evaluate its performance in challenging scenarios, we conducted an assessment using datasets intentionally infused with outliers and label noise. Additionally, to exemplify the effectiveness of -SVM within the biomedical domain, we evaluated it on two medical datasets: the electroencephalogram (EEG) signal dataset and the breast cancer (BreaKHis) dataset. The numerical results substantiate the superiority of the proposed -SVM model, both in terms of its remarkable generalization performance and its efficiency in training time.
Paper Structure (3 sections, 4 equations, 1 figure, 11 tables)

This paper contains 3 sections, 4 equations, 1 figure, 11 tables.

Figures (1)

  • Figure S.1: Sensitivity analysis of the $\mathcal{L}_{RoBoSS}$-SVM model with respect to the loss hyperparameters $a$ and $\lambda$ on abalone9-18, echocardiogram, titanic, and ecoli3 datasets. The 3D surface plots illustrate the model's accuracy variations as $a$ and $\lambda$ are systematically adjusted while keeping other hyperparameters fixed at their optimal values.