Table of Contents
Fetching ...

Theoretical Framework for Tempered Fractional Gradient Descent: Application to Breast Cancer Classification

Omar Naifar

TL;DR

The tempered memory mechanism proves particularly effective in medical classification tasks, where feature correlations benefit from stable gradient averaging, and position TFGD as a robust alternative to conventional optimizers in both theoretical and applied machine learning.

Abstract

This paper introduces Tempered Fractional Gradient Descent (TFGD), a novel optimization framework that synergizes fractional calculus with exponential tempering to enhance gradient-based learning. Traditional gradient descent methods often suffer from oscillatory updates and slow convergence in high-dimensional, noisy landscapes. TFGD addresses these limitations by incorporating a tempered memory mechanism, where historical gradients are weighted by fractional coefficients $|w_j| = \binomα{j}$ and exponentially decayed via a tempering parameter $λ$. Theoretical analysis establishes TFGD's convergence guarantees: in convex settings, it achieves an $\mathcal{O}(1/K)$ rate with alignment coefficient $d_{α,λ} = (1 - e^{-λ})^{-α}$, while stochastic variants attain $\mathcal{O}(1/k^α)$ error decay. The algorithm maintains $\mathcal{O}(n)$ time complexity equivalent to SGD, with memory overhead scaling as $\mathcal{O}(d/λ)$ for parameter dimension $d$. Empirical validation on the Breast Cancer Wisconsin dataset demonstrates TFGD's superiority, achieving 98.25\% test accuracy (vs. 92.11\% for SGD) and 2$\times$ faster convergence. The tempered memory mechanism proves particularly effective in medical classification tasks, where feature correlations benefit from stable gradient averaging. These results position TFGD as a robust alternative to conventional optimizers in both theoretical and applied machine learning.

Theoretical Framework for Tempered Fractional Gradient Descent: Application to Breast Cancer Classification

TL;DR

The tempered memory mechanism proves particularly effective in medical classification tasks, where feature correlations benefit from stable gradient averaging, and position TFGD as a robust alternative to conventional optimizers in both theoretical and applied machine learning.

Abstract

This paper introduces Tempered Fractional Gradient Descent (TFGD), a novel optimization framework that synergizes fractional calculus with exponential tempering to enhance gradient-based learning. Traditional gradient descent methods often suffer from oscillatory updates and slow convergence in high-dimensional, noisy landscapes. TFGD addresses these limitations by incorporating a tempered memory mechanism, where historical gradients are weighted by fractional coefficients and exponentially decayed via a tempering parameter . Theoretical analysis establishes TFGD's convergence guarantees: in convex settings, it achieves an rate with alignment coefficient , while stochastic variants attain error decay. The algorithm maintains time complexity equivalent to SGD, with memory overhead scaling as for parameter dimension . Empirical validation on the Breast Cancer Wisconsin dataset demonstrates TFGD's superiority, achieving 98.25\% test accuracy (vs. 92.11\% for SGD) and 2 faster convergence. The tempered memory mechanism proves particularly effective in medical classification tasks, where feature correlations benefit from stable gradient averaging. These results position TFGD as a robust alternative to conventional optimizers in both theoretical and applied machine learning.

Paper Structure

This paper contains 7 sections, 6 theorems, 38 equations, 2 figures, 2 tables.

Key Result

Lemma 4.1

For $0 < \alpha < 1$ and $\lambda > 0$: Define $d_{\alpha,\lambda} := (1 - e^{-\lambda})^{-\alpha}$ as the alignment coefficient.

Figures (2)

  • Figure 1: Time/Memory trade-off for TFGD on the Breast Cancer Wisconsin dataset ($d=30$, $\lambda=0.5$, $K=100$).
  • Figure 2: Comparison of TFGD ($\alpha=0.6$, $\lambda=0.5$) and SGD on the Breast Cancer Wisconsin dataset.

Theorems & Definitions (14)

  • Definition 2.1: Tempered Caputo Derivative, medved2021
  • Definition 2.2: TFGD Update Rule
  • Lemma 4.1: Tempered Weight Decay
  • proof
  • Theorem 4.1: Convex Convergence
  • proof
  • Lemma 4.2: Variance Bound
  • proof
  • Theorem 4.2: Stochastic Convergence
  • proof
  • ...and 4 more