Table of Contents
Fetching ...

Gradient Descent Efficiency Index

Aviral Dhingra

TL;DR

A new efficiency metric, Ek, designed to quantify the effectiveness of each iteration of gradient descent is introduced, which has the potential to guide more informed decisions in the selection and tuning of optimization algorithms in machine learning applications.

Abstract

Gradient descent is a widely used iterative algorithm for finding local minima in multivariate functions. However, the final iterations often either overshoot the minima or make minimal progress, making it challenging to determine an optimal stopping point. This study introduces a new efficiency metric, Ek, designed to quantify the effectiveness of each iteration. The proposed metric accounts for both the relative change in error and the stability of the loss function across iterations. This measure is particularly valuable in resource-constrained environments, where costs are closely tied to training time. Experimental validation across multiple datasets and models demonstrates that Ek provides valuable insights into the convergence behavior of gradient descent, complementing traditional performance metrics. The index has the potential to guide more informed decisions in the selection and tuning of optimization algorithms in machine learning applications and be used to compare the "effectiveness" of models relative to each other.

Gradient Descent Efficiency Index

TL;DR

A new efficiency metric, Ek, designed to quantify the effectiveness of each iteration of gradient descent is introduced, which has the potential to guide more informed decisions in the selection and tuning of optimization algorithms in machine learning applications.

Abstract

Gradient descent is a widely used iterative algorithm for finding local minima in multivariate functions. However, the final iterations often either overshoot the minima or make minimal progress, making it challenging to determine an optimal stopping point. This study introduces a new efficiency metric, Ek, designed to quantify the effectiveness of each iteration. The proposed metric accounts for both the relative change in error and the stability of the loss function across iterations. This measure is particularly valuable in resource-constrained environments, where costs are closely tied to training time. Experimental validation across multiple datasets and models demonstrates that Ek provides valuable insights into the convergence behavior of gradient descent, complementing traditional performance metrics. The index has the potential to guide more informed decisions in the selection and tuning of optimization algorithms in machine learning applications and be used to compare the "effectiveness" of models relative to each other.

Paper Structure

This paper contains 27 sections, 20 equations, 3 figures.

Figures (3)

  • Figure 1: Contour plot of cost b,w with path of gradient descent $J(w,b)$.
  • Figure 2: Comparison of cost with different domain restrictions.
  • Figure 3: Line graph of efficiency with respect to iterations in a sample dataset.