Table of Contents
Fetching ...

Enhancing Routing in SD-EONs through Reinforcement Learning: A Comparative Analysis

Ryan McCann, Arash Rezaee, Vinod M. Vokkarane

TL;DR

An optimization framework for routing in software-defined elastic optical networks using reinforcement learning algorithms and shows that Q-learning significantly outperforms traditional methods, achieving a reduction in blocking probability of up to 58.8% over KSP-FF, and 81.9% over SPF-FF under lower traffic volumes.

Abstract

This paper presents an optimization framework for routing in software-defined elastic optical networks using reinforcement learning algorithms. We specifically implement and compare the epsilon-greedy bandit, upper confidence bound (UCB) bandit, and Q-learning algorithms to traditional methods such as K-Shortest Paths with First-Fit core and spectrum assignment (KSP-FF) and Shortest Path with First-Fit (SPF-FF) algorithms. Our results show that Q-learning significantly outperforms traditional methods, achieving a reduction in blocking probability (BP) of up to 58.8% over KSP-FF, and 81.9% over SPF-FF under lower traffic volumes. For higher traffic volumes, Q-learning maintains superior performance with BP reductions of 41.9% over KSP-FF and 70.1% over SPF-FF. These findings demonstrate the efficacy of reinforcement learning in enhancing network performance and resource utilization in dynamic and complex environments.

Enhancing Routing in SD-EONs through Reinforcement Learning: A Comparative Analysis

TL;DR

An optimization framework for routing in software-defined elastic optical networks using reinforcement learning algorithms and shows that Q-learning significantly outperforms traditional methods, achieving a reduction in blocking probability of up to 58.8% over KSP-FF, and 81.9% over SPF-FF under lower traffic volumes.

Abstract

This paper presents an optimization framework for routing in software-defined elastic optical networks using reinforcement learning algorithms. We specifically implement and compare the epsilon-greedy bandit, upper confidence bound (UCB) bandit, and Q-learning algorithms to traditional methods such as K-Shortest Paths with First-Fit core and spectrum assignment (KSP-FF) and Shortest Path with First-Fit (SPF-FF) algorithms. Our results show that Q-learning significantly outperforms traditional methods, achieving a reduction in blocking probability (BP) of up to 58.8% over KSP-FF, and 81.9% over SPF-FF under lower traffic volumes. For higher traffic volumes, Q-learning maintains superior performance with BP reductions of 41.9% over KSP-FF and 70.1% over SPF-FF. These findings demonstrate the efficacy of reinforcement learning in enhancing network performance and resource utilization in dynamic and complex environments.

Paper Structure

This paper contains 15 sections, 3 equations, 4 figures, 1 table, 2 algorithms.

Figures (4)

  • Figure 1: NSFNet Topology.
  • Figure 2: BP vs. Episodes $Erlang=500$
  • Figure 3: BP vs. Episodes $Erlang=750$
  • Figure 4: BP vs. Episodes $Erlang=1000$