Table of Contents
Fetching ...

A Novel Hybrid Heuristic-Reinforcement Learning Optimization Approach for a Class of Railcar Shunting Problems

Ruonan Zhao, Joseph Geunes

TL;DR

A novel Hybrid Heuristic-Reinforcement Learning (HHRL) framework is presented that integrates railway-specific heuristic solution approaches with a reinforcement learning method, specifically Q-learning and leverages methods to decrease the state-action space and guide exploration during reinforcement learning.

Abstract

Railcar shunting is a core planning task in freight railyards, where yard planners need to disassemble and reassemble groups of railcars to form outbound trains. Classification tracks with access from one side only can be considered as stack structures, where railcars are added and removed from only one end, leading to a last-in-first-out (LIFO) retrieval order. In contrast, two-sided tracks function like queue structures, allowing railcars to be added from one end and removed from the opposite end, following a first-in-first-out (FIFO) order. We consider a problem requiring assembly of multiple outbound trains using two locomotives in a railyard with two-sided classification track access. To address this combinatorially challenging problem class, we decompose the problem into two subproblems, each with one-sided classification track access and a locomotive on each side. We present a novel Hybrid Heuristic-Reinforcement Learning (HHRL) framework that integrates railway-specific heuristic solution approaches with a reinforcement learning method, specifically Q-learning. The proposed framework leverages methods to decrease the state-action space and guide exploration during reinforcement learning. The results of a series of numerical experiments demonstrate the efficiency and quality of the HHRL algorithm in both one-sided access, single-locomotive problems and two-sided access, two-locomotive problems.

A Novel Hybrid Heuristic-Reinforcement Learning Optimization Approach for a Class of Railcar Shunting Problems

TL;DR

A novel Hybrid Heuristic-Reinforcement Learning (HHRL) framework is presented that integrates railway-specific heuristic solution approaches with a reinforcement learning method, specifically Q-learning and leverages methods to decrease the state-action space and guide exploration during reinforcement learning.

Abstract

Railcar shunting is a core planning task in freight railyards, where yard planners need to disassemble and reassemble groups of railcars to form outbound trains. Classification tracks with access from one side only can be considered as stack structures, where railcars are added and removed from only one end, leading to a last-in-first-out (LIFO) retrieval order. In contrast, two-sided tracks function like queue structures, allowing railcars to be added from one end and removed from the opposite end, following a first-in-first-out (FIFO) order. We consider a problem requiring assembly of multiple outbound trains using two locomotives in a railyard with two-sided classification track access. To address this combinatorially challenging problem class, we decompose the problem into two subproblems, each with one-sided classification track access and a locomotive on each side. We present a novel Hybrid Heuristic-Reinforcement Learning (HHRL) framework that integrates railway-specific heuristic solution approaches with a reinforcement learning method, specifically Q-learning. The proposed framework leverages methods to decrease the state-action space and guide exploration during reinforcement learning. The results of a series of numerical experiments demonstrate the efficiency and quality of the HHRL algorithm in both one-sided access, single-locomotive problems and two-sided access, two-locomotive problems.
Paper Structure (18 sections, 5 equations, 6 figures, 6 tables, 3 algorithms)

This paper contains 18 sections, 5 equations, 6 figures, 6 tables, 3 algorithms.

Figures (6)

  • Figure 1: One-sided railyard layout example.
  • Figure 2: Two-sided railyard layout example
  • Figure 3: The general setting for RL
  • Figure 4: Group type illustration
  • Figure 5: HHRL framework
  • ...and 1 more figures

Theorems & Definitions (2)

  • Definition 1
  • Definition 2