A Novel Hybrid Heuristic-Reinforcement Learning Optimization Approach for a Class of Railcar Shunting Problems

Ruonan Zhao; Joseph Geunes

A Novel Hybrid Heuristic-Reinforcement Learning Optimization Approach for a Class of Railcar Shunting Problems

Ruonan Zhao, Joseph Geunes

TL;DR

A novel Hybrid Heuristic-Reinforcement Learning (HHRL) framework is presented that integrates railway-specific heuristic solution approaches with a reinforcement learning method, specifically Q-learning and leverages methods to decrease the state-action space and guide exploration during reinforcement learning.

Abstract

Railcar shunting is a core planning task in freight railyards, where yard planners need to disassemble and reassemble groups of railcars to form outbound trains. Classification tracks with access from one side only can be considered as stack structures, where railcars are added and removed from only one end, leading to a last-in-first-out (LIFO) retrieval order. In contrast, two-sided tracks function like queue structures, allowing railcars to be added from one end and removed from the opposite end, following a first-in-first-out (FIFO) order. We consider a problem requiring assembly of multiple outbound trains using two locomotives in a railyard with two-sided classification track access. To address this combinatorially challenging problem class, we decompose the problem into two subproblems, each with one-sided classification track access and a locomotive on each side. We present a novel Hybrid Heuristic-Reinforcement Learning (HHRL) framework that integrates railway-specific heuristic solution approaches with a reinforcement learning method, specifically Q-learning. The proposed framework leverages methods to decrease the state-action space and guide exploration during reinforcement learning. The results of a series of numerical experiments demonstrate the efficiency and quality of the HHRL algorithm in both one-sided access, single-locomotive problems and two-sided access, two-locomotive problems.

A Novel Hybrid Heuristic-Reinforcement Learning Optimization Approach for a Class of Railcar Shunting Problems

TL;DR

Abstract

Paper Structure (18 sections, 5 equations, 6 figures, 6 tables, 3 algorithms)

This paper contains 18 sections, 5 equations, 6 figures, 6 tables, 3 algorithms.

Introduction
Literature Review
Problem Definition and TS-RSP Transformation
OS-RSP definition
TS-RSP definition
RL-Based Heuristic Solution Method
Decomposing the TS-RSP into coupled OS-RSP subproblems
$Q$-learning Model Formulations
HHRL Approach
Preprocessing
Phase 1
Phase 2
Phase 3
Phase 4
Phase 5
...and 3 more sections

Figures (6)

Figure 1: One-sided railyard layout example.
Figure 2: Two-sided railyard layout example
Figure 3: The general setting for RL
Figure 4: Group type illustration
Figure 5: HHRL framework
...and 1 more figures

Theorems & Definitions (2)

Definition 1
Definition 2

A Novel Hybrid Heuristic-Reinforcement Learning Optimization Approach for a Class of Railcar Shunting Problems

TL;DR

Abstract

A Novel Hybrid Heuristic-Reinforcement Learning Optimization Approach for a Class of Railcar Shunting Problems

Authors

TL;DR

Abstract

Table of Contents

Figures (6)

Theorems & Definitions (2)