Edge-DIRECT: A Deep Reinforcement Learning-based Method for Solving Heterogeneous Electric Vehicle Routing Problem with Time Window Constraints

Arash Mozhdehi; Mahdi Mohammadizadeh; Xin Wang

Edge-DIRECT: A Deep Reinforcement Learning-based Method for Solving Heterogeneous Electric Vehicle Routing Problem with Time Window Constraints

Arash Mozhdehi, Mahdi Mohammadizadeh, Xin Wang

TL;DR

This work tackles the heterogeneous electric vehicle routing problem with time-window constraints ($HEVRPTW$), a practical and NP-hard problem in logistics. It introduces Edge-DIRECT, a Transformer-based DRL method with an edge-enhanced encoder and a dual-attention decoder that explicitly models time-window reachability, energy consumption, and fleet heterogeneity. Empirical results on real-world Edmonton and Calgary data show Edge-DIRECT achieving superior solution quality and faster runtimes than state-of-the-art DRL methods and heuristics, especially when using a sampling decoding strategy. The approach advances scalable, energy-aware EV routing by embedding problem structure into a learned policy that can generalize across fleet configurations and time-window scenarios.

Abstract

In response to carbon-neutral policies in developed countries, electric vehicles route optimization has gained importance for logistics companies. With the increasing focus on customer expectations and the shift towards more customer-oriented business models, the integration of delivery time-windows has become essential in logistics operations. Recognizing the critical nature of these developments, this article studies the heterogeneous electric vehicle routing problem with time-window constraints (HEVRPTW). To solve this variant of vehicle routing problem (VRP), we propose a DRL-based approach, named Edge-enhanced Dual attentIon encoderR and feature-EnhanCed dual aTtention decoder (Edge-DIRECT). Edge-DIRECT features an extra graph representation, the node connectivity of which is based on the overlap of customer time-windows. Edge-DIRECT's self-attention encoding mechanism is enhanced by exploiting the energy consumption and travel time between the locations. To effectively account for the heterogeneity of the EVs' fleet, a dual attention decoder has been introduced. Experimental results based on two real-world datasets reveal that Edge-DIRECT outperforms a state-of-the-art DRL-based method and a well-established heuristic approach in solution quality and execution time. Furthermore, it exhibits competitive performance when compared to another leading heuristic method.

Edge-DIRECT: A Deep Reinforcement Learning-based Method for Solving Heterogeneous Electric Vehicle Routing Problem with Time Window Constraints

TL;DR

This work tackles the heterogeneous electric vehicle routing problem with time-window constraints (

), a practical and NP-hard problem in logistics. It introduces Edge-DIRECT, a Transformer-based DRL method with an edge-enhanced encoder and a dual-attention decoder that explicitly models time-window reachability, energy consumption, and fleet heterogeneity. Empirical results on real-world Edmonton and Calgary data show Edge-DIRECT achieving superior solution quality and faster runtimes than state-of-the-art DRL methods and heuristics, especially when using a sampling decoding strategy. The approach advances scalable, energy-aware EV routing by embedding problem structure into a learned policy that can generalize across fleet configurations and time-window scenarios.

Abstract

Paper Structure (12 sections, 6 equations, 1 figure, 3 tables)

This paper contains 12 sections, 6 equations, 1 figure, 3 tables.

Related Works
Problem Definitions and Formulation
Methodology
Policy Network Architecture of Edge-DIRECT
Edge-Enhanced Dual Attention Encoder
Feature-Enhanced Dual Decoder
DRL Algorithm
Experiments
Performance Analysis (RQ1)
Ablation Study (RQ2)
Impact of Charging Station-to-Customer Ratio (RQ3)
Conclusion

Figures (1)

Figure 1: Illustration of Edge-DIRECT's policy network architecture.

Theorems & Definitions (3)

Definition 1
Definition 2
Definition 3

Edge-DIRECT: A Deep Reinforcement Learning-based Method for Solving Heterogeneous Electric Vehicle Routing Problem with Time Window Constraints

TL;DR

Abstract

Edge-DIRECT: A Deep Reinforcement Learning-based Method for Solving Heterogeneous Electric Vehicle Routing Problem with Time Window Constraints

Authors

TL;DR

Abstract

Table of Contents

Figures (1)

Theorems & Definitions (3)