Table of Contents
Fetching ...

PathletRL++: Optimizing Trajectory Pathlet Extraction and Dictionary Formation via Reinforcement Learning

Gian Alix, Arian Haghparast, Manos Papagelis

TL;DR

The paper tackles the challenge of representing large-scale trajectory data with a compact pathlet dictionary by adopting a bottom-up, edge-disjoint pathlet merging strategy. It introduces PathletRL, a Deep Q-Network–based framework that learns a utility function for merging unit-length pathlets into longer, more informative ones, achieving dramatic memory reductions and high trajectory representability. To further improve performance, PathletRL++ adds richer state representations and enhanced reward shaping, yielding smaller dictionaries with faster convergence while preserving reconstruction quality. The approach outperforms state-of-the-art top-down methods, reducing dictionary size by up to 65.8% and enabling reconstruction of about 85% of trajectories with half of the dictionary, while delivering memory savings up to ~24,000×. Overall, the work offers a scalable, learning-driven solution for trajectory compression, route planning, and offline analysis, with open-source tooling to support reproducibility.

Abstract

Advances in tracking technologies have spurred the rapid growth of large-scale trajectory data. Building a compact collection of pathlets, referred to as a trajectory pathlet dictionary, is essential for supporting mobility-related applications. Existing methods typically adopt a top-down approach, generating numerous candidate pathlets and selecting a subset, leading to high memory usage and redundant storage from overlapping pathlets. To overcome these limitations, we propose a bottom-up strategy that incrementally merges basic pathlets to build the dictionary, reducing memory requirements by up to 24,000 times compared to baseline methods. The approach begins with unit-length pathlets and iteratively merges them while optimizing utility, which is defined using newly introduced metrics of trajectory loss and representability. We develop a deep reinforcement learning framework, PathletRL, which utilizes Deep Q-Networks (DQN) to approximate the utility function, resulting in a compact and efficient pathlet dictionary. Experiments on both synthetic and real-world datasets demonstrate that our method outperforms state-of-the-art techniques, reducing the size of the constructed dictionary by up to 65.8%. Additionally, our results show that only half of the dictionary pathlets are needed to reconstruct 85% of the original trajectory data. Building on PathletRL, we introduce PathletRL++, which extends the original model by incorporating a richer state representation and an improved reward function to optimize decision-making during pathlet merging. These enhancements enable the agent to gain a more nuanced understanding of the environment, leading to higher-quality pathlet dictionaries. PathletRL++ achieves even greater dictionary size reduction, surpassing the performance of PathletRL, while maintaining high trajectory representability.

PathletRL++: Optimizing Trajectory Pathlet Extraction and Dictionary Formation via Reinforcement Learning

TL;DR

The paper tackles the challenge of representing large-scale trajectory data with a compact pathlet dictionary by adopting a bottom-up, edge-disjoint pathlet merging strategy. It introduces PathletRL, a Deep Q-Network–based framework that learns a utility function for merging unit-length pathlets into longer, more informative ones, achieving dramatic memory reductions and high trajectory representability. To further improve performance, PathletRL++ adds richer state representations and enhanced reward shaping, yielding smaller dictionaries with faster convergence while preserving reconstruction quality. The approach outperforms state-of-the-art top-down methods, reducing dictionary size by up to 65.8% and enabling reconstruction of about 85% of trajectories with half of the dictionary, while delivering memory savings up to ~24,000×. Overall, the work offers a scalable, learning-driven solution for trajectory compression, route planning, and offline analysis, with open-source tooling to support reproducibility.

Abstract

Advances in tracking technologies have spurred the rapid growth of large-scale trajectory data. Building a compact collection of pathlets, referred to as a trajectory pathlet dictionary, is essential for supporting mobility-related applications. Existing methods typically adopt a top-down approach, generating numerous candidate pathlets and selecting a subset, leading to high memory usage and redundant storage from overlapping pathlets. To overcome these limitations, we propose a bottom-up strategy that incrementally merges basic pathlets to build the dictionary, reducing memory requirements by up to 24,000 times compared to baseline methods. The approach begins with unit-length pathlets and iteratively merges them while optimizing utility, which is defined using newly introduced metrics of trajectory loss and representability. We develop a deep reinforcement learning framework, PathletRL, which utilizes Deep Q-Networks (DQN) to approximate the utility function, resulting in a compact and efficient pathlet dictionary. Experiments on both synthetic and real-world datasets demonstrate that our method outperforms state-of-the-art techniques, reducing the size of the constructed dictionary by up to 65.8%. Additionally, our results show that only half of the dictionary pathlets are needed to reconstruct 85% of the original trajectory data. Building on PathletRL, we introduce PathletRL++, which extends the original model by incorporating a richer state representation and an improved reward function to optimize decision-making during pathlet merging. These enhancements enable the agent to gain a more nuanced understanding of the environment, leading to higher-quality pathlet dictionaries. PathletRL++ achieves even greater dictionary size reduction, surpassing the performance of PathletRL, while maintaining high trajectory representability.

Paper Structure

This paper contains 40 sections, 2 theorems, 32 equations, 16 figures, 7 tables, 1 algorithm.

Key Result

Theorem 3.1

At any step $i$ of the iterative Algorithm alg:pathlet_rl, then the trajectory representability $\mu$ of some trajectory $\tau \in \mathcal{T}$ by the end of that iteration $i$ is equal to: where $\Phi_0$ and $\Phi_i$ are the pathlet-based representation of trajectory $\tau$ in the initial (iteration 0) and iteration $i$ of the iterative algorithm respectively.

Figures (16)

  • Figure 1: (a) Graph representation of a small area in Toronto; (b) Example of various-length edge-disjoint pathlets in (a).
  • Figure 2: The memory required by top-down (existing) methods that use overlapping pathlets can be reduced by our proposed bottom-up solution that use edge-disjoint pathlets.
  • Figure 3: The overall architecture (including the constructed PDs) of our proposed PathletRL model
  • Figure 4: An illustrative example of Example \ref{['ex:pathlet-merge']}: (a) A toy example of a simple road network; (b) A grid representation of (a); (c) The initial pathlet graph representation (of length-1 pathlets) for the road network in (a); (d) The final (merged) pathlet graph representation after the completion of the pathlet-merging algorithm in Algorithm \ref{['alg:pathlet_rl']}.
  • Figure 5: An illustrative example of the paths (road segments) traversed by six trajectories $\{ \tau_1, \tau_2, \tau_3, \tau_4, \tau_5, \tau_6 \}$ (highlighted by maroon) from the road network of Example \ref{['ex:pathlet-merge']} as seen in Figure \ref{['fig:pathlet-merge-pic1']}; see Table \ref{['tab:pathlet-rep-set-toy']} that lists the sequential pathlet-based representations of each trajectory.
  • ...and 11 more figures

Theorems & Definitions (3)

  • Example 3.1
  • Theorem 3.1: Trajectory Representability Theorem
  • Theorem 3.2: Initial Memory Storage Requirement Theorem