Table of Contents
Fetching ...

Diffusion Models for Multi-target Adversarial Tracking

Sean Ye, Manisha Natarajan, Zixuan Wu, Matthew Gombolay

TL;DR

This paper tackles adversarial target tracking under partial observability in large-scale domains. It introduces CADENCE, a diffusion-model approach with cross-attention and constraint-guided sampling to generate multimodal, obstacle-aware trajectories from sparse detections. Key contributions include (1) a cross-attention diffusion architecture enabling implicit target assignment for multi-target tracking, (2) a constraint-guided sampling scheme that enforces motion and obstacle constraints and reduces mountain collisions by about 90%, and (3) a single-target diffusion model that improves $ADE$ by about 9.2% over the previous state of the art while producing complete trajectories across horizons. The method offers significant potential for autonomous tracking in UAV-enabled security and safety applications, providing richer foresight for pursuit and interdiction strategies while operating under partial observability.

Abstract

Target tracking plays a crucial role in real-world scenarios, particularly in drug-trafficking interdiction, where the knowledge of an adversarial target's location is often limited. Improving autonomous tracking systems will enable unmanned aerial, surface, and underwater vehicles to better assist in interdicting smugglers that use manned surface, semi-submersible, and aerial vessels. As unmanned drones proliferate, accurate autonomous target estimation is even more crucial for security and safety. This paper presents Constrained Agent-based Diffusion for Enhanced Multi-Agent Tracking (CADENCE), an approach aimed at generating comprehensive predictions of adversary locations by leveraging past sparse state information. To assess the effectiveness of this approach, we evaluate predictions on single-target and multi-target pursuit environments, employing Monte-Carlo sampling of the diffusion model to estimate the probability associated with each generated trajectory. We propose a novel cross-attention based diffusion model that utilizes constraint-based sampling to generate multimodal track hypotheses. Our single-target model surpasses the performance of all baseline methods on Average Displacement Error (ADE) for predictions across all time horizons.

Diffusion Models for Multi-target Adversarial Tracking

TL;DR

This paper tackles adversarial target tracking under partial observability in large-scale domains. It introduces CADENCE, a diffusion-model approach with cross-attention and constraint-guided sampling to generate multimodal, obstacle-aware trajectories from sparse detections. Key contributions include (1) a cross-attention diffusion architecture enabling implicit target assignment for multi-target tracking, (2) a constraint-guided sampling scheme that enforces motion and obstacle constraints and reduces mountain collisions by about 90%, and (3) a single-target diffusion model that improves by about 9.2% over the previous state of the art while producing complete trajectories across horizons. The method offers significant potential for autonomous tracking in UAV-enabled security and safety applications, providing richer foresight for pursuit and interdiction strategies while operating under partial observability.

Abstract

Target tracking plays a crucial role in real-world scenarios, particularly in drug-trafficking interdiction, where the knowledge of an adversarial target's location is often limited. Improving autonomous tracking systems will enable unmanned aerial, surface, and underwater vehicles to better assist in interdicting smugglers that use manned surface, semi-submersible, and aerial vessels. As unmanned drones proliferate, accurate autonomous target estimation is even more crucial for security and safety. This paper presents Constrained Agent-based Diffusion for Enhanced Multi-Agent Tracking (CADENCE), an approach aimed at generating comprehensive predictions of adversary locations by leveraging past sparse state information. To assess the effectiveness of this approach, we evaluate predictions on single-target and multi-target pursuit environments, employing Monte-Carlo sampling of the diffusion model to estimate the probability associated with each generated trajectory. We propose a novel cross-attention based diffusion model that utilizes constraint-based sampling to generate multimodal track hypotheses. Our single-target model surpasses the performance of all baseline methods on Average Displacement Error (ADE) for predictions across all time horizons.
Paper Structure (25 sections, 4 equations, 4 figures, 1 table, 1 algorithm)

This paper contains 25 sections, 4 equations, 4 figures, 1 table, 1 algorithm.

Figures (4)

  • Figure 1: Our proposed architecture (CADENCE) is a diffusion probabilistic model that iteratively refines trajectories for hypothesizing the future states of an adversarial target from a given history of past detections. Here we show cross attention across two parallel tracks but the model can be used for any number of $n$ agents. The color gradient shows the time horizon for each trajectory (blue to red).
  • Figure 2: Multi-Hypothesis Trajectory Samples: In one sample (left) the agents take separate paths to the final location while the other sample (right) shows all agents meeting at an intermediary point.
  • Figure 3: Detection Origin Ablation: ADE and minADE over Prediction Horizon. Shaded regions represent standard error of the mean.
  • Figure 4: Original sampling (left) vs Constraint-Based sampling (right): We show key improvements to avoid the mountains in the sample on the right (circled in red).