FlowRL: Flow-Augmented Few-Shot Reinforcement Learning for Semi-Structured Sensor Data

Mohammad Pivezhandi; Abusayeed Saifullah

FlowRL: Flow-Augmented Few-Shot Reinforcement Learning for Semi-Structured Sensor Data

Mohammad Pivezhandi, Abusayeed Saifullah

TL;DR

FlowRL tackles the challenge of few-shot reinforcement learning in DVFS-like environments by generating high-quality synthetic semi-structured sensor data using distribution-aware flow matching. It combines continuous normalizing flows with latent-space bootstrapping and feature-weighted conditioning to preserve critical correlations while expanding data support, leading to improved sample efficiency and faster Q-value convergence. In a DVFS case on the NVIDIA Jetson TX2, FlowRL achieves up to $35\%$ higher frame rates and more robust policy learning than model-based and model-free baselines, and the approach generalizes to robotics and smart grids. The work provides practical mechanisms to leverage flow-based data augmentation in data-scarce RL settings, offering a scalable path for online adaptation in resource-constrained systems.

Abstract

Reinforcement learning (RL) in few-shot scenarios with limited sensor data is challenging due to insufficient training samples, particularly in applications like Dynamic Voltage and Frequency Scaling (DVFS) where sensor readings are semi-structured with inherent correlations. We propose Flow-Augmented Reinforcement Learning (FlowRL), a novel method that leverages continuous normalizing flows to generate high-quality synthetic data for few-shot RL. By integrating latent space bootstrapping for diversity and feature-weighted flow matching to preserve critical data correlations, FlowRL enhances sample efficiency and policy robustness. Evaluated on a DVFS case study using the NVIDIA Jetson TX2, our approach achieves up to 35\% higher frame rates and faster Q-value convergence compared to baselines, demonstrating its effectiveness in resource-constrained environments. FlowRL generalizes to other semi-structured domains, such as robotics and smart grids, offering a scalable solution for data-scarce RL settings.

FlowRL: Flow-Augmented Few-Shot Reinforcement Learning for Semi-Structured Sensor Data

TL;DR

higher frame rates and more robust policy learning than model-based and model-free baselines, and the approach generalizes to robotics and smart grids. The work provides practical mechanisms to leverage flow-based data augmentation in data-scarce RL settings, offering a scalable path for online adaptation in resource-constrained systems.

Abstract

Paper Structure (36 sections, 18 equations, 5 figures, 1 table, 1 algorithm)

This paper contains 36 sections, 18 equations, 5 figures, 1 table, 1 algorithm.

Introduction
Preliminaries
Q-learning and Online RL
Online Setting.
Regret Bounds and Sample Efficiency
Model Realizability and LLN.
Flow Matching for Generative Modeling
Relevance to DVFS RL.
Design of Distribution-Aware Flow Matching for Data Generation
Redefinition of the Regret Function
Flow Matching and Bootstrapping
Bootstrapping in the Latent Space.
Feature Weighting and Conditional Flow Matching
Random Forest Feature Importance.
Weighted Conditional Objective.
...and 21 more sections

Figures (5)

Figure 1: Workflow of the proposed Flow-Augmented Reinforcement Learning (FlowRL) approach.
Figure 2: Correlation matrix comparisons.
Figure 3: Performance comparisons across methods.
Figure 4: Real and (Gen)erated data t-SNE.
Figure 5:

FlowRL: Flow-Augmented Few-Shot Reinforcement Learning for Semi-Structured Sensor Data

TL;DR

Abstract

FlowRL: Flow-Augmented Few-Shot Reinforcement Learning for Semi-Structured Sensor Data

Authors

TL;DR

Abstract

Table of Contents

Figures (5)