StagePilot: A Deep Reinforcement Learning Agent for Stage-Controlled Cybergrooming Simulation
Heajun An, Qi Zhang, Minqian Liu, Xinyi Zhang, Sang Won Lee, Lifu Huang, Pamela J. Wisniewski, Jin-Hee Cho
TL;DR
StagePilot introduces an offline deep reinforcement learning agent that controls stage transitions in cybergrooming simulations, treating dialogue as a long-horizon planning problem over six grooming stages. By combining adjacent-stage transition constraints with a sentiment-distance reward, StagePilot achieves coherent, emotionally engaging, and strategically progressive conversations, validated in LLM-driven predator–victim simulations. IQL+AWAC emerge as the strongest offline-RL baseline, delivering the highest Stage 6 reach and efficient dialogue length while maintaining sentiment alignment, and outperforming BC, CQL, and prompting baselines. The work provides a rigorous evaluation framework, expert validation, and demonstrates the potential of stage-level planning for safety-critical educational tools, with careful attention to ethical safeguards and applicability to broader counseling or negotiation simulations.
Abstract
Cybergrooming is an evolving threat to youth, necessitating proactive educational interventions. We propose StagePilot, an offline RL-based dialogue agent that simulates the stage-wise progression of grooming behaviors for prevention training. StagePilot selects conversational stages using a composite reward that balances user sentiment and goal proximity, with transitions constrained to adjacent stages for realism and interpretability. We evaluate StagePilot through LLM-based simulations, measuring stage completion, dialogue efficiency, and emotional engagement. Results show that StagePilot generates realistic and coherent conversations aligned with grooming dynamics. Among tested methods, the IQL+AWAC agent achieves the best balance between strategic planning and emotional coherence, reaching the final stage up to 43% more frequently than baselines while maintaining over 70% sentiment alignment.
