Preference-Driven Active 3D Scene Representation for Robotic Inspection in Nuclear Decommissioning

Zhen Meng; Kan Chen; Xiangmin Xu; Erwin Jose Lopez Pulgarin; Emma Li; Philip G. Zhao; David Flynn

Preference-Driven Active 3D Scene Representation for Robotic Inspection in Nuclear Decommissioning

Zhen Meng, Kan Chen, Xiangmin Xu, Erwin Jose Lopez Pulgarin, Emma Li, Philip G. Zhao, David Flynn

TL;DR

This work addresses the mismatch between traditional geometry/rendering-focused active 3D scene representations and operator-specific objectives in high-risk environments. It introduces a reinforcement-learning-from-human-feedback framework that learns a reward model from pairwise operator preferences and optimizes viewpoint planning via PPO, using a reward likelihood $P[\sigma_1 succ \sigma_2]$ and a corresponding cross-entropy loss to train $ yahat{r}$. The approach is validated on a UR3e-based reactor-tile inspection setup with 400 reconstructions and expert preferences, demonstrating improved scene fidelity and reduced trajectory length across multiple 3D representations. This operator-centric, online-learning paradigm advances adaptive, safety-critical robotic perception for nuclear decommissioning and similar high-risk tasks, with potential extensions to scalable human feedback and language-model-assisted reasoning.

Abstract

Active 3D scene representation is pivotal in modern robotics applications, including remote inspection, manipulation, and telepresence. Traditional methods primarily optimize geometric fidelity or rendering accuracy, but often overlook operator-specific objectives, such as safety-critical coverage or task-driven viewpoints. This limitation leads to suboptimal viewpoint selection, particularly in constrained environments such as nuclear decommissioning. To bridge this gap, we introduce a novel framework that integrates expert operator preferences into the active 3D scene representation pipeline. Specifically, we employ Reinforcement Learning from Human Feedback (RLHF) to guide robotic path planning, reshaping the reward function based on expert input. To capture operator-specific priorities, we conduct interactive choice experiments that evaluate user preferences in 3D scene representation. We validate our framework using a UR3e robotic arm for reactor tile inspection in a nuclear decommissioning scenario. Compared to baseline methods, our approach enhances scene representation while optimizing trajectory efficiency. The RLHF-based policy consistently outperforms random selection, prioritizing task-critical details. By unifying explicit 3D geometric modeling with implicit human-in-the-loop optimization, this work establishes a foundation for adaptive, safety-critical robotic perception systems, paving the way for enhanced automation in nuclear decommissioning, remote maintenance, and other high-risk environments.

Preference-Driven Active 3D Scene Representation for Robotic Inspection in Nuclear Decommissioning

TL;DR

Abstract

Preference-Driven Active 3D Scene Representation for Robotic Inspection in Nuclear Decommissioning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)