An Efficient Projection-Based Next-best-view Planning Framework for Reconstruction of Unknown Objects
Zhizhou Jia, Shaohui Zhang, Qun Hao
TL;DR
This work tackles the computational burden of Next Best View planning for complete 3D object reconstruction by introducing a projection-based NBV framework. It representations unknown object structure as ellipsoids fitted to voxel clusters via Gaussian Mixture Models and Minimum Volume Enclosing Ellipsoids, and evaluates candidate viewpoints through a projection-based quality function that aggregates weighted ellipsoid projections. A global partitioning strategy is employed to prevent backtracking and greedily selected views, enabling more robust coverage. The approach yields up to about 10× efficiency gains in simulation with comparable coverage and is demonstrated to be feasible in real-world experiments with a robotic arm and 3D camera. The contributions advance practical, scalable NBV planning for industrial robotics and quality inspection tasks.
Abstract
Efficiently and completely capturing the three-dimensional data of an object is a fundamental problem in industrial and robotic applications. The task of next-best-view (NBV) planning is to infer the pose of the next viewpoint based on the current data, and gradually realize the complete three-dimensional reconstruction. Many existing algorithms, however, suffer a large computational burden due to the use of ray-casting. To address this, this paper proposes a projection-based NBV planning framework. It can select the next best view at an extremely fast speed while ensuring the complete scanning of the object. Specifically, this framework refits different types of voxel clusters into ellipsoids based on the voxel structure.Then, the next best view is selected from the candidate views using a projection-based viewpoint quality evaluation function in conjunction with a global partitioning strategy. This process replaces the ray-casting in voxel structures, significantly improving the computational efficiency. Comparative experiments with other algorithms in a simulation environment show that the framework proposed in this paper can achieve 10 times efficiency improvement on the basis of capturing roughly the same coverage. The real-world experimental results also prove the efficiency and feasibility of the framework.
