Table of Contents
Fetching ...

RoboBPP: Benchmarking Robotic Online Bin Packing with Physics-based Simulation

Zhoufeng Wang, Hang Zhao, Juzhan Xu, Shishun Zhang, Zeyu Xiong, Ruizhen Hu, Chenyang Zhu, Zecui Zeng, Kai Xu

TL;DR

RoboBPP addresses the lack of standardized benchmarking for robotic online 3D bin packing by delivering a physics-based simulation environment, three real industrial datasets, and a multi-setting evaluation framework with a weighted multi-metric scoring system. The benchmark enables end-to-end assessment from geometric placement to robotic execution, capturing physical feasibility, stability, and safety during sequential packing. Through extensive evaluation of diverse methods—ranging from geometry-aware transformers to EMS-driven heuristics—it reveals dataset- and setting-dependent strengths, offering practical guidance for industrial deployment. All resources are openly available and accompanied by an online leaderboard to promote reproducibility and community collaboration.

Abstract

Physical feasibility in 3D bin packing is a key requirement in modern industrial logistics and robotic automation. With the growing adoption of industrial automation, online bin packing has gained increasing attention. However, inconsistencies in problem settings, test datasets, and evaluation metrics have hindered progress in the field, and there is a lack of a comprehensive benchmarking system. Direct testing on real hardware is costly, and building a realistic simulation environment is also challenging. To address these limitations, we introduce RoboBPP, a benchmarking system designed for robotic online bin packing. RoboBPP integrates a physics-based simulator to assess physical feasibility. In our simulation environment, we introduce a robotic arm and boxes at real-world scales to replicate real industrial packing workflows. By simulating conditions that arise in real industrial applications, we ensure that evaluated algorithms are practically deployable. In addition, prior studies often rely on synthetic datasets whose distributions differ from real-world industrial data. To address this issue, we collect three datasets from real industrial workflows, including assembly-line production, logistics packing, and furniture manufacturing. The benchmark comprises three carefully designed test settings and extends existing evaluation metrics with new metrics for structural stability and operational safety. We design a scoring system and derive a range of insights from the evaluation results. RoboBPP is fully open-source and is equipped with visualization tools and an online leaderboard, providing a reproducible and extensible foundation for future research and industrial applications (https://robot-bin-packing-benchmark.github.io).

RoboBPP: Benchmarking Robotic Online Bin Packing with Physics-based Simulation

TL;DR

RoboBPP addresses the lack of standardized benchmarking for robotic online 3D bin packing by delivering a physics-based simulation environment, three real industrial datasets, and a multi-setting evaluation framework with a weighted multi-metric scoring system. The benchmark enables end-to-end assessment from geometric placement to robotic execution, capturing physical feasibility, stability, and safety during sequential packing. Through extensive evaluation of diverse methods—ranging from geometry-aware transformers to EMS-driven heuristics—it reveals dataset- and setting-dependent strengths, offering practical guidance for industrial deployment. All resources are openly available and accompanied by an online leaderboard to promote reproducibility and community collaboration.

Abstract

Physical feasibility in 3D bin packing is a key requirement in modern industrial logistics and robotic automation. With the growing adoption of industrial automation, online bin packing has gained increasing attention. However, inconsistencies in problem settings, test datasets, and evaluation metrics have hindered progress in the field, and there is a lack of a comprehensive benchmarking system. Direct testing on real hardware is costly, and building a realistic simulation environment is also challenging. To address these limitations, we introduce RoboBPP, a benchmarking system designed for robotic online bin packing. RoboBPP integrates a physics-based simulator to assess physical feasibility. In our simulation environment, we introduce a robotic arm and boxes at real-world scales to replicate real industrial packing workflows. By simulating conditions that arise in real industrial applications, we ensure that evaluated algorithms are practically deployable. In addition, prior studies often rely on synthetic datasets whose distributions differ from real-world industrial data. To address this issue, we collect three datasets from real industrial workflows, including assembly-line production, logistics packing, and furniture manufacturing. The benchmark comprises three carefully designed test settings and extends existing evaluation metrics with new metrics for structural stability and operational safety. We design a scoring system and derive a range of insights from the evaluation results. RoboBPP is fully open-source and is equipped with visualization tools and an online leaderboard, providing a reproducible and extensible foundation for future research and industrial applications (https://robot-bin-packing-benchmark.github.io).

Paper Structure

This paper contains 29 sections, 16 equations, 9 figures, 13 tables, 11 algorithms.

Figures (9)

  • Figure 1: Overview of our benchmark for online 3D-BPP. We collect 3 real industrial datasets and build a physics-based simulation environment. We define 3 simulation settings and summarize 4 tpyes of metrics.
  • Figure 2: Visualization of the three industrial datasets. Each block contains two parts: the upper row shows the packing results inside a container, while the lower row displays the corresponding item distribution and their tasks.
  • Figure 3: Box plots of item dimensions (length, width, height) for the three datasets.
  • Figure 4: Repeat rates of 15 randomly selected groups for the three datasets, with each bar representing the proportion of duplicate boxes in that group.
  • Figure 5: Three test settings in the simulation environment. Math Pack evaluates purely geometric placement. Physics Pack adds gravity and collisions test under realistic physical constraints. Execution Pack integrates physics and robotic execution for end-to-end evaluation.
  • ...and 4 more figures