GarmentPile: Point-Level Visual Affordance Guided Retrieval and Adaptation for Cluttered Garments Manipulation
Ruihai Wu, Ziyu Zhu, Yuran Wang, Yue Chen, Jiarui Wang, Hao Dong
TL;DR
Cluttered garment manipulation poses challenges due to deformable, entangled garments. The authors propose point-level affordance learned from 3D point clouds to encode per-point actionability, garment geometry, structure, and inter-group relations, guiding retrieval. When entanglement prevents direct retrieval, an affordance-guided adaptation module iteratively reorganizes the pile via pick-and-place to reach manipulation-friendly states. Evaluations in GarmentLab across multiple scenes and real-world robot experiments show superior performance over baselines, demonstrating robust retrieval and adaptation for deformable garment clutter with practical implications for automation in laundry and wardrobe tasks.
Abstract
Cluttered garments manipulation poses significant challenges due to the complex, deformable nature of garments and intricate garment relations. Unlike single-garment manipulation, cluttered scenarios require managing complex garment entanglements and interactions, while maintaining garment cleanliness and manipulation stability. To address these demands, we propose to learn point-level affordance, the dense representation modeling the complex space and multi-modal manipulation candidates, while being aware of garment geometry, structure, and inter-object relations. Additionally, as it is difficult to directly retrieve a garment in some extremely entangled clutters, we introduce an adaptation module, guided by learned affordance, to reorganize highly-entangled garments into states plausible for manipulation. Our framework demonstrates effectiveness over environments featuring diverse garment types and pile configurations in both simulation and the real world. Project page: https://garmentpile.github.io/.
