Hierarchical Material Recognition from Local Appearance
Matthew Beveridge, Shree K. Nayar
TL;DR
This work introduces a visually grounded hierarchy of materials and a dedicated in-the-wild dataset, Matador, to enable hierarchical material recognition from local appearance. It combines a visual taxonomy with a graph attention network to exploit taxonomic proximity, predicting full hierarchical labels and enabling robust inferences even when fine-grained identification fails. The approach achieves state-of-the-art results on Matador and existing benchmarks, benefits from rendering novel views to improve generalization to real-world, out-of-distribution imaging conditions, and demonstrates rapid few-shot adaptation to unseen materials. The contributions have practical implications for robotics and autonomous systems, providing both material-level classifications and associated mechanical properties to guide interaction and manipulation.
Abstract
We introduce a taxonomy of materials for hierarchical recognition from local appearance. Our taxonomy is motivated by vision applications and is arranged according to the physical traits of materials. We contribute a diverse, in-the-wild dataset with images and depth maps of the taxonomy classes. Utilizing the taxonomy and dataset, we present a method for hierarchical material recognition based on graph attention networks. Our model leverages the taxonomic proximity between classes and achieves state-of-the-art performance. We demonstrate the model's potential to generalize to adverse, real-world imaging conditions, and that novel views rendered using the depth maps can enhance this capability. Finally, we show the model's capacity to rapidly learn new materials in a few-shot learning setting.
