RigAnything: Template-Free Autoregressive Rigging for Diverse 3D Assets

Isabella Liu; Zhan Xu; Wang Yifan; Hao Tan; Zexiang Xu; Xiaolong Wang; Hao Su; Zifan Shi

RigAnything: Template-Free Autoregressive Rigging for Diverse 3D Assets

Isabella Liu, Zhan Xu, Wang Yifan, Hao Tan, Zexiang Xu, Xiaolong Wang, Hao Su, Zifan Shi

TL;DR

RigAnything tackles automatic rigging for arbitrary 3D assets without predefined templates by modeling skeletons as BFS-ordered sequences of 3D joints with parent indices and learning skinning weights in a unified autoregressive transformer framework. It combines a diffusion-based joint sampler with a hybrid-attention transformer that processes both global shape context and evolving skeleton structure, and predicts connectivity and skinning weights in tandem. Trained end-to-end on RigNet and a filtered Objaverse subset, RigAnything achieves state-of-the-art results across humanoids, quadrupeds, marine life, insects, and other categories, while delivering rigging in under a few seconds per shape. The approach enhances generalizability, robustness, and efficiency for auto-rigging, enabling scalable pipelines for interactive 3D content creation.

Abstract

We present RigAnything, a novel autoregressive transformer-based model, which makes 3D assets rig-ready by probabilistically generating joints and skeleton topologies and assigning skinning weights in a template-free manner. Unlike most existing auto-rigging methods, which rely on predefined skeleton templates and are limited to specific categories like humanoid, RigAnything approaches the rigging problem in an autoregressive manner, iteratively predicting the next joint based on the global input shape and the previous prediction. While autoregressive models are typically used to generate sequential data, RigAnything extends its application to effectively learn and represent skeletons, which are inherently tree structures. To achieve this, we organize the joints in a breadth-first search (BFS) order, enabling the skeleton to be defined as a sequence of 3D locations and the parent index. Furthermore, our model improves the accuracy of position prediction by leveraging diffusion modeling, ensuring precise and consistent placement of joints within the hierarchy. This formulation allows the autoregressive model to efficiently capture both spatial and hierarchical relationships within the skeleton. Trained end-to-end on both RigNet and Objaverse datasets, RigAnything demonstrates state-of-the-art performance across diverse object types, including humanoids, quadrupeds, marine creatures, insects, and many more, surpassing prior methods in quality, robustness, generalizability, and efficiency. It achieves significantly faster performance than existing auto-rigging methods, completing rigging in under a few seconds per shape. Please check our website for more details: https://www.liuisabella.com/RigAnything

RigAnything: Template-Free Autoregressive Rigging for Diverse 3D Assets

TL;DR

Abstract

RigAnything: Template-Free Autoregressive Rigging for Diverse 3D Assets

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)