MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning

Xu Han; Yuan Tang; Jinfeng Xu; Xianzhi Li

MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning

Xu Han, Yuan Tang, Jinfeng Xu, Xianzhi Li

TL;DR

MoST presents a reparameterization-based 3D PEFT framework that uses Point Monarch to replace dense update matrices with sparse, locally-aware transformations, preserving inference efficiency while boosting representation learning on irregular 3D point clouds. By introducing K-Rectify based local fusion and a parameter-free multi-layer feature fusion strategy, MoST achieves state-of-the-art results across object- and scene-level tasks with only a small fraction of trainable parameters. It showcases strong generalization across diverse backbones, compatibility with matrix decompositions for further compression, and substantial performance gains over full fine-tuning in many benchmarks, signaling a practical path for efficient large-scale 3D model tuning. The work highlights the importance of local geometric feature capture in 3D PEFT and provides a flexible, hardware-friendly approach that can adapt to various architectures and tasks.

Abstract

We introduce Monarch Sparse Tuning (MoST), the first reparameterization-based parameter-efficient fine-tuning (PEFT) method tailored for 3D representation learning. Unlike existing adapter-based and prompt-tuning 3D PEFT methods, MoST introduces no additional inference overhead and is compatible with many 3D representation learning backbones. At its core, we present a new family of structured matrices for 3D point clouds, Point Monarch, which can capture local geometric features of irregular points while offering high expressiveness. MoST reparameterizes the dense update weight matrices as our sparse Point Monarch matrices, significantly reducing parameters while retaining strong performance. Experiments on various backbones show that MoST is simple, effective, and highly generalizable. It captures local features in point clouds, achieving state-of-the-art results on multiple benchmarks, e.g., 97.5% acc. on ScanObjectNN (PB_50_RS) and 96.2% on ModelNet40 classification, while it can also combine with other matrix decompositions (e.g., Low-rank, Kronecker) to further reduce parameters.

MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning

TL;DR

Abstract

MoST: Efficient Monarch Sparse Tuning for 3D Representation Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)