GFT: Graph Feature Tuning for Efficient Point Cloud Analysis
Manish Dhakal, Venkat R. Dasari, Rajshekhar Sunderraman, Yi Ding
TL;DR
GFT introduces a point-cloud–specific PEFT that learns dynamic graph features from transformer tokens via EdgeConv and injects them sparsely into a pretrained Point Transformer. By combining task-specific prompts, a multi-layer EdgeConv feature pyramid, and selective cross-attention, GFT achieves substantial parameter savings (~0.7M trainable params) while maintaining competitive accuracy on real-world and synthetic datasets. Ablation studies validate the contributions of each component and demonstrate robust parameter-efficiency trade-offs, though pretraining on synthetic data and added inference costs remain limitations. The work highlights the potential of graph-based feature integration for efficient point-cloud analysis and suggests future directions such as LoRA-like latency-free adaptation and improved real-world pretraining.
Abstract
Parameter-efficient fine-tuning (PEFT) significantly reduces computational and memory costs by updating only a small subset of the model's parameters, enabling faster adaptation to new tasks with minimal loss in performance. Previous studies have introduced PEFTs tailored for point cloud data, as general approaches are suboptimal. To further reduce the number of trainable parameters, we propose a point-cloud-specific PEFT, termed Graph Features Tuning (GFT), which learns a dynamic graph from initial tokenized inputs of the transformer using a lightweight graph convolution network and passes these graph features to deeper layers via skip connections and efficient cross-attention modules. Extensive experiments on object classification and segmentation tasks show that GFT operates in the same domain, rivalling existing methods, while reducing the trainable parameters. Code is available at https://github.com/manishdhakal/GFT.
