ControlTac: Force- and Position-Controlled Tactile Data Augmentation with a Single Reference Image

Dongyu Luo; Kelin Yu; Amir-Hossein Shahidzadeh; Cornelia Fermüller; Yiannis Aloimonos; Ruohan Gao

ControlTac: Force- and Position-Controlled Tactile Data Augmentation with a Single Reference Image

Dongyu Luo, Kelin Yu, Amir-Hossein Shahidzadeh, Cornelia Fermüller, Yiannis Aloimonos, Ruohan Gao

TL;DR

This work tackles the high cost and limited transferability of vision-based tactile data by introducing ControlTac, a two-stage diffusion-based framework that generates realistic tactile images conditioned on a single reference image, a target force $ΔF$, and a target contact position. The first stage builds a force-controlled generation via a diffusion transformer, and the second stage refines the output with a ControlNet conditioned on a contact mask to incorporate position control, yielding physically plausible, diverse tactile images from minimal data. Empirically, ControlTac improves data augmentation for force estimation, pose estimation, and object classification, and enables robust real-world deployment including precise object insertion with high success rates. The modular design supports extending conditioning to additional tactile priors and demonstrates significant practical impact for scalable tactile datasets and robotics manipulation.

Abstract

Vision-based tactile sensing has been widely used in perception, reconstruction, and robotic manipulation. However, collecting large-scale tactile data remains costly due to the localized nature of sensor-object interactions and inconsistencies across sensor instances. Existing approaches to scaling tactile data, such as simulation and free-form tactile generation, often suffer from unrealistic output and poor transferability to downstream tasks. To address this, we propose ControlTac, a two-stage controllable framework that generates realistic tactile images conditioned on a single reference tactile image, contact force, and contact position. With those physical priors as control input, ControlTac generates physically plausible and varied tactile images that can be used for effective data augmentation. Through experiments on three downstream tasks, we demonstrate that ControlTac can effectively augment tactile datasets and lead to consistent gains. Our three real-world experiments further validate the practical utility of our approach. Project page: https://dongyuluo.github.io/controltac.

ControlTac: Force- and Position-Controlled Tactile Data Augmentation with a Single Reference Image

TL;DR

Abstract

ControlTac: Force- and Position-Controlled Tactile Data Augmentation with a Single Reference Image

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)