TactileNet: Bridging the Accessibility Gap with AI-Generated Tactile Graphics for Individuals with Vision Impairment
Adnan Khan, Alireza Choubineh, Mai A. Shaaban, Abbas Akkasi, Majid Komeili
TL;DR
This work tackles the critical need for scalable tactile graphics for visually impaired learners by proposing TactileNet, a domain-specific dataset (1,029 tactile images across 66 classes) and an AI-driven pipeline that generates embossing-ready 2D tactile templates using text-to-image Stable Diffusion fine-tuned with LoRA and DreamBooth. The approach combines a 66-class adapter training strategy with prompt-based editing, enabling both text-only and image-conditioned generation, and is evaluated through a web-based, expert-driven protocol showing high adherence to tactile guidelines ($0.<q>$) and near-human structural fidelity (SSIM around $0.538$–$0.549$). Key findings show strong pose alignment and meaningful silhouette preservation, with generated outputs closely mirroring expert designs while enabling customization (e.g., removing logo or pocket details). The work demonstrates a practical, scalable path to AI-augmented tactile design that accelerates production without replacing human expertise, and commits to publicly releasing code, data, and models to foster further research in accessible education.
Abstract
Tactile graphics are essential for providing access to visual information for the 43 million people globally living with vision loss. Traditional methods for creating these graphics are labor-intensive and cannot meet growing demand. We introduce TactileNet, the first comprehensive dataset and AI-driven framework for generating embossing-ready 2D tactile templates using text-to-image Stable Diffusion (SD) models. By integrating Low-Rank Adaptation (LoRA) and DreamBooth, our method fine-tunes SD models to produce high-fidelity, guideline-compliant graphics while reducing computational costs. Quantitative evaluations with tactile experts show 92.86% adherence to accessibility standards. Structural fidelity analysis revealed near-human design similarity, with an SSIM of 0.538 between generated graphics and expert-designed tactile images. Notably, our method preserves object silhouettes better than human designs (SSIM = 0.259 vs. 0.215 for binary masks), addressing a key limitation of manual tactile abstraction. The framework scales to 32,000 images (7,050 high-quality) across 66 classes, with prompt editing enabling customizable outputs (e.g., adding or removing details). By automating the 2D template generation step-compatible with standard embossing workflows-TactileNet accelerates production while preserving design flexibility. This work demonstrates how AI can augment (not replace) human expertise to bridge the accessibility gap in education and beyond. Code, data, and models will be publicly released to foster further research.
