A Dataset for Mechanical Mechanisms
Farshid Ghezelbash, Amir Hossein Eskandari, Amir J Bidhendi
TL;DR
To accelerate mechanical mechanism design, the paper introduces a curated dataset of 8,994 image–description pairs, including both 2D and 3D sketches. It demonstrates the utility of the dataset by fine-tuning Stable Diffusion 1.6 to generate mechanism designs and BLIP-2 to caption them, highlighting both promise and current limitations. The results show that 3D sketches align more closely with textual prompts, while 2D sketches often lack coherent structure, and captions produced by BLIP-2 are imperfect due to limited training. The work demonstrates the potential of task-specific datasets to enable AI-assisted mechanical design and outlines concrete directions for expanding dataset size, improving models, and validating designs with domain experts.
Abstract
This study introduces a dataset consisting of approximately 9,000 images of mechanical mechanisms and their corresponding descriptions, aimed at supporting research in mechanism design. The dataset consists of a diverse collection of 2D and 3D sketches, meticulously curated to ensure relevance and quality. We demonstrate the application of this dataset by fine-tuning two models: 1) Stable Diffusion (for generating new mechanical designs), and 2) BLIP-2 (for captioning these designs). While the results from Stable Diffusion show promise, particularly in generating coherent 3D sketches, the model struggles with 2D sketches and occasionally produces nonsensical outputs. These limitations underscore the need for further development, particularly in expanding the dataset and refining model architectures. Nonetheless, this work serves as a step towards leveraging generative AI in mechanical design, highlighting both the potential and current limitations of these approaches.
