Creative synthesis of kinematic mechanisms
Jiong Lin, Jialong Ning, Judah Goldfeder, Hod Lipson
TL;DR
This work reframes planar kinematic synthesis as a cross-domain image-generation problem, proposing a shared-latent VAE that jointly models curves and mechanism images to enable bidirectional synthesis and analysis. A new dataset of paired RGB images for $1$-DOF planar linkages, including complex multi-loop mechanisms like Jansen’s, supports scalable learning across simple and complex structures. Empirical results on three dataset families demonstrate that image-based representations can unify trajectory-to-mechanism and mechanism-to-trajectory generation, with ViT-based decoders achieving higher fidelity and color-augmented inputs enhancing performance. The approach lays groundwork for data-driven mechanism design and robotics, highlighting both practical potential and avenues for improvement through larger datasets and richer representations (e.g., video).
Abstract
In this paper, we formulate the problem of kinematic synthesis for planar linkages as a cross-domain image generation task. We develop a planar linkages dataset using RGB image representations, covering a range of mechanisms: from simple types such as crank-rocker and crank-slider to more complex eight-bar linkages like Jansen's mechanism. A shared-latent variational autoencoder (VAE) is employed to explore the potential of image generative models for synthesizing unseen motion curves and simulating novel kinematics. By encoding the drawing speed of trajectory points as color gradients, the same architecture also supports kinematic synthesis conditioned on both trajectory shape and velocity profiles. We validate our method on three datasets of increasing complexity: a standard four-bar linkage set, a mixed set of four-bar and crank-slider mechanisms, and a complex set including multi-loop mechanisms. Preliminary results demonstrate the effectiveness of image-based representations for generative mechanical design, showing that mechanisms with revolute and prismatic joints, and potentially cams and gears, can be represented and synthesized within a unified image generation framework.
