GIFARC: Synthetic Dataset for Leveraging Human-Intuitive Analogies to Elevate AI Reasoning
Woochang Sim, Hyunseok Ryu, Kyungmin Choi, Sungwon Han, Sundong Kim
TL;DR
GIFARC addresses the gap in abstract reasoning by introducing a large-scale, analogy-grounded ARC-style dataset synthesized from GIFs with explicit ground-truth analogies and executable ARC transformations. The authors employ a three-stage pipeline—visual abstraction from GIFs, task sketch formulation, and executable ARC-task generation via LLMs and VLMs with retrieval augmentation—to produce 10,000 tasks and over 100k input–output pairs. Empirical results show high generation fidelity and that analogy cues steer LLM reasoning toward human-like strategies, improving alignment with ground-truth analogies on ARC-style challenges. The work lays groundwork for analogy-informed AI reasoning with potential impact on education, scientific discovery, and decision support, while noting limitations related to single-GIF dependence and future expansion to videos and more diverse analogies.
Abstract
The Abstraction and Reasoning Corpus (ARC) poses a stringent test of general AI capabilities, requiring solvers to infer abstract patterns from only a handful of examples. Despite substantial progress in deep learning, state-of-the-art models still achieve accuracy rates of merely 40-55% on 2024 ARC Competition, indicative of a significant gap between their performance and human-level reasoning. In this work, we seek to bridge that gap by introducing an analogy-inspired ARC dataset, GIFARC. Leveraging large language models (LLMs) and vision-language models (VLMs), we synthesize new ARC-style tasks from a variety of GIF images that include analogies. Each new task is paired with ground-truth analogy, providing an explicit mapping between visual transformations and everyday concepts. By embedding robust human-intuitive analogies into ARC-style tasks, GIFARC guides AI agents to evaluate the task analogically before engaging in brute-force pattern search, thus efficiently reducing problem complexity and build a more concise and human-understandable solution. We empirically validate that guiding LLM with analogic approach with GIFARC affects task-solving approaches of LLMs to align with analogic approach of human.
