ERL-MPP: Evolutionary Reinforcement Learning with Multi-head Puzzle Perception for Solving Large-scale Jigsaw Puzzles of Eroded Gaps

Xingke Song; Xiaoying Yang; Chenglin Yao; Jianfeng Ren; Ruibin Bai; Xin Chen; Xudong Jiang

ERL-MPP: Evolutionary Reinforcement Learning with Multi-head Puzzle Perception for Solving Large-scale Jigsaw Puzzles of Eroded Gaps

Xingke Song, Xiaoying Yang, Chenglin Yao, Jianfeng Ren, Ruibin Bai, Xin Chen, Xudong Jiang

TL;DR

This work addresses solving large-scale jigsaw puzzles with eroded gaps by introducing Evolutionary Reinforcement Learning with Multi-head Puzzle Perception (ERL-MPP). A Multi-head Puzzle Perception Network (MPPN) provides global and local puzzle status via a shared encoder, discriminative global assessment, and puzzle-unit heads, while EvoRL uses an actor-critic-evaluator framework to efficiently explore a large action space including Swap-2, Swap-3, and Swap-Puzzlet actions. The approach reports significant improvements over state-of-the-art on JPLEG-5 and MIT datasets, demonstrating strong perception under gaps and effective large-scale action-space optimization. The results suggest practical impact for artifact reconstruction and other settings with eroded information and combinatorial assembly tasks.

Abstract

Solving jigsaw puzzles has been extensively studied. While most existing models focus on solving either small-scale puzzles or puzzles with no gap between fragments, solving large-scale puzzles with gaps presents distinctive challenges in both image understanding and combinatorial optimization. To tackle these challenges, we propose a framework of Evolutionary Reinforcement Learning with Multi-head Puzzle Perception (ERL-MPP) to derive a better set of swapping actions for solving the puzzles. Specifically, to tackle the challenges of perceiving the puzzle with gaps, a Multi-head Puzzle Perception Network (MPPN) with a shared encoder is designed, where multiple puzzlet heads comprehensively perceive the local assembly status, and a discriminator head provides a global assessment of the puzzle. To explore the large swapping action space efficiently, an Evolutionary Reinforcement Learning (EvoRL) agent is designed, where an actor recommends a set of suitable swapping actions from a large action space based on the perceived puzzle status, a critic updates the actor using the estimated rewards and the puzzle status, and an evaluator coupled with evolutionary strategies evolves the actions aligning with the historical assembly experience. The proposed ERL-MPP is comprehensively evaluated on the JPLEG-5 dataset with large gaps and the MIT dataset with large-scale puzzles. It significantly outperforms all state-of-the-art models on both datasets.

ERL-MPP: Evolutionary Reinforcement Learning with Multi-head Puzzle Perception for Solving Large-scale Jigsaw Puzzles of Eroded Gaps

TL;DR

Abstract

ERL-MPP: Evolutionary Reinforcement Learning with Multi-head Puzzle Perception for Solving Large-scale Jigsaw Puzzles of Eroded Gaps

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)