Siamese-Discriminant Deep Reinforcement Learning for Solving Jigsaw Puzzles with Large Eroded Gaps

Authors

  • Xingke Song University of Nottingham Ningbo China
  • Jiahuan Jin University of Nottingham Ningbo China
  • Chenglin Yao University of Nottingham Ningbo China
  • Shihe Wang University of Nottingham Ningbo China
  • Jianfeng Ren University of Nottingham Ningbo China
  • Ruibin Bai University of Nottingham Ningbo China

DOI:

https://doi.org/10.1609/aaai.v37i2.25325

Keywords:

CV: Visual Reasoning & Symbolic Representations, CV: Learning & Optimization for CV, ML: Deep Neural Network Algorithms, SO: Other Foundations of Search & Optimization

Abstract

Jigsaw puzzle solving has recently become an emerging research area. The developed techniques have been widely used in applications beyond puzzle solving. This paper focuses on solving Jigsaw Puzzles with Large Eroded Gaps (JPwLEG). We formulate the puzzle reassembly as a combinatorial optimization problem and propose a Siamese-Discriminant Deep Reinforcement Learning (SD2RL) to solve it. A Deep Q-network (DQN) is designed to visually understand the puzzles, which consists of two sets of Siamese Discriminant Networks, one set to perceive the pairwise relations between vertical neighbors and another set for horizontal neighbors. The proposed DQN considers not only the evidence from the incumbent fragment but also the support from its four neighbors. The DQN is trained using replay experience with carefully designed rewards to guide the search for a sequence of fragment swaps to reach the correct puzzle solution. Two JPwLEG datasets are constructed to evaluate the proposed method, and the experimental results show that the proposed SD2RL significantly outperforms state-of-the-art methods.

Downloads

Published

2023-06-26

How to Cite

Song, X., Jin, J., Yao, C., Wang, S., Ren, J., & Bai, R. (2023). Siamese-Discriminant Deep Reinforcement Learning for Solving Jigsaw Puzzles with Large Eroded Gaps. Proceedings of the AAAI Conference on Artificial Intelligence, 37(2), 2303-2311. https://doi.org/10.1609/aaai.v37i2.25325

Issue

Section

AAAI Technical Track on Computer Vision II