Visual Redundancy Removal for Composite Images: A Benchmark Dataset and a Multi-Visual-Effects Driven Incremental Method

Authors

  • Miaohui Wang Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University, China
  • Rong Zhang Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University, China
  • Lirong Huang Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University, China
  • Yanshan Li Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University, China

DOI:

https://doi.org/10.1609/aaai.v38i9.28884

Keywords:

HAI: Applications, HAI: Learning Human Values and Preferences, HAI: Other Foundations of Human Computation & AI

Abstract

Composite images (CIs) typically combine various elements from different scenes, views, and styles, which are a very important information carrier in the era of mixed media such as virtual reality, mixed reality, metaverse, etc. However, the complexity of CI content presents a significant challenge for subsequent visual perception modeling and compression. In addition, the lack of benchmark CI databases also hinders the use of recent advanced data-driven methods. To address these challenges, we first establish one of the earliest visual redundancy prediction (VRP) databases for CIs. Moreover, we propose a multi-visual effect (MVE)-driven incremental learning method that combines the strengths of hand-crafted and data-driven approaches to achieve more accurate VRP modeling. Specifically, we design special incremental rules to learn the visual knowledge flow of MVE. To effectively capture the associated features of MVE, we further develop a three-stage incremental learning approach for VRP based on an encoder-decoder network. Extensive experimental results validate the superiority of the proposed method in terms of subjective, objective, and compression experiments.

Published

2024-03-24

How to Cite

Wang, M., Zhang, R., Huang, L., & Li, Y. (2024). Visual Redundancy Removal for Composite Images: A Benchmark Dataset and a Multi-Visual-Effects Driven Incremental Method. Proceedings of the AAAI Conference on Artificial Intelligence, 38(9), 10189-10197. https://doi.org/10.1609/aaai.v38i9.28884

Issue

Section

AAAI Technical Track on Humans and AI