RL-SeqISP: Reinforcement Learning-Based Sequential Optimization for Image Signal Processing

Xinyu Sun; Zhikun Zhao; Lili Wei; Congyan Lang; Mingxuan Cai; Longfei Han; Juan Wang; Bing Li; Yuxuan Guo

doi:10.1609/aaai.v38i5.28307

Authors

Xinyu Sun Key Laboratory of Big Data & Artificial Intelligence in Transportation (Ministry of Education), School of Computer and Information Technology, Beijing Jiaotong University Institute of Automation, Chinese Academy of Sciences
Zhikun Zhao Key Laboratory of Big Data & Artificial Intelligence in Transportation (Ministry of Education), School of Computer and Information Technology, Beijing Jiaotong University Institute of Automation, Chinese Academy of Sciences
Lili Wei Key Laboratory of Big Data & Artificial Intelligence in Transportation (Ministry of Education), School of Computer and Information Technology, Beijing Jiaotong University
Congyan Lang Key Laboratory of Big Data & Artificial Intelligence in Transportation (Ministry of Education), School of Computer and Information Technology, Beijing Jiaotong University
Mingxuan Cai Shanghai Jiaotong University
Longfei Han Beijing Technology and Business University
Juan Wang Institute of Automation, Chinese Academy of Sciences
Bing Li Institute of Automation, Chinese Academy of Sciences PeopleAI Inc. Beijing, China
Yuxuan Guo Shenzhen Heytap Technology Corp., Ltd

DOI:

https://doi.org/10.1609/aaai.v38i5.28307

Keywords:

CV: Low Level & Physics-based Vision, ML: Reinforcement Learning

Abstract

Hardware image signal processing (ISP), aiming at converting RAW inputs to RGB images, consists of a series of processing blocks, each with multiple parameters. Traditionally, ISP parameters are manually tuned in isolation by imaging experts according to application-specific quality and performance metrics, which is time-consuming and biased towards human perception due to complex interaction with the output image. Since the relationship between any single parameter’s variation and the output performance metric is a complex, non-linear function, optimizing such a large number of ISP parameters is challenging. To address this challenge, we propose a novel Sequential ISP parameter optimization model, called the RL-SeqISP model, which utilizes deep reinforcement learning to jointly optimize all ISP parameters for a variety of imaging applications. Concretely, inspired by the sequential tuning process of human experts, the proposed model can progressively enhance image quality by seamlessly integrating information from both the image feature space and the parameter space. Furthermore, a dynamic parameter optimization module is introduced to avoid ISP parameters getting stuck into local optima, which is able to more effectively guarantee the optimal parameters resulting from the sequential learning strategy. These merits of the RL-SeqISP model as well as its high efficiency are substantiated by comprehensive experiments on a wide range of downstream tasks, including two visual analysis tasks (instance segmentation and object detection), and image quality assessment (IQA), as compared with representative methods both quantitatively and qualitatively. In particular, even using only 10% of the training data, our model outperforms other SOTA methods by an average of 7% mAP on two visual analysis tasks.

RL-SeqISP: Reinforcement Learning-Based Sequential Optimization for Image Signal Processing

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information