Learning a Deep Convolutional Network for Colorization in Monochrome-Color Dual-Lens System


  • Xuan Dong Beijing University
  • Weixin Li Beihang University
  • Xiaojie Wang Beijing University of Posts and Telecommunications
  • Yunhong Wang Beihang University




In the monochrome-color dual-lens system, the gray image captured by the monochrome camera has better quality than the color image from the color camera, but does not have color information. To get high-quality color images, it is desired to colorize the gray image with the color image as reference. Related works usually use hand-crafted methods to search for the best-matching pixel in the reference image for each pixel in the input gray image, and copy the color of the best-matching pixel as the result. We propose a novel deep convolution network to solve the colorization problem in an end-to-end way. Based on our observation that, for each pixel in the input image, there usually exist multiple pixels in the reference image that have the correct colors, our method performs weighted average of colors of the candidate pixels in the reference image to utilize more candidate pixels with correct colors. The weight values between pixels in the input image and the reference image are obtained by learning a weight volume using deep feature representations, where an attention operation is proposed to focus on more useful candidate pixels and a 3-D regulation is performed to learn with context information. In addition, to correct wrongly colorized pixels in occlusion regions, we propose a color residue joint learning module to correct the colorization result with the input gray image as guidance. We evaluate our method on the Scene Flow, Cityscapes, Middlebury, and Sintel datasets. Experimental results show that our method largely outperforms the state-of-the-art methods.




How to Cite

Dong, X., Li, W., Wang, X., & Wang, Y. (2019). Learning a Deep Convolutional Network for Colorization in Monochrome-Color Dual-Lens System. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 8255-8262. https://doi.org/10.1609/aaai.v33i01.33018255



AAAI Technical Track: Vision