Fast Omni-Directional Image Super-Resolution: Adapting the Implicit Image Function with Pixel and Semantic-Wise Spherical Geometric Priors

Authors

  • Xuelin Shen Guangdong Laboratory of Artificial Intelligence and Digital Economy(SZ)
  • Yitong Wang College of Computer Science and Software Engineering, Shenzhen University Guangdong Laboratory of Artificial Intelligence and Digital Economy(SZ)
  • Silin Zheng College of Computer Science and Software Engineering, Shenzhen University
  • Kang Xiao Guangdong Laboratory of Artificial Intelligence and Digital Economy(SZ)
  • Wenhan Yang Peng Cheng Laboratory
  • Xu Wang College of Computer Science and Software Engineering, Shenzhen University

DOI:

https://doi.org/10.1609/aaai.v39i7.32733

Abstract

In the context of Omni-Directional Image (ODI) Super-Resolution (SR), the unique challenge arises from the non-uniform oversampling characteristics caused by EquiRectangular Projection (ERP). Considerable efforts in designing complex spherical convolutions or polyhedron reprojection offer significant performance improvements but at the expense of cumbersome processing procedures and slower inference speeds. Under these circumstances, this paper proposes a new ODI-SR model characterized by its capacity to perform Fast and Arbitrary-scale ODI-SR processes, denoted as FAOR. The key innovation lies in adapting the implicit image function from the planar image domain to the ERP image domain by incorporating spherical geometric priors at both the latent representation and image reconstruction stages, in a low-overhead manner. Specifically, at the latent representation stage, we adopt a pair of pixel-wise and semantic-wise sphere-to-planar distortion maps to perform affine transformations on the latent representation, thereby incorporating it with spherical properties. Moreover, during the image reconstruction stage, we introduce a geodesic-based resampling strategy, aligning the implicit image function with spherical geometrics without introducing additional parameters. As a result, the proposed FAOR outperforms the state-of-the-art ODI-SR models with a much faster inference speed. Extensive experimental results and ablation studies have demonstrated the effectiveness of our design.

Downloads

Published

2025-04-11

How to Cite

Shen, X., Wang, Y., Zheng, S., Xiao, K., Yang, W., & Wang, X. (2025). Fast Omni-Directional Image Super-Resolution: Adapting the Implicit Image Function with Pixel and Semantic-Wise Spherical Geometric Priors. Proceedings of the AAAI Conference on Artificial Intelligence, 39(7), 6833–6841. https://doi.org/10.1609/aaai.v39i7.32733

Issue

Section

AAAI Technical Track on Computer Vision VI