VITA: A Multi-Source Vicinal Transfer Augmentation Method for Out-of-Distribution Generalization

Authors

  • Minghui Chen Department of Computer Science and Engineering, Southern University of Science and Technology
  • Cheng Wen The University of Sydney
  • Feng Zheng Department of Computer Science and Engineering, Southern University of Science and Technology
  • Fengxiang He JD Explore Academy, JD.com Inc
  • Ling Shao National Center for Artificial Intelligence, Saudi Data and Artificial Intelligence Authority, Riyadh, Saudi Arabia

DOI:

https://doi.org/10.1609/aaai.v36i1.19908

Keywords:

Computer Vision (CV)

Abstract

Invariance to diverse types of image corruption, such as noise, blurring, or colour shifts, is essential to establish robust models in computer vision. Data augmentation has been the major approach in improving the robustness against common corruptions. However, the samples produced by popular augmentation strategies deviate significantly from the underlying data manifold. As a result, performance is skewed toward certain types of corruption. To address this issue, we propose a multi-source vicinal transfer augmentation (VITA) method for generating diverse on-manifold samples. The proposed VITA consists of two complementary parts: tangent transfer and integration of multi-source vicinal samples. The tangent transfer creates initial augmented samples for improving corruption robustness. The integration employs a generative model to characterize the underlying manifold built by vicinal samples, facilitating the generation of on-manifold samples. Our proposed VITA significantly outperforms the current state-of-the-art augmentation methods, demonstrated in extensive experiments on corruption benchmarks.

Downloads

Published

2022-06-28

How to Cite

Chen, M., Wen, C., Zheng, F., He, F., & Shao, L. (2022). VITA: A Multi-Source Vicinal Transfer Augmentation Method for Out-of-Distribution Generalization. Proceedings of the AAAI Conference on Artificial Intelligence, 36(1), 321-329. https://doi.org/10.1609/aaai.v36i1.19908

Issue

Section

AAAI Technical Track on Computer Vision I