TagFog: Textual Anchor Guidance and Fake Outlier Generation for Visual Out-of-Distribution Detection

Authors

  • Jiankang Chen School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China Key Laboratory of Machine Intelligence and Advanced Computing, MOE, Guangzhou, China
  • Tong Zhang Peng Cheng Laboratory, Shenzhen, China
  • Wei-Shi Zheng School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China Key Laboratory of Machine Intelligence and Advanced Computing, MOE, Guangzhou, China
  • Ruixuan Wang School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China Peng Cheng Laboratory, Shenzhen, China Key Laboratory of Machine Intelligence and Advanced Computing, MOE, Guangzhou, China

DOI:

https://doi.org/10.1609/aaai.v38i2.27871

Keywords:

CV: Object Detection & Categorization, CV: Language and Vision, CV: Other Foundations of Computer Vision

Abstract

Out-of-distribution (OOD) detection is crucial in many real-world applications. However, intelligent models are often trained solely on in-distribution (ID) data, leading to overconfidence when misclassifying OOD data as ID classes. In this study, we propose a new learning framework which leverage simple Jigsaw-based fake OOD data and rich semantic embeddings (`anchors') from the ChatGPT description of ID knowledge to help guide the training of the image encoder. The learning framework can be flexibly combined with existing post-hoc approaches to OOD detection, and extensive empirical evaluations on multiple OOD detection benchmarks demonstrate that rich textual representation of ID knowledge and fake OOD knowledge can well help train a visual encoder for OOD detection. With the learning framework, new state-of-the-art performance was achieved on all the benchmarks. The code is available at https://github.com/Cverchen/TagFog.

Published

2024-03-24

How to Cite

Chen, J., Zhang, T., Zheng, W.-S., & Wang, R. (2024). TagFog: Textual Anchor Guidance and Fake Outlier Generation for Visual Out-of-Distribution Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 38(2), 1100-1109. https://doi.org/10.1609/aaai.v38i2.27871

Issue

Section

AAAI Technical Track on Computer Vision I