Object-Aware Domain Generalization for Object Detection

Authors

  • Wooju Lee Urban Robotics Lab, School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Republic of Korea
  • Dasol Hong Urban Robotics Lab, School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Republic of Korea
  • Hyungtae Lim Urban Robotics Lab, School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Republic of Korea
  • Hyun Myung Urban Robotics Lab, School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Republic of Korea

DOI:

https://doi.org/10.1609/aaai.v38i4.28076

Keywords:

CV: Representation Learning for Vision, CV: Vision for Robotics & Autonomous Driving, CV: Object Detection & Categorization, ML: Transfer, Domain Adaptation, Multi-Task Learning

Abstract

Single-domain generalization (S-DG) aims to generalize a model to unseen environments with a single-source domain. However, most S-DG approaches have been conducted in the field of classification. When these approaches are applied to object detection, the semantic features of some objects can be damaged, which can lead to imprecise object localization and misclassification. To address these problems, we propose an object-aware domain generalization (OA-DG) method for single-domain generalization in object detection. Our method consists of data augmentation and training strategy, which are called OA-Mix and OA-Loss, respectively. OA-Mix generates multi-domain data with multi-level transformation and object-aware mixing strategy. OA-Loss enables models to learn domain-invariant representations for objects and backgrounds from the original and OA-Mixed images. Our proposed method outperforms state-of-the-art works on standard benchmarks. Our code is available at https://github.com/WoojuLee24/OA-DG.

Published

2024-03-24

How to Cite

Lee, W., Hong, D., Lim, H., & Myung, H. (2024). Object-Aware Domain Generalization for Object Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 38(4), 2947–2955. https://doi.org/10.1609/aaai.v38i4.28076

Issue

Section

AAAI Technical Track on Computer Vision III