Decompose and Attribute: Boosting Generalizable Open-Set Object Detection via Objectness Score

Authors

  • Yuxuan Yuan Key Laboratory of Multimedia Trusted Perception and Efficient Computing, Ministry of Education of China, Xiamen University, China Institute of Artificial Intelligence, Xiamen University, China
  • Lichen Wei Key Laboratory of Multimedia Trusted Perception and Efficient Computing, Ministry of Education of China, Xiamen University, China Institute of Artificial Intelligence, Xiamen University, China
  • Luyao Tang University of Hong Kong
  • Chaoqi Chen Shenzhen University
  • Zheyuan Cai Key Laboratory of Multimedia Trusted Perception and Efficient Computing, Ministry of Education of China, Xiamen University, China School of Informatics, Xiamen University, China
  • Yue Huang School of Informatics, Xiamen University, China The National Key Laboratory of Infrared Detection Technologies
  • Xinghao Ding Key Laboratory of Multimedia Trusted Perception and Efficient Computing, Ministry of Education of China, Xiamen University, China School of Informatics, Xiamen University, China

DOI:

https://doi.org/10.1609/aaai.v40i15.38220

Abstract

Open-set object detection (OSOD) aims to recognize known object categories while localizing previously unseen instances. However, real-world scenarios often involve co-occurring domain shifts and novel object categories. Existing OSOD methods typically overlook domain shifts, relying on source-trained representations that entangle domain-specific style with semantic content, thereby hindering generalization to both unseen domains and novel categories. To address this challenge, we propose a unified framework, termed DecOmpose and ATtribute (DOAT), which disentangles domain-specific style from semantic structure, thereby facilitating generalizable object detection. DOAT employs wavelet-based feature decomposition to separate style information from high-frequency structural details, thus enabling an explicit separation of domain and category shifts. To account for domain shift, the low-frequency components are perturbed within a style subspace to simulate diverse domain appearances. For unknown object discovery, the high-frequency components are utilized to estimate objectness scores via an attribution mechanism that fuses wavelet energy with semantic distance to known-category prototypes. Extensive experiments on standard open-set benchmarks have demonstrated the superior generalization performance of DOAT.

Downloads

Published

2026-03-14

How to Cite

Yuan, Y., Wei, L., Tang, L., Chen, C., Cai, Z., Huang, Y., & Ding, X. (2026). Decompose and Attribute: Boosting Generalizable Open-Set Object Detection via Objectness Score. Proceedings of the AAAI Conference on Artificial Intelligence, 40(15), 12286–12294. https://doi.org/10.1609/aaai.v40i15.38220

Issue

Section

AAAI Technical Track on Computer Vision XII