Data Distribution Distilled Generative Model for Generalized Zero-Shot Recognition
DOI:
https://doi.org/10.1609/aaai.v38i6.28381Keywords:
CV: Learning & Optimization for CV, CV: Representation Learning for Vision, ML: Deep Generative Models & AutoencodersAbstract
In the realm of Zero-Shot Learning (ZSL), we address biases in Generalized Zero-Shot Learning (GZSL) models, which favor seen data. To counter this, we introduce an end-to-end generative GZSL framework called D3GZSL. This framework respects seen and synthesized unseen data as in-distribution and out-of-distribution data, respectively, for a more balanced model. D3GZSL comprises two core modules: in-distribution dual space distillation (ID2SD) and out-of-distribution batch distillation (O2DBD). ID2SD aligns teacher-student outcomes in embedding and label spaces, enhancing learning coherence. O2DBD introduces low-dimensional out-of-distribution representations per batch sample, capturing shared structures between seen and un seen categories. Our approach demonstrates its effectiveness across established GZSL benchmarks, seamlessly integrating into mainstream generative frameworks. Extensive experiments consistently showcase that D3GZSL elevates the performance of existing generative GZSL methods, under scoring its potential to refine zero-shot learning practices. The code is available at: https://github.com/PJBQ/D3GZSL.gitDownloads
Published
2024-03-24
How to Cite
Wang, Y., Hong, M., Huangfu, L., & Huang, S. (2024). Data Distribution Distilled Generative Model for Generalized Zero-Shot Recognition. Proceedings of the AAAI Conference on Artificial Intelligence, 38(6), 5695–5703. https://doi.org/10.1609/aaai.v38i6.28381
Issue
Section
AAAI Technical Track on Computer Vision V