F2SST: Frequency-to-Spatial Semantic Transfer for Few-Shot Image Classification
DOI:
https://doi.org/10.1609/aaai.v40i24.39123Abstract
Few-shot image classification (FSIC) aims to recognize novel categories from only a few labeled examples, making it inherently challenging under limited supervision. Existing approaches have attempted to alleviate this issue by incorporating explicit semantics like class names or knowledge graphs to guide learning. However, such methods often encounter semantic ambiguity due to their dependence on either overly simplistic semantic priors or resource-intensive external knowledge sources, which limits their potential. In this paper, we explore the frequency domain as an implicit and task-adaptive source of semantic information. We propose F2SST, a Frequency-to-Spatial Semantic Transfer framework that enhances feature learning by leveraging spectral signals as hidden semantics. Specifically, F2SST applies Fast Fourier Transform (FFT) to extract phase-invariant global frequency descriptors, followed by a lightweight Gated Spectral Attention (GSA) module that selectively emphasizes class-relevant frequency components. These enhanced spectral cues are then integrated into the spatial stream through a class-guided fusion mechanism, enabling more robust and semantically aligned representations. Extensive experiments on four standard benchmarks (miniImageNet, tieredImageNet, CIFAR-FS and FC100) demonstrate that F2SST consistently improves performance, validating the effectiveness of frequency-domain semantics in FSIC.Published
2026-03-14
How to Cite
Chen, X., Wang, B., Fan, J., Zhang, L., & Li, F. (2026). F2SST: Frequency-to-Spatial Semantic Transfer for Few-Shot Image Classification. Proceedings of the AAAI Conference on Artificial Intelligence, 40(24), 20345-20353. https://doi.org/10.1609/aaai.v40i24.39123
Issue
Section
AAAI Technical Track on Machine Learning I