PBECount: Prompt-Before-Extract Paradigm for Class-Agnostic Counting

Authors

  • Canchen Yang Sichuan University
  • Tianyu Geng Sichuan University
  • Jian Peng Sichuan University
  • Chun Xu Sichuan University

DOI:

https://doi.org/10.1609/aaai.v39i9.32989

Abstract

In the field of class-agnostic counting (CAC), counting only objects of interest that are similar to exemplars in multi-class scenarios has been a challenging task. To address this challenge, recent research has proposed the extract-and-match paradigm based on the vision transformer (ViT) architecture. However, although this paradigm can improve the accuracy of exemplar-similar object identification, it overly emphasizes the role of the ViT structure. To address this shortcoming, this work introduces a more generalized prompt-before-extract paradigm on top of the extract-and-match paradigm and designs a pure convolutional neural network (CNN) model named PBECount. In addition, an innovative loss function, a post-processing strategy, and a dynamic threshold method are proposed to enhance the detection performance of the proposed model when the probability maps are used as ground truth during model training. The experimental results on the FSC-147 and CARPK datasets demonstrate that the proposed PBECount can identify whether unknown class objects are similar to exemplars and outperform the state-of-the-art CAC methods in terms of accuracy and generalization.

Published

2025-04-11

How to Cite

Yang, C., Geng, T., Peng, J., & Xu, C. (2025). PBECount: Prompt-Before-Extract Paradigm for Class-Agnostic Counting. Proceedings of the AAAI Conference on Artificial Intelligence, 39(9), 9139–9147. https://doi.org/10.1609/aaai.v39i9.32989

Issue

Section

AAAI Technical Track on Computer Vision VIII