GigaHumanDet: Exploring Full-Body Detection on Gigapixel-Level Images

Authors

  • Chenglong Liu University of Chinese Academy of Sciences BNRist, Tsinghua University
  • Haoran Wei MEGVII Technology
  • Jinze Yang University of Chinese Academy of Sciences
  • Jintao Liu University of Chinese Academy of Sciences
  • Wenxi Li Shanghai Jiao Tong University
  • Yuchen Guo BNRist, Tsinghua University
  • Lu Fang BNRist, Tsinghua University

DOI:

https://doi.org/10.1609/aaai.v38i9.28873

Keywords:

HAI: Applications, DMKM: Mining of Visual, Multimedia & Multimodal Data, HAI: Other Foundations of Human Computation & AI

Abstract

Performing person detection in super-high-resolution images has been a challenging task. For such a task, modern detectors, which usually encode a box using center and width/height, struggle with accuracy due to two factors: 1) Human characteristic: people come in various postures and the center with high freedom is difficult to capture robust visual pattern; 2) Image characteristic: due to vast scale diversity of input (gigapixel-level), distance regression (for width and height) is hard to pinpoint, especially for a person, with substantial scale, who is near the camera. To address these challenges, we propose GigaHumanDet, an innovative solution aimed at further enhancing detection accuracy for gigapixel-level images. GigaHumanDet employs the corner modeling method to avoid the potential issues of a high degree of freedom in center pinpointing. To better distinguish similar-looking persons and enforce instance consistency of corner pairs, an instance-guided learning approach is designed to capture discriminative individual semantics. Further, we devise reliable shape-aware bodyness equipped with a multi-precision strategy as the human corner matching guidance to be appropriately adapted to the single-view large scene. Experimental results on PANDA and STCrowd datasets show the superiority and strong applicability of our design. Notably, our model achieves 82.4% in term of AP, outperforming current state-of-the-arts by more than 10%.

Published

2024-03-24

How to Cite

Liu, C., Wei, H., Yang, J., Liu, J., Li, W., Guo, Y., & Fang, L. (2024). GigaHumanDet: Exploring Full-Body Detection on Gigapixel-Level Images. Proceedings of the AAAI Conference on Artificial Intelligence, 38(9), 10092-10100. https://doi.org/10.1609/aaai.v38i9.28873

Issue

Section

AAAI Technical Track on Humans and AI