High-Order Structure Based Middle-Feature Learning for Visible-Infrared Person Re-identification

Authors

  • Liuxiang Qiu Xiamen University, China Xiamen University of Technology, China
  • Si Chen Xiamen University of Technology, China
  • Yan Yan Xiamen University, China
  • Jing-Hao Xue University College London, UK
  • Da-Han Wang Xiamen University of Technology, China
  • Shunzhi Zhu Xiamen University of Technology, China

DOI:

https://doi.org/10.1609/aaai.v38i5.28259

Keywords:

CV: Image and Video Retrieval, CV: Multi-modal Vision

Abstract

Visible-infrared person re-identification (VI-ReID) aims to retrieve images of the same persons captured by visible (VIS) and infrared (IR) cameras. Existing VI-ReID methods ignore high-order structure information of features while being relatively difficult to learn a reasonable common feature space due to the large modality discrepancy between VIS and IR images. To address the above problems, we propose a novel high-order structure based middle-feature learning network (HOS-Net) for effective VI-ReID. Specifically, we first leverage a short- and long-range feature extraction (SLE) module to effectively exploit both short-range and long-range features. Then, we propose a high-order structure learning (HSL) module to successfully model the high-order relationship across different local features of each person image based on a whitened hypergraph network. This greatly alleviates model collapse and enhances feature representations. Finally, we develop a common feature space learning (CFL) module to learn a discriminative and reasonable common feature space based on middle features generated by aligning features from different modalities and ranges. In particular, a modality-range identity-center contrastive (MRIC) loss is proposed to reduce the distances between the VIS, IR, and middle features, smoothing the training process. Extensive experiments on the SYSU-MM01, RegDB, and LLCM datasets show that our HOS-Net achieves superior state-of-the-art performance. Our code is available at https://github.com/Jaulaucoeng/HOS-Net.

Published

2024-03-24

How to Cite

Qiu, L., Chen, S., Yan, Y., Xue, J.-H., Wang, D.-H., & Zhu, S. (2024). High-Order Structure Based Middle-Feature Learning for Visible-Infrared Person Re-identification. Proceedings of the AAAI Conference on Artificial Intelligence, 38(5), 4596-4604. https://doi.org/10.1609/aaai.v38i5.28259

Issue

Section

AAAI Technical Track on Computer Vision IV