Learning Heterogeneous Dictionary Pair with Feature Projection Matrix for Pedestrian Video Retrieval via Single Query Image

Xiaoke Zhu; Xiao-Yuan Jing; Fei Wu; Yunhong Wang; Wangmeng Zuo; Wei-Shi Zheng

doi:10.1609/aaai.v31i1.11221

Authors

Xiaoke Zhu Wuhan University
Xiao-Yuan Jing Wuhan University
Fei Wu Nanjing University of Posts and Telecommunications
Yunhong Wang Beihang University
Wangmeng Zuo Harbin Institute of Technology
Wei-Shi Zheng Sun Yat-sen University

DOI:

https://doi.org/10.1609/aaai.v31i1.11221

Keywords:

Image to video person re-identification, Feature projection matrix and heterogeneous dictionary pair learning (PHDL), Point-to-set coefficient discriminant term

Abstract

Person re-identification (re-id) plays an important role in video surveillance and forensics applications. In many cases, person re-id needs to be conducted between image and video clip, e.g., re-identifying a suspect from large quantities of pedestrian videos given a single image of him. We call re-id in this scenario as image to video person re-id (IVPR). In practice, image and video are usually represented with different features, and there usually exist large variations between frames within each video. These factors make matching between image and video become a very challenging task. In this paper, we propose a joint feature projection matrix and heterogeneous dictionary pair learning (PHDL) approach for IVPR. Specifically, PHDL jointly learns an intra-video projection matrix and a pair of heterogeneous image and video dictionaries. With the learned projection matrix, the influence of variations within each video to the matching can be reduced. With the learned dictionary pair, the heterogeneous image and video features can be transformed into coding coefficients with the same dimension, such that the matching can be conducted using coding coefficients. Furthermore, to ensure that the obtained coding coefficients have favorable discriminability, PHDL designs a point-to-set coefficient discriminant term. Experiments on the public iLIDS-VID and PRID 2011 datasets demonstrate the effectiveness of the proposed approach.

Learning Heterogeneous Dictionary Pair with Feature Projection Matrix for Pedestrian Video Retrieval via Single Query Image

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information