摘要
A key challenge of visible-infrared person re-identification (VI-ReID) comes from the modality difference between visible and infrared images, which further causes large intra-person and small inter-person distances. Most existing methods design feature extractors and loss functions to bridge the modality gap. However, the unpaired-images constrain the VI-ReID model's ability to learn instance-level alignment features. Different from these methods, in this paper, we propose a pose attention-guided paired-images generation network (PAPG) from the standpoint of data augmentation. PAPG can generate cross-modality paired-images with shape and appearance consistency with the real image to perform instance-level feature alignment by minimizing the distances of every pair of images. Furthermore, our method alleviates data insufficient and reduces the risk of VI-ReID model overfitting. Comprehensive experiments conducted on two publicly available datasets validate the effectiveness and generalizability of PAPG. Especially, on the SYSU-MM01 dataset, our method accomplishes 7.76% and 5.87% gains in Rank-1 and mAP.
原文 | English |
---|---|
頁(從 - 到) | 346-350 |
頁數 | 5 |
期刊 | IEEE Signal Processing Letters |
卷 | 31 |
DOIs | |
出版狀態 | Published - 2024 |