Skip to main content
Erschienen in: Neural Processing Letters 3/2023

22.08.2022

Deep Constraints Space of Medium Modality for RGB-Infrared Person Re-identification

verfasst von: Baojin Huang, Hao Chen, Wencheng Qin

Erschienen in: Neural Processing Letters | Ausgabe 3/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Reducing the gap between modalities is key to RGB-Infrared cross-modality person re-identification. In this paper, we propose an architecture based on the Deep Constrains Space of Medium Modality (DCSMM) for RGB-Infrared person re-identification. Specifically, a Medium Modality Network (MMN) is proposed to extract fused features of RGB and grayscale images, and we combine the fused features with infrared features for constraint. In addition, we also propose a loss function termed Domain Alignment and ID Consistency Loss (DAIC), which constrains the differences between the medium modality and the infrared modality as well as within single-modality in terms of instance level. Finally, in the high-level semantic stage, we also propose a Spatial Barycenter Margin Loss (SBM) based on each identity barycenter to constrain the feature space with different identities. The proposed method is validated on two large-scale datasets SYSU-MM01 and RegDB for cross-modality person re-identification, the results show that it achieves superior performance compared with the state-of-the-art methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
2.
Zurück zum Zitat Cheng D, Gong Y, Zhou S, Wang J, Zheng N (2016) Person re-identification by multi-channel parts-based CNN with improved triplet loss function. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1335–1344 Cheng D, Gong Y, Zhou S, Wang J, Zheng N (2016) Person re-identification by multi-channel parts-based CNN with improved triplet loss function. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1335–1344
3.
Zurück zum Zitat Yang J, Shen X, Tian X, Li H, Huang J, Hua X.-S (2018) Local convolutional neural networks for person re-identification. In: Proceedings of the 26th ACM international conference on multimedia, pp 1074–1082 Yang J, Shen X, Tian X, Li H, Huang J, Hua X.-S (2018) Local convolutional neural networks for person re-identification. In: Proceedings of the 26th ACM international conference on multimedia, pp 1074–1082
4.
Zurück zum Zitat Guo J, Yuan Y, Huang L, Zhang C, Yao J. -G, Han K (2019) Beyond human parts: dual part-aligned representations for person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 3642–3651 Guo J, Yuan Y, Huang L, Zhang C, Yao J. -G, Han K (2019) Beyond human parts: dual part-aligned representations for person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 3642–3651
5.
Zurück zum Zitat Zhong Z, Zheng L, Luo Z, Li S, Yang Y (2020) Learning to Adapt Invariance in Memory for Person Re-identification. IEEE Trans Pattern Anal Mach Intell 43(8):2723–2738 Zhong Z, Zheng L, Luo Z, Li S, Yang Y (2020) Learning to Adapt Invariance in Memory for Person Re-identification. IEEE Trans Pattern Anal Mach Intell 43(8):2723–2738
6.
Zurück zum Zitat Wei X, Li D, Hong X, Ke W, Gong Y (2020) Co-attentive lifting for infrared-visible person re-identification. In: Proceedings of the 28th ACM international conference on multimedia, pp 1028–1037 Wei X, Li D, Hong X, Ke W, Gong Y (2020) Co-attentive lifting for infrared-visible person re-identification. In: Proceedings of the 28th ACM international conference on multimedia, pp 1028–1037
7.
Zurück zum Zitat Gao Y, Liang T, Jin Y et al (2021) MSO: multi-feature space joint optimization network for RGB-infrared person re-identification. In: Proceedings of the 29th ACM international conference on multimedia, pp 5257–5265 Gao Y, Liang T, Jin Y et al (2021) MSO: multi-feature space joint optimization network for RGB-infrared person re-identification. In: Proceedings of the 29th ACM international conference on multimedia, pp 5257–5265
8.
Zurück zum Zitat Wu A, Zheng W. -S, Yu H. -X, Gong S, Lai J (2017) RGB-infrared cross-modality person re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 5380–5389 Wu A, Zheng W. -S, Yu H. -X, Gong S, Lai J (2017) RGB-infrared cross-modality person re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 5380–5389
9.
Zurück zum Zitat Hao Y, Wang N, Li J et al (2019) HSME: Hypersphere manifold embedding for visible thermal person re-identification. In: Proceedings of the AAAI conference on artificial intelligence, pp 8385–8392 Hao Y, Wang N, Li J et al (2019) HSME: Hypersphere manifold embedding for visible thermal person re-identification. In: Proceedings of the AAAI conference on artificial intelligence, pp 8385–8392
10.
Zurück zum Zitat Ye M, Lan X, Li J, Yuen P (2018) Hierarchical discriminative learning for visible thermal person re-identification. In: Proceedings of the AAAI conference on artificial intelligence, pp 7501–7508 Ye M, Lan X, Li J, Yuen P (2018) Hierarchical discriminative learning for visible thermal person re-identification. In: Proceedings of the AAAI conference on artificial intelligence, pp 7501–7508
11.
Zurück zum Zitat Ye M, Lan X, Wang Z, Yuen PC (2020) Bi-Directional Center-Constrained Top-Ranking for Visible Thermal Person Re-Identification. IEEE Trans Inf Forensics Secur 15:407–419CrossRef Ye M, Lan X, Wang Z, Yuen PC (2020) Bi-Directional Center-Constrained Top-Ranking for Visible Thermal Person Re-Identification. IEEE Trans Inf Forensics Secur 15:407–419CrossRef
12.
Zurück zum Zitat Wang Z, Wang Z, Zheng Y, Chuang YY, Satoh SI (2019) Learning to reduce dual-level discrepancy for infrared-visible person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 618–626 Wang Z, Wang Z, Zheng Y, Chuang YY, Satoh SI (2019) Learning to reduce dual-level discrepancy for infrared-visible person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 618–626
13.
Zurück zum Zitat Zhu Y, Yang Z, Wang L, Zhao S, Hu X, Tao D (2020) Hetero-Center loss for cross-modality person re-identification. Neurocomputing 386:97–109CrossRef Zhu Y, Yang Z, Wang L, Zhao S, Hu X, Tao D (2020) Hetero-Center loss for cross-modality person re-identification. Neurocomputing 386:97–109CrossRef
14.
Zurück zum Zitat Ye M, Shen J, J Crandall D, Shao L, Luo J (2020) Dynamic dual-attentive aggregation learning for visible-infrared person re-identification. In: European conference on computer vision, pp 229–247 Ye M, Shen J, J Crandall D, Shao L, Luo J (2020) Dynamic dual-attentive aggregation learning for visible-infrared person re-identification. In: European conference on computer vision, pp 229–247
15.
Zurück zum Zitat Wang GA, Zhang T, Yang Y, Cheng J, Chang J, Liang X (2020) Cross-modality paired-images generation for RGB-infrared person re-identification. In: Proceedings of the AAAI conference on artificial intelligence, pp 12144–12151 Wang GA, Zhang T, Yang Y, Cheng J, Chang J, Liang X (2020) Cross-modality paired-images generation for RGB-infrared person re-identification. In: Proceedings of the AAAI conference on artificial intelligence, pp 12144–12151
16.
Zurück zum Zitat Feng Z, Lai J, Xie X (2019) Learning modality-specific representations for visible-infrared person re-identification. IEEE Trans Image Process 29:579–590MathSciNetCrossRefMATH Feng Z, Lai J, Xie X (2019) Learning modality-specific representations for visible-infrared person re-identification. IEEE Trans Image Process 29:579–590MathSciNetCrossRefMATH
17.
Zurück zum Zitat Alemu L T, Pelillo M, Shah M (2019) Deep constrained dominant sets for person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 9854–9863 Alemu L T, Pelillo M, Shah M (2019) Deep constrained dominant sets for person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 9854–9863
18.
Zurück zum Zitat Li W, Zhao R, Xiao T, Wang X (2014) DeepReID: Deep Filter Pairing Neural Network for Person Re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 152–159 Li W, Zhao R, Xiao T, Wang X (2014) DeepReID: Deep Filter Pairing Neural Network for Person Re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 152–159
19.
Zurück zum Zitat Xiao T, Li H, Ouyang W, Wang X (2016) Learning deep feature representations with domain guided dropout for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1249–1258 Xiao T, Li H, Ouyang W, Wang X (2016) Learning deep feature representations with domain guided dropout for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1249–1258
20.
Zurück zum Zitat Liu J, Ni B, Yan Y, Zhou P, Cheng S, Hu J (2018) Pose transferrable person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4099–4108 Liu J, Ni B, Yan Y, Zhou P, Cheng S, Hu J (2018) Pose transferrable person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4099–4108
21.
Zurück zum Zitat Wang G, Yuan Y, Li J, Ge S, Zhou X (2020) Receptive multi-granularity representation for person re-identification. IEEE Trans Image Process 29:6096–6109CrossRefMATH Wang G, Yuan Y, Li J, Ge S, Zhou X (2020) Receptive multi-granularity representation for person re-identification. IEEE Trans Image Process 29:6096–6109CrossRefMATH
22.
Zurück zum Zitat Miao J, Wu Y, Liu P, Ding Y, Yang Y (2019) Pose-guided feature alignment for occluded person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 542–551 Miao J, Wu Y, Liu P, Ding Y, Yang Y (2019) Pose-guided feature alignment for occluded person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 542–551
23.
Zurück zum Zitat Gao S, Wang J, Lu H, Liu Z (2020) Pose-guided visible part matching for occluded person ReID. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11741–11749 Gao S, Wang J, Lu H, Liu Z (2020) Pose-guided visible part matching for occluded person ReID. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11741–11749
24.
Zurück zum Zitat Sohn K (2016) Improved deep metric learning with multi-class N-pair loss objective. In: Proceedings of the 30th international conference on neural information processing systems (NIPS'16), pp 1857–1865 Sohn K (2016) Improved deep metric learning with multi-class N-pair loss objective. In: Proceedings of the 30th international conference on neural information processing systems (NIPS'16), pp 1857–1865
25.
Zurück zum Zitat Varior RR, Haloi M, Wang G (2016) Gated siamese convolutional neural network architecture for human re-identification. In: European conference on computer vision, pp 791–808 Varior RR, Haloi M, Wang G (2016) Gated siamese convolutional neural network architecture for human re-identification. In: European conference on computer vision, pp 791–808
26.
27.
Zurück zum Zitat Chen W, Chen X, Zhang J, Huang K (2017) Beyond triplet loss: a deep quadruplet network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1320–1329 Chen W, Chen X, Zhang J, Huang K (2017) Beyond triplet loss: a deep quadruplet network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1320–1329
28.
Zurück zum Zitat Xiao Q, Luo H, Zhang C (2017) Margin sample mining loss: a deep learning based method for person re-identification. Preprint arXiv:1710.00478 Xiao Q, Luo H, Zhang C (2017) Margin sample mining loss: a deep learning based method for person re-identification. Preprint arXiv:​1710.​00478
29.
Zurück zum Zitat Ling Y, Zhong Z, Luo Z, Rota P, Li S, Sebe N (2020) Class-aware modality mix and center-guided metric learning for visible-thermal person re-identification. In: Proceedings of the 28th ACM international conference on multimedia, pp 889–897 Ling Y, Zhong Z, Luo Z, Rota P, Li S, Sebe N (2020) Class-aware modality mix and center-guided metric learning for visible-thermal person re-identification. In: Proceedings of the 28th ACM international conference on multimedia, pp 889–897
30.
Zurück zum Zitat Li D, Wei X, Hong X, Gong Y (2020) Infrared-visible cross-modal person re-identification with an x modality. In: Proceedings of the AAAI conference on artificial intelligence, pp 4610–4617 Li D, Wei X, Hong X, Gong Y (2020) Infrared-visible cross-modal person re-identification with an x modality. In: Proceedings of the AAAI conference on artificial intelligence, pp 4610–4617
31.
Zurück zum Zitat Kansal K, Subramanyam AV, Wang Z, Satoh SI (2020) SDL: Spectrum-disentangled representation learning for visible-infrared person re-identification. IEEE Trans Circuits Syst Video Technol 30(10):3422–3432CrossRef Kansal K, Subramanyam AV, Wang Z, Satoh SI (2020) SDL: Spectrum-disentangled representation learning for visible-infrared person re-identification. IEEE Trans Circuits Syst Video Technol 30(10):3422–3432CrossRef
32.
Zurück zum Zitat Zhang Z, Jiang S, Huang C, Li Y, Da Xu RY (2020) RGB-IR cross-modality person ReID based on teacher-student GAN model. Pattern Recogn Lett 150:155–161CrossRef Zhang Z, Jiang S, Huang C, Li Y, Da Xu RY (2020) RGB-IR cross-modality person ReID based on teacher-student GAN model. Pattern Recogn Lett 150:155–161CrossRef
33.
Zurück zum Zitat Choi S, Lee S, Kim Y, Kim T, Kim C (2020) Hi-CMD: hierarchical cross-modality disentanglement for visible-infrared person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10254–10263 Choi S, Lee S, Kim Y, Kim T, Kim C (2020) Hi-CMD: hierarchical cross-modality disentanglement for visible-infrared person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10254–10263
34.
Zurück zum Zitat Ye M, Lan X, Leng Q, Shen J (2020) Cross-modality person re-identification via modality aware collaborative ensemble learning. IEEE Trans Image Process 29:9387–9399CrossRefMATH Ye M, Lan X, Leng Q, Shen J (2020) Cross-modality person re-identification via modality aware collaborative ensemble learning. IEEE Trans Image Process 29:9387–9399CrossRefMATH
35.
Zurück zum Zitat Zhao YB, Lin JW, Xuan Q, Xi X (2019) HPILN: a feature learning framework for cross-modality person re-identification. IET Image Process 13(14):2897–2904CrossRef Zhao YB, Lin JW, Xuan Q, Xi X (2019) HPILN: a feature learning framework for cross-modality person re-identification. IET Image Process 13(14):2897–2904CrossRef
36.
Zurück zum Zitat He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778 He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
37.
Zurück zum Zitat Nguyen DT, Hong HG, Kim KW, Park KR (2017) Person recognition system based on a combination of body images from visible light and thermal cameras. Sensors 17(3):605CrossRef Nguyen DT, Hong HG, Kim KW, Park KR (2017) Person recognition system based on a combination of body images from visible light and thermal cameras. Sensors 17(3):605CrossRef
38.
Zurück zum Zitat Liu H, Cheng J, Wang W, Su Y, Bai H (2020) Enhancing the discriminative feature learning for visible-thermal cross-modality person re-identification. Neurocomputing 398:11–19CrossRef Liu H, Cheng J, Wang W, Su Y, Bai H (2020) Enhancing the discriminative feature learning for visible-thermal cross-modality person re-identification. Neurocomputing 398:11–19CrossRef
39.
Zurück zum Zitat Wang G, Zhang T, Cheng J, Liu S, Yang Y, Hou Z (2019) RGB-infrared cross-modality person re-identification via joint pixel and feature alignment. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 3622–3631 Wang G, Zhang T, Cheng J, Liu S, Yang Y, Hou Z (2019) RGB-infrared cross-modality person re-identification via joint pixel and feature alignment. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 3622–3631
40.
Zurück zum Zitat Ye M, Shen J, Shao L (2020) Visible-infrared person re-identification via homogeneous augmented tri-modal learning. IEEE Trans Inf Forensics Security 16:728–739CrossRef Ye M, Shen J, Shao L (2020) Visible-infrared person re-identification via homogeneous augmented tri-modal learning. IEEE Trans Inf Forensics Security 16:728–739CrossRef
41.
Zurück zum Zitat Park H, Lee S, Lee J, Ham B (2021) Learning by aligning: visible-infrared person re-identification using cross-modal correspondences. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 12046–12055. Park H, Lee S, Lee J, Ham B (2021) Learning by aligning: visible-infrared person re-identification using cross-modal correspondences. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 12046–12055.
42.
Zurück zum Zitat Fu C, Hu Y, Wu X, Mei T, He R (2021) CM-NAS: rethinking cross-modality neural architectures for visible-infrared person re-identification. Preprint arXiv:2101.08467 Fu C, Hu Y, Wu X, Mei T, He R (2021) CM-NAS: rethinking cross-modality neural architectures for visible-infrared person re-identification. Preprint arXiv:​2101.​08467
43.
44.
Zurück zum Zitat Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2016) Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2818–2826 Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2016) Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2818–2826
Metadaten
Titel
Deep Constraints Space of Medium Modality for RGB-Infrared Person Re-identification
verfasst von
Baojin Huang
Hao Chen
Wencheng Qin
Publikationsdatum
22.08.2022
Verlag
Springer US
Erschienen in
Neural Processing Letters / Ausgabe 3/2023
Print ISSN: 1370-4621
Elektronische ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-022-10995-3

Weitere Artikel der Ausgabe 3/2023

Neural Processing Letters 3/2023 Zur Ausgabe

Neuer Inhalt