skip to main content
research-article

Fully Unsupervised Person Re-Identification via Selective Contrastive Learning

Authors Info & Claims
Published:16 February 2022Publication History
Skip Abstract Section

Abstract

Person re-identification (ReID) aims at searching the same identity person among images captured by various cameras. Existing fully supervised person ReID methods usually suffer from poor generalization capability caused by domain gaps. Unsupervised person ReID has attracted a lot of attention recently, because it works without intensive manual annotation and thus shows great potential in adapting to new conditions. Representation learning plays a critical role in unsupervised person ReID. In this work, we propose a novel selective contrastive learning framework for fully unsupervised feature learning. Specifically, different from traditional contrastive learning strategies, we propose to use multiple positives and adaptively selected negatives for defining the contrastive loss, enabling to learn a feature embedding model with stronger identity discriminative representation. Moreover, we propose to jointly leverage global and local features to construct three dynamic memory banks, among which the global and local ones are used for pairwise similarity computation and the mixture memory bank are used for contrastive loss definition. Experimental results demonstrate the superiority of our method in unsupervised person ReID compared with the state of the art. Our code is available at https://github.com/pangbo1997/Unsup_ReID.git.

REFERENCES

  1. [1] Ye Mang, Shen Jianbing, Lin Gaojie, Xiang Tao, Shao Ling, and Hoi Steven C. H.. 2021. Deep learning for person re-identification: A survey and outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence. Early access, January 26, 2021.Google ScholarGoogle ScholarCross RefCross Ref
  2. [2] Wang Jingya, Zhu Xiatian, Gong Shaogang, and Li Wei. 2018. Transferable joint attribute-identity deep learning for unsupervised person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 22752284.Google ScholarGoogle ScholarCross RefCross Ref
  3. [3] Deng Weijian, Zheng Liang, Ye Qixiang, Kang Guoliang, Yang Yi, and Jiao Jianbin. 2018. Image-image domain adaptation with preserved self-similarity and domain-dissimilarity for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 9941003.Google ScholarGoogle ScholarCross RefCross Ref
  4. [4] Liu Jiawei, Zha Zheng-Jun, Chen Di, Hong Richang, and Wang Meng. 2019. Adaptive transfer network for cross-domain person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 72027211.Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Ge Yixiao, Zhu Feng, Chen Dapeng, Zhao Rui, and Li Hongsheng. 2020. Self-paced contrastive learning with hybrid memory for domain adaptive object re-ID. In Advances in Neural Information Processing Systems.Google ScholarGoogle Scholar
  6. [6] Lin Yutian, Dong Xuanyi, Zheng Liang, Yan Yan, and Yang Yi. 2019. A bottom-up clustering approach to unsupervised person re-identification. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 87388745. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. [7] Wang Dongkai and Zhang Shiliang. 2020. Unsupervised person re-identification via multi-label classification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1098110990.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. [8] Lin Yutian, Xie Lingxi, Wu Yu, Yan Chenggang, and Tian Qi. 2020. Unsupervised person re-identification via softened similarity learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 33903399.Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] Chen Ting, Kornblith Simon, Norouzi Mohammad, and Hinton Geoffrey. 2020. A simple framework for contrastive learning of visual representations. arXiv preprint arXiv:2002.05709.Google ScholarGoogle Scholar
  10. [10] He Kaiming, Fan Haoqi, Wu Yuxin, Xie Saining, and Girshick Ross. 2020. Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 97299738.Google ScholarGoogle ScholarCross RefCross Ref
  11. [11] Sun Yifan, Zheng Liang, Yang Yi, Tian Qi, and Wang Shengjin. 2018. Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In Proceedings of the European Conference on Computer Vision (ECCV’18). 480496.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. [12] Long J., Shelhamer E., and Darrell T.. 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’15). 34313440.Google ScholarGoogle ScholarCross RefCross Ref
  13. [13] Zhao Liming, Li Xi, Zhuang Yueting, and Wang Jingdong. 2017. Deeply-learned part-aligned representations for person re-identification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’17). 32193228.Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Jin X., Lan C., Zeng W., Chen Z., and Zhang L.. 2020. Style normalization and restitution for generalizable person re-identification. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20). 31403149.Google ScholarGoogle ScholarCross RefCross Ref
  15. [15] Wu Guile, Zhu Xiatian, and Gong Shaogang. 2020. Tracklet self-supervised learning for unsupervised person re-identification. Proceedings of the AAAI Conference on Artificial Intelligence 34, 7 (2020), 1236212369.Google ScholarGoogle ScholarCross RefCross Ref
  16. [16] Zhong Zhun, Zheng Liang, Li Shaozi, and Yang Yi. 2018. Generalizing a person retrieval model hetero-and homogeneously. In Proceedings of the European Conference on Computer Vision (ECCV’18). 172188.Google ScholarGoogle ScholarCross RefCross Ref
  17. [17] Wu Yu, Lin Yutian, Dong Xuanyi, Yan Yan, and Yang Yi. 2018. Exploit the unknown gradually: One-shot video-based person re-identification by stepwise learning. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’18).Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Wu Yu, Lin Yutian, Dong Xuanyi, Yan Yan, Bian Wei, and Yang Yi. 2019. Progressive learning for person re-identification with one example. IEEE Transactions on Image Processing. Early access, January 10, 2019.Google ScholarGoogle ScholarCross RefCross Ref
  19. [19] Ye Mang, Lan Xiangyuan, and Yuen Pong C.. 2018. Robust anchor embedding for unsupervised video person re-identification in the wild. In Proceedings of the European Conference on Computer Vision (ECCV’18). 170186.Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Liao Shengcai, Hu Yang, Zhu Xiangyu, and Li Stan Z.. 2015. Person re-identification by local maximal occurrence representation and metric learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 21972206.Google ScholarGoogle ScholarCross RefCross Ref
  21. [21] Zheng Liang, Shen Liyue, Tian Lu, Wang Shengjin, Wang Jingdong, and Tian Qi. 2015. Scalable person re-identification: A benchmark. In Proceedings of the IEEE International Conference on Computer Vision. 11161124. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. [22] Gidaris Spyros, Singh Praveer, and Komodakis Nikos. 2018. Unsupervised representation learning by predicting image rotations. In Proceedings of the International Conference on Learning Representations.Google ScholarGoogle Scholar
  23. [23] Wu Zhirong, Xiong Yuanjun, Yu Stella, and Lin Dahua. 2018. Unsupervised feature learning via non-parametric instance-level discrimination. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle ScholarCross RefCross Ref
  24. [24] Zheng L., Shen L., Tian L., Wang S., Wang J., and Tian Q.. 2015. Scalable person re-identification: A benchmark. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV’15). 11161124. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. [25] Ristani Ergys, Solera Francesco, Zou Roger, Cucchiara Rita, and Tomasi Carlo. 2016. Performance measures and a data set for multi-target, multi-camera tracking. In Proceedings of the European Computer Vision Workshop on Benchmarking Multi-Target Tracking.Google ScholarGoogle ScholarCross RefCross Ref
  26. [26] Wei Longhui, Zhang Shiliang, Gao Wen, and Tian Qi. 2018. Person transfer GAN to bridge domain gap for person re-identification. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle ScholarCross RefCross Ref
  27. [27] Wu Y., Lin Y., Dong X., Yan Y., Ouyang W., and Yang Y.. 2018. Exploit the unknown gradually: One-shot video-based person re-identification by stepwise learning. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 51775186.Google ScholarGoogle ScholarCross RefCross Ref
  28. [28] Zheng Liang, Bie Zhi, Sun Yifan, Wang Jingdong, Su Chi, Wang Shengjin, and Tian Qi. 2016. MARS: A video benchmark for large-scale person re-identification. In Computer Vision—ECCV 2016, Leibe Bastian, Matas Jiri, Sebe Nicu, and Welling Max (Eds.). Springer International, Cham, Switzerland, 868884.Google ScholarGoogle Scholar
  29. [29] Ding Guodong, Khan Salman H., and Tang Zhenmin. 2019. Dispersion based clustering for unsupervised person re-identification. In Proceedings of the British Machine Vision Conference (BMVC’19). 264.Google ScholarGoogle Scholar
  30. [30] Zhong Zhun, Zheng Liang, Luo Zhiming, Li Shaozi, and Yang Yi. 2020. Invariance matters: Exemplar memory for domain adaptive person re-identification. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20).Google ScholarGoogle Scholar
  31. [31] Fu Yang, Wei Yunchao, Wang Guanshuo, Zhou Yuqian, Shi Honghui, and Huang Thomas. 2019. Self-similarity grouping: A simple unsupervised cross domain adaptation approach for person re-identification. In Proceedings of the 2019 International Conference on Computer Vision (ICCV’19).Google ScholarGoogle ScholarCross RefCross Ref
  32. [32] Wang Dongkai and Zhang Shiliang. 2020. Unsupervised person re-identification via multi-label classification. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20). 10981–10990.Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Chen Yanbei, Zhu Xiatian, and Gong Shaogang. 2018. Deep association learning for unsupervised video person re-identification. arXiv preprint arXiv:1808.07301.Google ScholarGoogle Scholar
  34. [34] Wu Yu, Lin Yutian, Dong Xuanyi, Yan Yan, Ouyang Wanli, and Yang Yi. 2018. Exploit the unknown gradually: One-shot video-based person re-identification by stepwise learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 51775186.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Fully Unsupervised Person Re-Identification via Selective Contrastive Learning

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 18, Issue 2
      May 2022
      494 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/3505207
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 16 February 2022
      • Accepted: 1 September 2021
      • Revised: 1 August 2021
      • Received: 1 July 2021
      Published in tomm Volume 18, Issue 2

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text

    HTML Format

    View this article in HTML Format .

    View HTML Format