skip to main content
10.1145/3428658.3430967acmconferencesArticle/Chapter ViewAbstractPublication PageswebmediaConference Proceedingsconference-collections
research-article

A Cluster-Matching-Based Method for Video Face Recognition

Published:30 November 2020Publication History

ABSTRACT

Face recognition systems are present in many modern solutions and thousands of applications in our daily lives. However, current solutions are not easily scalable, especially when it comes to the addition of new targeted people. We propose a cluster-matching-based approach for face recognition in video. In our approach, we use unsupervised learning to cluster the faces present in both the dataset and targeted videos selected for face recognition. Moreover, we design a cluster matching heuristic to associate clusters in both sets that is also capable of identifying when a face belongs to a non-registered person. Our method has achieved a recall of 99.435% and a precision of 99.131% in the task of video face recognition. Besides performing face recognition, it can also be used to determine the video segments where each person is present.

References

  1. Y. Adini, Y. Moses, and S. Ullman. 1997. Face recognition: the problem of compensating for changes in illumination direction. IEEE Transactions on Pattern Analysis and Machine Intelligence 19, 7 (1997), 721--732.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Sanjeev Arora, Yuanzhi Li, Yingyu Liang, Tengyu Ma, and Andrej Risteski. 2018. Linear algebraic structure of word senses, with applications to polysemy. Transactions of the Association for Computational Linguistics 6 (2018), 483--495.Google ScholarGoogle ScholarCross RefCross Ref
  3. Qiong Cao, Li Shen, Weidi Xie, Omkar M Parkhi, and Andrew Zisserman. 2018. Vggface2: A dataset for recognising faces across pose and age. In 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018). IEEE, 67--74.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Harihara Santosh Dadi and GK Mohan Pillutla. 2016. Improved face recognition rate using HOG features and SVM classifier. IOSR Journal of Electronics and Communication Engineering 11, 04 (2016), 34--44.Google ScholarGoogle ScholarCross RefCross Ref
  5. Brendan J Frey and Delbert Dueck. 2007. Clustering by passing messages between data points. science 315, 5814 (2007), 972--976.Google ScholarGoogle Scholar
  6. V. Ghosal, P. Tikmani, and P. Gupta. 2009. Face Classification Using Gabor Wavelets and Random Forest. In 2009 Canadian Conference on Computer and Robot Vision. 68--73.Google ScholarGoogle Scholar
  7. Patrick Grother, Patrick Grother, Mei Ngan, Kayee Hanaoka, Chris Boehnen, and Lars Ericson. 2017. The 2017 IARPA Face Recognition Prize Challenge (FRPC). US Department of Commerce, National Institute of Standards and Technology.Google ScholarGoogle Scholar
  8. Kim Hazelwood, Sarah Bird, David Brooks, Soumith Chintala, Utku Diril, Dmytro Dzhulgakov, Mohamed Fawzy, Bill Jia, Yangqing Jia, Aditya Kalro, et al. 2018. Applied machine learning at facebook: A datacenter infrastructure perspective. In 2018 IEEE International Symposium on High Performance Computer Architecture (HPCA). IEEE, 620--629.Google ScholarGoogle ScholarCross RefCross Ref
  9. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.Google ScholarGoogle ScholarCross RefCross Ref
  10. Julia Bell Hirschberg and Andrew Rosenberg. 2007. V-Measure: a conditional entropy-based external cluster evaluation. (2007).Google ScholarGoogle Scholar
  11. Jie Hu, Li Shen, and Gang Sun. 2018. Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 7132--7141.Google ScholarGoogle ScholarCross RefCross Ref
  12. Gary B. Huang, Manu Ramesh, Tamara Berg, and Erik Learned-Miller. 2007. Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments. Technical Report 07--49. University of Massachusetts, Amherst.Google ScholarGoogle Scholar
  13. Robert Layton, Paul Watters, and Richard Dazeley. 2013. Automated unsupervised authorship analysis using evidence accumulation clustering. Natural Language Engineering 19, 1 (2013), 95--120.Google ScholarGoogle ScholarCross RefCross Ref
  14. Stuart Lloyd. 1982. Least squares quantization in PCM. IEEE transactions on information theory 28, 2 (1982), 129--137.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. I. Masi, Y. Wu, T. Hassner, and P. Natarajan. 2018. Deep Face Recognition: A Survey. In 2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI). 471--478.Google ScholarGoogle Scholar
  16. Iacopo Masi, Yue Wu, Tal Hassner, and Prem Natarajan. 2018. Deep face recognition: A survey. In 2018 31st SIBGRAPI conference on graphics, patterns and images (SIBGRAPI). IEEE, 471--478.Google ScholarGoogle ScholarCross RefCross Ref
  17. Rafael Pena, Felipe Ferreira, Frederico Caroli, Luiz Schirmer, and Hélio Lopes. 2020. Globo Face Stream: A System for Video Meta-data Generation in an Entertainment Industry Setting. 23 International Conference on Enterprise Information Systems (ICEIS) (2020).Google ScholarGoogle ScholarCross RefCross Ref
  18. Yongming Rao, Jiwen Lu, and Jie Zhou. 2017. Attention-aware deep reinforcement learning for video face recognition. In Proceedings of the IEEE international conference on computer vision. 3931--3940.Google ScholarGoogle ScholarCross RefCross Ref
  19. André Lira Rolim and Ed Porto Bezerra. 2008. Um Sistema de Identificação Automática de Faces Para Um Ambiente Virtual de Ensino e Aprendizagem. In Companion Proceedings of the XIV Brazilian Symposium on Multimedia and the Web (Vila Velha, Espírito Santo, Brazil) (WebMedia '08). Association for Computing Machinery, New York, NY, USA, 129--132. https://doi.org/10.1145/1809980.1810015Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Edith M Ross and Florian Markowetz. 2016. OncoNEM: inferring tumor evolution from single-cell sequencing data. Genome biology 17, 1 (2016), 1--14.Google ScholarGoogle Scholar
  21. Peter J Rousseeuw. 1987. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Journal of computational and applied mathematics 20 (1987), 53--65.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Florian Schroff, Dmitry Kalenichenko, and James Philbin. 2015. Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE conference on computer vision and pattern recognition. 815--823.Google ScholarGoogle ScholarCross RefCross Ref
  23. Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).Google ScholarGoogle Scholar
  24. Kihyuk Sohn, Sifei Liu, Guangyu Zhong, Xiang Yu, Ming-Hsuan Yang, and Manmohan Chandraker. 2017. Unsupervised domain adaptation for face recognition in unlabeled videos. In Proceedings of the IEEE International Conference on Computer Vision. 3210--3218.Google ScholarGoogle ScholarCross RefCross Ref
  25. Yi Sun, Xiaogang Wang, and Xiaoou Tang. 2014. Deep learning face representation from predicting 10,000 classes. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1891--1898.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato, and Lior Wolf. 2014. Deepface: Closing the gap to human-level performance in face verification. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1701--1708.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. CJ Van Rijsbergen. 1979. Information retrieval 2nd edition butterworths. London available on internet (1979).Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Joe H Ward Jr. 1963. Hierarchical grouping to optimize an objective function. Journal of the American statistical association 58, 301 (1963), 236--244.Google ScholarGoogle ScholarCross RefCross Ref
  29. Cameron Whitelam, Emma Taborsky, Austin Blanton, Brianna Maze, Jocelyn Adams, Tim Miller, Nathan Kalka, Anil K Jain, James A Duncan, Kristen Allen, et al. 2017. Iarpa janus benchmark-b face dataset. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. 90--98.Google ScholarGoogle ScholarCross RefCross Ref
  30. Jiaolong Yang, Peiran Ren, Dongqing Zhang, Dong Chen, Fang Wen, Hongdong Li, and Gang Hua. 2017. Neural aggregation network for video face recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 4362--4371.Google ScholarGoogle ScholarCross RefCross Ref
  31. K. Zhang, Z. Zhang, Z. Li, and Y. Qiao. 2016. Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks. IEEE Signal Processing Letters 23, 10 (Oct 2016), 1499--1503. https://doi.org/10.1109/LSP.2016.2603342Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. A Cluster-Matching-Based Method for Video Face Recognition

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        WebMedia '20: Proceedings of the Brazilian Symposium on Multimedia and the Web
        November 2020
        364 pages
        ISBN:9781450381963
        DOI:10.1145/3428658

        Copyright © 2020 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 30 November 2020

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

        Acceptance Rates

        WebMedia '20 Paper Acceptance Rate34of87submissions,39%Overall Acceptance Rate270of873submissions,31%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader