Skip to main content
Log in

Global-view hashing: harnessing global relations in near-duplicate video retrieval

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

Multi-view features are often used in video hashing for near-duplicate video retrieval because of their mutual assistance and complementarity. However, most methods only consider the local available information in multiple features, such as individual or pairwise structural relations, which do not fully utilize the dependent nature of multiple features. We thus propose a global-view hashing (GVH) framework to address the above-mentioned issue; our framework harnesses the global relations among samples characterized by multiple features. In the proposed framework, multiple features of all videos are jointly used to explore a common Hamming space, where the hash functions are obtained by comprehensively utilizing the relations from not only intra-view but also inter-view objects. In addition, the hash function obtained from the proposed GVH can learn multi-bit hash codes in a single iteration. Compared to existing video hashing schemes, the GVH not only globally considers the relations to obtain a more precise retrieval with short-length hash codes but also achieves multi-bit learning in a single iteration. We conduct extensive experiments on the CC_WEB_VIDEO and UQ_VIDEO datasets, and the experimental results show that our proposed method outperforms the state-of-the-art methods. As a side contribution, we will release the codes to facilitate other research.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8

Similar content being viewed by others

References

  1. Chen, Q., Sun, S.: Hierarchical multi-view fisher discriminant analysis. In: International Conference on Neural Information Processing, pp 289–298. Springer, Berlin (2009)

  2. Chou, C.L., Chen, H.T., Lee, S.Y.: Pattern-based near-duplicate video retrieval and localization on Web-scale videos. IEEE Trans. Multimedia 17(3), 382–395 (2015)

    Article  Google Scholar 

  3. Chung, Y.C., Su, I.F., Lee, C., Liu, P.C.: Multiple k nearest neighbor search. World Wide Web 20(2), 371–398 (2017)

    Article  Google Scholar 

  4. Cirakman, O., Gunsel, B., Sengor, N.S., et al.: Content-based copy detection by a subspace learning based video fingerprinting scheme. Multimedia Tools Appl. 71(3), 1381–1409 (2014)

    Article  Google Scholar 

  5. Cui, B., Tung, A.K., Zhang, C., Zhao, Z.: Multiple feature fusion for social media applications. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, pp 435–446. ACM, New York (2010)

  6. Dasgupta, S., Littman, M.L., McAllester, D.A.: Pac generalization bounds for co-training. In: Advances in Neural Information Processing Systems, pp 375–382 (2002)

  7. Fu, Y., Cao, L., Guo, G., Huang, T.S.: Multiple feature fusion by subspace learning. In: Proceedings of the 2008 International Conference on Content-Based Image and Video Retrieval, pp 127–134. ACM, New York (2008)

  8. Gao, L., Guo, Z., Zhang, H., Xu, X., Shen, H.T.: Video captioning with attention-based lstm and semantic consistency. IEEE Trans. Multimedia 19(9), 2045–2055 (2017)

    Article  Google Scholar 

  9. Hao, Y., Mu, T., Hong, R., Wang, M., An, N., Goulermas, J.Y.: Stochastic multiview hashing for large-scale near-duplicate video retrieval. IEEE Trans. Multimedia 19(1), 1–14 (2017)

    Article  Google Scholar 

  10. Jiang, M.L., Tian, Y.H., Huang, T.J.: Video copy detection using a soft cascade of multimodal features. In: 2012 IEEE International Conference on Multimedia and Expo, pp 374–379. IEEE, Piscataway (2012)

  11. Jolliffe, I.: Principal Component Analysis. Wiley Online Library (2002)

  12. Kan, M., Shan, S., Zhang, H., Lao, S., Chen, X.: Multi-view discriminant analysis. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 188–194 (2016)

    Article  Google Scholar 

  13. Li, M., Vishal, M.: Robust video hashing via multilinear subspace projections. IEEE Trans. Image Process. 21(10), 4397–4409 (2012)

    Article  MathSciNet  MATH  Google Scholar 

  14. Li, Y., Mou, L., Jiang, M., Su, C., Fang, X., Qian, M., Tian, Y., Wang, Y., Huang, T., Gao, W.: Pku-idm@ trecvid 2010: copy detection with visual-audio feature fusion and sequential pyramid matching. online https://www-nlpir.nist.gov/projects/tvpubs/tv.pubs.17.org.html (2010)

  15. Liong, V.E., Lu, J., Tan, Y.P., Zhou, J.: Deep video hashing. IEEE Trans. Multimedia 19(6), 1209–1219 (2017)

    Article  Google Scholar 

  16. Liu, J., Huang, Z., Cai, H., Shen, H.T., Ngo, C.W., Wang, W.: Near-duplicate video retrieval: current research and future trends. ACM Comput. Surv. (CSUR) 45(4), 44 (2013)

    Article  Google Scholar 

  17. Liu, X., Li, Z., Deng, C., Tao, D.: Distributed adaptive binary quantization for fast nearest neighbor search. IEEE Trans. Image Process. A Publ. IEEE Signal Process. Soc. PP(99), 1–1 (2017)

    MATH  Google Scholar 

  18. Mou, L., Huang, T., Tian, Y., et al.: Content-based copy detection through multimodal feature representation and temporal pyramid matching. ACM Trans. Multimed. Comput. Commun. Appl. (TOMM) 10(1), 5 (2013)

    Google Scholar 

  19. Nie, X., Qiao, J., Liu, J., Sun, J., Li, X., Liu, W.: Lle-Based video hashing for video identification. In: 2010 IEEE 10th International Conference on Signal Processing (ICSP), pp 1837–1840. IEEE, Piscataway (2010)

  20. Nie, L., Wang, M., Zha, Z., Li, G., Chua, T.S.: Multimedia answering: enriching text Qa with media information. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 695–704. ACM, New York (2011)

  21. Nie, L., Yan, S., Wang, M., Hong, R., Chua, T.S.: Harvesting visual concepts for image search with complex queries. In: ACM International Conference on Multimedia, pp 59–68 (2012)

  22. Nie, X., Liu, J., Sun, J., Wang, L., Yang, X.: Robust video hashing based on representative-dispersive frames. Sci. China Inform. Sci. 56(6), 1–11 (2013)

    Article  MathSciNet  Google Scholar 

  23. Nie, X., Yin, Y., Sun, J., Liu, J., Cui, C.: Comprehensive feature-based robust video fingerprinting using tensor model. IEEE Trans. Multimedia 19(4), 785–796 (2017)

    Article  Google Scholar 

  24. Shen, H.T., Zhou, X., Huang, Z., Shao, J., Zhou, X.: Uqlips: a Real-Time Near-Duplicate video clip detection system. In: Proceedings of the 33rd International Conference on Very Large Data Bases, pp. 1374–1377. VLDB Endowment (2007)

  25. Shen, X., Shen, F., Sun, Q.S., Yuan, Y.H.: Multi-View latent hashing for efficient multimedia search. In: Proceedings of the 23rd ACM International Conference on Multimedia, pp 831–834. ACM, New York (2015)

  26. Song, J, Yang, Y, Huang Z, et al.: Effective multiple feature hashing for large-scale near-duplicate video retrieval[J]. IEEE Trans. Multimedia 15(8), 1997–2008 (2013)

    Article  Google Scholar 

  27. Song, J., Gao, L., Nie, F., Shen, H.T., Yan, Y., Sebe, N.: Optimized graph learning using partial tags and multiple features for image and video annotation. IEEE Trans. Image Process. 25(11), 4999–5011 (2016)

    Article  MathSciNet  MATH  Google Scholar 

  28. Song, J, Gao, L, Liu, L, et al.: Quantization-based hashing: a general framework for scalable image and video retrieval[J]. Pattern Recogn. 75, 175–187 (2018)

    Article  Google Scholar 

  29. Wang, J., Zhang, T., Song, J., Sebe, N., Shen, H.T.: A survey on learning to hash. IEEE Trans. Pattern Anal. Mach. Intell. PP(99), 1–1 (2017)

    Google Scholar 

  30. Wang, X, Gao, L, Wang, P, et al.: Two-stream 3d convnet fusion for action recognition in videos with arbitrary size and length[J]. IEEE Transactions on Multimedia (2017)

  31. Wei, S., Zhao, Y., Zhu, C., et al.: Frame fusion for video copy detection. IEEE Trans. Circuits Syst. Video Technol. 21(1), 15–28 (2011)

    Article  Google Scholar 

  32. Weiss, Y., Torralba, A., Fergus, R.: Spectral hashing. In: Advances in Neural Information Processing Systems, pp 1753–1760 (2009)

  33. Wu, X., Hauptmann, A.G., Ngo, C.W.: Practical elimination of near-duplicates from Web video search. In: Proceedings of the 15th ACM International Conference on Multimedia, pp 218–227. ACM, New York (2007)

  34. Yang, G.B., Chen, N., Jiang, Q.: A robust hashing algorithm based on surf for video copy detection. Comput. Secur. 31(1), 33–39 (2012)

    Article  Google Scholar 

  35. Yang, Y., Song, J., Huang, Z., Ma, Z., Sebe, N., Hauptmann, A.G.: Multi-feature fusion via hierarchical regression for multimedia analysis. IEEE Trans. Multimedia 15(3), 572–581 (2013)

    Article  Google Scholar 

  36. Zhang, H., Gao, X., Wu, P., Xu, X.: A cross-media distance metric learning framework based on multi-view correlation mining and matching. World Wide Web 19(2), 181–197 (2016)

    Article  Google Scholar 

  37. Zhang, H., Wang, M., Hong, R., Chua, T.S.: Play and rewind: optimizing binary representations of videos by self-supervised temporal hashing. In: ACM on Multimedia Conference, pp 781–790 (2016)

  38. Zhao, G., Pietikainen, M.: Dynamic texture recognition using local binary patterns with an application to facial expressions. IEEE Trans. Pattern Anal. Mach. Intell. 29(6), 915–928 (2007)

    Article  Google Scholar 

  39. Zhu, X., Li, X., Zhang, S., Ju, C., Wu, X.: Robust joint graph sparse coding for unsupervised spectral feature selection. IEEE Trans. Neural Netw. Learn. Syst. 28(6), 1263–1275 (2017)

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (61671274, 61573219, 61701281, 61701280), China Postdoctoral Science Foundation (2016M592190), Shandong Provincial Key Research and Development Plan (2017CXGC1504), the Fostering Project of Dominant Discipline and Talent Team of Shandong Province Higher Education Institutions, and the Fostering Project of Dominant Discipline and Talent Team of SDUFE.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Xiushan Nie or Yilong Yin.

Additional information

Weizhen Jing and Xiushan Nie equally contributed to this work.

This article belongs to the Topical Collection: Special Issue on Deep vs. Shallow: Learning for Emerging Web-scale Data Computing and Applications Guest Editors: Jingkuan Song, Shuqiang Jiang, Elisa Ricci, and Zi Huang

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jing, W., Nie, X., Cui, C. et al. Global-view hashing: harnessing global relations in near-duplicate video retrieval. World Wide Web 22, 771–789 (2019). https://doi.org/10.1007/s11280-018-0536-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-018-0536-7

Keywords

Navigation