Global-view hashing: harnessing global relations in near-duplicate video retrieval

Jing, Weizhen; Nie, Xiushan; Cui, Chaoran; Xi, Xiaoming; Yang, Gongping; Yin, Yilong

doi:10.1007/s11280-018-0536-7

Global-view hashing: harnessing global relations in near-duplicate video retrieval

Published: 26 February 2018

Volume 22, pages 771–789, (2019)
Cite this article

World Wide Web Aims and scope Submit manuscript

Weizhen Jing¹,
Xiushan Nie^1,2,
Chaoran Cui²,
Xiaoming Xi²,
Gongping Yang¹ &
…
Yilong Yin¹

632 Accesses
9 Citations
Explore all metrics

Abstract

Multi-view features are often used in video hashing for near-duplicate video retrieval because of their mutual assistance and complementarity. However, most methods only consider the local available information in multiple features, such as individual or pairwise structural relations, which do not fully utilize the dependent nature of multiple features. We thus propose a global-view hashing (GVH) framework to address the above-mentioned issue; our framework harnesses the global relations among samples characterized by multiple features. In the proposed framework, multiple features of all videos are jointly used to explore a common Hamming space, where the hash functions are obtained by comprehensively utilizing the relations from not only intra-view but also inter-view objects. In addition, the hash function obtained from the proposed GVH can learn multi-bit hash codes in a single iteration. Compared to existing video hashing schemes, the GVH not only globally considers the relations to obtain a more precise retrieval with short-length hash codes but also achieves multi-bit learning in a single iteration. We conduct extensive experiments on the CC_WEB_VIDEO and UQ_VIDEO datasets, and the experimental results show that our proposed method outperforms the state-of-the-art methods. As a side contribution, we will release the codes to facilitate other research.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Application of Perceptual Video Hashing for Near-duplicate Video Retrieval

Efficient Storage Support for Real-Time Near-Duplicate Video Retrieval

Attention-based deep supervised hashing for near duplicate video retrieval

Article 27 December 2023

References

Chen, Q., Sun, S.: Hierarchical multi-view fisher discriminant analysis. In: International Conference on Neural Information Processing, pp 289–298. Springer, Berlin (2009)
Chou, C.L., Chen, H.T., Lee, S.Y.: Pattern-based near-duplicate video retrieval and localization on Web-scale videos. IEEE Trans. Multimedia 17(3), 382–395 (2015)
Article Google Scholar
Chung, Y.C., Su, I.F., Lee, C., Liu, P.C.: Multiple k nearest neighbor search. World Wide Web 20(2), 371–398 (2017)
Article Google Scholar
Cirakman, O., Gunsel, B., Sengor, N.S., et al.: Content-based copy detection by a subspace learning based video fingerprinting scheme. Multimedia Tools Appl. 71(3), 1381–1409 (2014)
Article Google Scholar
Cui, B., Tung, A.K., Zhang, C., Zhao, Z.: Multiple feature fusion for social media applications. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, pp 435–446. ACM, New York (2010)
Dasgupta, S., Littman, M.L., McAllester, D.A.: Pac generalization bounds for co-training. In: Advances in Neural Information Processing Systems, pp 375–382 (2002)
Fu, Y., Cao, L., Guo, G., Huang, T.S.: Multiple feature fusion by subspace learning. In: Proceedings of the 2008 International Conference on Content-Based Image and Video Retrieval, pp 127–134. ACM, New York (2008)
Gao, L., Guo, Z., Zhang, H., Xu, X., Shen, H.T.: Video captioning with attention-based lstm and semantic consistency. IEEE Trans. Multimedia 19(9), 2045–2055 (2017)
Article Google Scholar
Hao, Y., Mu, T., Hong, R., Wang, M., An, N., Goulermas, J.Y.: Stochastic multiview hashing for large-scale near-duplicate video retrieval. IEEE Trans. Multimedia 19(1), 1–14 (2017)
Article Google Scholar
Jiang, M.L., Tian, Y.H., Huang, T.J.: Video copy detection using a soft cascade of multimodal features. In: 2012 IEEE International Conference on Multimedia and Expo, pp 374–379. IEEE, Piscataway (2012)
Jolliffe, I.: Principal Component Analysis. Wiley Online Library (2002)
Kan, M., Shan, S., Zhang, H., Lao, S., Chen, X.: Multi-view discriminant analysis. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 188–194 (2016)
Article Google Scholar
Li, M., Vishal, M.: Robust video hashing via multilinear subspace projections. IEEE Trans. Image Process. 21(10), 4397–4409 (2012)
Article MathSciNet MATH Google Scholar
Li, Y., Mou, L., Jiang, M., Su, C., Fang, X., Qian, M., Tian, Y., Wang, Y., Huang, T., Gao, W.: Pku-idm@ trecvid 2010: copy detection with visual-audio feature fusion and sequential pyramid matching. online https://www-nlpir.nist.gov/projects/tvpubs/tv.pubs.17.org.html (2010)
Liong, V.E., Lu, J., Tan, Y.P., Zhou, J.: Deep video hashing. IEEE Trans. Multimedia 19(6), 1209–1219 (2017)
Article Google Scholar
Liu, J., Huang, Z., Cai, H., Shen, H.T., Ngo, C.W., Wang, W.: Near-duplicate video retrieval: current research and future trends. ACM Comput. Surv. (CSUR) 45(4), 44 (2013)
Article Google Scholar
Liu, X., Li, Z., Deng, C., Tao, D.: Distributed adaptive binary quantization for fast nearest neighbor search. IEEE Trans. Image Process. A Publ. IEEE Signal Process. Soc. PP(99), 1–1 (2017)
MATH Google Scholar
Mou, L., Huang, T., Tian, Y., et al.: Content-based copy detection through multimodal feature representation and temporal pyramid matching. ACM Trans. Multimed. Comput. Commun. Appl. (TOMM) 10(1), 5 (2013)
Google Scholar
Nie, X., Qiao, J., Liu, J., Sun, J., Li, X., Liu, W.: Lle-Based video hashing for video identification. In: 2010 IEEE 10th International Conference on Signal Processing (ICSP), pp 1837–1840. IEEE, Piscataway (2010)
Nie, L., Wang, M., Zha, Z., Li, G., Chua, T.S.: Multimedia answering: enriching text Qa with media information. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 695–704. ACM, New York (2011)
Nie, L., Yan, S., Wang, M., Hong, R., Chua, T.S.: Harvesting visual concepts for image search with complex queries. In: ACM International Conference on Multimedia, pp 59–68 (2012)
Nie, X., Liu, J., Sun, J., Wang, L., Yang, X.: Robust video hashing based on representative-dispersive frames. Sci. China Inform. Sci. 56(6), 1–11 (2013)
Article MathSciNet Google Scholar
Nie, X., Yin, Y., Sun, J., Liu, J., Cui, C.: Comprehensive feature-based robust video fingerprinting using tensor model. IEEE Trans. Multimedia 19(4), 785–796 (2017)
Article Google Scholar
Shen, H.T., Zhou, X., Huang, Z., Shao, J., Zhou, X.: Uqlips: a Real-Time Near-Duplicate video clip detection system. In: Proceedings of the 33rd International Conference on Very Large Data Bases, pp. 1374–1377. VLDB Endowment (2007)
Shen, X., Shen, F., Sun, Q.S., Yuan, Y.H.: Multi-View latent hashing for efficient multimedia search. In: Proceedings of the 23rd ACM International Conference on Multimedia, pp 831–834. ACM, New York (2015)
Song, J, Yang, Y, Huang Z, et al.: Effective multiple feature hashing for large-scale near-duplicate video retrieval[J]. IEEE Trans. Multimedia 15(8), 1997–2008 (2013)
Article Google Scholar
Song, J., Gao, L., Nie, F., Shen, H.T., Yan, Y., Sebe, N.: Optimized graph learning using partial tags and multiple features for image and video annotation. IEEE Trans. Image Process. 25(11), 4999–5011 (2016)
Article MathSciNet MATH Google Scholar
Song, J, Gao, L, Liu, L, et al.: Quantization-based hashing: a general framework for scalable image and video retrieval[J]. Pattern Recogn. 75, 175–187 (2018)
Article Google Scholar
Wang, J., Zhang, T., Song, J., Sebe, N., Shen, H.T.: A survey on learning to hash. IEEE Trans. Pattern Anal. Mach. Intell. PP(99), 1–1 (2017)
Google Scholar
Wang, X, Gao, L, Wang, P, et al.: Two-stream 3d convnet fusion for action recognition in videos with arbitrary size and length[J]. IEEE Transactions on Multimedia (2017)
Wei, S., Zhao, Y., Zhu, C., et al.: Frame fusion for video copy detection. IEEE Trans. Circuits Syst. Video Technol. 21(1), 15–28 (2011)
Article Google Scholar
Weiss, Y., Torralba, A., Fergus, R.: Spectral hashing. In: Advances in Neural Information Processing Systems, pp 1753–1760 (2009)
Wu, X., Hauptmann, A.G., Ngo, C.W.: Practical elimination of near-duplicates from Web video search. In: Proceedings of the 15th ACM International Conference on Multimedia, pp 218–227. ACM, New York (2007)
Yang, G.B., Chen, N., Jiang, Q.: A robust hashing algorithm based on surf for video copy detection. Comput. Secur. 31(1), 33–39 (2012)
Article Google Scholar
Yang, Y., Song, J., Huang, Z., Ma, Z., Sebe, N., Hauptmann, A.G.: Multi-feature fusion via hierarchical regression for multimedia analysis. IEEE Trans. Multimedia 15(3), 572–581 (2013)
Article Google Scholar
Zhang, H., Gao, X., Wu, P., Xu, X.: A cross-media distance metric learning framework based on multi-view correlation mining and matching. World Wide Web 19(2), 181–197 (2016)
Article Google Scholar
Zhang, H., Wang, M., Hong, R., Chua, T.S.: Play and rewind: optimizing binary representations of videos by self-supervised temporal hashing. In: ACM on Multimedia Conference, pp 781–790 (2016)
Zhao, G., Pietikainen, M.: Dynamic texture recognition using local binary patterns with an application to facial expressions. IEEE Trans. Pattern Anal. Mach. Intell. 29(6), 915–928 (2007)
Article Google Scholar
Zhu, X., Li, X., Zhang, S., Ju, C., Wu, X.: Robust joint graph sparse coding for unsupervised spectral feature selection. IEEE Trans. Neural Netw. Learn. Syst. 28(6), 1263–1275 (2017)
Article MathSciNet Google Scholar

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (61671274, 61573219, 61701281, 61701280), China Postdoctoral Science Foundation (2016M592190), Shandong Provincial Key Research and Development Plan (2017CXGC1504), the Fostering Project of Dominant Discipline and Talent Team of Shandong Province Higher Education Institutions, and the Fostering Project of Dominant Discipline and Talent Team of SDUFE.

Author information

Authors and Affiliations

School of Computer Science and Technology, Shandong University, Jinan, Shandong, China
Weizhen Jing, Xiushan Nie, Gongping Yang & Yilong Yin
School of Computer Science and Technology, Shandong University of Finance and Economics, Jinan, Shandong, China
Xiushan Nie, Chaoran Cui & Xiaoming Xi

Authors

Weizhen Jing
View author publications
You can also search for this author in PubMed Google Scholar
Xiushan Nie
View author publications
You can also search for this author in PubMed Google Scholar
Chaoran Cui
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoming Xi
View author publications
You can also search for this author in PubMed Google Scholar
Gongping Yang
View author publications
You can also search for this author in PubMed Google Scholar
Yilong Yin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Xiushan Nie or Yilong Yin.

Additional information

Weizhen Jing and Xiushan Nie equally contributed to this work.

This article belongs to the Topical Collection: Special Issue on Deep vs. Shallow: Learning for Emerging Web-scale Data Computing and Applications Guest Editors: Jingkuan Song, Shuqiang Jiang, Elisa Ricci, and Zi Huang

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jing, W., Nie, X., Cui, C. et al. Global-view hashing: harnessing global relations in near-duplicate video retrieval. World Wide Web 22, 771–789 (2019). https://doi.org/10.1007/s11280-018-0536-7

Download citation

Received: 15 August 2017
Revised: 29 January 2018
Accepted: 05 February 2018
Published: 26 February 2018
Issue Date: 15 March 2019
DOI: https://doi.org/10.1007/s11280-018-0536-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Global-view hashing: harnessing global relations in near-duplicate video retrieval

Abstract

Access this article

Similar content being viewed by others

Application of Perceptual Video Hashing for Near-duplicate Video Retrieval

Efficient Storage Support for Real-Time Near-Duplicate Video Retrieval

Attention-based deep supervised hashing for near duplicate video retrieval

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Global-view hashing: harnessing global relations in near-duplicate video retrieval

Abstract

Access this article

Similar content being viewed by others

Application of Perceptual Video Hashing for Near-duplicate Video Retrieval

Efficient Storage Support for Real-Time Near-Duplicate Video Retrieval

Attention-based deep supervised hashing for near duplicate video retrieval

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation