Skip to main content
Log in

An efficient content based video copy detection using the sample based hierarchical adaptive k-means clustering

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

Content-based video copy detection (CBCD) is very important for video copyright protection in view of the growing popularity of video sharing websites, which deals with not only whether a copy occurs in a query video stream but also where the copy is located and where the copy is originated from. In this paper, we present a video copy detection scheme based on local features which can deal with very large databases both in terms of quality and speed. First, we propose a new clustering algorithm to build “visual words” efficiently. Since our CBCD framework performs indexing using tree structures of cluster centers, the proposed clustering algorithm can improve the quality of retrieval and greatly shrink the off-line training time. Then, we introduce a TF-IDF (term frequency–inverse document frequency) weighted BOF (bag-of-features) voting retrieval method for matching video frames, which is robust to significant video distortion and efficient in terms of memory usage and computation time. Furthermore, we present a verification step robustly inspecting the temporal consistency between the query video and the corresponding candidate videos to further improve the accuracy of retrieval. The experimental results show that the proposed video copy detection scheme achieves high localization accuracy, while achieving comparable performance compared with state-of-the-art copy detection methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  • Al Aghbari, Z. (2012). cTraj: efficient indexing and searching of sequences containing multiple moving objects. Journal of Intelligent Information Systems, 1–28.

  • Anjulan, A., & Canagarajah, N. (2009). A unified framework for object retrieval and mining. IEEE Transactions on Circuits and Systems for Video Technology, 19(1), 63–76.

    Article  Google Scholar 

  • Cha, G.-H. (2012). Capturing contextual relationship for effective media search. Multimedia Tools and Applications, 56(2), 351–364.

    Article  Google Scholar 

  • Chang, S. H., Cosman, P. C., & Milstein, L. B. (2011). Chernoff-type bounds for the gaussian error function. [Article]. IEEE Transactions on Communications, 59(11), 2939–2944.

    Article  Google Scholar 

  • Chen, E. C. H., Pau, L., & Wang, P. (1993). Cluster analysis and related issue. Handbook of Pattern Recognition and Computer Vision, 3–32.

  • Cheung, S. C. S., & Zakhor, A. (2005). Fast similarity search and clustering of video sequences on the world-wide-web. IEEE Transactions on Multimedia, 7(3), 524–537.

    Article  Google Scholar 

  • Chou, J.-K., Yang, C.-K., & Gong, S.-D. (2012). Face-off: Automatic alteration of facial features. Multimedia Tools and Applications, 56(3), 569–596.

    Article  Google Scholar 

  • Datar, M., Immorlica, N., Indyk, P., & Mirrokni, V. S. Locality-sensitive hashing scheme based on p-stable distributions. In Proceedings of the twentieth annual symposium on Computational geometry, 2004 (pp. 253–262): ACM.

  • Douze, M., Jegou, H., & Schmid, C. (2010). An image-based approach to video copy detection with spatio-temporal post-filtering. IEEE Transactions on Multimedia, 12(4), 257–266.

    Article  Google Scholar 

  • Dunn, J. C. (1974). Well-separated clusters and optimal fuzzy partitions. Journal of Cybernetics, 4(1), 95–104.

    Article  MathSciNet  Google Scholar 

  • Feng, T., Wang, C. Y., Guo, W., & Gao, Q. (2009). Linear clustering algorithm by competitive learning suitable for coal mine in wireless sensor networks. [Article]. International Journal of Distributed Sensor Networks, 5(1), 60–60.

    Article  Google Scholar 

  • Ferman, A. M., Tekalp, A. M., & Mehrotra, R. (2002). Robust color histogram descriptors for video segment retrieval and identification. IEEE Transactions on Image Processing, 11(5), 497–508.

    Article  Google Scholar 

  • Ho, Y.-H., Lin, C. W., Chen, J.-F., & Liao, H. Y. M. (2006). Fast coarse-to-fine video retrieval using shot-level spatio-temporal statistics. IEEE Transactions on Circuits and Systems for Video Technology, 16(5), 642–648.

    Article  Google Scholar 

  • Hruschka, E. R., Campello, R. J. G. B., Freitas, A. A., & De Carvalho, A. C. P. L. F. (2009). A survey of evolutionary algorithms for clustering. IEEE Transactions on Systems, Man, and Cybernetics Part C: Applications and Reviews, 39(2), 133–155.

    Article  Google Scholar 

  • Jégou, H., Douze, M., & Schmid, C. On the burstiness of visual elements. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, 2009 (pp. 1169–1176): IEEE.

  • Jegou, H., Douze, M., & Schmid, C. (2010). Improving bag-of-features for large scale image search. International Journal of Computer Vision, 87(3), 316–336.

    Article  Google Scholar 

  • Jianping, F., Elmagarmid, A. K., Xingquan, Z., Aref, W. G., & Lide, W. (2004). ClassView: Hierarchical video shot classification, indexing, and accessing. IEEE Transactions on Multimedia, 6(1), 70–86.

    Article  Google Scholar 

  • Joly, A., Buisson, O., & Frelicot, C. (2007). Content-based copy retrieval using distortion-based probabilistic similarity search. IEEE Transactions on Multimedia, 9(2), 293–306.

    Article  Google Scholar 

  • Kashino, K., Kurozumi, T., & Murase, H. (2003). A quick search method for audio and video signals based on histogram pruning. IEEE Transactions on Multimedia, 5(3), 348–357.

    Article  Google Scholar 

  • Kaufmann, L., & Rousseeuw, P. (1990). Finding groups in data: An introduction to cluster analysis. New York: Wiley series in Probability and Statistics.

    Book  Google Scholar 

  • Ke, Y., & Sukthankar, R. PCA-SIFT: A more distinctive representation for local image descriptors. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004 (Vol. 2, pp. II-506-II-513 Vol. 502): IEEE.

  • Ke, Z., Yi, C., & Yang, S. X. (2010). Self-organizing feature map for cluster analysis in multi-disease diagnosis. Expert Systems with Applications, 37(9), 6359–63676367.

    Article  Google Scholar 

  • Law-To, J., Buisson, O., Gouet-Brunet, V., & Boujemaa, N. Robust voting algorithm based on labels of behavior for video copy detection. In Proceedings of the 14th annual ACM international conference on Multimedia, 2006 (pp. 835–844): ACM.

  • Law-To, J., Chen, L., Joly, A., Laptev, I., Buisson, O., Gouet-Brunet, V., et al. Video copy detection: A comparative study. In Proceedings of the 6th ACM international conference on Image and video retrieval, 2007 (pp. 371–378): ACM.

  • Liu, Z., Liu, T., & Shahraray, B. AT&T Research at TRECVID 2009 Content-based Copy Detection. In 2009 TRECVID Workshop, 2009.

  • Lowe, D. G. (2004). Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2), 91–110.

    Article  Google Scholar 

  • Mezaris, V., Kompatsiaris, I., Boulgouris, N. V., & Strintzis, M. G. (2004). Real-time compressed-domain spatiotemporal segmentation and ontologies for video indexing and retrieval. IEEE Transactions on Circuits and Systems for Video Technology, 14(5), 606–621.

    Article  Google Scholar 

  • Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., et al. (2005). A comparison of affine region detectors. International Journal of Computer Vision, 65(1–2), 43–72.

    Article  Google Scholar 

  • Min, H., Kim, S. M., De Neve, W., & Ro, Y. M. Video Copy Detection Using Inclined Video Tomography and Bag-of-Visual-Words. In Multimedia and Expo (ICME), 2012 I.E. International Conference on, 2012 (pp. 562–567): IEEE.

  • Naïja, Y., & Sinaoui, K. B. (2012). Interpretability-based validity methods for clustering results evaluation. Journal of Intelligent Information Systems, 1–31.

  • Ngo, C.-W., Pong, T.-C., & Zhang, H.-J. (2002). On clustering and retrieval of video shots through temporal slices analysis. IEEE Transactions on Multimedia, 4(4), 446–458.

    Article  Google Scholar 

  • Nister, D., & Stewenius, H. (2006). Scalable recognition with a vocabulary tree. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2, 2161–2168.

    Google Scholar 

  • Quack, T., Ferrari, V., & Van Gool, L. (2006). Video mining with frequent itemset configurations. Image and Video Retrieval, 360–369.

  • Sam, I. S., Devaraj, P., & Bhuvaneswaran, R. S. (2012). A novel image cipher based on mixed transformed logistic maps. Multimedia Tools and Applications, 56(2), 315–330.

    Article  Google Scholar 

  • Sarkar, A., Singh, V., Ghosh, P., Manjunath, B. S., & Singh, A. (2010). Efficient and Robust Detection of Duplicate Videos in a Large Database. IEEE Transactions on Circuits and Systems for Video Technology, 20(6), 870–885.

    Article  Google Scholar 

  • Sivic, J., & Zisserman, A. (2003). Video Google: A text retrieval approach to object matching in videos. Proceedings Ninth IEEE International Conference on Computer Vision, 1472, 1470–1477.

    Article  Google Scholar 

  • Sivic, J., & Zisserman, A. (2008). Efficient visual search for objects in videos. Proceedings of the IEEE, 96(4), 548–566.

    Article  Google Scholar 

  • Sivic, J., & Zisserman, A. (2009). Efficient Visual Search of Videos Cast as Text Retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(4), 591–606.

    Article  Google Scholar 

  • Snoek, C. G. M., & Worring, M. (2005). Multimedia event-based video indexing using time intervals. IEEE Transactions on Multimedia, 7(4), 638–647.

    Article  Google Scholar 

  • Wei, S., Zhao, Y., Zhu, C., Xu, C., & Zhu, Z. (2011). Frame fusion for video copy detection. IEEE Transactions on Circuits and Systems for Video Technology, 21(1), 15–28.

    Article  Google Scholar 

  • Wichert, A. (2008). Content-based image retrieval by hierarchical linear subspace method. Journal of Intelligent Information Systems, 31(1), 85–107.

    Article  MathSciNet  Google Scholar 

  • Wichert, A., Teixeira, P., Santos, P., & Galhardas, H. (2010). Subspace tree: High dimensional multimedia indexing with logarithmic temporal complexity. Journal of Intelligent Information Systems, 35(3), 495–516.

    Article  Google Scholar 

  • Wu, X., Hauptmann, A. G., & Ngo, C. W. Practical elimination of near-duplicates from web video search. In Proceedings of the 15th international conference on Multimedia, 2007 (pp. 218–227): ACM.

  • Xu, R., & Wunsch, D. (2005). Survey of clustering algorithms. IEEE Transactions on Neural Networks, 16(3), 645–678.

    Article  Google Scholar 

  • Yang, J. B., & Wei, W. (2013). Random sampling in shift invariant spaces. [Article]. Journal of Mathematical Analysis and Applications, 398(1), 26–34.

    Article  MATH  MathSciNet  Google Scholar 

Download references

Acknowledgments

The authors would like to thank the anonymous reviewers for valuable comments. This work was supported by funds from National Natural Science Foundation of China (NSFC Projects No. 61173110 and No. 11272253).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guizhong Liu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liao, K., Liu, G. An efficient content based video copy detection using the sample based hierarchical adaptive k-means clustering. J Intell Inf Syst 44, 133–158 (2015). https://doi.org/10.1007/s10844-014-0332-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-014-0332-5

Keywords

Navigation