skip to main content
10.1145/2578726.2578757acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
tutorial

ISOMER: Informative Segment Observations for Multimedia Event Recounting

Published:01 April 2014Publication History

ABSTRACT

This paper describes a system for multimedia event detection and recounting. The goal is to detect a high level event class in unconstrained web videos and generate event oriented summarization for display to users. For this purpose, we detect informative segments and collect observations for them, leading to our ISOMER system. We combine a large collection of both low level and semantic level visual and audio features for event detection. For event recounting, we propose a novel approach to identify event oriented discriminative video segments and their descriptions with a linear SVM event classifier. User friendly concepts including objects, actions, scenes, speech and optical character recognition are used in generating descriptions. We also develop several mapping and filtering strategies to cope with noisy concept detectors. Our system performed competitively in the TRECVID 2013 Multimedia Event Detection task with near 100,000 videos and was the highest performer in TRECVID 2013 Multimedia Event Recounting task.

References

  1. M.-Y. Chen and A. Hauptmann. Mosift: Recognizing human actions in surveillance videos. Technical Report CMU-CS-09-161, 2009.Google ScholarGoogle Scholar
  2. J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. ImageNet: A Large-Scale Hierarchical Image Database. In CVPR, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  3. D. Ding, F. Metze, S. Rawat, P. F. Schulam, S. Burger, E. Younessian, L. Bao, M. G. Christel, and A. Hauptmann. Beyond audio and video retrieval: Towards multimedia summarization. In ICMR, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Habibian, K. E. A. van de Sande, and C. G. M. Snoek. Recommendations for video event recognition using concept vocabularies. In ICMR, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. H. Izadinia and M. Shah. Recognizing complex events using large margin joint low-level event model. In ECCV, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. H. Jégou, M. Douze, and C. Schmid. Product quantization for nearest neighbor search. TPAMI, 2011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. I. Laptev. On space-time interest points. IJCV, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. Liu, Q. Yu, O. Javed, S. Ali, A. Tamrakar, A. Divakaran, H. Cheng, and H. S. Sawhney. Video event recognition using concept attributes. In WACV, 2013.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. D. G. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. G. K. Myers, R. C. Bolles, Q.-T. Luong, J. A. Herson, and H. Aradhye. Rectification and recognition of text in 3-d scenes. IJDAR, 2005.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. G. K. Myers, R. Nallapati, J. van Hout, S. Pancoast, R. Nevatia, C. Sun, A. Habibian, D. C. Koelma, K. E. A. van de Sande, A. W. M. Smeulders, and C. G. M. Snoek. Evaluating multimedia features and fusion for example-based event detection. MVA, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. P. Natarajan et al. Bbnviser: Bbn viser trecvid 2012 multimedia event detection and multimedia event recounting systems. In TRECVID, 2012.Google ScholarGoogle Scholar
  13. P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders, W. Kraaij, A. F. Smeaton, and G. Queenot. Trecvid 2013 -- an overview of the goals, tasks, data, evaluation mechanisms and metrics. In TRECVID, 2013.Google ScholarGoogle Scholar
  14. J. Sanchez, F. Perronnin, T. Mensink, and J. Verbeek. Image Classification with the Fisher Vector: Theory and Practice. IJCV, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. F. Smeaton, P. Over, and W. Kraaij. Evaluation campaigns and trecvid. In MIR, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. K. Soomro, A. R. Zamir, and M. Shah. Ucf101: A dataset of 101 human actions classes from videos in the wild. CRCV-TR-12-01.Google ScholarGoogle Scholar
  17. C. Sun and R. Nevatia. Active: Activity concept transitions in video event classification. In ICCV, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. C. Sun and R. Nevatia. Large-scale web video event classification by use of fisher vectors. In WACV, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. B. T. Truong and S. Venkatesh. Video abstraction: A systematic review and classification. TOMCCAP, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. K. E. A. van de Sande, T. Gevers, and C. G. M. Snoek. Evaluating color descriptors for object and scene recognition. TPAMI, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. H. Wang, A. Kläser, C. Schmid, and C.-L. Liu. Dense trajectories and motion boundary descriptors for action recognition. IJCV, 2013.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. ISOMER: Informative Segment Observations for Multimedia Event Recounting

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICMR '14: Proceedings of International Conference on Multimedia Retrieval
      April 2014
      564 pages
      ISBN:9781450327824
      DOI:10.1145/2578726

      Copyright © 2014 Owner/Author

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 April 2014

      Check for updates

      Qualifiers

      • tutorial
      • Research
      • Refereed limited

      Acceptance Rates

      ICMR '14 Paper Acceptance Rate21of111submissions,19%Overall Acceptance Rate254of830submissions,31%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader