skip to main content
10.1145/1178677.1178707acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

Video summarization using personal photo libraries

Published:26 October 2006Publication History

ABSTRACT

In this paper, we propose a video summarization system which takes into account users' individual preferences by using their personal photo libraries. Nowadays it is common, especially among people of younger generations, to store thousands of photos inside their PCs and manage them using software such as iPhoto and Picasa. These personal photo libraries contain rich information about the user's tastes, personalities, and lifestyles. Since still photos are in many aspects similar to video as a medium, we assume that these personal photo libraries can be used to estimate users' preferences on video summarization.Our system first divides a movie into short segments, and uses image classification techniques to judge whether each segment is meaningful to the user or not. If many photos with contents similar to the segment can be found in the user's photo library, the segment is judged as being "important" to the user. Conventional image classification techniques use public or commercial photo databases as training data, while our system uses personal photo libraries. This difference leads to the need of several modifications in the classification process.We have implemented a prototype version of our system, and have validated the effectiveness of our approach through evaluating both the accuracy of our image classification algorithm, and users' subjective satisfaction levels of the summarization results.

References

  1. Carson, C., Belongie, S., Greenspan, H. Malik, J. Blobworld: Image segmentation using Expectation-Maximization and its application to image querying. In IEEE Transactions on Pattern Analysis and Machine Intelligence, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Duygulu, P., Barnard, K., Freitas, J. F. G., Forsyth, D. A. Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary. In Proceedings of European Conference on Computer Vision, 2002.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Jeon, J., Lavrenko, V., Manmatha, R. Automatic Image Annotation and Retrieval using Cross-Media Relevance Models. In SIGIR '03, 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Ju, S. X., Black, M. J., Minneman, S., Kimber, D. Summarization of Video-taped Presentations: Automatic Analysis of Motion and Gesture. In IEEE Transactions on Circuits and Systems for Video Technology, 1998.]]Google ScholarGoogle Scholar
  5. Li, J., Wang, J. Z. Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach. In IEEE Transactions on Pattern Analysis and Machine Intelligence, 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Ma, Y., Lu, L., Zhang, H., Li, M. A User Attention Model for Video Summarization. In ACM Multimedia '02, 2002.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Mei, T. Hua, X. Zhou, H. Tracking Users' Capture Intention: A Novel Complementary View for Home Video Content Analysis. In ACM Multimedia '05, 2005.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Mori, Y., Takahashi, H., Oka, R. Image-to-word transformation based on dividing and vector quantizing images with words. In First International Workshop on Multimedia Intelligent Storage and Retrieval Management (MISRM 99), 1999.]]Google ScholarGoogle Scholar
  9. Parshin, V., Chen, L. Video Summarization Based on User-Defined Constraints and Preferences. In Proceedings of RIAO 2004, 2004.]]Google ScholarGoogle Scholar
  10. Pass, G., Zabih, R. Comparing Images Using Joint Histograms. In Journal of Multimedia Systems, 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Smith, M. A., Kanade, T. Video Skimming and Characterization through the Combination of Image and Language Understanding Techniques. In CMU Technology Report, 1997.]]Google ScholarGoogle ScholarCross RefCross Ref
  12. Yow, D., Yeo, B. L., Yeung, M., Liu, G. Analysis and Presentation of Soccer Highlights from Digital Video. In Proceedings of Second Asian Conference on Computer Vision, 1995.]]Google ScholarGoogle Scholar

Index Terms

  1. Video summarization using personal photo libraries

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MIR '06: Proceedings of the 8th ACM international workshop on Multimedia information retrieval
      October 2006
      344 pages
      ISBN:1595934952
      DOI:10.1145/1178677

      Copyright © 2006 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 26 October 2006

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader