skip to main content
10.1145/1555400.1555414acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
research-article

Query parameters for harvesting digital video and associated contextual information

Published:15 June 2009Publication History

ABSTRACT

Video is increasingly important to digital libraries and archives as both primary content and as context for the primary objects in collections. Services like YouTube not only offer large numbers of videos but also usage data such as comments and ratings that may help curators today make selections and aid future generations to interpret those selections. A query-based harvesting strategy is presented and results from daily harvests for six topics defined by 145 queries over a 20-month period are discussed with respect to, query specification parameters, topic, and contribution patterns. The limitations of the strategy and these data are considered and suggestions are offered for curators who wish to use query-based harvesting.

References

  1. Blue Ribbon Task Force on Sustainable Digital Preservation and Access. (2009). Sustaining the Digital Investment: Issues and Challenges of Economically Sustainable Digital Preservation. http://brtf.sdsc.edu/biblio/BRTF_Interim_Report.pdfGoogle ScholarGoogle Scholar
  2. Capra, R., Lee, C., Marchionini, G., Russell, T., Shah, C., & Stutzman, F. (2008). Selection and Context Scoping for Digital Video Collections: An Investigation of YouTube and Blogs. Proceedings of ACM/IEEE JCDL 2008 (Pittsburgh, PA, June 16--20, 2008), 211--220. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Clemens, R., Capra, R., Lee, C., and Sheble, L. (2009). Contextual Information from Blogs in Video Digital Curation. Proceedings of Society of American Archivists 2008 Research Forum.Google ScholarGoogle Scholar
  4. Conway, P. (2000). Overview: Rational for digitization and preservation. In Handbook for digital projects: A management tool for preservation and access. Northeast Document Conservation Center, Andover, MA. http://www.nedcc.org/digital/dman.pdf.Google ScholarGoogle Scholar
  5. Lavoie, B. & Dempsey, L. (2004). Thirteen ways of looking at.digital preservation. D-Lib Magazine, 10(7/8). http://www.dlib.org/dlib/july04/lavoie/07lavoie.html.Google ScholarGoogle Scholar
  6. Lee, C. 2007. "Taking Context Seriously: A Framework for Contextual Information in Digital Collections." UNC SILS TR-2007-04.Google ScholarGoogle Scholar
  7. Maslov, A., Mikeal, A., & Leggett, J. (2009). Cooperation or Control? Web 2.0 and the Digital Library. Journal of Digital Information, 10(1), https://journals.tdl.org/jodi/issue/view/65.Google ScholarGoogle Scholar
  8. Najork, M. Wiener, J. (2001). Breadth-First Crawling Yields High-Quality Pages. In: Proceedings of the 10th International Conference on the World Wide Web (Hong Kong, May 01 -- 05, 2001). WWW '01, 114--118. ACM Press, New York, NY. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Pant, G., & Srinivasan, P. (2005). Learning to Crawl: Comparing Classification Schemes. ACM Trans. Inf. Syst. 23, 430--462. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Rosset, S., Neumann, E., Eick, U., Vatnik, N., and Idan, Y. 2002. Customer lifetime value modeling and its use for customer retention planning. In Proceedings of the Eighth ACM SIGKDD international Conference on Knowledge Discovery and Data Mining (Edmonton, Alberta, Canada, July 23 -- 26, 2002). KDD '02. ACM, New York, NY, 332--340. DOI= http://doi.acm.org/10.1145/775047.775097 Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Shah, C., and Marchionini, G. 2007. Preserving 2008 US Presidential Election Videos. In the Proceedings of International Web Archiving Workshop (IWAW) 2007.Google ScholarGoogle Scholar

Index Terms

  1. Query parameters for harvesting digital video and associated contextual information

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      JCDL '09: Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries
      June 2009
      502 pages
      ISBN:9781605583228
      DOI:10.1145/1555400

      Copyright © 2009 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 15 June 2009

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate415of1,482submissions,28%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader