ABSTRACT
Video is increasingly important to digital libraries and archives as both primary content and as context for the primary objects in collections. Services like YouTube not only offer large numbers of videos but also usage data such as comments and ratings that may help curators today make selections and aid future generations to interpret those selections. A query-based harvesting strategy is presented and results from daily harvests for six topics defined by 145 queries over a 20-month period are discussed with respect to, query specification parameters, topic, and contribution patterns. The limitations of the strategy and these data are considered and suggestions are offered for curators who wish to use query-based harvesting.
- Blue Ribbon Task Force on Sustainable Digital Preservation and Access. (2009). Sustaining the Digital Investment: Issues and Challenges of Economically Sustainable Digital Preservation. http://brtf.sdsc.edu/biblio/BRTF_Interim_Report.pdfGoogle Scholar
- Capra, R., Lee, C., Marchionini, G., Russell, T., Shah, C., & Stutzman, F. (2008). Selection and Context Scoping for Digital Video Collections: An Investigation of YouTube and Blogs. Proceedings of ACM/IEEE JCDL 2008 (Pittsburgh, PA, June 16--20, 2008), 211--220. Google ScholarDigital Library
- Clemens, R., Capra, R., Lee, C., and Sheble, L. (2009). Contextual Information from Blogs in Video Digital Curation. Proceedings of Society of American Archivists 2008 Research Forum.Google Scholar
- Conway, P. (2000). Overview: Rational for digitization and preservation. In Handbook for digital projects: A management tool for preservation and access. Northeast Document Conservation Center, Andover, MA. http://www.nedcc.org/digital/dman.pdf.Google Scholar
- Lavoie, B. & Dempsey, L. (2004). Thirteen ways of looking at.digital preservation. D-Lib Magazine, 10(7/8). http://www.dlib.org/dlib/july04/lavoie/07lavoie.html.Google Scholar
- Lee, C. 2007. "Taking Context Seriously: A Framework for Contextual Information in Digital Collections." UNC SILS TR-2007-04.Google Scholar
- Maslov, A., Mikeal, A., & Leggett, J. (2009). Cooperation or Control? Web 2.0 and the Digital Library. Journal of Digital Information, 10(1), https://journals.tdl.org/jodi/issue/view/65.Google Scholar
- Najork, M. Wiener, J. (2001). Breadth-First Crawling Yields High-Quality Pages. In: Proceedings of the 10th International Conference on the World Wide Web (Hong Kong, May 01 -- 05, 2001). WWW '01, 114--118. ACM Press, New York, NY. Google ScholarDigital Library
- Pant, G., & Srinivasan, P. (2005). Learning to Crawl: Comparing Classification Schemes. ACM Trans. Inf. Syst. 23, 430--462. Google ScholarDigital Library
- Rosset, S., Neumann, E., Eick, U., Vatnik, N., and Idan, Y. 2002. Customer lifetime value modeling and its use for customer retention planning. In Proceedings of the Eighth ACM SIGKDD international Conference on Knowledge Discovery and Data Mining (Edmonton, Alberta, Canada, July 23 -- 26, 2002). KDD '02. ACM, New York, NY, 332--340. DOI= http://doi.acm.org/10.1145/775047.775097 Google ScholarDigital Library
- Shah, C., and Marchionini, G. 2007. Preserving 2008 US Presidential Election Videos. In the Proceedings of International Web Archiving Workshop (IWAW) 2007.Google Scholar
Index Terms
- Query parameters for harvesting digital video and associated contextual information
Recommendations
Digital Preservation, Archival Science and Methodological Foundations for Digital Libraries
Digital libraries, whether commercial, public, or personal, lie at the heart of the information society. Yet, research into their long-term viability and the meaningful accessibility of their contents remains in its infancy. In general, as we have ...
Digital preservation in a box: outreach resources for digital stewardship
JCDL '12: Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries"Digital Preservation in a Box" is a major activity of the National Digital Stewardship Alliance (NDSA) Outreach Working Group. This toolkit of digital stewardship outreach resources can be utilized by diverse communities as a gentle introduction to the ...
Building interoperable digital library services: MARIAN, open archives, and the NDLTD
SIGIR '01: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrievalIn this demonstration, we present interoperable and personalized search services for the Networked Digital Library of Theses and Dissertations (NDLTD). Using standard protocols and software, including those specified by the Open Archives Initiative (OAI)...
Comments