Abstract
As a storage and retrieval unit of user generated web objects, set has been receiving increased attention recently in information retrieval research community. Set search requires relevant sets to be retrieved to meet information needs of users. It is different from individual object search in terms of content granularity. While a web object itself is not divisible and independent with each other, a set consists of separable objects that are related in some aspects. This paper proposes a new approach that can effectively measure topical relevance of sets against a user query by utilizing the tags attached to web objects. The main idea of the proposed approach is to prefer the set which covers as many query related subtopics as possible. In particular, in order to compute the topical relevance while addressing the problem of noisy tags, the notion of tag significance score is introduced based on tag co-occurrence frequency. We consider a problem domain of photo set search at flickr.com where individual photos are annotated with texts such as titles and tags. Experimental results show that our proposed method outperforms the previous approaches for photo set retrieval.
Similar content being viewed by others
References
Bao S, Wu X, Fei B, Xue G, Su Z, Yu Y (2007) Optimizing web search using social annotations. In: Proceedings of the 16th international conference on World Wide Web. ACM, Alberta, Canada, pp 501–510
Carbonell J, Goldstein J (1998) The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval. ACM, Melbourne, Australia, pp 335–336
Dai W, Srihari R (2005) Minimal document set retrieval. In: Proceedings of the 14th ACM international conference on Information and knowledge management. ACM, Bremen, Germany, pp 752–759
Daniyal F, Taj M, Cavallaro A (2010) Content and task-based view selection from multiple video streams. Multimed Tools Appl 46(2–3):235–258
Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(2):1–60
Halpin H, Robu V, Shepherd H (2007) The complex dynamics of collaborative tagging. In: Proceedings of the 16th international conference on World Wide Web. ACM, Alberta, Canada, pp 211–220
Han SK, Shin D, Jung J, Park J (2009) Exploring the relationship between keywords and feed elements in blog post search. World Wide Web 12(4):381–398
Hua G, Tian Q (2009) What can visual content analysis do for text based image search?. In: Proceedings of the 2009 IEEE international conference on Multimedia and Expo, New York, USA, pp 1480–1483
Järvelin K, Kekäläinen J (2002) Cumulated gain-based evaluation of IR techniques. ACM Trans Inform Syst 20(4):422–446
Jin Y, Khan L, Wang L, Awad M (2005) Image annotations by combining multiple evidence & wordnet. In: Proceedings of the 13th annual ACM international conference on Multimedia. ACM, Singapore, pp 706–715
JungWon Y (2009) Towards a user-oriented thesaurus for non-domain-specific image collections. Inform Process Manag 45(4):452–468
Kennedy LS, Naaman M (2008) Generating diverse and representative image search results for landmarks. In: Proceeding of the 17th international conference on World Wide Web. ACM, Beijing, China, pp 297–306
Kherfi ML, Ziou D (2004) Image retrieval from the world wide web: issues, techniques, and systems. ACM Comput Surv 36(1):35–67
Koutrika G, Effendi FA, Gyöngy Z, Heymann P, Garcia-Molina H (2008) Combating spam in tagging systems: an evaluation. ACM Trans Web 2(4):1–34
Lavrenko V, Croft WB (2001) Relevance based language models. In: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, New Orleans, USA, pp 120–127
Lee S, Park J (2009) A scoring function for retrieving photo sets with broad topic coverage. In: Proceedings of 2009 Fifth International Joint Conference on INC, IMS and IDC, Seoul, Korea, pp 1577–1580
Li X, Snoek CGM, Worring M (2009) Learning tag relevance by neighbor voting for social image retrieval. IEEE Trans Multimed 11(7):1310–1322
Lindstaedt S, Mörzinger R, Sorschag R, Pammer V, Thallinger G (2009) Automatic image annotation using visual content and folksonomies. Multimed Tools Appl 42(1):97–113
Manish G, Rui L, Zhijun Y, Jiawei H (2009) Survey on social tagging techniques. SIGKDD Explor 12(1):58–72
Melucci M (2007) On rank correlation in information retrieval evaluation. ACM SIGIR Forum 41(1):18–33
Ogilvie P, Callan J (2001) Experiments using the Lemur toolkit. Proceedings of the 10th Text Retrieval Conference (TREC-10)
Powell AL, French JC (2003) Comparing the performance of collection selection algorithms. ACM Trans Inform Syst 21(4):412–456
Sawant N, Li J, Wang JZ (2011) Automatic image semantic interpretation using social action and tagging data. Multimed Tools Appl 51(1):213–246
Seo J, Croft WB (2008) Blog site search using resource selection. In:. Proceeding of the 17th ACM conference on Information and knowledge management. ACM, Napa Valley, USA, pp 1053–1062
Sevil SG, Kucuktunc O, Duygulu P, Can F (2010) Automatic tag expansion using visual similarity for photo sharing websites. Multimed Tools Appl 49(1):81–99
Sigurbjörnsson B, Zwol RV (2008) Flickr tag recommendation based on collective knowledge. In: Proceeding of the 17th international conference on World Wide Web. ACM, Beijing, China, pp 327–336
Swaminathan A, Mathew CV, Kirovski D (2008). Essential pages. Technical Report MSR-TR-2008-015, Microsoft Research
Wu L, Yang L, Yu N, Hua XS (2009) Learning to tag. In: Proceeding of the 18th international conference on World Wide Web. ACM, Madrid, Spain, pp 361–370
Wu L, Hua X, Yu N, Ma W, Li S (2008) Flickr distance. In: Proceeding of the 16th ACM international conference on Multimedia. ACM, Canada, pp 31–40
Xirong L, Snoek CGM, Worring M (2009) Annotating images by harnessing worldwide user-tagged photos. In: Proceedings of the 2009 IEEE International Conference on Acoustics, speech and signal processing, Taipei, Taiwan, pp 3717–3720
Zhai CX, Cohen WW, Lafferty J (2003) Beyond independent relevance: methods and evaluation metrics for subtopic retrieval. In: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, Toronto, Canada, pp 10–17
Zhuang J, Hoi SCH (2011) A two-view learning approach for image tag ranking. In: Proceedings of the fourth ACM international conference on Web search and data mining. ACM, Hong Kong, China, pp 625–634
Zwol RV (2007) Flickr: who is looking?. In: Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, Silicon Valley, USA, pp 184–190
Zwol RV, Murdock V, Ramirez G (2008). Diversifying image search with user generated content. In: Proceeding of the 1st ACM international conference on Multimedia information retrieval. ACM, Vancouver, Canada, pp 67–74
Acknowledgment
This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (No. 2010-0012967) and partly by Engineering Research Institute at Seoul National University.
Author information
Authors and Affiliations
Corresponding author
Additional information
A preliminary version of this work appeared in Proceedings of the 5th International Conference on Digital Content, Multimedia Technology and its Applications.
Rights and permissions
About this article
Cite this article
Lee, S., Park, J. Topic based photo set retrieval using user annotated tags. Multimed Tools Appl 64, 7–26 (2013). https://doi.org/10.1007/s11042-011-0850-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-011-0850-x