Adaptive Model for Integrating Different Types of Associated Texts for Automated Annotation of Web Images

Xu, Hongtao; Zhou, Xiangdong; Lin, Lan; Wang, Mei; Chua, Tat-Seng

doi:10.1007/978-3-540-92892-8_3

Hongtao Xu⁵,
Xiangdong Zhou^5,6,
Lan Lin⁷,
Mei Wang⁶ &
…
Tat-Seng Chua⁶

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5371))

Included in the following conference series:

International Conference on Multimedia Modeling

769 Accesses

Abstract

A lot of texts are associated with Web images, such as image file name, ALT texts, surrounding texts etc on the corresponding Web pages. It is well known that the semantics of Web images are well correlated with these associated texts, and thus they can be used to infer the semantics of Web images. However, different types of associated texts may play different roles in deriving the semantics of Web contents. Most previous work either regard the associated texts as a whole, or assign fixed weights to different types of associated texts according to some prior knowledge or heuristics. In this paper, we propose a novel linear basic expansion-based approach to automatically annotate Web images based on their associated texts. In particular, we adaptively model the semantic contributions of different types of associated texts by using a piecewise penalty weighted regression model. We also demonstrate that we can leverage the social tagging data of Web images, such as the Flickr’s Related Tags, to enhance the performance of Web image annotation. Experiments conducted on a real Web image data set demonstrate that our approach can significantly improve the performance of Web image annotation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

http://htmlparser.sourceforge.net
http://www.flickr.com
Blei, D., Jordan, M.: Modeling annotated data. SIGIR, 127–134 (2003)
Google Scholar
Carneiro, G., Chan, A., Moreno, P., Vasconcelos, N.: Supervised learning of semantic classes for image annotation and retrieval. PAMI (2007)
Google Scholar
Chang, E., et al.: Cbsa: Content-based soft annotation for multimodal image retrieval using bayes point machines. CirSysVideo 13(1), 26–38 (2003)
Google Scholar
Christiane, F.: Wordnet: An electronic lexical database. MIT Press, Cambridge (1998)
MATH Google Scholar
Duygulu, P., Barnard, K., de Freitas, J., Forsyth, D.: Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 97–112. Springer, Heidelberg (2002)
Chapter Google Scholar
Feng, H., Shi, R., Chua, T.-S.: A bootstrapping framework for annotating and retrieving www images. ACM Multimedia, 960–967 (2004)
Google Scholar
Feng, S., Manmatha, R., Lavrenko, V.: Multiple bernoulli relevance models for image and video annotation. In: CVPR, pp. 1002–1009 (2004)
Google Scholar
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. SIGIR, 119–126 (2003)
Google Scholar
Li, J., Wang, J.: Automatic linguistic indexing of pictures by a statistical modeling approach. IEEE Trans. on Pattern Analysis and Machine Intelligence 25(19), 1075–1088 (2003)
Google Scholar
Li, X., Chen, L., Zhang, L., Lin, F., Ma, W.-Y.: Image annotation by large-scale content-based image retrieval. ACM Multimedia, 607–610 (2006)
Google Scholar
Mori, Y., Takahashi, H., Oka, R.: Image-to-word transformation based on dividing and vector quantizing images with words. MISRM (1999)
Google Scholar
Ricardo, B., Berthier, R.: Modern information retrieval. ACM Press, New York (1999)
Google Scholar
Rui, X., Li, M., Li, Z., Ma, W.-Y., Yu, N.: Bipartite graph reinforcement model for web image annotation. ACM Multimedia, 585–594 (2007)
Google Scholar
Sanderson, H., Dunlop, M.: Image retrieval by hypertext links. SIGIR, 296–303 (1997)
Google Scholar
Shen, H., Qoi, B., Tan, K.: Giving meaning to web images. ACM Multimedia, 39–47 (2000)
Google Scholar
Tang, J., Hua, X.-S., Qi, G.-J., Wang, M., Mei, T., Wu, X.: Structure-sensitive manifold ranking for video concept detection. ACM MM, 23–29 (2007)
Google Scholar
Tseng, V., Su, J., Wang, B., Lin, Y.: Web image annotation by fusing visual features and textual information. In: SAC, pp. 1056–1060 (2007)
Google Scholar
Yang, C., Dong, M.: Region-based image annotation using asymmetrical support vector machine-based multiple-instance learning. In: CVPR, pp. 2057–2063 (2006)
Google Scholar
Zhou, X., Wang, M., Zhang, Q., Zhang, J., Shi, B.: Automatic image annotation by an iterative approach:incorporating keyword correlations and region matching. In: CIVR, pp. 25–32 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, Fudan University, Shanghai, China
Hongtao Xu & Xiangdong Zhou
National University of Singapore, Singapore
Xiangdong Zhou, Mei Wang & Tat-Seng Chua
Tongji University, Shanghai, China
Lan Lin

Authors

Hongtao Xu
View author publications
You can also search for this author in PubMed Google Scholar
Xiangdong Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Lan Lin
View author publications
You can also search for this author in PubMed Google Scholar
Mei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Tat-Seng Chua
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Eurécom, 2229, route des crêtes, 06904, Sophia-Antipolis, France
Benoit Huet
Dublin City University, Dublin, Ireland
Alan Smeaton
Department of Computer Science, University of North Carolina, Chapel Hill, NC, USA
Ketan Mayer-Patel
Image, Video and Multimedia Systems Laboratory, School of Electrical and Computer Engineering, National Technical University of Athens, 9 Iroon Polytechniou Str., 157 80, Athens, Greece
Yannis Avrithis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, H., Zhou, X., Lin, L., Wang, M., Chua, TS. (2009). Adaptive Model for Integrating Different Types of Associated Texts for Automated Annotation of Web Images. In: Huet, B., Smeaton, A., Mayer-Patel, K., Avrithis, Y. (eds) Advances in Multimedia Modeling . MMM 2009. Lecture Notes in Computer Science, vol 5371. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92892-8_3

Download citation

DOI: https://doi.org/10.1007/978-3-540-92892-8_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-92891-1
Online ISBN: 978-3-540-92892-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics