An Adaptive Image Content Representation and Segmentation Approach to Automatic Image Annotation

Shi, Rui; Feng, Huamin; Chua, Tat-Seng; Lee, Chin-Hui

doi:10.1007/978-3-540-27814-6_64

Rui Shi²⁰,
Huamin Feng^20,21,
Tat-Seng Chua²⁰ &
…
Chin-Hui Lee²²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3115))

Included in the following conference series:

International Conference on Image and Video Retrieval

1062 Accesses
25 Citations
1 Altmetric

Abstract

Automatic image annotation has been intensively studied for content-based image retrieval recently. In this paper, we propose a novel approach to automatic image annotation based on two key components: (a) an adaptive visual feature representation of image contents based on matching pursuit algorithms; and (b) an adaptive two-level segmentation method. They are used to address the important issues of segmenting images into meaningful units, and representing the contents of each unit with discriminative visual features. Using a set of about 800 training and testing images, we compare these techniques in image retrieval against other popular segmentation schemes, and traditional non-adaptive feature representation methods. Our preliminary results indicate that the proposed approach outperforms other competing systems based on the popular Blobworld segmentation scheme and other prevailing feature representation methods, such as DCT and wavelets. In particular, our system achieves an F1 measure of over 50% for the image annotation task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aslandogan, Y.A., Yu, C.T.: „Multiple evidence combination in image retrieval: Diohenese searches for people on the web. In: ACM SIGIR 2000, Athens, Greece (2000)
Google Scholar
Bergeaud, F., Mallat, S.: „Matching pursuits of images. In: Proc. IEEE ICIP 1995, Washington DC, October 1995, vol. 1, pp. 53–56 (1995)
Google Scholar
Carson, C., Thomas, M., Belongie, S., Hellerstein, J.M., Malik, J.: „Blobworld: A system for region-based image indexing and retrieval. In: Proc. Int’l Conf. Visual Information System (1999)
Google Scholar
Chang, C.-C., Lin, C.-J.: „Training nu-support vector classifiers: theory and algorithms. Neural Computation 13(9), 2119–2147 (2001)
Article MATH Google Scholar
Deng, Y., Manjunath, B.S., Shin, H.: „Color image segmentation. In: IEEE CVPR (1999)
Google Scholar
Duygulu, P., Vural, F.Y.: „Multi-Level image segmentation and object representation for content based image retrieval. In: SPIE Electronic Imaging 2001, Storage and Retrieval for Media Databases, San Jose, CA, January 21-26 (2001)
Google Scholar
Feng, H.M., Chua, T.S.: „A Bootstrapping Approach to Annotating Large Image Collection. In: ACM SIGMM International Workshop on Multimedia Information Retrieval, Berkeley, November 2003, pp. 55–62 (2003)
Google Scholar
Jeon, J., Lavrenko, V., Manmatha, R.: „Automatic image annotation and retrieval using cross-media relevance models. In: ACM SIGIR 2003, July 28-August 1 (2003)
Google Scholar
Lee, C.-H., Soong, F.K., Paliwal, K.K.: Automatic Speech and Speaker Recognition: Advanced Topics. Kluwer Academic Press, Dordrecht (1996)
Google Scholar
Mallat, S.G., Zhang, Z.F.: „Matching pursuits with time-frequency dictionaries. IEEE Trans. on Signal Processing 41(12), 3397–3415 (1993)
Article MATH Google Scholar
Manjunath, B., Ma, W.: „Texture features for browsing and retrieval of image data. IEEE Trans. on Pattern Analysis and Machine Intelligence 18(8), 837–842 (1996)
Article Google Scholar
Mori, Y., Takahashi, H., Oka, R.: „Image-to-word transformation based on dividing and vector quantizing images with words. Proc. of First International Journal of Computer Vision 40(2), 99–121 (2000)
Google Scholar
Rissanen, J.: „Modeling by shortest data description. Automatica 14, 465–471 (1978)
Article MATH Google Scholar
Rodden, K.: „How do people organize their photographs? In: BCS IRSG 21st Ann. Colloq. on Info. Retrieval Research (1999)
Google Scholar
Shenier, M., Abedel-Mottaleb, M.: „Exploiting the JPEG compression scheme for image retrieval. IEEE Trans. on Pattern Analysis and Machine Intelligence 18(8), 849–853 (1996)
Article Google Scholar
Smith, J.R., Chang, S.F.: „VisualSeek: A fully automated content-based query system. ACM Multimedia (1996)
Google Scholar
Smith, J.R., Li, C.S.: „Image classification and querying using composite region templates. Journal of Computer Vision and Image Understanding (2000)
Google Scholar
Swain, M., Ballard, D.: „Color indexing. International Journal of Computer Vision 7(1), 11–32 (1991)
Article Google Scholar
Szummer, M., Picard, R.W.: „Indoor-outdoor image classification. In: IEEE Intl Workshop on Content-based Access of Image and Video Databases (January 1998)
Google Scholar
Tuceryan, M., Jain, A.K.: „Texture analysis. In: Handbook Pattern Recognition and Computer Vision. ch. 2, pp. 235–276. World Scientific, Singapore (1993)
Google Scholar
Unser, M.: „Texture classification and segmentation using wavelet frames. IEEE Trans. on Image Processing 4(11), 1549–1560 (1995)
Article Google Scholar
Valois, R.D., Valois, K.D.: „Spatial Vision. Oxford, New York (1988)
Google Scholar
Wang, J.Z., Li, J.: „Learning-based linguistic indexing of pictures with 2-D MHMMs. In: Proc. ACM Multimedia, Juan Les Pins, France, December 2002, pp. 436–445 (2002)
Google Scholar
http://www.loc.gov/rr/print/tgm1/

Download references

Author information

Authors and Affiliations

School of Computing, National University of Singapore, Singapore
Rui Shi, Huamin Feng & Tat-Seng Chua
Beijing Electronic Science & Technology Institute, 100070, China
Huamin Feng
School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA
Chin-Hui Lee

Authors

Rui Shi
View author publications
You can also search for this author in PubMed Google Scholar
Huamin Feng
View author publications
You can also search for this author in PubMed Google Scholar
Tat-Seng Chua
View author publications
You can also search for this author in PubMed Google Scholar
Chin-Hui Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing, Mathematical and Information Sciences, University of Brighton, UK
Peter Enser
Informatics and Telematics Institute, Centre for Research and Technology-Hellas, 57001, Thessaloniki, Greece
Yiannis Kompatsiaris
Centre for Digital Video Processing, Adaptive Information Cluster, Dublin City University, Ireland
Noel E. O’Connor
Dublin City University, Dublin, Ireland
Alan F. Smeaton
ISLA lab, Informatics Institute, University of Amsterdam, The Netherlands
Arnold W. M. Smeulders

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shi, R., Feng, H., Chua, TS., Lee, CH. (2004). An Adaptive Image Content Representation and Segmentation Approach to Automatic Image Annotation. In: Enser, P., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds) Image and Video Retrieval. CIVR 2004. Lecture Notes in Computer Science, vol 3115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27814-6_64

Download citation

DOI: https://doi.org/10.1007/978-3-540-27814-6_64
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22539-3
Online ISBN: 978-3-540-27814-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics