Abstract
This paper describes the automatic assignment of images into classes described by individual keywords provided with the Corel data set. Automatic image annotation technology aims to provide an efficient and effective searching environment for users to query their images more easily, but current image retrieval systems are still not very accurate when assigning images into a large number of keyword classes. Noisy features are the main problem, causing some keywords never to be assigned to their correct images. This paper focuses on improving image classification, first by selection of features to characterise each image, and then the selection of the most suitable feature vectors as training data. A Pixel Density filter (PDfilter) and Information Gain (IG) are proposed to perform these respective tasks. We filter out the noisy features so that groups of images can be represented by their most important values. The experiments use hue, saturation and value (HSV) colour feature space to categorise images according to one of 190 concrete keywords or subsets of these. The study shows that feature selection through the PDfilter and IG can improve the problem of spurious similarity.
Notes
Images can be characterised by texture as well as colour. Texture is typically described by the wavelet transform (Daubechies 1992). However, the emphasis of this paper is on colour.
A software review is given at: http://www.uottawa.ca/academic/cut/options/Nov_96/CorelCDs.htm.
A software review is given at: http://www.gtpcc.org/gtpcc/corelgallery.htm.
For more information about WordNet please visit http://www.wordnet.princeton.edu/.
Available at: http://www.cs.washington.edu/research/imagedatabase/.
References
Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. Addison Wesley, England
Barnard K, Duygulu P, Forsyth D, de Freitas N, Blei DM, Jordan MI (2003) Matching words and pictures. J Mach Learn Res 3:1107–1135
Belew RK (2000) Finding out about: a cognitive perspective on search engine technology and the WWW. Cambridge University Press, Cambridge
Bishop CM (2006) Pattern recognition and machine learning. Springer, New York
Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3:993–1022
Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. CRC Press, Boca Raton
Burges CJC (1998) A tutorial on support vector machines for pattern recognition. Data Mining Knowl Discov 2(2):121–167
Carneiro G, Chan AB, Moreno PJ, Vasconcelos N (2007) Supervised learning of semantic classes for image annotation and retrieval. IEEE Trans Pattern Anal Mach Intell 29(3):394–410
Daubechies I (1992) Ten lectures on wavelets. Society for Industrial and Applied Mathematics, Philadelphia
Del Bimbo A (1996) Image and video databases: visual browsing, querying and retrieval. J Vis Lang Comput 7(4):353–359
Del Bimbo A (1999) Visual information retrieval. Morgan Kaufmann, San Francisco
Eakins JP, Graham ME (1999) Content-based image retrieval: a report of the JISC technology applications programme. The Joint Information Systems Committee (JISC). http://www.jisc.ac.uk/uploaded_documents/jtap-039.doc (26 January 2007)
Gonzalez RC, Woods RE, Eddins SL (2004) Digital image processing using MATLAB. Pearson Prentice-Hall, Upper Saddle River
Grubinger M, Clough P, Müller H, Deselaers T (2006) The IAPR TC-12 Benchmark—a new evaluation resource for visual information systems. In: Proceedings of the International Workshop OntoImage’2006 Language Resources for Content-Based Image Retrieval, held in conjunction with LREC’06. Genoa, Italy, 22 May 2006, pp 13–23
Gupta A, Santini S, Jain R (1997) In search of information in visual media. Commun ACM 40(12):35–42
Howarth P, Rüger S (2004) Evaluation of texture features for content-based image retrieval. International Conference on Image and Video Retrieval (CIVR), Dublin, pp 326–334
Idris F, Panchanathan S (1997) Review of image and video indexing techniques. J Vis Commun Image Represent 8(2):146–166
Jain AK, Duin RPW, Mao J (2000) Statistical pattern recognition: a review. IEEE Trans Pattern Anal Mach Intell 22(1):4–37
Jeon J, Manmatha R (2004) using maximum entropy for automatic image annotation. In: Proceedings of the International Conference on Image and Video Retrieval, Dublin, Ireland, July 21–23 2004: 24–32
Jin X, French JC (2003) Improving image retrieval effectiveness via multiple queries. In: Proceedings of the First ACM International Workshop on Multimedia Database, New Orleans, LA, USA, pp 86–93
Jörgensen C, Jaimes A, Benitez AB, Chang S (2001) A conceptual framework and research for classifying visual descriptors. J Am Soc Inf Sci 52(11):938–947 Special Issue on Image Access: Bridging Multiple Needs and Multiple Perspectives
Lai T (2000) CHROMA: a photographic image retrieval system. PhD Thesis. University of Sunderland, UK
Lavrenko V, Manmatha R, Jeon J (2003) A model for learning the semantics of pictures. In: Proceedings of the International Conference on Neural Information Processing Systems, Vancouver, Canada, 8–13 December 2003
Lew MS (2001) Principles of visual information retrieval. Springer, London
Li J, Wang JZ (2003) Automatic linguistic indexing of pictures by a statistical modelling approach. IEEE Trans Pattern Anal Mach Intell 25(9):1075–1088
Long F, Zhang H, Feng DD (2003) Fundamentals of content-based image retrieval. In: Feng DD, Siu WC, Zhang H (eds) Multimedia information retrieval and management—technological fundamentals and applications. Springer, Germany
Mandal MK, Idris F, Panchanathan S (1999) A critical evaluation of image and video indexing techniques in the compressed domain. Image Vis Comput 17(7):513–529
Manning CD, Schütze H (1999) Foundations of statistical natural language processing. MIT, London
Mathias E, Conci A (1998) Comparing the influence of color spaces and metrics in content based image retrieval. In: Proceedings of the IEEE International Symposium on Computer Graphics, Image Processing, and Vision. Rio de Janeiro, Brazil, 20–23 October 1998, pp 371–378
Mitchell TM (1997) Machine learning. McGraw Hill, New York
Müller H, Müller W, Marchand-Maillet S, Pun T, Squire DM (2003) A framework for benchmarking in CBIR. Multimedia Tools Appl 21(1):55–73
Oakes MP (1998) Statistics for corpus linguistics. Edinburgh University Press, Edinburgh
Quinlan JR (1986) Induction of decision trees. Mach Learn 1(1):81–106
Shanbehzadeh J, Moghadam AME, Mahmoudi F (2000) Image indexing and retrieval techniques: past, present, and next. In: Proceedings of SPIE, The International Society for Optical Engineering, 3972, pp 461–490
Shannon CE (1948) A mathematical theory of communication. Bell Syst Tech J. 27, July and October: 379–423 and 623–656
Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):888–905
Smeaton AF, Kraaij W, OverP (2004) The TREC video retrieval evaluation (TRECVID): a case study and status report. In: Proceedings of the RIAO 2004 Conference. Avignon, France, 26–28 April 2004, pp 25–37
Swain MJ, Ballard DH (1991) Color indexing. Int J Comp Vis 7(1):11–32
Tsai C (2005) Automatically annotating images with keywords. PhD Thesis, University of Sunderland, UK
Tsai C, McGarry K, Tait J (2006) Qualitative evaluation of automatic assignment of keywords to images. Inf Process Manage 42(1):136–154
Vailaya A (2000) Semantic classification in image databases. PhD Thesis. Michigan State University, USA
van der Heijden F (1994) Image based measurement systems: object recognition and parameter estimation. Wiley, Chichester
Wu JK, Kankanhalli MS, Lim J, Hong D (2000) Perspectives on content-based multimedia systems. Kluwer Academic Publishers, London
Yang Y, Pedersen JO (1997) A comparative study on feature selection in text categorization. In: Proceedings of the Fourteenth International Conference on Machine Learning. 8–12 July 1997, pp 412–420
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lin, WC., Oakes, M., Tait, J. et al. Improving image annotation via useful representative feature selection. Cogn Process 10, 233–242 (2009). https://doi.org/10.1007/s10339-008-0247-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10339-008-0247-6