ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
Computer Vision and Image Understanding
Volume 94, Issues 1-3, April-June 2004, Pages 295-310
Special Issue: Colour for Image Indexing and Retrieval
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (453 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.cviu.2003.10.007    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2003 Elsevier Inc. All rights reserved.

Classifying offensive sites based on image content

Will Archer Arentz Corresponding Author Contact Information, E-mail The Corresponding Author and Bjørn Olstad E-mail The Corresponding Author

Department of Computer and Information Science, Norwegian University of Science and Technology, NO-7491, Trondheim, Norway

Received 1 December 2002; 
accepted 29 October 2003. 
Available online 23 December 2003.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

This paper proposes a method for helping to identify adult web sites by using the image-content as means of detecting erotic material. The image content is classified by investigating probable skin-regions, and extracting their feature vectors. These feature vectors are based on color-, texture-, contour-, placement-, and relative size-information for a given region. The importance of the different elements in the feature vector is determined by a genetic algorithm. For each picture, the algorithm gives the probability that a certain picture has erotic content. By mapping all the images in a web site, and running the image-based classifier on the whole collection, we were able to set up a histogram of images with regards to the log-likelihood of erotic content for each image. Hence giving a good overview of the web site’s content and at the same time leaving room for errors in the image-based classifier.

The algorithm proved to be quite successful in our tests where all 20 sites where classified correctly. The image-based classifier is able to properly identify 89% of the evaluation images at an average processing speed of 11 images per second.

Although this experiment focused on classifying adult web sites, small alterations to the system can be done, enabling classification of other kinds of images and web sites.

Author Keywords: Author Keywords: Object recognition; Erotica/pornography; Internet; Web site classification; Color; Image retrieval

Article Outline

1. Introduction
2. The image content examiner (ICE)
2.1. The composite object feature vector
2.2. Training by genetic algorithms
3. Web site classification
4. Experimental results
4.1. The image processing
4.2. The site classification
5. Further work
6. Conclusion
Acknowledgements
References











Computer Vision and Image Understanding
Volume 94, Issues 1-3, April-June 2004, Pages 295-310
Special Issue: Colour for Image Indexing and Retrieval
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.