Skip to main content
Log in

Automatic Detection of Human Nudes

  • Published:
International Journal of Computer Vision Aims and scope Submit manuscript

Abstract

This paper demonstrates an automatic system for telling whether there are human nudes present in an image. The system marks skin-like pixels using combined color and texture properties. These skin regions are then fed to a specialized grouper, which attempts to group a human figure using geometric constraints on human structure. If the grouper finds a sufficiently complex structure, the system decides a human is present. The approach is shown to be effective for a wide range of shades and colors of skin and human configurations. This approach offers an alternate view of object recognition, where an object model is an organized collection of grouping hints obtained from a combination of constraints on color and texture and constraints on geometric properties such as the structure of individual parts and the relationships between parts. The system demonstrates excellent performance on a test set of 565 uncontrolled images of human nudes, mostly obtained from the internet, and 4289 assorted control images, drawn from a wide variety of sources.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Ahmad, S. 1995. Ausable real-time 3d hand tracker. In 28th Asilomar Conference on Signals, Systems and Computers.

  • Akita, K. 1984. Image sequence analysis of realworld human motion. Pattern Recognition, 17(1):73–83.

    Google Scholar 

  • Biederman, I. 1987. Recognition-by-components: A theory of human image understanding. Psych. Review, 94(2):115–147.

    Google Scholar 

  • Binford, T.O. 1971. Visual perception by computer. In Proc. IEEE Conference on Systems and Control.

  • Brady, J.M. and Asada, H. 1984. Smoothed local symmetries and their implementation. International Journal of Robotics Research, 3(3).

  • Bregler, C. and Malik, J. 1998. Tracking people with twists and exponential maps. In IEEE Conf. on Computer Vision and Pattern Recognition, pp. 8–15.

  • Brooks, R.A. 1981. Symbolic reasoning among 3-D models and 2-D images. Artificial Intelligence, 17(1–3):285–348.

    Google Scholar 

  • Burl, M.C., Leung, T.K., and Perona, P. 1995. Face localisation via shape statistics. In Int. Workshop on Automatic Face and Gesture Recognition.

  • Canny, J.F. 1986. A computational approach to edge detection. IEEE Trans. Patt. Anal. Mach. Intell., 8(6):679–698.

    Google Scholar 

  • Chen, Z. and Lee, H.-J. 1992. Knowledge-guided visual perception of 3d human gait from a single image sequence. IEEE T. Pattern Analysis and Machine Intelligence, 22(2):336–342.

    Google Scholar 

  • Enser, P.G.B. 1993. Query analysis in a visual information retrieval context. J. Document and Text Management, 1(1):25–52.

    Google Scholar 

  • Faugeras, O.D. and Hebert, M. 1986. The representation, recognition, and locating of 3-D objects. International Journal of Robotics Research, 5(3):27–52, Fall.

    Google Scholar 

  • Fleck, M.M. 1994. Practical edge finding with a robust estimator. In IEEE Conf. on Computer Vision and Pattern Recognition, pp. 649–653.

  • Flickner, M., Sawhney, H., Niblack, W., and Ashley, J. 1995. Query by image and video content: the qbic system. Computer, 28(9): 23–32.

    Google Scholar 

  • Forsyth, D.A. and Fleck, M.M. 1997. Body plans. In IEEE Conf. on Computer Vision and Pattern Recognition.

  • Forsyth, D.A., Malik, J., Fleck, M.M., Greenspan, H., Leung, T., Belongie, S., Carson, C., and Bregler, C. 1996. Finding pictures of objects in large collections of images. In Proc. 2'nd International Workshop on Object Representation in Computer Vision.

  • Forsyth, D.A., Mundy, J.L., Zisserman, A.P., Coelho, C., Heller, A., and Rothwell, C.A. 1991. Invariant descriptors for 3d object recognition and pose. PAMI, 13(10):971–991

    Google Scholar 

  • Gavrila, D.M. and Davis, L.S. 1996. 3d model-based tracking of humans in action: A multi-view approach. In IEEE Conf. on Computer Vision and Pattern Recognition, pp. 73–80.

  • Gershon, R., Jepson, A.D., and Tsotsos, J.K. 1986. Ambient illumination and the determination of material changes. J. Opt. Soc. America, A-3(10):1700–1707.

    Google Scholar 

  • Gorkani, M.M. and Picard, R.W. 1994.Texture orientation for sorting photos “at a glance”. In Proceedings IAPR International Conference on Pattern Recognition, pp. 459–464.

  • Grimson, W.E.L. and Lozano-Pérez, T. 1987. Localizing overlapping parts by searching the interpretation tree. IEEE Trans. Patt. Anal. Mach. Intell., 9(4):469–482.

    Google Scholar 

  • Hampapur, A., Gupta, A., Horowitz, B., and Shu, C.-F. 1997. Virage video engine. In Storage and Retrieval for Image and Video Databases V—Proceedings of the SPIE, vol. 3022, pp. 188–98.

    Google Scholar 

  • Hoaglin, D.C., Mosteller, F., and Tukey, J.W. (Eds.). 1983. Understanding Robust and Exploratory Data Analysis. John Wiley.

  • Hogg, D. 1983. Model based vision: a program to see a walking person. Image and Vision Computing, 1(1):5–20.

    Google Scholar 

  • Huang, C.-Y., Camps, O.T., and Kanungo, T. 1997. Object recognition using appearance-based parts and relations. In IEEE Conf. on Computer Vision and Pattern Recognition, pp. 877–883.

  • Huttenlocher, D.P. and Ullman, S. 1987. Object recognition using alignment. In Proc. Int. Conf. Comp. Vision, London, U.K., pp. 102–111.

  • Jacobs, C.E., Finkelstein, A., and Salesin, D.H. 1995. Fast multiresolution image querying. In Proc SIGGRAPH-95, pp. 277–285.

  • Kelly, P.M., Cannon, M., and Hush, D.R. 1995. Query by image example: The comparison algorithm for navigating digital image databases (candid) approach. In SPIE Proc. Storage and Retrieval for Image and Video Databases III, pp. 238–249.

    Google Scholar 

  • Koenderink, J.J. and Van Doorn, A.J. 1979. The internal representation of solid shape with respect to vision. Biological Cybernetics, 32:211–216.

    Google Scholar 

  • Kriegman, D.J. and Ponce, J. 1990. On recognizing and positioning curved 3D objects from image contours. IEEE Trans. Patt. Anal. Mach. Intell., 12(12):1127–1137.

    Google Scholar 

  • Leung, M.K. and Yang, Y.-H. 1995. First sight: A human body labelling system. IEEE T. Pattern Analysis and Machine Intelligence, 17(4):359–377.

    Google Scholar 

  • Leung, T.K., Burl, M.C., and Perona, P. 1995. Finding faces in cluttered scenes using random labelled graph matching. In Int. Conf. on Computer Vision.

  • Lipson, P., Grimson, W.E.L., and Sinha, P. 1997. Configuration based scene classification and image indexing. In IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1007–1013.

  • Liu, F. and Picard, R.W. 1996. Periodicity, directionality, and randomness: Wold features for image modeling and retrieval. IEEE T. Pattern Analysis and Machine Intelligence, 18:722–733.

    Google Scholar 

  • Lowe, D. 1987. Three-dimensional object recognition from single two-dimensional images. Artificial Intelligence, 31(3): 355–395.

    Google Scholar 

  • Marr, D. and Nishihara, K. 1978. Representation and recognition of the spatial organization of three-dimensional shapes. Proc. Royal Society London, B-200:269–294.

    Google Scholar 

  • Minka, T.P. and Picard, R.W. 1997. Interactive learning with a “society of models”. Pattern Recognition, 30:465–481.

    Google Scholar 

  • Mundy, J.L. and Zisserman, A. 1992. Geometric Invariance in Computer Vision. MIT Press: Cambridge, Mass.

    Google Scholar 

  • Mundy, J.L., Zisserman, A., and Forsyth, D. 1994. Applications of Invariance in Computer Vision, volume 825 of Lecture Notes in Computer Science, Springer-Verlag.

  • Murase, H. and Nayar, S. 1995. Visual learning and recognition of 3D objects from appearance. Int. J. of Comp. Vision, 14(1):5–24.

    Google Scholar 

  • Nayar, S.K., Nene, S.A., and Murase, H. 1996. Real time 100 object recognition system. In Int. Conf. on Robotics and Automation, pp. 2321–2325.

  • Nevatia, R. and Binford, T.O. 1977. Description and recognition of complex curved objects. Artificial Intelligence, 8:77–98.

    Google Scholar 

  • Ogle, V.E. and Stonebraker, M. 1995. Chabot: retrieval from a relational database of images. Computer, 28:40–48.

    Google Scholar 

  • Oren, M., Papageorgiou, C., Sinha, P., and Osuna, E. 1997. Pedestrian detection using wavelet templates. In IEEE Conf. on Computer Vision and Pattern Recognition, pp. 193–199.

  • O'Rourke, J. and Badler, N. 1980. Model-based image analysis of human motion using constraint propagation. IEEE T. Pattern Analysis and Machine Intelligence, 2:522–546.

    Google Scholar 

  • Osuna, E., Freund, R., and Girosi, F. 1997. Training support vector machines: An application to face detection. In IEEE Conf. on Computer Vision and Pattern Recognition, pp. 130–136.

  • Pentland, A., Moghaddam, B., and Starner, T. 1994. View-based and modular eigenspaces for face recognition. In IEEE Conf. on Computer Vision and Pattern Recognition, pp. 84–91.

  • Pentland, A., Picard, R., and Sclaroff, S. 1996. Photobook: Contentbased manipulation of image databases. Int. J. Computer Vision, 18(3):233–254.

    Google Scholar 

  • Picard, R.W., Kabir, T., and Liu, F. 1993. Real-time recognition with the entire brodatz texture database. In IEEE Conf. on Computer Vision and Pattern Recognition, pp. 638–639.

  • Plantinga, H. and Dyer, C. 1990. Visibility, occlusion, and the aspect graph. Int. J. of Comp. Vision, 5(2):137–160.

    Google Scholar 

  • Poggio, T. and Sung, K.-K. 1995. Finding human faces with a gaussian mixture distribution-based face model. In Asian Conf. on Computer Vision, pp. 435–440.

  • Pope, A.R. and Lowe, D.G. 1993. Learning object recognition models from images. In Int. Conf. on Computer Vision, pp. 296–301.

  • Proceedings of the First Int. Workshop on Automatic Face-and Gesture-Recognition, 1995.

  • Proceedings of the Second Int. Workshop on Automatic Face-and Gesture-Recognition, 1996.

  • Rehg, J. and Kanade, T. 1995. Model-based tracking of selfoccluding articulated objects. In ICCV-95, pp. 612–617.

  • Roberts, L.G. 1965. Machine perception of three-dimensional solids. In Optical and Electro-Optical Information Processing, J.T. Tippett et al. (Ed.), MIT Press: Cambridge, pp. 159–197.

    Google Scholar 

  • Rohr, K. 1993. Incremental recognition of pedestrians from image sequences. In IEEE Conf. on Computer Vision and Pattern Recognition, pp. 9–13.

  • Rossotti, H. 1983. Colour: Why the World Isn't Grey. Princeton University Press: Princeton, NJ.

    Google Scholar 

  • Rowley, H.A., Baluja, S., and Kanade, T. 1996. Human face detection in visual scenes. In Advances in Neural Information Processing 8, D.S. Touretzky, M.C. Mozer, and M.E. Hasselmo (Eds.), pp. 875–881.

  • Rowley, H.A., Baluja, S., and Kanade, T. 1996. Neural networkbased face detection. In IEEE Conf. on Computer Vision and Pattern Recognition, pp. 203–208.

  • Sanger, D., Haneishi, H., and Miyake, Y. 1995. Method for light source discrimination and facial pattern detection from negative colour film. J. Imaging Science and Technology, 2:166–175.

    Google Scholar 

  • Santini, S. and Jain, R. 1996. Similarity queries in image databases. In IEEE Conf. on Computer Vision and Pattern Recognition, pp. 646–651.

  • Stricker, M. and Swain, M.J. 1994. The capacity of color histogram indexing. In IEEE Conf. on Computer Vision and Pattern Recognition, pp. 704–708.

  • Swain, M.J. 1993. Interactive indexing into image databases. In Storage and Retrieval for Image and Video Databases—Proceedings of the SPIE, vol. 1908, pp. 95–103.

    Google Scholar 

  • Swain, M.J. and Ballard, D.H. 1991. Color indexing. Int. J. Computer Vision, 7(1):11–32.

    Google Scholar 

  • Thompson, D.W. and Mundy, J.L. 1987. Three-dimensional model matching from an unconstrained viewpoint. In IEEE Int. Conf. on Robotics and Automation, Raleigh, NC, pp. 208–220.

  • Ullman, S. 1996. High-level Vision: Object Recognition and Visual Cognition. MIT Press.

  • Ullman, S. and Basri, R. 1991. Recognition by linear combination of models. IEEE Trans. Patt. Anal. Mach. Intell., 13(10): 992–1006.

    Google Scholar 

  • White, D.A. and Jain, R. 1997. Imagegrep: Fast visual pattern matching in image databases. In Storage and Retrieval for Image and Video Databases V—Proceedings of the SPIE, vol. 3022, pp. 96–107.

    Google Scholar 

  • White house ‘couples’ set off indecency program. Iowa City Press Citizen, 24 Feb., 1996.

  • Wren, C., Azabayejani, A., Darrell, T., and Pentland, A. 1995. Pfinder: Real-time tracking of the human body. MIT Media lab perceptual computing section tr 353, MIT.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Forsyth, D., Fleck, M. Automatic Detection of Human Nudes. International Journal of Computer Vision 32, 63–77 (1999). https://doi.org/10.1023/A:1008145029462

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1008145029462

Navigation