Elsevier

Pattern Recognition

Volume 39, Issue 4, April 2006, Pages 695-706
Pattern Recognition

A study of Gaussian mixture models of color and texture features for image classification and segmentation

https://doi.org/10.1016/j.patcog.2005.10.028Get rights and content

Abstract

The aims of this paper are two-fold: to define Gaussian mixture models (GMMs) of colored texture on several feature spaces and to compare the performance of these models in various classification tasks, both with each other and with other models popular in the literature. We construct GMMs over a variety of different color and texture feature spaces, with a view to the retrieval of textured color images from databases. We compare supervised classification results for different choices of color and texture features using the Vistex database, and explore the best set of features and the best GMM configuration for this task. In addition we introduce several methods for combining the ‘color’ and ‘structure’ information in order to improve the classification performances. We then apply the resulting models to the classification of texture databases and to the classification of man-made and natural areas in aerial images. We compare the GMM model with other models in the literature, and show an overall improvement in performance.

Introduction

In many domains of image processing, there is a strong correspondence between entities in the scene and textures1 in the image. This implies that the ability to recognize these textures can furnish important semantic information about the scene. Consequently, the problems of texture description and classification, and the closely related problem of segmentation, have received considerable attention, with numerous approaches being proposed (Refs. [1], [2] and references therein). In particular, in the field of content-based image retrieval, the ability to answer the question: “Is there a significant amount of such-and-such texture in this image?”, can be the basis for many types of query.

Two variations on the problem exist: supervised and unsupervised segmentation. In the former, models of the texture associated with different entities in the scene are assumed known, and are then applied to the image in the hope of segmenting it into regions corresponding to those entities. Clearly this requires a training stage in which human beings group texture exemplars into classes, corresponding to the entities involved, from which the corresponding model parameters are then learnt. In the unsupervised case, no models are known a priori. Instead, the aim is to discover similarities in the data that betray the existence of one or more distinct classes into which the data can be divided. This may or may not involve explicitly learning the model parameters. When the entities in the scene into which the image should be segmented are not decided upon beforehand, as they often are not, unsupervised segmentation is methodologically ill-defined, since no specification of the ideal result is given. In supervised segmentation on the other hand, texture classes necessarily correspond to distinct entities in the scene, and the success or failure of the segmentation can be decided on this basis. In this paper, we consider the supervised texture segmentation problem.

Many kinds of statistical models have been applied to texture classification. These include Bayes classifiers assuming multivariate Gaussian distributions for the features [3], [4], [5], [6]; Fisher transformation [7], [8]; nonparametric nearest-neighbor classification [9], [10], [11], [12]; classification trees [8]; learning vector quantization [13], [14]; feed-forward neural networks [15]; and recently support vector machine [16], [17] and multiple histogram combined with self-organized map [18] . In some earlier cases, the statistical modelling after the feature extraction is just thresholding [19], [20], [21], [22]; or simple extremum picking [23], [24], [25]. Markov random fields, and especially Gaussian Markov random fields have been extensively used for texture modelling and segmentation since the early work in Ref. [26]. For a good review, see the paper by Geman and Graffigne [27]. Li and Gray [28] proposed a 2D hidden Markov model (HMM) for image classification, while a somewhat different model is the noncausal HMM described in Ref. [29].

Another recent class of models uses hidden Markov trees (HMTs) to model the joint statistics of wavelet coefficients. Tree models sacrifice some descriptive power (usually only inter- rather than intra-scale dependencies) to ease of implementation (many algorithms that work in the case of linear graphs, also work on trees, but not on more complicated models). HMT models were first introduced in Ref. [30], and were applied to texture analysis in Refs. [31], [32]. They are typically used, even in texture applications, with binary-valued hidden state variables that switch between high and low variance Gaussian distributions for the wavelet coefficients. This behavior is intended to capture the difference between edges and noisy but otherwise smooth regions in images, an important distinction for ‘edge-preserving denoising’. Indeed, for denoising, HMTs result in state of the art algorithms. It is not clear however, that they remain appropriate for single textures, whose statistics may differ markedly from those for natural images considered as a whole. In particular, the division into ‘edges’ and ‘noise’ seems strange in this context. HMTs are used in Ref. [33], where texture and color are combined in an HMT model. Texture is described using HMTs of grayscale wavelet coefficient magnitudes, while color is described using independent Gaussian distributions at each scale for the colored scaling coefficients.

In recent work [34], we proposed the use of Gaussian mixture models (GMMs) for texture classification, demonstrating improved performance over other, computationally more expensive methods. This paper is an extension of the work presented there. In related but differently directed work, Gray et al. [35] also used GMMs for image classification.

Section snippets

Classification, GMMs, and feature spaces

In this section, we describe in top-down fashion the models we will use. We begin with our general approach to the classification problem, and continue by describing the place of GMMs within that framework. Finally, we describe the various feature spaces on which the GMMs are defined. We assume throughout that we are dealing with N classes, labelled by nN.2

Experimental results

We used several databases in order to evaluate the GMM classification performances, to study more about the GMM properties and to compare the GMM performances with other statistical models. The experiments were conducted on the MIT Vision Texture (VisTex) database, on mosaic images created from the VisTex database, and on the aerial images of the San Francisco Bay area that were used in Refs. [28], [38], [39], [40].

Conclusion

We have described Gaussian mixture models (GMMs) of texture and color features. We have evaluated the classification performances variety of ‘color’ and ‘structure’ features and found which are the most appropriate one. In addition we suggested several methods for combining the ‘color’ and ‘structure’ information and analyzed the influence of the model selection of the GMM and the influence of using a diagonal covariance versus the full covariance of the Gaussian. The advantage of using the

Summary

This work defines Gaussian mixture models (GMM) of colored texture on several feature spaces and compares the performance of these models in various classification tasks, both with each other and with other models popular in the literature. The work evaluates the classification performances on variety of ‘color’ and ‘structure’ features and finds which are the most appropriate one. In addition the work suggests several methods for combining the ‘color’ and ‘structure’ information and analyzes

Acknowledgments

This work was partially supported by EU project MOUMIR (HP-99-108, http://www.moumir.org). The Authors would like to thank Prof. R.M. Gray and Prof. J. Li for providing the ariel images database that is used in the experiments section. They also wish to thank the reviewers for their helpful comments and suggestions.

About the AuthorHAIM PERMUTER received his B.Sc. (summa cum laude) and M.Sc. (summa cum laude) degree in Electrical and Computer Engineering from the Ben-Gurion University, Israel, in 1997 and 2003, respectively. Between 1997 and 2004, he was an officer at a research and development unit of the Israeli Defense Forces. He is currently pursuing his Ph.D. degree in Electrical Engineering at the Stanford University, CA. He is a recipient of both the Fullbright Fellowship and the Stanford Graduate

References (45)

  • T.P. Weldon et al.

    Design of multiple gabor filter for texture segmentation

  • T.P. Weldon et al.

    Integrated approach to texture segmentation using multiple gabor filters

  • J.S. Weszka et al.

    A comparative study of texture measures for terrain classification

    IEEE Trans. Systems Man Cybernet.

    (1976)
  • N. Saito et al.

    Local discriminator bases and their application

    J. Math. Image Vis.

    (1995)
  • T. Randen et al.

    Multichannel filtering for image texture segmentation

    Opt. Eng.

    (1994)
  • T. Randen et al.

    Filtering for texture classification: a comparative study

    IEEE Trans. Pattern Anal. Mach. Intell.

    (1999)
  • F. Farrokhnia, Multi-channel filtering techniques for texture segmentation and surface quality inspection, Ph.D....
  • S. Li, J.T. Kwok, H. Zhu, Y. Wang, Texture Classification Using the Support Vector Machines,...
  • K. Kim et al.

    Support vector machines for texture classification

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2002)
  • M.M. Pietikäinen et al.

    Experiments with texture classification using averages local pattern matches

    IEEE Trans. Systems Man Cybernet.

    (1983)
  • A. Teuner et al.

    Unsupervised texture segmentation of images using tuned matched gabor filters

    IEEE Trans. Image Process.

    (1995)
  • T. Randen, J.H. Husoy, Texture segmentation using filters with optimized energy separation, IEEE Trans. Image Process....
  • Cited by (339)

    • Group benefits instance for data purification

      2024, Computers and Electrical Engineering
    • Suppressing Uncertainty in Gaze Estimation

      2024, Proceedings of the AAAI Conference on Artificial Intelligence
    View all citing articles on Scopus

    About the AuthorHAIM PERMUTER received his B.Sc. (summa cum laude) and M.Sc. (summa cum laude) degree in Electrical and Computer Engineering from the Ben-Gurion University, Israel, in 1997 and 2003, respectively. Between 1997 and 2004, he was an officer at a research and development unit of the Israeli Defense Forces. He is currently pursuing his Ph.D. degree in Electrical Engineering at the Stanford University, CA. He is a recipient of both the Fullbright Fellowship and the Stanford Graduate Fellowship (SGF).

    About the AuthorJOSEPH FRANCOS received his B.Sc. degree in Computer Engineering in 1982, and his D.Sc. degree in Electrical Engineering in 1991, both from the Technion-Israel Institute of Technology. In 1993 he joined the ECE Department, Ben-Gurion University, where he is now an Associate Professor. His current research interests are in parametric modelling and estimation of 2-D random fields, random fields theory, image registration, and texture analysis and synthesis. Dr. Francos served as an Associate Editor for the IEEE Transactions on Signal Processing from 1999 to 2001.

    About the AuthorIAN JERMYN received his B.A. (Physics) from the Oxford University (1986), his Ph.D. (Theoretical Physics) from the Manchester University (1991), and his Ph.D. (Computer Science) from the Courant Institute (2000). He is currently a Senior Research Scientist in the Ariana group at INRIA. His research interests include shape and texture modelling.

    View full text