A study of Gaussian mixture models of color and texture features for image classification and segmentation

doi:10.1016/j.patcog.2005.10.028

Pattern Recognition

Volume 39, Issue 4, April 2006, Pages 695-706

https://doi.org/10.1016/j.patcog.2005.10.028 Get rights and content

Abstract

The aims of this paper are two-fold: to define Gaussian mixture models (GMMs) of colored texture on several feature spaces and to compare the performance of these models in various classification tasks, both with each other and with other models popular in the literature. We construct GMMs over a variety of different color and texture feature spaces, with a view to the retrieval of textured color images from databases. We compare supervised classification results for different choices of color and texture features using the Vistex database, and explore the best set of features and the best GMM configuration for this task. In addition we introduce several methods for combining the ‘color’ and ‘structure’ information in order to improve the classification performances. We then apply the resulting models to the classification of texture databases and to the classification of man-made and natural areas in aerial images. We compare the GMM model with other models in the literature, and show an overall improvement in performance.

Introduction

In many domains of image processing, there is a strong correspondence between entities in the scene and textures¹ in the image. This implies that the ability to recognize these textures can furnish important semantic information about the scene. Consequently, the problems of texture description and classification, and the closely related problem of segmentation, have received considerable attention, with numerous approaches being proposed (Refs. [1], [2] and references therein). In particular, in the field of content-based image retrieval, the ability to answer the question: “Is there a significant amount of such-and-such texture in this image?”, can be the basis for many types of query.

Two variations on the problem exist: supervised and unsupervised segmentation. In the former, models of the texture associated with different entities in the scene are assumed known, and are then applied to the image in the hope of segmenting it into regions corresponding to those entities. Clearly this requires a training stage in which human beings group texture exemplars into classes, corresponding to the entities involved, from which the corresponding model parameters are then learnt. In the unsupervised case, no models are known a priori. Instead, the aim is to discover similarities in the data that betray the existence of one or more distinct classes into which the data can be divided. This may or may not involve explicitly learning the model parameters. When the entities in the scene into which the image should be segmented are not decided upon beforehand, as they often are not, unsupervised segmentation is methodologically ill-defined, since no specification of the ideal result is given. In supervised segmentation on the other hand, texture classes necessarily correspond to distinct entities in the scene, and the success or failure of the segmentation can be decided on this basis. In this paper, we consider the supervised texture segmentation problem.

Many kinds of statistical models have been applied to texture classification. These include Bayes classifiers assuming multivariate Gaussian distributions for the features [3], [4], [5], [6]; Fisher transformation [7], [8]; nonparametric nearest-neighbor classification [9], [10], [11], [12]; classification trees [8]; learning vector quantization [13], [14]; feed-forward neural networks [15]; and recently support vector machine [16], [17] and multiple histogram combined with self-organized map [18] . In some earlier cases, the statistical modelling after the feature extraction is just thresholding [19], [20], [21], [22]; or simple extremum picking [23], [24], [25]. Markov random fields, and especially Gaussian Markov random fields have been extensively used for texture modelling and segmentation since the early work in Ref. [26]. For a good review, see the paper by Geman and Graffigne [27]. Li and Gray [28] proposed a 2D hidden Markov model (HMM) for image classification, while a somewhat different model is the noncausal HMM described in Ref. [29].

Another recent class of models uses hidden Markov trees (HMTs) to model the joint statistics of wavelet coefficients. Tree models sacrifice some descriptive power (usually only inter- rather than intra-scale dependencies) to ease of implementation (many algorithms that work in the case of linear graphs, also work on trees, but not on more complicated models). HMT models were first introduced in Ref. [30], and were applied to texture analysis in Refs. [31], [32]. They are typically used, even in texture applications, with binary-valued hidden state variables that switch between high and low variance Gaussian distributions for the wavelet coefficients. This behavior is intended to capture the difference between edges and noisy but otherwise smooth regions in images, an important distinction for ‘edge-preserving denoising’. Indeed, for denoising, HMTs result in state of the art algorithms. It is not clear however, that they remain appropriate for single textures, whose statistics may differ markedly from those for natural images considered as a whole. In particular, the division into ‘edges’ and ‘noise’ seems strange in this context. HMTs are used in Ref. [33], where texture and color are combined in an HMT model. Texture is described using HMTs of grayscale wavelet coefficient magnitudes, while color is described using independent Gaussian distributions at each scale for the colored scaling coefficients.

In recent work [34], we proposed the use of Gaussian mixture models (GMMs) for texture classification, demonstrating improved performance over other, computationally more expensive methods. This paper is an extension of the work presented there. In related but differently directed work, Gray et al. [35] also used GMMs for image classification.

Section snippets

Classification, GMMs, and feature spaces

In this section, we describe in top-down fashion the models we will use. We begin with our general approach to the classification problem, and continue by describing the place of GMMs within that framework. Finally, we describe the various feature spaces on which the GMMs are defined. We assume throughout that we are dealing with N classes, labelled by $n \in N$ .²

Experimental results

We used several databases in order to evaluate the GMM classification performances, to study more about the GMM properties and to compare the GMM performances with other statistical models. The experiments were conducted on the MIT Vision Texture (VisTex) database, on mosaic images created from the VisTex database, and on the aerial images of the San Francisco Bay area that were used in Refs. [28], [38], [39], [40].

Conclusion

We have described Gaussian mixture models (GMMs) of texture and color features. We have evaluated the classification performances variety of ‘color’ and ‘structure’ features and found which are the most appropriate one. In addition we suggested several methods for combining the ‘color’ and ‘structure’ information and analyzed the influence of the model selection of the GMM and the influence of using a diagonal covariance versus the full covariance of the Gaussian. The advantage of using the

Summary

This work defines Gaussian mixture models (GMM) of colored texture on several feature spaces and compares the performance of these models in various classification tasks, both with each other and with other models popular in the literature. The work evaluates the classification performances on variety of ‘color’ and ‘structure’ features and finds which are the most appropriate one. In addition the work suggests several methods for combining the ‘color’ and ‘structure’ information and analyzes

Acknowledgments

This work was partially supported by EU project MOUMIR (HP-99-108, http://www.moumir.org). The Authors would like to thank Prof. R.M. Gray and Prof. J. Li for providing the ariel images database that is used in the experiments section. They also wish to thank the reviewers for their helpful comments and suggestions.

About the Author—HAIM PERMUTER received his B.Sc. (summa cum laude) and M.Sc. (summa cum laude) degree in Electrical and Computer Engineering from the Ben-Gurion University, Israel, in 1997 and 2003, respectively. Between 1997 and 2004, he was an officer at a research and development unit of the Israeli Defense Forces. He is currently pursuing his Ph.D. degree in Electrical Engineering at the Stanford University, CA. He is a recipient of both the Fullbright Fellowship and the Stanford Graduate

References (45)

T. Reed et al.
A recent review of texture segmentation and feature extraction techniques
CVGIP Image Understanding
(1993)
M. Unser
Local linear transforms for texture measurements
Signal Processing
(1986)
J. Strand et al.
Local frequency features for texture classification
Pattern Recognition
(1994)
T. Ojala et al.
A comparative study of texture measures with classification based on feature distributions
Pattern Recognition
(1996)
P.P. Ohanian et al.
Performance evaluation of four classes of textural features
Pattern Recognition
(1992)
S. Singh et al.
Nearest neighbour classifiers in natural scene analysis
Pattern Recognition
(2001)
M. Pietikäinen et al.
View-based recognition of real-world textures
Pattern Recognition
(2004)
D.A. Reynolds et al.
Speaker verification using adapted Gaussian mixture models
Digital Signal Process.
(2000)
M. Tuceryan et al.
Handbook of Pattern Recognition and Computer Vision
(1993)
M. Unser
Texture classification and segmentation using wavelet frame
IEEE Trans. Image Process.
(1995)

T.P. Weldon et al.

Design of multiple gabor filter for texture segmentation

T.P. Weldon et al.

Integrated approach to texture segmentation using multiple gabor filters

J.S. Weszka et al.

A comparative study of texture measures for terrain classification

IEEE Trans. Systems Man Cybernet.

(1976)

N. Saito et al.

Local discriminator bases and their application

J. Math. Image Vis.

(1995)

T. Randen et al.

Multichannel filtering for image texture segmentation

Opt. Eng.

(1994)

T. Randen et al.

Filtering for texture classification: a comparative study

IEEE Trans. Pattern Anal. Mach. Intell.

(1999)

F. Farrokhnia, Multi-channel filtering techniques for texture segmentation and surface quality inspection, Ph.D....

S. Li, J.T. Kwok, H. Zhu, Y. Wang, Texture Classification Using the Support Vector Machines,...

K. Kim et al.

Support vector machines for texture classification

IEEE Trans. Pattern Anal. Mach. Intell.

(2002)

M.M. Pietikäinen et al.

Experiments with texture classification using averages local pattern matches

IEEE Trans. Systems Man Cybernet.

(1983)

A. Teuner et al.

Unsupervised texture segmentation of images using tuned matched gabor filters

IEEE Trans. Image Process.

(1995)

T. Randen, J.H. Husoy, Texture segmentation using filters with optimized energy separation, IEEE Trans. Image Process....

Cited by (339)

Group benefits instance for data purification
2024, Computers and Electrical Engineering
Manually annotating datasets for training deep models is very labor-intensive and time-consuming. To overcome such inferiority, directly leveraging web images to conduct training data becomes a natural choice. Nevertheless, the presence of label noise in web data usually degrades the model performance. Existing methods for combating label noise are typically designed and tested on synthetic noisy datasets. However, they tend to fail to achieve satisfying results on real-world noisy datasets. To this end, we propose a method named GRIP to alleviate the noisy label problem for both synthetic and real-world datasets. Specifically, GRIP utilizes a group regularization strategy that estimates class soft labels to improve noise-robustness. Soft label supervision reduces overfitting on noisy labels and learns inter-class similarities to benefit classification. Furthermore, an instance purification operation globally identifies noisy labels by measuring the difference between each training sample and its class soft label. Through operations at both group and instance levels, our approach integrates the advantages of noise-robust and noise-cleaning methods and remarkably alleviates the performance degradation caused by noisy labels. Comprehensive experimental results on synthetic and real-world datasets demonstrate the superiority of GRIP over the existing state-of-the-art methods. The data and source code of this work have been made available at: https://github.com/NUST-Machine-Intelligence-Laboratory/GRIP.
A semi-supervised framework for computational fluid dynamics prediction
2024, Applied Soft Computing
Data-driven deep learning approach heavily relies on the diversity and quantity of data. Acquiring data in the computational fluid dynamics (CFD) domain is a time and computationally intensive process. This paper proposes a semi-supervised learning method called discriminative regression fitters (DRF) for aerodynamic prediction of airfoils. DRF utilizes neural networks’ memory property to dynamically divide pseudo-labeled data into easy and difficult subsets using a model of Gaussian distribution. The method classifies unlabeled data based on loss and updates the pseudo-labeled data, improving the model’s generalization capability. Experiments on airfoil regression task datasets show that DRF achieves similar or better prediction accuracy than fully supervised approaches. It reduces data acquisition time by 70%. Ablation studies and qualitative results verify the effectiveness of DRF. The surrogate model obtained from DRF is extended to airfoil optimization, demonstrating its practicality. DRF provides a promising direction for improving the regression task while reducing the reliance on large amounts of CFD data.
An automated method for tendon image segmentation on ultrasound using grey-level co-occurrence matrix features and hidden Gaussian Markov random fields
2024, Computers in Biology and Medicine
Despite knowledge of qualitative changes that occur on ultrasound in tendinopathy, there is currently no objective and reliable means to quantify the severity or prognosis of tendinopathy on ultrasound.
The primary objective of this study is to produce a quantitative and automated means of inferring potential structural changes in tendinopathy by developing and implementing an algorithm which performs a texture based segmentation of tendon ultrasound (US) images.
A model-based segmentation approach is used which combines Gaussian mixture models, Markov random field theory and grey-level co-occurrence (GLCM) features. The algorithm is trained and tested on 49 longitudinal B-mode ultrasound images of the Achilles tendons which are labelled as tendinopathic (24) or healthy (25). Hyperparameters are tuned, using a training set of 25 images, to optimise a decision tree based classification of the images from texture class proportions. We segment and classify the remaining test images using the decision tree.
Our approach successfully detects a difference in the texture profiles of tendinopathic and healthy tendons, with 22/24 of the test images accurately classified based on a simple texture proportion cut-off threshold. Results for the tendinopathic images are also collated to gain insight into the topology of structural changes that occur with tendinopathy. It is evident that distinct textures, which are predominantly present in tendinopathic tendons, appear most commonly near the transverse boundary of the tendon, though there was a large variability among diseased tendons.
The GLCM based segmentation of tendons under ultrasound resulted in distinct segmentations between healthy and tendinopathic tendons and provides a potential tool to objectively quantify damage in tendinopathy.
A dynamic star spots extraction method based on pixel association
2024, Advances in Space Research
Star trackers are devices that determine high-accuracy attitude by observing stars. As aerospace technology develops, the dynamic performance of the star tracker becomes increasingly important. In the star images taken under high dynamic conditions, the star spots will be tailed or fractured. It will cause a sharp decrease in Signal-to-Noise Ratio (SNR) and make it a challenge to extract and position star spots. In this paper, an Object-to-Pixel star spots extraction and positioning method based on pixel association was proposed, which is a breakthrough compared to the existing Pixel-to-Object extraction method. In the proposed method, rough extraction of star spots is carried out by the Radon transform and image entropy. Precise positioning of star spots is carried out by Gaussian mixture model (GMM) and maximum likelihood estimation (MLE). Experimental results indicate that the proposed method can achieve reliable extraction for star spots with SNR less than 3 or even close to 1. Meanwhile, the positioning accuracy can be improved by at least twice compared with the conventional threshold method.
Assessment of adulteration in honey by artificial sweeteners using dynamic laser speckle technique
2023, Optik
Our research work uses the dynamic laser speckle technique for the first time to analyze the adulteration of “Dabur Honey” with two golden syrups Dhampur Green Golden Syrup (DGGS) and Solar Golden Syrup (SGS) as adulterate. Identifying 25% adulteration in honey by sweeteners is challenging due to several reasons, such as the complexity of honey composition and the similarity of some sweeteners to honey. A spatially filtered He-Ne laser beam is made to pass through adulterated samples. Forwarding scattered light is captured using a charge-coupled device (CCD) camera and Digital Image Processing is used to analyze the images. Seven textures and six features of images are evaluated. For 20% adulteration in honey by both the adulterates, a significant percentage change of 17%, 16%, 16%, 13%, and 11% is observed in Angular Second Moment, Covariance, Autocorrelation, Inertia, and Kurtosis respectively. Additionally, for a 20% adulteration in honey by DGGS and SGS separately, a notable percentage change of 3.8% and 6.6% respectively in the value of Inertia Moment (IM) is obtained. Thus, it is concluded that the above image processing algorithms can be used to detect 20% adulteration in pure honey.
Suppressing Uncertainty in Gaze Estimation
2024, Proceedings of the AAAI Conference on Artificial Intelligence

View all citing articles on Scopus

About the Author—JOSEPH FRANCOS received his B.Sc. degree in Computer Engineering in 1982, and his D.Sc. degree in Electrical Engineering in 1991, both from the Technion-Israel Institute of Technology. In 1993 he joined the ECE Department, Ben-Gurion University, where he is now an Associate Professor. His current research interests are in parametric modelling and estimation of 2-D random fields, random fields theory, image registration, and texture analysis and synthesis. Dr. Francos served as an Associate Editor for the IEEE Transactions on Signal Processing from 1999 to 2001.

About the Author—IAN JERMYN received his B.A. (Physics) from the Oxford University (1986), his Ph.D. (Theoretical Physics) from the Manchester University (1991), and his Ph.D. (Computer Science) from the Courant Institute (2000). He is currently a Senior Research Scientist in the Ariana group at INRIA. His research interests include shape and texture modelling.

View full text

A study of Gaussian mixture models of color and texture features for image classification and segmentation

Abstract

Introduction

Section snippets

Classification, GMMs, and feature spaces

Experimental results

Conclusion

Summary

Acknowledgments

CVGIP Image Understanding

Signal Processing

Pattern Recognition

Pattern Recognition

Pattern Recognition

Pattern Recognition

Pattern Recognition

Digital Signal Process.

Handbook of Pattern Recognition and Computer Vision

Texture classification and segmentation using wavelet frame

IEEE Trans. Image Process.

Design of multiple gabor filter for texture segmentation

Integrated approach to texture segmentation using multiple gabor filters

A comparative study of texture measures for terrain classification

IEEE Trans. Systems Man Cybernet.

Local discriminator bases and their application

J. Math. Image Vis.

Multichannel filtering for image texture segmentation

Opt. Eng.

Filtering for texture classification: a comparative study

IEEE Trans. Pattern Anal. Mach. Intell.

Support vector machines for texture classification

IEEE Trans. Pattern Anal. Mach. Intell.

Experiments with texture classification using averages local pattern matches

IEEE Trans. Systems Man Cybernet.

Unsupervised texture segmentation of images using tuned matched gabor filters

IEEE Trans. Image Process.