Multiscale texture classification using dual-tree complex wavelet transform

doi:10.1016/j.patrec.2008.10.006

Pattern Recognition Letters

Volume 30, Issue 3, 1 February 2009, Pages 331-339

https://doi.org/10.1016/j.patrec.2008.10.006 Get rights and content

Abstract

This paper presents a multiscale texture classifier that exploits the Gabor-like properties of the dual-tree complex wavelet transform, shift invariance and six directional subbands at each scale, and uses a feature vector comprising of a variance and an entropy at different scales of each of the directional subbands. Experimental results demonstrate its robustness against noise and a higher classification accuracy than a discrete wavelet transform based classifier.

Introduction

Numerous methods have been proposed for texture feature extraction and classification (Tuceryan and Jain, 1993). A comparative study (Randen and Husoy, 1999) suggests that for texture classification it is preferable to extract texture features by learning discriminative texture features from texture samples. Most approaches to texture analysis use a hybrid of different methodologies, making it difficult to categorize them. By considering the main methodology used in texture analysis, we can loosely classify them into three main categories (Kim and Kang, 2007).

Statistical approach to texture analysis is motivated from the findings that the human visual system recognizes textured objects based on the statistical distribution of their image grey-levels via first-order, second-order, or higher-order statistics (Julesz et al., 1973, Julesz, 1962). The most commonly used method is the grey-level co-occurrence matrix (Haralick et al., 1973), which estimates texture properties related to the second-order statistics. It is worth pointing out that although statistical textural features are usually used to classify texture regions, most of them are extracted explicitly or implicitly based on a statistical representation of texture.

Model based approach to texture analysis views textures as mathematical image perceptual models. The key problems with this approach is how to choose a suitable model for characterizing the selected textures and how to estimate the parameters of these models based on some criteria. Another concern is that an intensive computation is usually required to determine the model parameters. The derived parameters are used as the features to capture the perceived essential qualities of the texture. Commonly used texture models include the autoregressive model (AR) (Randen and Husoy, 1999, Cariou and Chehdi, 2008), Markov random field (MRF) (Cohen et al., 1991), and Wold decomposition model (Liu and Picard, 1996). AR model is a linear model derived from the training samples via a least-mean-squares fitting. Using cliques MRF attempts to describe the relationship of texture pixels within a region of interest. Its optimization is based on maximizing the posteriori probability. Wold decomposition model is a perceptual texture model that decomposes textures into deterministic and non-deterministic fields, that correspond to regular textural component and random textural component, respectively. Usually, these models can capture the local contextual information in a texture image. However, the model parameters are optimized based on an image representative features instead of its discriminative features.

Signal processing approach to texture analysis includes multichannel Gabor filter, wavelet transform, finite impulse response (FIR) filter, etc. Gabor filter is appealing because of its simplicity and support from neurophysiological experiments (Faugeras, 1978). Gabor filters have been used for texture segmentation despite being based on texture reconstruction (Jain and Farrokhnia, 1991, Arivazhagan et al., 2006). A general filter bank is often too large because it is designed to capture general texture properties. However, textures can be classified by only a small set of filters, which gives rise to the filter selection problem. For example, a neural network system has been used to select a minimum set of Gabor filters for texture discrimination while keeping the performance at an acceptable level compared to the case without filter selection (Jain and Karu, 1996). In these filtering methods, texture images are usually decomposed into several feature images through projection by using a set of selected filters. These filters are often based on representation such that textures are reconstructed with the minimum information loss. On the other hand, our proposed approach extracts features that maximizes the separation or discrimination among different textures. The wavelet based methods are similar to Gabor based methods with the Gabor filters replaced by Discrete Wavelet Transform (DWT) (Wang et al., 1998, Laine and Fan, 1993, Arivazhagan and Ganesan, 2003a, Arivazhagan and Ganesan, 2003b, Muneeswarana et al., 2005, Kim and Kang, 2007, Kokare et al., 2007, Hiremath and Shivashankar, 2008). Since the DWT is shift variance, a shift in the signal degrades the performance of DWT based classifiers.

For the purpose of pattern discrimination, linear Fisher discriminant (Duda et al., 2001) can incorporate feature extraction, dimensionality reduction and discrimination. However, its linear optimal solution heavily depends on the assumption that the input patterns have equal covariance matrix. This assumption is usually not true for real data. To overcome this limitation, a kernel version of the Fisher discriminant is recently developed for non-linear discriminative feature extraction. The optimal solution of the kernel Fisher discriminant corresponds to the optimal Bayesian classifier which accounts for the minimization of the classification (Bayesian) error rate (Schölkopf and Smola, 2002, Mika et al., 1999). But these methods are computationally expensive.

Since the dual-tree complex wavelet transform (DT-CWT) (Kingsbury, 2001) is a special case of the Gabor filters with complex coefficients it has the directional advantages of the Gabor filters but requiring less computation. Furthermore, it is better than DWT as it is approximately shift invariance and has good directional selectivity in two dimensions. Thus, we propose a computationally efficient multiscale texture classifier using DT-CWT which exploits these advantages. The proposed multiscale texture classifier utilises the benefits of the multiresolution structure of DT-CWT for multiscale feature extraction.

This paper is organized as follows. Section 2 presents DT-CWT. Section 3 presents the proposed multiscale texture classifier, and the learning and classification of texture features for different classes. The experimental results and discussions are presented in Section 4. Finally, Section 5 concludes the paper.

Section snippets

Dual-tree complex wavelet transform

Standard DWT’s suffer from shift variance, i.e., the decomposition of image energy between levels of a multiscale decomposition can vary significantly if the original image is shifted prior to decomposition. In order to address the problem of shift variance, complex wavelets have been proposed. A complex wavelet is a set of two real wavelets with a 90° phase difference.

DT-CWT is obtained by filtering an image separably: two trees are used for the rows of the image and two trees for the columns

The proposed texture classifier

We propose a texture classifier which comprises a texture training stage and a texture classification stage. In the texture training stage, S level DT-CWT is applied to the training texture samples from different texture classes. The subbands of S level DT-CWT are used to form a discriminative feature vector for the multiscale texture classifier. The mean feature vector of the extracted feature vectors for each texture class is calculated and stored in a database for texture classification. In

Experimental results and discussion

The effectiveness of the proposed texture feature extraction approach to texture classification is evaluated by performing supervised classification of several test images with varying texture complexities from two commonly used natural texture image databases: MIT VisTex database (MIT Vision and Modelling Group, 1998) and Brodatz album (Brodatz, 1966). Each texture image has a size of $512 \times 512$ , with 256 grey-levels. Each image is globally histogram equalized and normalized to [−1, 1] to ensure

Conclusion

In this paper, we propose a novel algorithm for texture classification using DT-CWT. The proposed texture classifier achieves an average correct classification rate of 100% and an average false classification rate of 0% for $S = 3$ on two sets of texture samples of varying complexities and a hybrid test set. This performance shows that the proposed variance and entropy as features of the magnitude of the DT-CWT complex coefficients in six directional subbands for three scales are good candidates

Acknowledgement

The authors would like to thank Warwick University Vice Chancellor Scholarship for providing the funds for this research.

References (27)

S. Arivazhagan et al.
Texture classification using wavelet transform
Pattern Recognition Lett.
(2003)
S. Arivazhagan et al.
Texture segmentation using wavelet transform
Pattern Recognition Lett.
(2003)
S. Arivazhagan et al.
Texture classification using Gabor wavelets based rotation invariant features
Pattern Recognition Lett.
(2006)
C. Cariou et al.
Unsupervised texture segmentation/classification using 2-D autoregressive modeling and the stochastic expectation–maximization algorithm
Pattern Recognition Lett.
(2008)
P.S. Hiremath et al.
Wavelet based co-occurrence histogram features for texture classification with an application to script identification in a document image
Pattern Recognition Lett.
(2008)
A. Jain et al.
Unsupervised texture segmentation using Gabor filters
Pattern Recognition
(1991)
S.C. Kim et al.
Texture classification and segmentation using wavelet packet frame and Gaussian mixture model
Pattern Recognition
(2007)
N.G. Kingsbury
Complex wavelets for shift invariant analysis and filtering of signals
Journal of Applied and Computational Harmonic Analysis
(2001)
M. Kokare et al.
Texture image retrieval using rotated wavelet filters
Pattern Recognition Lett.
(2007)
J.W. Wang et al.
Texture classification using non-separable two-dimensional wavelets
Pattern Recognition Lett.
(1998)

P. Brodatz

Textures: A Photographic Album for Artists and Designers

(1966)

F. Cohen et al.

Classification of rotated and scaled textured images using Gaussian Markov random field models

IEEE Trans. Pattern Anal. Machine Intell.

(1991)

R.O. Duda et al.

Pattern Classification

(2001)

Cited by (81)

Magnetic Resonance Imaging, texture analysis and regression techniques to non-destructively predict the quality characteristics of meat pieces
2019, Engineering Applications of Artificial Intelligence
The quality of meat products is traditionally assessed by chemical or sensorial analysis, which are time consuming, need specialized technicians and destroy the products. The development of new technologies to monitor meat pieces using non-destructive methods in order to establish their quality is earning importance in the last years. An increasing number of studies have been carried out on meat pieces combining Magnetic Resonance Imaging (MRI), texture descriptors and regression techniques to predict several physico-chemical or sensorial attributes of the meat, mainly different types of pig ham and loins. In spite of the importance of the problem, the conclusions of these works are still preliminary because they only use the most classical texture descriptors and regressors instead of stronger methods, and because the methodology used to measure the performance is optimistic. In this work, we test a wide range of texture analysis techniques and regression methods using a realistic methodology to predict several physico-chemical and sensorial attributes of different meat pieces of Iberian pigs. The texture descriptors include statistical techniques, like Haralick descriptors, local binary patterns, fractal features and frequential descriptors, like Gabor or wavelet features. The regression techniques include linear regressors, neural networks, deep learning, support vector machines, regression trees, ensembles, boosting machines and random forests, among others. We developed experiments using 15 texture feature vectors, 28 regressors over 4 datasets of Iberian pig meat pieces to predict 39 physico-chemical and sensorial attributes, summarizing16,380 experiments. There is not any combination of texture vector and regressor which provides the best result for all attributes tested. Nevertheless, all these experiments provided the following conclusions: (1) the regressor performance, measured using the squared correlation ( $R^{2}$ ), is from good to excellent (above 0.5625) for 29 out of 39 attributes tested; (2) the WAPE (Weighted Absolute Percent Error) is lower than 2% for 32 out of 37 attributes; (3) the dispersion in computer predictions around the true attributes is lower or similar than the dispersion in the labeling expert’s for the majority of attributes (85%); and (4) differences between predicted and true values are not statistically significant for 29 out of 37 attributes using the Wilcoxon ranksum statistical test. We can conclude that these results provide a high reliability for an automatic system to predict the quality of meat pieces, which may operate on-line in the meat industries in the future.
Local Neighborhood Intensity Pattern–A new texture feature descriptor for image retrieval
2018, Expert Systems with Applications
In this paper, a new texture descriptor based on the local neighborhood intensity difference is proposed for content based image retrieval (CBIR). For computation of texture features like Local Binary Pattern (LBP), the center pixel in a 3 × 3 window of an image is compared with all the remaining neighbors, one pixel at a time to generate a binary bit pattern. It ignores the effect of the adjacent neighbors of a particular pixel for its binary encoding and also for texture description. The proposed method is based on the concept that neighbors of a particular pixel hold significant amount of texture information that can be considered for efficient texture representation for CBIR. The main impact of utilizing the mutual relationship among adjacent neighbors is that we do not rely on the sign of the intensity difference between central pixel and one of its neighbors (I_i) only, rather we take into account the sign of difference values between I_i and its adjacent neighbors along with the central pixels and same set of neighbors of I_i. This makes our pattern more resistant to illumination changes. Moreover, most of the local patterns including LBP concentrates mainly on the sign information and thus ignores the magnitude. The magnitude information which plays an auxiliary role to supply complementary information of texture descriptor, is integrated in our approach by considering the mean of absolute deviation about each pixel I_i from its adjacent neighbors. Taking this into account, we develop a new texture descriptor, named as Local Neighborhood Intensity Pattern (LNIP) which considers the relative intensity difference between a particular pixel and the center pixel by considering its adjacent neighbors and generate a sign and a magnitude pattern. Finally, the sign pattern (LNIP_S) and the magnitude pattern (LNIP_M) are concatenated into a single feature descriptor to generate a more effective feature descriptor. The proposed descriptor has been tested for image retrieval on four databases, including three texture image databases - Brodatz texture image database, MIT VisTex database and Salzburg texture database and one face database - AT&T face database. The precision and recall values observed on these databases are compared with some state-of-art local patterns. The proposed method showed a significant improvement over many other existing methods.
Co-occurrence of adjacent sparse local ternary patterns: A feature descriptor for texture and face image retrieval
2018, Optik
Searching for similar images in large databases is a time-consuming work and an efficient image retrieval system is valuable in this situation. Texture is a prominent feature of image which can extract contents of image, hence a good texture feature extractor is necessary for content based image retrieval process. In this paper, a new texture descriptor is developed which is a combination of Local Ternary Pattern (LTP) and gray level co-occurrence matrix (GLCM). This feature descriptor which is named as CoALTP, inherits the attributes of both LTP and GLCM. First LTPs of pixels are obtained and then using GLCM in four directions, co-relations between pixel pairs are calculated as features. Two texture databases and a face images dataset are used for evaluation of proposed descriptor and results are compared with other local feature descriptors.
Trabecular bone characterization using circular parametric models
2017, Biomedical Signal Processing and Control
Texture analysis of radiographic bone X-ray images presents a major challenge for pattern recognition and medical applications. Classifying such textures from osteoporotic and healthy subjects is a difficult task. In this paper, we propose a new approach combining wavelet decomposition and parametric circular models to capture the statistical behavior of phase coefficients. We demonstrate that, unlike the magnitude components, the wavelet phase coefficients convey local and structural information across scales and orientations which are of great interest for the study of trabecular bone texture. To assess how well the proposed circular models fit phase coefficients, the statistical test of Kuiper and graphical analysis Quantile–Quantile plots were used. The Support Vector Machine (SVM) and the Neural Network (NN) classifiers were used to evaluate the efficiency of the proposed models to classify two populations composed of osteoporotic patients and control subjects. Using Gabor filters and the Wrapped Cauchy model, an Area Under Curve (AUC) rate of 96.45% was achieved with the SVM classifier. To compare the performance of the proposed parametric approach to other non-parametric texture analysis techniques, the Receiver Operating Characteristic (ROC) analysis was performed. Results have proven that the proposed approach provides the best performance in terms of ROC curves.
Influence of normalization and color space to color texture classification
2017, Pattern Recognition
Color texture classification has recently attracted significant attention due to its multiple applications. The color texture images depend on the texture surface and its albedo, the illumination, the camera and its viewing position. A key problem to get an acceptable performance is the ambient illumination, which can vary the perceived structures in the surface. Given a color texture classification problem, it would be desirable to know which is the best approach to solve the problem making the minimal assumptions about the illumination conditions. The present work does an exhaustive evaluation of the state-of-the-art color texture classification methods, considering 5 different color spaces, 12 normalization methods to achieve illumination invariances, 19 texture feature vectors and 23 pure color feature vectors. Our experiments allow to conclude that parallel approaches are better than integrative approaches for color texture classification achieving the first positions in the Friedman ranking. Multiresolution Local Binary Patterns (MLBP) are the best intensity texture features, followed by wavelet and Gabor filters combined with luminance–chrominance color spaces (Lab and Lab2000_HL), and for pure color classification the best are First Order Statistics (FOS) calculated in RGB color space. For intensity texture features, the learning methods work better on the four smallest datasets, although they could not be tested in other four bigger datasets due to its huge computational cost, nor with color texture classification. Normalization and color spaces slightly increase the average accuracy of color texture classification, although the differences achieved using normalization are not statistically significant in a paired T-Test. Lab2000_HL and RGB are the best color spaces, but the former is the slowest one. Regarding elapsed time, the best vector features MLBP for intensity texture, Daub4 (Daubechies filters using mean and variance statistics) for color texture and FOS, for pure color are nearly the fastest or are in the middle interval of all the tested methods.
Feature extraction using dual-tree complex wavelet transform and gray level co-occurrence matrix
2016, Neurocomputing
This paper introduces a new feature extraction method for texture classification application. In the proposed method, dual-tree complex wavelet transform is first performed on the original image to obtain sub-images at six directions. After that gray level co-occurrence matrix of each sub-image is calculated and the corresponding statistical values are used to construct the final feature vector. The experimental results demonstrate that our proposed method has the property of robustness, and can achieve higher texture classification accuracy rate than the conventional methods.

View all citing articles on Scopus

View full text

Multiscale texture classification using dual-tree complex wavelet transform

Abstract

Introduction

Section snippets

Dual-tree complex wavelet transform

The proposed texture classifier

Experimental results and discussion

Conclusion

Acknowledgement

Pattern Recognition Lett.

Pattern Recognition Lett.

Pattern Recognition Lett.

Pattern Recognition Lett.

Pattern Recognition Lett.

Pattern Recognition

Pattern Recognition

Journal of Applied and Computational Harmonic Analysis

Pattern Recognition Lett.

Pattern Recognition Lett.

Textures: A Photographic Album for Artists and Designers

Classification of rotated and scaled textured images using Gaussian Markov random field models

IEEE Trans. Pattern Anal. Machine Intell.

Pattern Classification