An efficient computation method for the texture browsing descriptor of MPEG-7

https://doi.org/10.1016/j.imavis.2004.12.002Get rights and content

Abstract

In this paper, an efficient computation method for computing the texture browsing descriptor of MPEG-7 is provided. Texture browsing descriptor is used to characterize a texture's regularity, directionality and coarseness. To compute the regularity of textures, Fourier transform is first performed. To get more discriminative features for regularity computation, the Fourier spectrum is treated as an image and the Fourier transform is performed again to produce an enhanced Fourier spectrum. A regularity measure based on the variance of the radial wedge distribution is then calculated to determine the regularity of textures. For regular textures, the texture primitives are assumed to be parallelograms, the two dominant directions are extracted by Hough transform. A scale computation method is then provided to determine the scales corresponding to the two dominant directions. In addition, principal component analysis is provided to detect textures with only one dominant direction. Experiments of texture browsing, coarse classification of textures and similarity-based image-to-image matching are performed on the texture images of Brodatz album and Corel Gallery image database to demonstrate the efficiency and effectiveness of the proposed method. The proposed method can be used in the applications of texture browsing and texture retrieval.

Introduction

Texture almost presents everywhere in natural and real world images. Texture, therefore, has long been an important research topic in image processing. Successful applications of texture analysis methods have been widely found in industrial, biomedical and remote sensing areas. Thus, appropriate descriptors for textures could provide powerful means in applications of texture browsing, coarse texture classification and texture retrieval. Three texture descriptors have been specified in MEPG-7 [1]. The Homogeneous Texture Descriptor (HTD) provides quantitative characterization of texture patterns and is useful for images with homogeneous textural properties. HTD can be used for similarity-based image-to-image matching. To provide descriptions for non-homogeneous textures, Edge Histogram Descriptor (EHD) is provided. The spatial distribution of edges is extracted and is useful for image-to-image matching when the underlying texture is not homogeneous. Texture Browsing Descriptor also named Texture Browsing Component (TBC) relates to the perceptual characterization of texture, in terms of regularity, directionality and coarseness. The coarseness is related to image scale or resolution. This descriptor is useful for browsing type applications and coarse classification of textures.

Approaches for estimating the dominant directions of textures in spatial domain [2] or in frequency domain [3] have been proposed. Liu and Picard [4] proposed a texture model addressing the perceptual characteristics of textures mentioned above. The texture model proposed emphasizes the perceptually most salient periodic information. To determine the prominence of periodic structures in a texture, the energy distribution of image autocovariance function is examined. The image autocovariance is computed as the inverse DFT of the image power spectrum. In addition, a computation method of TBC has been recommended in [5], [6]. In this method, an image is filtered using a bank of scale and orientation selective band-pass filters called Gabor wavelet [7]. To compute the dominant directions, directional histograms are constructed from the filtered images at different scales and then the two histogram peaks with the highest contrasts are considered as the two dominant directions. To compute the scale and regularity, the filtered images are projected along the two dominant directions to form two sets of projections. For each set of projections, the autocorrelation function is evaluated. The scale and regularity are then determined based on the peaks and valleys of the autocorrelation function. In addition, consistency checks based on the neighboring relationship of the projections are provided to make the method more robust. As it involves the computation of applying Gabor wavelet filters and autocorrelation function, the method is relatively time-consuming.

In this paper, an efficient computation method of TBC will be provided. The method is based on the fact that for a directional texture image, the magnitudes of its Fourier spectrum will concentrate on a certain direction; for regular, on several directions; for irregular, not on any direction [8], [9]. To compute the regularity of textures, Fourier transform is first performed. The Fourier spectrum is then smoothed to reduce noises. The smoothed Fourier spectrum is treated as an image and the Fourier transform is performed again to produce an enhanced Fourier spectrum. A regularity measure based on the variance of the radial wedge distribution is then calculated to determine the regularity of textures. For regular textures, the texture primitives are assumed to be parallelograms, the two dominant directions and their associated scales of the primitives are determined by Hough transform [10]. In addition, principal component analysis is provided to detect textures with only one dominant direction. Experiments of texture browsing, coarse classification of textures and texture retrieval are performed to test the texture images of Brodatz album [11] and Corel Gallery image database to demonstrate the efficiency and effectiveness of the proposed method.

The rest of the paper is organized as follows. The proposed computation method is presented in Section 2. In Section 3, experimental results and discussion are described. Finally, in Section 4, we give a conclusion.

Section snippets

The proposed method

TBC characterizes perceptual attributes in terms of regularity, directionality and coarseness. We will base on the following properties of Fourier spectrum to measure these three perceptual textural attributes: (1) for regular textures, the Fourier spectrum consists of significant peaks scattering out regularly on some directions of the frequency plane, (2) for textures with strong directionality, the directionality will be preserved in the Fourier spectrum, and the high spectral values of

Experimental results

Texture images of Brodatz album and Corel Gallery image databases are used to test the proposed method. To build up the Brodatz album database, eight patches of the 112 textures in Brodatz album are scanned and 896 texture images are obtained for experiments. Four out of the eight patches of each texture are used as training set to obtain the empirical values for thresholds used, while the remaining images are used as testing set. To build up the Corel Gallery database, 1896 natural color

Conclusion

In this paper, an efficient computation method for computing the texture browsing descriptor specified in MPEG-7 is provided. The eigenvalue ratio obtained by performing principal component analysis on the Fourier spectrum of the texture image is used to detect directional textures. To compute the regularity, Fourier transform is applied to the Fourier spectrum image to produce an enhanced Fourier spectrum. A spectral measure based on the variance of the radial wedge distribution is then

Ling-Hwei Chen was born in Changhua, Taiwan, Republic of China on February 18, 1954. She received the BS degree in Mathematics and the MS degree in Applied Mathematics from National Tsing Hua University, Hsinchu, Taiwan in 1975 and 1977, respectively, and the PhD degree in Computer Engineering from National Chiao Tung University, Hsinchu, Taiwan in 1987. From August 1977 to April 1979 she worked as a research assistant in the Chung-Shan Institute of Science and Technology, Taoyan, Taiwan. From

References (13)

There are more references available in the full text version of this article.

Cited by (25)

  • Lattice estimation from images of patterns that exhibit translational symmetry

    2014, Image and Vision Computing
    Citation Excerpt :

    There are relatively few papers concerned with discriminating regular texture images with translational symmetry from images not exhibiting this regularity. Lee and Chen proposed a Fourier transform-based measure called Radial Wedge Distribution Variance (RWDV) for this purpose [3,4]. Others have proposed texture descriptors related to texture regularity without attempting to infer texel geometry.

  • Comparative study of global color and texture descriptors for web image retrieval

    2012, Journal of Visual Communication and Image Representation
    Citation Excerpt :

    Considering the complexities of distance functions, it is predominant the use of linear distance functions, which shows that the most laborious step related to texture description is the features extraction phase. From the analysis of feature vectors sizes, we highlight the compact feature vectors generated by the descriptors TBDI [57], TBD [26,35], EHD [26], and LBP [34]. Table 4 shows the validation environments of each texture descriptor.

  • A review on automatic image annotation techniques

    2012, Pattern Recognition
    Citation Excerpt :

    In spectral texture feature extraction techniques, an image is transformed into frequency domain and then feature is calculated from the transformed image. The common spectral techniques include Fourier transform (FT) [98,102], discrete cosine transform (DCT) [99], wavelet [8,24,58], and Gabor filters [82,100,101,113]. FT and DCT are very fast to compute but are not scale and rotation invariant.

View all citing articles on Scopus

Ling-Hwei Chen was born in Changhua, Taiwan, Republic of China on February 18, 1954. She received the BS degree in Mathematics and the MS degree in Applied Mathematics from National Tsing Hua University, Hsinchu, Taiwan in 1975 and 1977, respectively, and the PhD degree in Computer Engineering from National Chiao Tung University, Hsinchu, Taiwan in 1987. From August 1977 to April 1979 she worked as a research assistant in the Chung-Shan Institute of Science and Technology, Taoyan, Taiwan. From May 1979 to February 1981 she worked as a research associate in the Electronic Research and Service Organization, Industry Technology Research Institute, Hsinchu, Taiwan. From March 1981 to August 1983 she worked as an engineer in the Institute of Information Industry, Taipei, Taiwan. She is now a Professor of the Department of Computer and information Science at the National Chiao Tung University. Her current research interests include image processing, pattern recognition, data hiding, image compression, and image cryptography.

Kuen-Long Lee received the BS, MS and PhD degrees in Computer and Information Science from National Chiao Tung University, Taiwan, in 1988, 1990 and 2002, respectively. He is currently the department manager of the e-commerce application department of Macronix, a semiconductor company. His research interests include image processing, pattern recognition, texture analysis and image retrieval.

This research was supported in part by the National Science Council of ROC under contract NSC-87-2213-E-009-060.

View full text