Differential diagnosis of CT focal liver lesions using texture features, feature selection and ensemble driven classifiers

https://doi.org/10.1016/j.artmed.2007.05.002Get rights and content

Summary

Objectives

The aim of the present study is to define an optimally performing computer-aided diagnosis (CAD) architecture for the classification of liver tissue from non-enhanced computed tomography (CT) images into normal liver (C1), hepatic cyst (C2), hemangioma (C3), and hepatocellular carcinoma (C4). To this end, various CAD architectures, based on texture features and ensembles of classifiers (ECs), are comparatively assessed.

Materials and methods

Number of regions of interests (ROIs) corresponding to C1–C4 have been defined by experienced radiologists in non-enhanced liver CT images. For each ROI, five distinct sets of texture features were extracted using first order statistics, spatial gray level dependence matrix, gray level difference method, Laws’ texture energy measures, and fractal dimension measurements. Two different ECs were constructed and compared. The first one consists of five multilayer perceptron neural networks (NNs), each using as input one of the computed texture feature sets or its reduced version after genetic algorithm-based feature selection. The second EC comprised five different primary classifiers, namely one multilayer perceptron NN, one probabilistic NN, and three k-nearest neighbor classifiers, each fed with the combination of the five texture feature sets or their reduced versions. The final decision of each EC was extracted by using appropriate voting schemes, while bootstrap re-sampling was utilized in order to estimate the generalization ability of the CAD architectures based on the available relatively small-sized data set.

Results

The best mean classification accuracy (84.96%) is achieved by the second EC using a fused feature set, and the weighted voting scheme. The fused feature set was obtained after appropriate feature selection applied to specific subsets of the original feature set.

Conclusions

The comparative assessment of the various CAD architectures shows that combining three types of classifiers with a voting scheme, fed with identical feature sets obtained after appropriate feature selection and fusion, may result in an accurate system able to assist differential diagnosis of focal liver lesions from non-enhanced CT images.

Introduction

One of the most common and robust imaging techniques for the detection of hepatic lesions is computed tomography (CT) [1]. Although the quality of CT images has been significantly improved during the last years, it is difficult in some cases, even for experienced doctors, to make a 100% accurate diagnosis. In these cases, the diagnosis has to be confirmed by administration of contrast agents, which is related with renal toxicity and allergic reactions, or invasive procedures (biopsies). During the last years, along with the developments in image processing and artificial intelligence, computer-aided diagnosis (CAD) systems, aiming at the characterization of liver tissue, attract much attention, since they can provide diagnostic assistance to clinicians, and contribute to reduction of the number of required biopsies.

Various approaches, most of them using ultrasound B-scan and CT images, have been proposed based on different image characteristics, such as texture features, estimated from first- and second-order gray level statistics, and fractal dimension estimators combined with various classifiers [2], [3], [4], [5]. Texture analysis of liver CT images based on spatial gray level dependence matrix (SGLDM), gray level run length method (GLRLM), and gray level difference method (GLDM) has been proposed by Mir et al. [6], in order to discriminate normal from malignant hepatic tissue. Chen et al. [7] have applied SGLDM texture features to a probabilistic neural network (P-NN) for the characterization of hepatic tissue (hepatoma and hemangioma) from CT images. Additionally, SGLDM-based texture features fed to a system of three sequentially placed neural networks (NNs) have been used by Gletsos et al. [8] for the classification of hepatic tissue into four categories. Although a lot of effort has been devoted to liver tissue characterization, the developed systems are most of the times limited to two or three classes of liver tissue and/or do not gain from the interaction of different texture characterization methods, or the combination of different classifiers.

In order to select the most robust characteristics from an initial high-dimensional feature set, that might be derived from different feature extraction techniques, feature selection methods can be applied. Deterministic, or stochastic feature selection methods decrease the feature extraction costs of the classification system, and may also enhance its performance [9]. During the last years, an increasing number of researchers are using genetic algorithms (GAs) for dimensionality reduction. The use of GAs for feature selection was first introduced in 1989 [10], and since then, GAs have been successfully applied to a broad spectrum of dimensionality reduction studies [11], [12], [13]. According to Ref. [8], the use of GAs results in more robust feature vectors as compared to other deterministic feature selection techniques, in problems related to liver tissue classification from CT images.

In the last decade, the use of multiple classifier systems has been proposed in order to optimize the performance of CAD systems. A set of classifiers whose individual predictions are fused through a combining strategy, usually a voting scheme, to classify new examples constitutes an ensemble of classifiers (EC) [14]. The attraction that this topic exerts on machine learning and diagnostic decision support research is based on the premise that ECs are often much more accurate than any individual classifier of the set [15], [16]. Early diagnosis of melanoma has been facilitated by combining three types of classifiers, namely linear discriminant analysis (LDA), k-nearest neighbor (k-NN), and a decision tree, with a voting scheme [17]. A multiple classification system based on a committee of NNs, trained by the Levenberg–Marquardt algorithm, along with a voting scheme across the NN outputs has been used by Jerebko et al. [18] for the detection of colonic polyps in CT colonography data. Furthermore, a novel system for diabetes diagnosis has been proposed [19], which is based on retinal images fractal characteristics and applies a voting scheme across the outputs of an EC consisting of a back-propagation trained NN, a radial basis function NN, and a GA-based classifier. The use of texture features and shape parameters along with a multi-classifier modular architecture composed from a self-organizing map (SOM) and/or k-NN classifiers has been recently proposed by Christodoulou et al. [15], aiming at the characterization of carotid plaques for the identification of individuals with asymptomatic carotid stenosis at risk of stroke.

Although previous studies [20], [21] have shown that CAD systems based on various texture features and ECs can enhance the diagnosis efficiency of CT focal liver lesions, the evaluation of the proposed methods on small-sized samples constitutes a significant drawback for these studies. To address this drawback, re-sampling methods like cross-validation, jack-knife, and bootstrap can be applied [22], [23], [24]. In Ref. [18], cross-validation has been applied for sensitivity estimation of a colonic polyp detection system, while in Ref. [25] the bootstrap method has been used for the development of a diagnosis system able to differentiate benign and malignant tumors from breast ultrasound images. The bootstrap method, which was introduced by Efron [22] as an approach to calculate confidence intervals for parameters where standard methods cannot be applied, is based on re-sampling with replacement. The comparative assessment of various re-sampling methods has shown that the bootstrap method provides less biased and more consistent results than the jack-knife method [26].

The principal aim of the present paper is to assess the potential of ECs in the development of a CAD system able to discriminate four hepatic tissue types (normal liver, hepatic cyst, hemangioma, and hepatocellular carcinoma) from non-enhanced CT images. Furthermore, the use of a variety of texture features as input to the CAD system is examined while the application of feature selection based on a GA is investigated aiming at improving the resulting classification performance. In order to overcome problems with small data sets and biased classification performances, the bootstrap method is applied. In this framework, five different CAD architectures based on the above design concepts are comparatively assessed.

The rest of the paper is organized as follows: in Section 2, the generic system design concepts are presented, including description of the data used, the methodology of feature extraction and selection, the ECs and the applied voting schemes, as well as the five alternative architectures of the CAD system. In Section 3, the experimental results of the five CAD system architectures are presented and compared, followed by conclusions presented in Section 4.

Section snippets

Methodology

The generic design of a CAD system aiming at the classification of CT liver tissue into one of the four classes: normal liver (C1), hepatic cyst (C2), hemangioma (C3), and hepatocellular carcinoma (C4), is presented in Fig. 1. Firstly, regions of interest (ROIs) drawn by an experienced radiologist on CT images were driven to a feature extraction module, where five different texture feature sets were obtained using first order statistics (FOS), spatial gray level dependence matrices (SGLDM),

Results and discussion

The classification accuracies achieved by CAD1, …, CAD5 using the 50 groups of sets (training, validation and testing set), obtained through bootstrap re-sampling, were estimated. For each architecture, the results are presented in terms of mean values and standard deviations of the classification accuracy of the primary classifiers, as well as the total CAD system performance with use of either plurality or weighted voting scheme.

The results for CAD1 and CAD2 are presented in Table 4. It can

Conclusion

The aim of the present paper was to define a CAD system architecture able to accurately classify hepatic tissue from non-enhanced CT images as normal liver, hepatic cyst, hemangioma, and hepatocellular carcinoma.

The system design was based on the use of texture features, feature selection techniques, and ECs. For each CT liver ROI, five types of texture feature sets, based on first order statistics, spatial gray level dependence matrices, gray level difference matrices, Laws’ texture energy

References (40)

  • Y.M. Kadah et al.

    Classification algorithms for quantitative tissue characterization of diffuse liver disease from ultrasound images

    IEEE Trans Med Imaging

    (1996)
  • Y.N. Sun et al.

    Ultrasonic image analysis for liver diagnosis

    IEEE Eng Med Biol

    (1996)
  • Ch.-M. Wu et al.

    Texture features for classification of ultrasonic liver images

    IEEE Trans Med Imaging

    (1992)
  • A.H. Mir et al.

    Texture analysis of CT images

    IEEE Eng Med Biol

    (1995)
  • E.L. Chen et al.

    An automatic diagnostic system for CT liver image classification

    IEEE Trans Biomed Eng

    (1998)
  • M. Gletsos et al.

    A computer-aided diagnostic system to characterize CT focal liver lesions: design and optimization of a neural network classifier

    IEEE Trans Inf Technol B

    (2003)
  • A.K. Jain et al.

    Statistical pattern recognition: a review

    IEEE Trans Pattern Anal

    (2000)
  • A.P. Dhawan et al.

    Analysis of mammographic microcalcifications using gray-level image structure features

    IEEE Trans Med Imaging

    (1996)
  • F. Roli et al.

    Methods for designing multiple classifier systems

  • C.I. Christodoulou et al.

    Texture-based classification of atherosclerotic carotid plaques

    IEEE Trans Med Imaging

    (2003)
  • Cited by (96)

    • Computer-aided diagnosis of liver lesions using CT images: A systematic review

      2020, Computers in Biology and Medicine
      Citation Excerpt :

      Most of the aforementioned segmentation methods were coupled with morphological operations to refine the segmentation results. Most of the authors preferred to manually delineate the lesions [14,15,26,28,33–35,40,42,43,48,51,78]. FCM with three clusters corresponding to liver, lesion and background was used in Refs. [13,16,18,22,30,38,65,70,79] and was the next most popular method.

    View all citing articles on Scopus
    1

    Tel.: +30 210 772 2968; fax: +30 210 772 3557.

    2

    Tel.: +30 210 722 7488; fax: +30 210 729 2280.

    View full text