Segmentation of magnetic resonance images using a combination of neural networks and active contour models

doi:10.1016/S1350-4533(03)00137-1

Medical Engineering & Physics

Volume 26, Issue 1, January 2004, Pages 71-86

https://doi.org/10.1016/S1350-4533(03)00137-1 Get rights and content

Abstract

Segmentation of medical images is very important for clinical research and diagnosis, leading to a requirement for robust automatic methods. This paper reports on the combined use of a neural network (a multilayer perceptron, MLP) and active contour model (‘snake’) to segment structures in magnetic resonance (MR) images. The perceptron is trained to produce a binary classification of each pixel as either a boundary or a non-boundary point. Subsequently, the resulting binary (edge-point) image forms the external energy function for a snake, used to link the candidate boundary points into a continuous, closed contour. We report here on the segmentation of the lungs from multiple MR slices of the torso; lung-specific constraints have been avoided to keep the technique as general as possible. In initial investigations, the inputs to the MLP were limited to normalised intensity values of the pixels from an (7×7) window scanned across the image. The use of spatial coordinates as additional inputs to the MLP is then shown to provide an improvement in segmentation performance as quantified using the effectiveness measure (a weighted product of precision and recall). Training sets were first developed using a lengthy iterative process. Thereafter, a novel cost function based on effectiveness is proposed for training that allows us to achieve dramatic improvements in segmentation performance, as well as faster, non-iterative selection of training examples. The classifications produced using this cost function were sufficiently good that the binary image produced by the MLP could be post-processed using an active contour model to provide an accurate segmentation of the lungs from the multiple slices in almost all cases, including unseen slices and subjects.

Introduction

Segmentation has been defined [1:p. 347] as the process of: “dividing the image into regions that … correspond to structural units in the scene or distinguish objects of interest”. It is a necessary first step in the visualisation and interpretation of many complex images, such as those typically encountered in medical imaging. In this area, fully automatic and robust segmentation techniques would have an enormous beneficial impact on clinical practice and research, by decreasing dramatically the manual effort which must otherwise be devoted to this task. Not only are medical images themselves inherently complex, but acquisition must also recognise practical needs to limit radiation dose, scan time, etc., so that image quality is often compromised. Given this, deployment of conventional image-processing techniques has not so far led to a robust fully automatic solution usable in a range of clinical settings, although semi-automatic systems do exist.

Semi-automatic segmentation has been used extensively in nuclear medicine based on thresholding or gradient techniques: both two- and three-dimensional techniques have been described [2]. Medical image-processing systems such as ANALYZE (from the Mayo Foundation, Rochester, MN) also include tools for partially automating manual segmentation. Fully automatic segmentation is possible in specific instances, such as thresholding to identify bone in computer tomography (CT) images [3]. Various new approaches look to have considerable potential for automatic segmentation in more general applications (e.g. [4], [5]) although their use in clinical practice has yet to be proven. Thus, segmentation remains the “image-processing bottleneck” [6].

The method proposed here is developed and illustrated on the practical problem of segmenting lung outlines from magnetic resonance (MR) images of the thorax. It consists of two stages. First, a neural network (multilayer perceptron, MLP) trained in supervised fashion is used to classify each pixel of the MR image into boundary and non-boundary classes, so producing a binary, edge-point image. Second, to compensate for classification errors, the edge-point images are then post-processed using an active contour model, or ‘snake’ [7], [8]. In this way, the edge-point image acts as the external energy function for the snake. A similar combination of MLP classifier and active contour model has previously been used to locate the interior contour of the brain from MR images of the head [9]. However, the initial classification achieved by the neural network in that work was relatively poor and required a rather complex model-based active contour technique (using a stochastic decision mechanism based on a Gibbs sampler) to extract the final boundary. In this work, we aim to produce a sufficiently good initial classification to be able to use a simple and standard snake as the post-processor.

Early results for the classification stage, using data from a single subject and a restricted number of slices, were reported by Middleton and Damper [10], and showed good segmentation of the lung boundaries in a given MR image of the torso. Unfortunately, however, generalisation to other (unseen) slices and subjects was very much worse. We have since shown that an elastic net [11] modified to give robustness against initial classification errors, can be used to extract the region of the lungs very effectively from some of these classifications [12]. However, many of the results from the classification stage were too poor for the lungs to be accurately identified in this way. In the present work, several modifications and improvements have been made to our earlier work, which allow a much more accurate classification to be achieved from unseen MR data from different slices and different subjects. In particular, a novel cost function is proposed that simplifies the process of selecting the training data. Further, automatic exclusion of some pixels from the training set leads to dramatic improvement in classification. Consequently, the lungs can now be successfully segmented from the vast majority of available images using a standard active contour model, for which purpose we use the Cohen snake [13].

The remainder of this paper is structured as follows. The next two sections describe the images used (Section 2) and review alternative segmentation techniques (Section 3). The purpose of the latter section is to illustrate the difficulties in segmenting MR images using conventional image-processing techniques and to motivate the use of a neural network. The initial approach to classification using an MLP is then described in Section 4. We then detail our method of quantifying the quality of the segmentation (Section 5). Section 6 presents preliminary results of MLP classification using the standard squared-error cost function in training the network, and details the steps that were found necessary to achieve reasonable results. In Section 7, we define a new cost function for training based on the measure used to quantify segmentation performance. Section 8 describes MLP training using the new cost function, and Section 9 presents classification results using this new function. Post-processing using the snake and the results of the final segmentation are given in Section 10, and Section 11 concludes.

Section snippets

Image data and labelling

MR imaging is a non-invasive method of acquiring precise anatomical information in the form of three-dimensional data sets (see [14], [15] for good introductory treatments). The data used here consist of transverse slices of the thorax obtained from a 0.5 T MR machine from 13 subjects. All were healthy volunteers, identified hereafter by a two letter abbreviation of the subject’s name. Fig. 1 shows two examples of slices from subject AC. The lungs are clearly visible in these images as two

Alternative segmentation techniques

Difficulties such as those described above mean that standard image-processing techniques are often unable to segment MR images satisfactorily (unlike some other imaging modalities). For example, it is well documented that (unlike CT images) MR images cannot be segmented using histogram-based thresholding because of the non-uniform nature of the data [18], [25]. To justify the approach taken here, various other standard image-processing techniques have been investigated for MR image

Initial classification using a neural network

Classification of each pixel of the image as either a boundary or non-boundary edge-point uses a multi-layer perceptron (MLP) trained on error back-propagation [30], [31]. Since back-propagation is a supervised, gradient-descent technique, it requires labelled training data and some cost function which is differentiable, to give gradient information used in the search for a minimum-cost configuration of network connection weights. Initially, we have used the standard squared-error cost

Quantifying segmentation performance

To assess results, a method of measuring the accuracy of the segmentation techniques is required. This is a common problem in medical image segmentation [18]. Visual inspection is sometimes used to evaluate performance “since ‘perfect’ segmentations cannot be defined” [36:p. 341]. For instance, Brown et al. [37] assess the quality of chest CT segmentations through visual inspection by an experienced thoracic radiologist. Also, in the work of Chiou and Hwang [9], results are assessed

Results using squared-error cost function

Initially, the MLP was trained to identify the lung boundaries on a single slice (S=1) of an MR image of the torso using the squared-error cost function (1). This showed that the method of selecting the training data as described in Section 4.2, using all lung-boundary examples plus an equal number of randomly selected non-boundary examples, led to a very poor classification [10]. In classifying the interior contour of the brain with a similar MLP to that used here, Chiou and Hwang [9] also

Defining a new cost function

A potential advantage of the MLP classifier trained on squared error (Eq. (1)) or cross-entropy cost functions is that its output can be interpreted as an estimate of posterior probability [32:pp. 245–247]. However, the discussion in Section 4.3 indicated that this potential advantage is of limited value here, because of the imbalance of positive and negative examples in the test data. This suggests that there might be advantage to minimising during training a cost function more directly

MLP training with the new cost function

In theory, the precision and recall should be recalculated each time the network is modified during training. For incremental learning, this would impose a severe computational burden, since weights are updated for each training example. If batch training was used, precision and recall would only have to be recalculated once per epoch. However, rather poor results were obtained using this batch method. Better results were obtained using incremental learning and approximate values of precision

Classification results with the new cost function

Table 4 shows typical segmentation performance of a network trained using the E cost function with the (non-iterative) method of training set selection just described. To allow a fair comparison with earlier results, the network here was trained (with spatial inputs) on the same slices from subjects AC, CB and LP as used in producing the training data for the squared-error cost function (see Section 6.1).

These new results indicate that performance is comparable to that obtained using the

Post-processing with an active contour model

The initial segmentation by an MLP classifier produces an edge-point image of candidate boundary points which can never realistically give an acceptable closed contour. False negatives will lead to gaps in the contour (especially where the great blood vessels join the lungs and image evidence for a boundary is low or absent) and false positives will arise and need to be eliminated. Therefore, post-processing is required to close these gaps and to distinguish false positives from true positives.

Conclusions

MR image segmentation is an important but inherently difficult problem in medical image processing. In general, it cannot be solved using straightforward, conventional image-processing techniques. The solution proposed here is to use a multilayer perceptron to form the external energy function for an active contour model (‘snake’). Initial work used the conventional squared-error cost function for training the MLP. This showed that the MLP could classify the lung boundaries in MR images of the

Acknowledgements

We are grateful to Dr. Liz Moore and Prof. John Fleming for supplying the MR images used here and for valuable advice in connection with this work. Liz Moore provided the semi-automatic labellings of the lung outlines.

References (49)

L.D. Cohen
On active contour models and balloons
Comput Vision Graphics Image Process
(1991)
L.P. Clarke et al.
MRI segmentation: methods and applications
Magn Reson Imaging
(1995)
J.C. Rajapakse et al.
A technique for single-channel MR brain tissue segmentation: application to a pediatric sample
Magn Reson Imaging
(1996)
D.H. Ballard
Generalizing the Hough transform to detect arbitrary shapes
Pattern Recog
(1981)
J. Illingworth et al.
A survey of the Hough transform
Comput Vision Graphics Image Process
(1988)
V.F. Leavers
Which Hough transform?
Comput Vision Graphics Image Process
(1993)
A.A. Kassim et al.
A comparative study of efficient generalised Hough transform techniques
Image Vision Comput
(1999)
S. Haring et al.
Kohonen networks for multiscale image segmentation
Image Vision Comput
(1994)
T. McInerney et al.
Deformable models in medical image analysis: a survey
Med Image Anal
(1996)
T. McInerney et al.
T-snakes: topology adaptive snakes
Med Image Anal
(2000)

X.M. Pardo et al.

A snake for CT image segmentation integrating region and edge information

Image Vision Comput

(2001)

I. Cohen et al.

Using deformable surfaces to segment 3-D images and infer differential structures

Comput Vision Graphics Image Process

(1992)

J.C. Russ

The image processing handbook

(1995)

J.S. Fleming

Quantitative measurements for gamma camera images

M.W. Vannier et al.

Craniosynotosis: diagnostic value of 3D CT reconstructions

Radiology

(1989)

M.X.H. Yan et al.

An adaptive Bayesian approach to three-dimensional MR brain segmentation

T.F. Cootes et al.

The use of active shape models for locating structures in medical images

H.S. Stiehl

3D image understanding in radiology

IEEE Eng Med Biol Mag

(1990)

M. Kass et al.

Snakes: active contour models

Int J Comput Vision

(1987)

A. Blake et al.

Active contours: the application of techniques from graphics, control theory and statistics to visual tracking of shapes in motion

(1998)

G.I. Chiou et al.

A neural network-based stochastic active model (NNS-SNAKE) for contour finding of distinct features

IEEE Trans Image Process

(1995)

I. Middleton et al.

Segmentation of magnetic resonance images of the thorax by back-propagation

R. Durbin et al.

An analogue approach to the travelling salesman problem using an elastic net method

Nature

(1987)

R.I. Damper et al.

A semi-localized elastic net for surface reconstruction of objects from multislice images

Int J Neural Syst

(2002)

Cited by (85)

Deep learning in biomedical informatics
2022, Intelligent Nanotechnology: Merging Nanoscience and Artificial Intelligence
With the massive influx of multimodal data in the last decade, the role of data analytics in health informatics has grown rapidly. Deep learning (DL) is defined as a technology based on artificial neural networks (ANNs), which have recently emerged as a powerful ML tool. The rapid increase in computing power, fast data storage, and parallelization, as well as predictive capabilities and the ability to generate automatically optimized advanced functions and semantic interpretation from input data, all contribute to the rapid adoption of this technology. This chapter introduces the latest developments in the use of deep learning in health informatics and makes an important analysis of the relative advantages, potential shortcomings, and future prospects of the technology. This chapter mainly focuses on the key applications of DL in the fields of computational biology, drug design, medical imaging, pervasive sensing, medical informatics, and public health. Artificial intelligence (AI) has become a trend in recent years, opening up an entirely new era of research in various fields. The demand for AI in healthcare has increased in both the academia and industry; therefore, the potential benefits of its applications have been proven. Previous studies have attempted to implement AI methods on medical images, electronic health records, molecular characteristics, and a variety of lifestyles. Researchers used data aggregated from multiple data sources to train models that mimic what clinicians do when they see patients and help in decision-making through results and interpretations. It included how to read clinical images, predict results, discover the connection between genotype and phenotype or phenotype and disease, analyze treatment responses, and track lesions or structural changes (e.g., hippocampal volume reduction). In addition, predictive research (e.g., disease or readmission predictions) and correlation and pattern recognition research have been extended to early warning systems with risk scores and overall pattern research and population care (such as predictive care for an entire population).
Evaluation of clinical applicability of automated liver parenchyma segmentation of multi-center magnetic resonance images
2022, European Journal of Radiology Open
Automated algorithms for liver parenchyma segmentation can be used to create patient-specific models (PSM) that assist clinicians in surgery planning. In this work, we analyze the clinical applicability of automated deep learning methods together with level set post-processing for liver segmentation in contrast-enhanced T1-weighted magnetic resonance images.
UNet variants with/without attention gate, multiple loss functions, and level set post-processing were used in the workflow. A multi-center, multi-vendor dataset from Oslo laparoscopic versus open liver resection for colorectal liver metastasis clinical trial is used in our study. The dataset of 150 volumes is divided as 81:25:25:19 corresponding to train:validation:test:clinical evaluation respectively. We evaluate the clinical use, time to edit automated segmentation, tumor regions, boundary leakage, and over-and-under segmentations of predictions.
The deep learning algorithm shows a mean Dice score of 0.9696 in liver segmentation, and we also examined the potential of post-processing to improve the PSMs. The time to create clinical use segmentations of level set post-processed predictions shows a median time of 16 min which is 2 min less than deep learning inferences. The intra-observer variations between manually corrected deep learning and level set post-processed segmentations show a 3% variation in the Dice score. The clinical evaluation shows that 7 out of 19 cases of both deep learning and level set post-processed segmentations contain all required anatomy and pathology, and hence these results could be used without any manual corrections.
The level set post-processing reduces the time to create clinical standard segmentations, and over-and-under segmentations to a certain extent. The time advantage greatly supports clinicians to spend their valuable time with patients.
Model-based segmentation using neural network-based boundary detectors: Application to prostate and heart segmentation in MR images
2021, Machine Learning with Applications
Model-based segmentation (MBS) is a variant of active surfaces and active shape models that has successfully been used to segment anatomical structures such as the heart or the brain. We propose to integrate neural networks (NNs) into MBS for boundary detection. We formulate boundary detection as a regression task and use a NN to predict the distances between a surface mesh and the corresponding boundary points. The proposed approach has been applied to two tasks — prostate segmentation in MR images and the segmentation of the left and right ventricle in MR images. For the first task, data from the Prostate MR Image Segmentation 2012 (PROMISE12) challenge has been used. For the second task, a diverse database with cardiac MR images from six clinical sites has been used. We compare the results to the popular U-net approaches using the nnU-net implementation that is among the top performing segmentation algorithms in various challenges. In cross-validation experiments, the mean Dice scores are very similar and no statistically significant difference is observed. On the PROMISE12 test set, nnU-net Dice scores are significantly better. This is achieved by using an ensemble of 2D and 3D U-nets to generate the final segmentation, a concept that may also be adapted to NN-based boundary detection in the future. While the U-net provides a voxel labeling, our approach provides a 3D surface mesh with pre-defined mesh topology, establishes correspondences with respect to the reference mesh, avoids isolated falsely segmented regions and ensures proper connectivity of different regions.
Fractal and multifractal analysis of atherosclerotic plaque in ultrasound images of the carotid artery
2019, Chaos, Solitons and Fractals
Citation Excerpt :
The options for treating atherosclerosis are medical therapy and carotid endarterectomy or surgery [5,6]. The important factors that are taken into account when taking the decision for surgery are the thickness of the vessel wall, that is, the intima-media thickness as shown in Fig. 1, the degree of stenosis of the carotid artery and the existence or non-existence of symptoms [7-11]. The characteristics of plaque have diverse impacts on human life.
Stroke is the cause of death following ischemic heart disease in 87% of cases. The chance of stroke rises with the severity of carotid stenosis and the thickening of the carotid artery due to the deposition of plaque. This study analyses the non-linear parameters of ultrasound images of the plaque in the carotid artery and classifies the images based on the textural features. The non-linear analysis is implemented via fractal and multifractal methods. The fractal dimensions and the lacunarity differ significantly for symptomatic and asymptomatic plaques. The occurrence of multifractal spectra and the scaling exponent function, i.e., interleaving sets of singularity strength, proves the multi-scaling property. The multifractal characteristics quantify the heterogeneity in the textural features, and this could be used for improving the classification of symptomatic and asymptomatic plaque. The results show the significance of fractal parameters of plaque in deciding the severity of plaque and hence, aiding the diagnostic process.
A gentle introduction to deep learning in medical image processing
2019, Zeitschrift fur Medizinische Physik
This paper tries to give a gentle introduction to deep learning in medical image processing, proceeding from theoretical foundations to applications. We first discuss general reasons for the popularity of deep learning, including several major breakthroughs in computer science. Next, we start reviewing the fundamental basics of the perceptron and neural networks, along with some fundamental theory that is often omitted. Doing so allows us to understand the reasons for the rise of deep learning in many application domains. Obviously medical image processing is one of these areas which has been largely affected by this rapid progress, in particular in image detection and recognition, image segmentation, image registration, and computer-aided diagnosis. There are also recent trends in physical simulation, modeling, and reconstruction that have led to astonishing results. Yet, some of these approaches neglect prior knowledge and hence bear the risk of producing implausible results. These apparent weaknesses highlight current limitations of deep ()learning. However, we also briefly discuss promising approaches that might be able to resolve these problems in the future.
Assessment of despeckle filtering algorithms for segmentation of breast tumours from ultrasound images
2019, Biocybernetics and Biomedical Engineering
Citation Excerpt :
Several algorithms have been developed by many researchers for the segmentation of medical images [52–57]. Out of all the segmentation techniques, active contour method of segmentation has been widely used in case of medical images [57–74]. The active contour models can be classified to be either edge-based or region-based.
In the present work, the performance assessment of despeckle filtering algorithms has been carried out for (a) noise reduction in breast ultrasound images and (b) segmentation of benign and malignant tumours from breast ultrasound images. The despeckle filtering algorithms are broadly classified into eight categories namely local statistics based filters, fuzzy filters, Fourier filters, multiscale filters, non-linear iterative filters, total variation filters, non-local mean filters and hybrid filters. Total 100 breast ultrasound images (40 benign and 60 malignant) are processed using 42 despeckle filtering algorithms. A despeckling filter is considered to be appropriate if it preserves edges and features/structures of the image. Edge preservation capability of a despeckling filter is measured by beta metric (β) and feature/structure preservation capability is quantified using image quality index (IQI). It is observed that out of 42 filters, six filters namely Lee Sigma, FI, FB, HFB, BayesShrink and DPAD yield more clinically acceptable images in terms of edge and feature/structure preservation. The qualitative assessment of these images has been done on the basis of grades provided by the experienced participating radiologist. The pre-processed images are then fed to a segmentation module for segmenting the benign or malignant tumours from ultrasound images. The performance assessment of segmentation algorithm has been done quantitatively using the Jaccard index. The results of both quantitative and qualitative assessment by the radiologist indicate that the DPAD despeckle filtering algorithm yields more clinically acceptable images and results in better segmentation of benign and malignant tumours from breast ultrasound images.

View all citing articles on Scopus

View full text

Segmentation of magnetic resonance images using a combination of neural networks and active contour models

Abstract

Introduction

Section snippets

Image data and labelling

Alternative segmentation techniques

Initial classification using a neural network

Quantifying segmentation performance

Results using squared-error cost function

Defining a new cost function

MLP training with the new cost function

Classification results with the new cost function

Post-processing with an active contour model

Conclusions

Acknowledgements

Comput Vision Graphics Image Process

Magn Reson Imaging

Magn Reson Imaging

Pattern Recog

Comput Vision Graphics Image Process

Comput Vision Graphics Image Process

Image Vision Comput

Image Vision Comput

Med Image Anal

Med Image Anal

Image Vision Comput

Comput Vision Graphics Image Process

The image processing handbook

Quantitative measurements for gamma camera images

Craniosynotosis: diagnostic value of 3D CT reconstructions

Radiology

An adaptive Bayesian approach to three-dimensional MR brain segmentation

The use of active shape models for locating structures in medical images

3D image understanding in radiology

IEEE Eng Med Biol Mag

Snakes: active contour models

Int J Comput Vision

Active contours: the application of techniques from graphics, control theory and statistics to visual tracking of shapes in motion

A neural network-based stochastic active model (NNS-SNAKE) for contour finding of distinct features

IEEE Trans Image Process

Segmentation of magnetic resonance images of the thorax by back-propagation

An analogue approach to the travelling salesman problem using an elastic net method

Nature

A semi-localized elastic net for surface reconstruction of objects from multislice images

Int J Neural Syst