Human visual system inspired multi-modal medical image fusion framework

doi:10.1016/j.eswa.2012.09.011

Expert Systems with Applications

Volume 40, Issue 5, April 2013, Pages 1708-1720

https://doi.org/10.1016/j.eswa.2012.09.011 Get rights and content

Abstract

Multi-modal medical image fusion, as a powerful tool for the clinical applications, has developed with the advent of various imaging modalities in medical imaging. The main motivation is to capture most relevant information from sources into a single output, which plays an important role in medical diagnosis. In this paper, a novel framework for medical image fusion based on framelet transform is proposed considering the characteristics of human visual system (HVS). The core idea behind the proposed framework is to decompose all source images by the framelet transform. Two different HVS inspired fusion rules are proposed for combining the low- and high-frequency coefficients respectively. The former is based on the visibility measure, and the latter is based on the texture information. Finally, the fused image is constructed by the inverse framelet transform with all composite coefficients. Experimental results highlight the expediency and suitability of the proposed framework. The efficiency of the proposed method is demonstrated by the different experiments on different multi-modal medical images. Further, the enhanced performance of the proposed framework is understood from the comparison with existing algorithms.

Highlights

► This paper presents a novel multi-modal medical image fusion framework for better diagnosis. ► The proposed framework relies on framelet transform and human visual system characteristics. ► Two new fusion rules are proposed to fuse low- and high-frequency bands. ► The efficiency of the framework is highlighted by different experiments on CT/MRI and PET/MRI images. ► The superiority of the framework is achieved by the comparison with state-of-art algorithms.

Introduction

With the rapid development in high-tech and advanced instrumentations, medical imaging has become a vital component of a large number of applications, including diagnosis, research, and treatment. This rife development has enabled radiologists to quickly acquire images of the human body and its internal structures with effective resolution and realism. These images are often known as multimodality medical images such as X-ray, computed tomography (CT), magnetic resonance imaging (MRI), magnetic resonance angiography (MRA), and positron emission tomography (PET) images (Maes, Vandermeulen, & Suetens, 2003). These multimodality medical images usually provide complementary and occasionally conflicting information. For example, X-ray and computed tomography (CT) can provide dense structures like bones and implants with less distortion, but it cannot detect physiological changes (Aguilar & Garrett, 2001). Similarly, normal and pathological soft tissue can be better visualized by MRI whereas PET can be used to provide better information on blood flow and flood activity with low spatial resolution. For medical diagnosis, treatment planning and evaluation, the complementary information obtained from different modality images is needed. For example, combined PET/CT imaging can concurrently visualize anatomical and physiological characteristics of the human body and can also be used to view tumor activity in conjunction with anatomical references in oncology. Also in organ diagnosis, the combined PET/CT imaging is very useful, where tumor boundaries are difficult to discern (Baum et al., 2008, Kamman et al., 1989).

Hence, the fusion of the multi-modal medical images is necessary and it has become a promising and very challenging research area nowadays. Image fusion can be defined as the process in which some important features of multiple input images are combined into a single image without any loss of information. Medical image fusion aims at integrating complementary as well as redundant information from multiple modality images to obtain a more complete and accurate description of the same object. It provides easy access to the PET/CT/MRI images at the same location where reading from all other modalities is done, allowing radiologists to quickly and efficiently report PET/CT/MRI studies (Rojas, Raff, Quintana, Huete, & Hutchinson, 2007).

So far, many effective techniques for image fusion have been proposed in the literature especially for medical images. The simplest ways are pixel-by-pixel (Bhatnagar et al., 2010, Jiang and Tian, 2011, Xydeas and Petrovic, 2002, Yang and Li, 2012) gray level average or selection of the source images but these ways lead to undesirable side effects such as reduced contrast. Other categories include image fusion methods based on statistical and numerical methods (Cardinali and Nason, 2005, Wang and Ma, 2008), intensity-hue-saturation (IHS) (Daneshvar and Ghassemian, 2010, Tu et al., 2001), principal component analysis (PCA) (Chavez & Kwarteng, 1989), independent component analysis (ICA) (McKeown et al., 1998), contrast pyramid (Burt & Kolczynski, 1993), gradient pyramid (Petrovic & Xydeas, 2004) and multiresolution methods (Boussion et al., 2008, Bhatnagar and Raman, 2009, Bhatnagar and Wu, 2012, Guihong et al., 2001, Li and Wang, 2011, Li et al., 1995, Miao et al., 2011, Nemec et al., 2010, Wu et al., 2012, Yang et al., 2008, Yang et al., 2010, Zhang and Guo, 2009). Statistical and numerical methods involve huge computation using floating-point arithmetic, therefore these methods are time and memory consuming. The IHS method is based on the representation of low spatial resolution images using IHS system and then substitution of the intensity component by a high-resolution image. In PCA/ICA method, original images are transformed into uncorrelated images and then fused by choosing the maximum value among all. PCA/ICA is frequently used for fusion because of its ability to compact the redundant data into fewer bands. In pyramid and multiresolution based methods, the source images are decomposed by applying pyramid or wavelet transform, then fusion operation is performed on the transformed images. These methods produce very good results in less computation time and less memory when compared to others. However, these methods often produce undesirable side effects like block artifacts, reduced contrast etc. which often result in the wrong diagnosis (Li, Yang, & Hu, 2011).

This paper is an attempt to rectify the drawbacks of multiresolution transform in medical image fusion. For this purpose, framelet transform is used in the proposed framework. First, all the input images are decomposed in low and high frequency bands and the fusion is employed by considering the physical meaning of the bands. For this purpose, two new HVS based fusion rules are developed for fusing low and high frequency bands respectively. The low frequency bands are fused by considering the visibility as fusion measure instead of simple averaging as used in the existing schemes whereas the high frequency bands are fused based on texture information obtained from HVS model. The used HVS model is the Smallest Univalue Segment Assimilating Nucleus (SUSAN) which is a feature extractor function. The fused medical image that is produced by this scheme presents a visually better representation than the input images. Moreover, the compatibility and superiority of the proposed method can be judged by the comparison with the existing methods. The salient contributions of the proposed framework over existing methods can be summarized as follows.

•
This paper proposes a new image fusion framework for multimodal medical images, which relies on the human visual system characteristics in framelet domain.
•
Two different fusion rules are proposed for combining low- and high-frequency coefficients considering their physical meaning.
•
For fusing the low-frequency coefficients, a visibility based process is used. The main benefit of visibility is that it selects and combines the low-frequency coefficients from the focused part of images.
•
On the contrary, the texture information obtained from SUSAN feature extractor is used to combine high-frequency coefficients. Using SUSAN features, the most prominent texture and edge information are selected from high-frequency coefficients and combined in the fused ones.
•
The efficiency of the proposed framework is highlighted by different experiments on different medical images of different modality whereas superiority is achieved by the comparison with state-of-art algorithms.
•
Further, two clinical examples of the persons affected with alzheimer and tumor is also done for more elaborated performance comparison analysis.

The rest of the paper is organized as follows. The framelet transform and its benefits in image fusion are illustrated in Sections 2 Framelet transform, 3 Benefits of framelet transform in image fusion respectively. The proposed multi-modal medical image fusion framework is explained in Section 4 followed by the experimental results and discussions in Section 5. Finally, the concluding remarks are given in Section 6.

Section snippets

Framelet transform

The framelet transform (Chui and He, 2000, Selesnick, 2001, Selesnick and Abdelnour, 2004, Selesnick and Sendur, 2000) is very similar to wavelet transform but has some important differences. In particular, framelet transform has one scaling function ϕ(t) and two wavelet functions ψ₁(t) and ψ₂(t) whereas wavelet transform has one scaling function ϕ(t) and one wavelet function ψ(t).

Let us suppose the low-pass and high pass filters associated with ϕ(t), ψ₁(t) and ψ₂(t) are h₀(n), h₁(n) and h₂(n)

Benefits of framelet transform in image fusion

The use of framelet transform has efficient assets over the wavelet like transforms. In fact, framelet transform overcomes the drawbacks of existing wavelet and related transforms in image fusion due to these assets only. The following assets of framelet transform acted as the motivation to use it for image fusion.

•
Framelet transform designs the wavelet tight frames based on iterated oversampled filter banks. The main benefit of tight frames is that the signal is reconstructed with the transpose

Proposed fusion technique

The aim of this paper is to ensure the transferability of the most relevant information avaliable in source images into a new composite image with the least amount of required processing. A new efficient framework that combines the advantages of framelet transform and human visual system is developed. Before going to the proposed framework, the used human visual system models are described first.

Experimental results and discussions

In medical diagnostics, new imaging methods such as, computed tomography (CT), ultrasound, positron emission tomography (PET), nuclear magnetic resonance (NMR) etc., assist the physician to localize the abnormal masses and give an easy overview of the anatomic details. All these imaging methods have their own characteristics and drawbacks. As an example, the excellent views of bones and other dense structures are given by CT images whereas the excellent views of soft tissues are given perfectly

Conclusions

In this paper, a novel multi-modal medical image fusion algorithm based on framelet transform is proposed. Considering the human visual system characteristics, two different fusion rules are proposed to fuse the low- and high-frequency sub-bands respectively. The proposed algorithm can preserve more information in the fused image with improved quality. The human visual system models encompassing image visibility and SUSAN feature extractor are adopted as the fusion measurement for coefficient’s

Acknowledgments

This work was supported by the Canada Research Chair program, the Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Grant.

References (44)

N. Boussion et al.
Contrast enhancement in emission tomography by way of synergistic PET/CT image combination
Computer Methods and Programs in Biomedicine
(2008)
C.K. Chui et al.
Compactly Supported Tight Frames Associated with Refinable Functions
Applied and Computational Harmonic Analysis
(2000)
S. Daneshvar et al.
MRI and PET image fusion by combining IHS and retina-inspired models
Information Fusion
(2010)
H. Jiang et al.
Fuzzy image fusion based on modified Self-Generating Neural Network
Expert Systems with Applications
(2011)
H. Li et al.
Multisensor image fusion using the wavelet transform
Graph Models Image Processing
(1995)
T. Li et al.
Biological image fusion using a NSCT based variable-weight method
Information Fusion
(2011)
S. Li et al.
Performance comparison of different multiresolution transforms for image fusion
Information Fusion
(2011)
Q. Miao et al.
A novel algorithm of image fusion using shearlets
Optics Communications
(2011)
S.F. Nemec et al.
CTMR image data fusion for computer-assisted navigated surgery of orbital tumors
European Journal of Radiology
(2010)
G.M. Rojas et al.
Image fusion in neuroradiology: three clinical examples including MRI of Parkinson disease
Computerized Medical Imaging and Graphics
(2007)

I.W. Selesnick et al.

Symmetric wavelet tight frames with two generators

Applied and Computational Harmonic Analysis

(2004)

W. Shi et al.

Wavelet-based image fusion and quality assessment

International Journal of Applied Earth Observation and Geoinformation

(2005)

T.M. Tu et al.

A new look at IHS-like image fusion methods

Information Fusion

(2001)

Z.B. Wang et al.

Medical image fusion using m-PCNN

Information Fusion

(2008)

C. Wu et al.

Ultrasonic liver tissue characterization by feature fusion

Expert Systems with Applications

(2012)

L. Yang et al.

Multimodality medical image fusion based on multiscale geometric analysis of contourlet transform

Neurocomputing

(2008)

B. Yang et al.

Pixel-level image fusion with simultaneous orthogonal matching pursuit

Information Fusion

(2012)

S. Yang et al.

Image fusion based on a new contourlet packet

Information Fusion

(2010)

Q. Zhang et al.

Multifocus image fusion using the nonsubsampled contourlet transform

Signal Processing

(2009)

Aguilar, M., & Garrett, A. L. (2001). Neuro-physiologically motivated sensor fusion for visualization and...

K.G. Baum et al.

Fusion viewer: a new tool for fusion and visualization of multimodal medical data sets

Journal of Digital Imaging

(2008)

Bhatnagar, G., Jonathan, Wu, Q. M., & Raman, B. (2010). Real time human visual system based framework for image fusion....

Cited by (106)

FDGNet: A pair feature difference guided network for multimodal medical image fusion
2023, Biomedical Signal Processing and Control
Most multimodal medical image fusion (MMIF) methods suffer from insufficient complementary feature extraction and luminance degradation, such that the fused results cannot effectively assist in clinical diagnosis. Therefore, a novel end-to-end unsupervised learning fusion network is proposed to address these defects, termed a pair feature difference guided network (FDGNet). To adequately extract complementary features from source images, the MMIF task is modeled as feature-weighted guided learning, where the feature extraction framework is dedicated to calculating the difference among features at various levels, such that the feature reconstruction framework can generate a pair of interactive weights via the guidance of the feature differences to directly produce the fused result. Simultaneously, a hybrid loss composed of weighted fidelity loss and feature difference loss is introduced to effectively train the proposed network. Particularly, a weight estimation is designed in the weighted fidelity loss, resorting to a joint of enhanced saliency and pixel intensity of source images to prevent luminance degradation of the fused image. Extensive experiments on six category multimodal medical images demonstrate that FDGNet not only preserves rich luminance (CT), tissue texture (MRI), and functional (PET/SPECT) details from source images but also improves the quantitative metrics of the NMI, QABF, QY, and VIFP with about 68.68%, 6.73%, 12.52%, and 18.33%, respectively over the second-best algorithm.
Multimodality medical image fusion in shearlet domain
2022, Digital Image Enhancement and Reconstruction
For various medical applications, the fusion of important imaging information has emerged as a critical issue. A new multimodality medical image fusion technology, referred to as the shearlet domain, is proposed in this study. The proposed technique decomposes input images using nonsubsampled shearlet transform (NSST) to extract low- and high-frequency components. A unique approach, which includes bilateral filter processing and local energy-based fusion, is applied to acquire crisp and smooth features in low-frequency components. An SML algorithm (sum modified Laplacian) combines the high-frequency coefficients to provide image fusion for high-frequency components. This study explores the experimentation results and the analysis of many medical modalities on the multimodal medical image dataset. The proposed methodology outperforms the cutting-edge fusion algorithms that deal with edge preservation in objective and subjective assessments.
Multi scale decomposition based medical image fusion using convolutional neural network and sparse representation
2021, Biomedical Signal Processing and Control
Medical image fusion foremost focuses on discovering superior technique on merging multimodal medical images which plays an important part on clinical analysis and treatment planning. To get a merged image with improved graphical excellence associated with obvious structure information, a novel medical image fusion system based on multi scale decomposition with convolutional neural network and sparse representation is proposed. Initially, L₀ smoothing filter is applied to decompose the source images into two frequency layers as Low Frequency Layers (LFLs) and High Frequency Layers (HFLs). Next, HFLs are combined by CNN fusion rule and the LFLs of different modal images are combined by using the Non-Subsampled Contourlet Transformation - sparse representation (NSCT–SR) fusion rule. Followed by combining the reconstructed LFL and HFL, the resultant fused image is attained. In the proposed work, fusion of medical image is performed between CT & MRI, CT & SPECT, MRI & SPECT, MRI/T₂ & MRI/Gad, MRI & PET, MRI/T₂ & MRI/PD of brain images. The experimental results depict that the proposed work yields better results than the current methodologies in terms of both visual consistency and quantitative analysis.
Multi-modal medical image fusion framework using co-occurrence filter and local extrema in NSST domain
2021, Biomedical Signal Processing and Control
The fusion of vital imaging information has turned into a primary issue for biomedical applications. In this paper, a new multi-modality medical image fusion method is proposed in the shearlet domain. In the proposed algorithm, input images are decomposed using Non-subsampled shearlet transform (NSST) to get low and high frequencies components. A novel procedure to decompose and combine the base layers and detail layers using the local extrema (LE) approach is used and fused using a Co-occurrence filter (CoF) in low-frequency components. In high-frequency components, an edge-preserving image fusion strategy is performed using sum modified Laplacian (SML) to combine the high-frequency coefficients. The experimental outcomes and comparative analysis are performed over the Multi-modal medical image dataset using proposed and existing methods. It is exhibited through test outcomes and assessments that the proposed technique beats the cutting-edge fusion strategies concerning edge preservation in subjective and objective assessment criteria.
Multimodal medical image fusion review: Theoretical background and recent advances
2021, Signal Processing
Citation Excerpt :
This is due to directional sensitivity where the number of orientation at each scale is unlimited. Mainly, many other transforms have been proposed in medical image fusion, including the higher order generalization of CVT termed ripplet-I transform [12], the ripplet-II transform [111], the grouplet transform [112], the framelet transform [113], the tetrolet transform [35], etc. However, there is a trade-off between noise and artifacts in the fused results and capability to detect spatial details [83].
Multimodal medical image fusion consists in combining two or more images of the same or different modalities aiming to improve the image content, and preserve information. The rapid advance in medical imaging techniques (Computed Tomography (CT), Positron Emission Tomography (PET), Magnetic Resonance Imaging (MRI), Single Photon Emission Computed Tomography (SPECT)) has attracted researcher’s attention to fuse different modalities in order to assist experts decision making during the aided-diagnosis pipeline. Moreover, the fused results may help boosting other tasks such as classification, detection and segmentation. The main objective of this work is to provide a comprehensive overview of medical image fusion methods with theoretical background and recent advances. To do so, we present a detailed literature panorama of medical image fusion. The pixel-level, feature-level and decision-level fusion methods are highlighted and discussed with several approaches in each category. Theories behind fusion algorithms are explored aiming to address challenges and limitations of each classes. Therefore, we propose an experimental analysis of fusion performance given by different categories to guide the discussion. By summarizing the existing fusion classes, we discuss merits and demerits of each category to provide some recommendations for future research directions. Finally, performance evaluation metrics are presented to draw conclusions and perspectives.
Multi-image fusion: optimal decomposition strategy with heuristic-assisted non-subsampled shearlet transform for multimodal image fusion
2024, Signal, Image and Video Processing

View all citing articles on Scopus

View full text

Human visual system inspired multi-modal medical image fusion framework

Abstract

Highlights

Introduction

Section snippets

Framelet transform

Benefits of framelet transform in image fusion

Proposed fusion technique

Experimental results and discussions

Conclusions

Acknowledgments

Computer Methods and Programs in Biomedicine

Applied and Computational Harmonic Analysis

Information Fusion

Expert Systems with Applications

Graph Models Image Processing

Information Fusion

Information Fusion

Optics Communications

European Journal of Radiology

Computerized Medical Imaging and Graphics

Applied and Computational Harmonic Analysis

International Journal of Applied Earth Observation and Geoinformation

Information Fusion

Information Fusion

Expert Systems with Applications

Neurocomputing

Information Fusion

Information Fusion

Signal Processing

Fusion viewer: a new tool for fusion and visualization of multimodal medical data sets

Journal of Digital Imaging