CBIR of spine X-ray images on inter-vertebral disc space and shape profiles using feature ranking and voting consensus

doi:10.1016/j.datak.2009.07.008

Data & Knowledge Engineering

Volume 68, Issue 12, December 2009, Pages 1359-1369

https://doi.org/10.1016/j.datak.2009.07.008 Get rights and content

Abstract

Very limited research is published in the literature that applies content-based image retrieval (CBIR) techniques to retrieval of digitized spine X-ray images that combines inter-vertebral disc space and vertebral shape profiles. This paper describes a novel technique for retrieving vertebra pairs that exhibit a specified disc space narrowing (DSN) and inter-vertebral disc shape. DSN is characterized using spatial and geometrical features between two adjacent vertebrae. In order to obtain the best retrieval result, all selected features are ranked and assigned a weight to indicate their importance in the computation of the final similarity measure. Using a two phase algorithm, initial retrieval results are clustered and used to construct a voting committee to retrieve vertebra pairs with the highest DSN similarity. The overall retrieval accuracy is validated by a radiologist and proves that selected features combined with voting consensus are effective for DSN-based spine X-ray image retrieval.

Introduction

Osteoarthritis affects a significant portion of the elderly population in the United States [1]. Osteophytes, disc space narrowing (DSN), subluxation and spondylolisthesis are typical radiographic hallmarks characterizing this condition on the spine. The ability to retrieve spine X-ray images on these conditions could be very valuable to clinicians (radiologists), researchers of arthritis and musculoskeletal diseases, and educators. This paper focuses on the problem of retrieval of digitized X-ray images of the spine based on disk space narrowing coupled with vertebral shape content analysis.

Manually finding reference images from a large image database is a tedious and error prone process. An automatic CBIR system can significantly alleviate the problem of retrieving relevant images with specified DSN. Content-Based Image Retrieval (CBIR) techniques have been studied for nearly two decades. The techniques have been used for searching images in digital libraries, on the World Wide Web, and other applications such as trademark search [2]. Research on medical image retrieval, however, has been fairly recent [3], [4], [5], [6], [7], [8]. These efforts can be broadly categorized into two themes: (i) retrieval of biomedical images from a heterogeneous collection (images of different anatomy, modality, and detail) with little importance given to localized pathology, and (ii) retrieval of images from a homogenous collection (images of single modality, anatomy, and detail) with particular focus on the localized pathology. Our research [9] has been of the latter category.

Retrieval of medical images started with text-based retrieval and has grown to include image content-based retrieval with explosive growth in the acquisition and use of biomedical images. Mao and Chu [10] studied the vector space model (VSM) to automatically retrieve medical documents. Following the development of image processing and computer vision techniques, indexing and retrieval of the medical images based on content analysis became possible. Muller et al. [11] gave an overview of available literature in the field of content-based access to medical image data and on the technologies used in this field.

The Lister Hill National Center for Biomedical Communications, an intramural R&D division of the National Library of Medicine (NLM) at the National Institutes of Health (NIH), maintains an archive of digitized spine X-rays collected from the second National Health and Nutrition Examination Surveys (NHANES II) [12] which can serve as a reference collection for study of DSN. A prior study [13] proposed use of four scale-invariant, distance transform-based features to characterize spacing between adjacent vertebrae. K-means clustering and self-organizing map (SOM) were used to classify inter-vertebral disc space and assigned it a degree of DSN severity with an overall accuracy of 82.1%. A shortcoming of using this approach which has proven to be robust for automatically classification of severity level for shape-based CBIR is the lack of disc shape profiles. Using DSN severity level classification alone is insufficient for shape-based CBIR which is the focus of this work.

Vertebral shape is valuable in expressing the spine conditions described earlier. Fig. 1a shows two adjacent vertebrae outlined on a lumbar spine X-ray. As seen in the sagittal view, the inferior and superior edges of vertebrae adjacent to the disc can serve as the disc shape profile. Experienced radiologists use several criteria when evaluating DSN similarity between a candidate case and references from an atlas, for example. These criteria include the top to bottom size of the inter-vertebral gap, the length of the gap, and its configuration, i.e., whether there are spurs, concavities, convexities, irregularities, etc. Many of these disc space characteristics can be computed from disc shape profiles. In this paper, we present an approach that combines disc shape profile with computed inter-vertebral disc space features and uses voting consensus for finding similar images. In addressing this important problem, this effort makes advances the state of the art in CBIR taking advantage of clustering ensemble based machine learning methods [1], [14], [15], [16], [17].

The proposed algorithm and DSN similarity measures are discussed in Section 2. Feature ranking for weight computation of extracted features is presented in Section 3. Section 4 introduces the proposed voting consensus mechanism. Experimental results and analysis are presented in Section 5. Conclusions and future directions are described in Section 6.

Section snippets

DSN features selected for similarity measurement

X-ray images from the NHANES II data set used for this study are segmented using active contour segmentation method and the resulting 9-point and 36-point contour shapes are validated by a board certified radiologist. Examples of these contours are shown in Fig. 1b and c. Fig. 1b shows the 9-point model commonly used by radiologists. The left side (Points 8–7–9) of the vertebra is the anterior edge and the right side (Points 1–4) is the posterior edge. Fig. 1c shows a vertebra contour

Feature ranking for assigning feature weights

Feature selection has been used for assigning weights to features in CBIR in recent years [23], [24], [25]. Relevance feedback has been used to assign feature weights according to user’s judgment [23], [24]. Relevant features are selected and assigned greater weights [25]. Six features extracted in Section 2 are of different importance for measuring DSN similarity. They are categorized into two feature sets. Of these features, mean and standard deviation of distance, skewness, and I(R_d,2) are

Voting consensus

Generally, CBIR system retrieves images by comparing the query image against images in the database using similarity measures. Voting consensus has shown success in clustering ensemble [14], object classification [15], and information extraction [16]. In [17], authors used a clustering algorithm to retrieve clusters of images that are in the vicinity of the query image. These clusters can be deemed as semantic groups. This paper presents a voting consensus mechanism to achieve a similar task.

Results

A set of 801 cervical and 972 lumbar vertebral outlines (shapes) segmented from a total of 400 digitized spine X-ray images was used for performance evaluation. Ten disc pairs of both cervical and lumbar shapes were selected randomly as queries. From the set of cervical vertebrae outlines, pairs of adjacent vertebrae were used to identify discs. Three discs from the C3–C4 pair, 2 discs from the C4–C5 pair, 3 discs from the C5–C6 pair, and 2 discs from the C6–C7 vertebrae pair were selected as

Conclusion

This paper presents a novel approach for content-based retrieval of vertebra pairs using spatial, geometrical, and shape constraints applied to inter-vertebral disc space and using both the 9-point model familiar to radiologists and bone morphometrists and the computationally meaningful 36-point vertebral shape profiles. The mean and standard deviation of disc space distances and skewness measures are used as the spatial and geometrical properties of DSN. Furthermore, features such as

Acknowledgement

This work research was supported by the National Library of Medicine (NLM) under Contract No. HHSN276200700335P and the intramural research funds of the Lister Hill National Center for Biomedical Communications, the National Library of Medicine (NLM), and the National Institutes of Health (NIH).

Dah-Jye Lee received his B.S.E.E. from National Taiwan University of Science and Technology in 1984, M.S. and Ph.D. degrees in electrical engineering from Texas Tech University in 1987 and 1990, respectively. He also received his MBA degree from Shenandoah University, Winchester, Virginia in 1999.

References (34)

W. Mao et al.
The phrase-based vector space model for automatic retrieval of free-text medical documents
Data and Knowledge Engineering
(2007)
H. Muller et al.
A review of content-based image retrieval systems in medical applications – clinical benefits and future directions
International Journal of Medical Informatics
(2004)
P. Chamarthy et al.
Image analysis techniques for characterizing disc space narrowing in cervical vertebrae interface
Computerized medical Imaging and Graphics
(2004)
Fact Sheet: Osteoarthritis. American College of Rheumatology, Atlanta, Georgia,...
A.W.M. Smeulders et al.
Content-based image retrieval at the end of the early years
IEEE Transactions on Pattern Analysis and Machine Intelligence
(2000)
I. El-Naqa et al.
A similarity learning approach to content-based image retrieval: application to digital mammography
IEEE Transactions on Medical Imaging
(2004)
J. Kim et al.
A new way for multidimensional medical data management: volume of interest (VOI)-based retrieval of medical images with visual and functional features
IEEE Transactions on Information Technology in Biomedicine
(2006)
A. Mojsilovic, J. Gomes, Semantic based categorization, browsing and retrieval in medical image databases,...
H. Greenspan et al.
Medical image categorization and retrieval for PACS using the GMM-KL framework
IEEE Transactions on Information Technology in Biomedicine
(2007)
T.M. Lehmann et al.
A monohierarchical multiaxial classification code for medical images in content-based retrieval
IEEE International Symposium on Biomedical Imaging
(2002)

W. Liu, Q.Y. Tong, medical image retrieval using salient point detector, in: IEEE-EMBS 27th Annual International...

X.Q. Xu et al.

A spine X-ray image retrieval system using partial shape matching

IEEE Transactions on Information Technology in Biomedicine

(2008)

L.R. Long, G.R. Thoma, Image query and indexing for digital X-rays, in: SPIE Conference on Storage and Retrieval for...

H.G. Avad et al.

Cumulative voting consensus method for partitions with variable number of clusters

IEEE Transactions on Pattern Analysis and Machine Intelligence

(2008)

T.L. Berg et al.

Animal on the web

IEEE International Conference on Computer Vision and Pattern Recognition

(2006)

G. Sigletos et al.

Combining information extraction systems using voting and stacked generalization

Journal of Machine Learning Research

(2005)

Y. Chen et al.

CLUE: clustering-based retrieval of image by unsupervised learning

IEEE Transactions on Image Processing

(2005)

Cited by (21)

Content-based large-scale medical image retrieval
2019, Biomedical Information Technology
This chapter introduces content-based image retrieval (CBIR) and its key components including image feature extraction, similarity comparison, the indexing scheme, and the interactive query interface. The need for CBIR in the medical domain (CBMIR) and its related challenges are discussed and followed by a detailed review of the current major CBMIR techniques in four different categories of retrieval: based on physical visual features (color and texture); based on geometric spatial features (shape, 3-D volumetric features, and spatial relationships); by combination of semantic and visual features (semantic pathology interpretation and generic models); and based on physiological functional features. The success of CBMIR could open up many new vistas in medical services and research, such as disease tracking, differential diagnosis, noninvasive surgical planning, clinical training, and outcomes research.
Feature selection in multimedia: The state-of-the-art review
2017, Image and Vision Computing
Citation Excerpt :
Bhanu and Lin [92] and Yu and Srinath [72] validated the robustness of one algorithm on many databases. Nine papers validated one algorithm on a single database [52,71,78,79,83,85,88,93,94]. Elguebaly and Bouguila [14], Elguebaly and Bouguila [15], Yang et al. [20], Wang et al. [21], and Sun et al. [61] tested their algorithms against databases with different applications.
Multimedia data mining, particularly feature selection (FS), has been successfully applied in recent classification and recognition works. However, only a few studies in the contemporary literature have reviewed FS (e.g., analyses of data pre-processing prior to classification and clustering). This study aimed to fill this research gap by presenting an extensive survey on the current development of FS in multimedia. A total of 70 related papers published from 2001 to 2017 were collected from multiple databases. Breakdowns and analyses were performed on data types, methods, search strategies, performance measures, and challenges. The development trend of FS presages the increased prominence of heuristic search strategies and hybrid FS in the latest multimedia data mining.
Multimodal medical imaging (CT and dynamic MRI) data and computer-graphics multi-physical model for the estimation of patient specific lumbar spine muscle forces
2015, Data and Knowledge Engineering
Citation Excerpt :
Knowledge extraction from biomedical data is one of the most challenging topics in the health engineering [1–3]. From biomedical informatics point of view, knowledge could be derived from complex multimodal and multidimensional data [4–8]. In the context of biomedical applications, there are many approaches to extract the knowledge such as machine learning or physics-based simulation [9].
Computer-graphics multi-physical model has been used to assist the clinician in their decision-making processes. In particular, patient specific musculoskeletal modeling using medical imaging data and physical laws has demonstrated great potential for future clinical analysis of the lumbar spine. The main objective of this present work was to propose a data-driven modeling workflow to create computer-graphics multi-physical model from multimodal medical imaging data to extract useful clinical simulation knowledge leading to better diagnosis and treatment of human diseases such as low back pain. Computed Tomography (CT) data and tissue-based physical laws were used to create geometries as well as to compute full patient specific anthropometrical properties of a patient specific multi-physical lumbar spine model. Kinematical range of motion and spinal curvatures were derived from in vivo dynamic MRI. Then, these multimodal data were combined into the developed model to estimate the lumbar spine muscle forces using inverse dynamics and static optimization. Finally, kinematic behavior of the developed model was evaluated. As results, maximal estimated forces of all muscle groups range from 3 to 40 N for hyperlordosis motion. The higher muscle forces were estimated in iliocostalis lumborum pars lumborum muscle group. The simulated spinal curvatures ranging from 2.7909 to 3.1745 (1/m) are within the range of values (from 2.02 to 9.6142 (1/m)) measured from in vivo dynamic MRI. This study suggested that multimodal medical imaging data derived from CT and dynamic MRI could be of great interest in the development of computer-graphics multi-physical model as well as in the estimation of kinematical ranges of motion, their evaluation and muscle forces for biomechanical applications.
A graph-based approach for the retrieval of multi-modality medical images
2014, Medical Image Analysis
Citation Excerpt :
In the medical domain, Huang et al. (2012) calculated the spatial relationships between adjacent objects through the intensity profile of the local neighbourhood of tumour ROIs. Spine X-ray retrieval has been achieved using partial shape matching (Xu et al., 2008), and the spatial and geometric features between adjacent vertebrae (Lee et al., 2009). Graphs are a more general approach to represent relational data (Bunke and Riesen, 2011).
In this paper, we address the retrieval of multi-modality medical volumes, which consist of two different imaging modalities, acquired sequentially, from the same scanner. One such example, positron emission tomography and computed tomography (PET-CT), provides physicians with complementary functional and anatomical features as well as spatial relationships and has led to improved cancer diagnosis, localisation, and staging.
The challenge of multi-modality volume retrieval for cancer patients lies in representing the complementary geometric and topologic attributes between tumours and organs. These attributes and relationships, which are used for tumour staging and classification, can be formulated as a graph. It has been demonstrated that graph-based methods have high accuracy for retrieval by spatial similarity. However, naïvely representing all relationships on a complete graph obscures the structure of the tumour-anatomy relationships.
We propose a new graph structure derived from complete graphs that structurally constrains the edges connected to tumour vertices based upon the spatial proximity of tumours and organs. This enables retrieval on the basis of tumour localisation. We also present a similarity matching algorithm that accounts for different feature sets for graph elements from different imaging modalities. Our method emphasises the relationships between a tumour and related organs, while still modelling patient-specific anatomical variations. Constraining tumours to related anatomical structures improves the discrimination potential of graphs, making it easier to retrieve similar images based on tumour location.
We evaluated our retrieval methodology on a dataset of clinical PET-CT volumes. Our results showed that our method enabled the retrieval of multi-modality images using spatial features. Our graph-based retrieval algorithm achieved a higher precision than several other retrieval techniques: gray-level histograms as well as state-of-the-art methods such as visual words using the scale- invariant feature transform (SIFT) and relational matrices representing the spatial arrangements of objects.
A Lattice-Computing ensemble for reasoning based on formal fusion of disparate data types, and an industrial dispensing application
2014, Information Fusion
Citation Excerpt :
In the domain of Soft Computing or, equivalently, Computational Intelligence, the term “hybrid (system/algorithm)” frequently denotes an integration of different techniques/technologies including artificial neural networks, fuzzy systems, evolutionary/swarm computing, etc. towards improving an index of performance in real-world applications [1,15]; the term “intelligence” is pertinent to decision-making, e.g. in pattern classification/recognition [82]; moreover, the term “(intelligent) fusion” may signify an aggregate intelligence towards improving decision-making [48]. In the aforementioned sense, a “hybrid intelligent fusion system” may be a Multiple Classifier System (MCS) [46,49] also known in the literature as Classifier Ensemble [16,59,65] or Committee [21,80] or Voting Consensus [5,51]. Note that a number of MCS architectures/strategies including applications have been reported [22,29,30,47,50,52,55,56,70,71,74,81,85,86].
By “fusion” this work means integration of disparate types of data including (intervals of) real numbers as well as possibility/probability distributions defined over the totally-ordered lattice (R, ⩽) of real numbers. Such data may stem from different sources including (multiple/multimodal) electronic sensors and/or human judgement. The aforementioned types of data are presented here as different interpretations of a single data representation, namely Intervals’ Number (IN). It is shown that the set F of INs is a partially-ordered lattice (F, ⪯) originating, hierarchically, from (R, ⩽). Two sound, parametric inclusion measure functions σ:F^N × F^N → [0, 1] result in the Cartesian product lattice (F^N, ⪯) towards decision-making based on reasoning. In conclusion, the space (F^N, ⪯) emerges as a formal framework for the development of hybrid intelligent fusion systems/schemes. A fuzzy lattice reasoning (FLR) ensemble scheme, namely FLR pairwise ensemble, or FLRpe for short, is introduced here for sound decision-making based on descriptive knowledge (rules). Advantages include the sensible employment of a sparse rule base, employment of granular input data (to cope with imprecision/uncertainty/vagueness), and employment of all-order data statistics. The advantages as well as the performance of our proposed techniques are demonstrated, comparatively, by computer simulation experiments regarding an industrial dispensing application.
A Content Based Image Retrieval Approach based on Multiple Multimedia Features Descriptors in E-health Environment
2020, IEEE Medical Measurements and Applications, MeMeA 2020 - Conference Proceedings

View all citing articles on Scopus

He is currently a Professor in the Department of Electrical and Computer Engineering at Brigham Young University. He worked in the machine vision industry for eleven years prior to joining BYU in 2001. His research work focuses on Medical informatics and imaging, shape-based pattern recognition, hardware implementation of real-time 3-D vision algorithms and machine vision applications.

He is a senior member of IEEE and a member of SPIE. He has actively served the research community as a paper and proposal reviewer and conference organizer.

Sameer Antani received his B.E. (Computer) from University of Pune in 1994, M.E. and Ph.D. degrees in Computer Science and Engineering from the Pennsylvania State University in 1998 and 2001, respectively.

He is currently a Staff Scientist with the Lister Hill National Center for Biomedical Communications an intramural R&D division of the National Library of Medicine which is an institute within the US National Institutes of Health. His research interests are in data management of and retrieval from, large biomedical multimedia archives. His research includes content-based indexing, and retrieval of biomedical images, combining image and text retrieval, and next-generation documents that are enriched with interconnections to data sets and multimedia content.

He is a member of IEEE and IEEE Computer Society. He serves on the steering committee for IEEE Symposium for Computer Based Medical Systems (CBMS). He is a reviewer for several journals including various IEEE transactions.

Yuchou Chang was born in Hunan, China in 1980. He received his B.S. degree in automatic control department from Northwestern Polytechnical University, Xi’An, China, in 2003 and M.S. degree in institute of image processing and pattern recognition from Shanghai Jiao Tong University, Shanghai, China, in 2006. He worked in the Robotic Vision Laboratory in the Electrical and Computer Engineering Department at Brigham Young University as a research assistant from September 2006 to December 2008.

His research interest includes machine learning-assisted multimedia analysis, image segmentation, image and video semantic content description, content-based multimedia indexing and retrieval. He is an IEEE student member.

Kent Gledhill was born and raised in Utah. He attended undergraduate and medical school at the University of Utah in Salt Lake City, graduating in 1992. He completed his Radiology residency and Neuroradiology fellowship at the University of New Mexico in Albuquerque. He currently practices diagnostic and neuroradiology in Utah County. He serves on the Board of Directors of the Central Utah Clinic in Provo and has been the medical staff president and a board member at Timpanogos Regional Hospital in Orem. He lives in Provo, Utah with his wife and four children.

L. Rodney Long received his B.A. and M.A. degrees in mathematics from the University of Texas in 1971 and 1976, respectively, and M.A. degree in applied mathematics from the University of Maryland in 1987.

Since 1990 he has been an Electronics Engineer in the Communications Engineering Branch of the National Library of Medicine. He previously worked for 14 years in industry as a programmer and engineer. His research work is concerned with Content-Based Image Retrieval, image processing, and image databases for biomedical applications.

He has been a member of IEEE since 1986 and has served as co-chair of the IEEE International Symposium of Computer-Based Medical Systems.

Paul Christensen is currently an undergraduate student in the Computer Science Department at Brigham Young University. He is expecting to receive his B.S. degree in December 2009. He served a two-year church mission for The Church of Jesus Christ of Latter-day Saints in Edinburgh, Scotland from 2004 to 2006. He worked as a research assistant in the Robotic Vision Lab at BYU from 2006 to 2008 and is currently an intern at Intel in Hillsboro, OR.

View full text

CBIR of spine X-ray images on inter-vertebral disc space and shape profiles using feature ranking and voting consensus

Abstract

Introduction

Section snippets

DSN features selected for similarity measurement

Feature ranking for assigning feature weights

Voting consensus

Results

Conclusion

Acknowledgement

Data and Knowledge Engineering

International Journal of Medical Informatics

Computerized medical Imaging and Graphics

Content-based image retrieval at the end of the early years

IEEE Transactions on Pattern Analysis and Machine Intelligence

A similarity learning approach to content-based image retrieval: application to digital mammography

IEEE Transactions on Medical Imaging

A new way for multidimensional medical data management: volume of interest (VOI)-based retrieval of medical images with visual and functional features

IEEE Transactions on Information Technology in Biomedicine

Medical image categorization and retrieval for PACS using the GMM-KL framework

IEEE Transactions on Information Technology in Biomedicine

A monohierarchical multiaxial classification code for medical images in content-based retrieval

IEEE International Symposium on Biomedical Imaging

A spine X-ray image retrieval system using partial shape matching

IEEE Transactions on Information Technology in Biomedicine

Cumulative voting consensus method for partitions with variable number of clusters

IEEE Transactions on Pattern Analysis and Machine Intelligence

Animal on the web

IEEE International Conference on Computer Vision and Pattern Recognition

Combining information extraction systems using voting and stacked generalization

Journal of Machine Learning Research

CLUE: clustering-based retrieval of image by unsupervised learning

IEEE Transactions on Image Processing