Elsevier

Data & Knowledge Engineering

Volume 68, Issue 12, December 2009, Pages 1359-1369
Data & Knowledge Engineering

CBIR of spine X-ray images on inter-vertebral disc space and shape profiles using feature ranking and voting consensus

https://doi.org/10.1016/j.datak.2009.07.008Get rights and content

Abstract

Very limited research is published in the literature that applies content-based image retrieval (CBIR) techniques to retrieval of digitized spine X-ray images that combines inter-vertebral disc space and vertebral shape profiles. This paper describes a novel technique for retrieving vertebra pairs that exhibit a specified disc space narrowing (DSN) and inter-vertebral disc shape. DSN is characterized using spatial and geometrical features between two adjacent vertebrae. In order to obtain the best retrieval result, all selected features are ranked and assigned a weight to indicate their importance in the computation of the final similarity measure. Using a two phase algorithm, initial retrieval results are clustered and used to construct a voting committee to retrieve vertebra pairs with the highest DSN similarity. The overall retrieval accuracy is validated by a radiologist and proves that selected features combined with voting consensus are effective for DSN-based spine X-ray image retrieval.

Introduction

Osteoarthritis affects a significant portion of the elderly population in the United States [1]. Osteophytes, disc space narrowing (DSN), subluxation and spondylolisthesis are typical radiographic hallmarks characterizing this condition on the spine. The ability to retrieve spine X-ray images on these conditions could be very valuable to clinicians (radiologists), researchers of arthritis and musculoskeletal diseases, and educators. This paper focuses on the problem of retrieval of digitized X-ray images of the spine based on disk space narrowing coupled with vertebral shape content analysis.

Manually finding reference images from a large image database is a tedious and error prone process. An automatic CBIR system can significantly alleviate the problem of retrieving relevant images with specified DSN. Content-Based Image Retrieval (CBIR) techniques have been studied for nearly two decades. The techniques have been used for searching images in digital libraries, on the World Wide Web, and other applications such as trademark search [2]. Research on medical image retrieval, however, has been fairly recent [3], [4], [5], [6], [7], [8]. These efforts can be broadly categorized into two themes: (i) retrieval of biomedical images from a heterogeneous collection (images of different anatomy, modality, and detail) with little importance given to localized pathology, and (ii) retrieval of images from a homogenous collection (images of single modality, anatomy, and detail) with particular focus on the localized pathology. Our research [9] has been of the latter category.

Retrieval of medical images started with text-based retrieval and has grown to include image content-based retrieval with explosive growth in the acquisition and use of biomedical images. Mao and Chu [10] studied the vector space model (VSM) to automatically retrieve medical documents. Following the development of image processing and computer vision techniques, indexing and retrieval of the medical images based on content analysis became possible. Muller et al. [11] gave an overview of available literature in the field of content-based access to medical image data and on the technologies used in this field.

The Lister Hill National Center for Biomedical Communications, an intramural R&D division of the National Library of Medicine (NLM) at the National Institutes of Health (NIH), maintains an archive of digitized spine X-rays collected from the second National Health and Nutrition Examination Surveys (NHANES II) [12] which can serve as a reference collection for study of DSN. A prior study [13] proposed use of four scale-invariant, distance transform-based features to characterize spacing between adjacent vertebrae. K-means clustering and self-organizing map (SOM) were used to classify inter-vertebral disc space and assigned it a degree of DSN severity with an overall accuracy of 82.1%. A shortcoming of using this approach which has proven to be robust for automatically classification of severity level for shape-based CBIR is the lack of disc shape profiles. Using DSN severity level classification alone is insufficient for shape-based CBIR which is the focus of this work.

Vertebral shape is valuable in expressing the spine conditions described earlier. Fig. 1a shows two adjacent vertebrae outlined on a lumbar spine X-ray. As seen in the sagittal view, the inferior and superior edges of vertebrae adjacent to the disc can serve as the disc shape profile. Experienced radiologists use several criteria when evaluating DSN similarity between a candidate case and references from an atlas, for example. These criteria include the top to bottom size of the inter-vertebral gap, the length of the gap, and its configuration, i.e., whether there are spurs, concavities, convexities, irregularities, etc. Many of these disc space characteristics can be computed from disc shape profiles. In this paper, we present an approach that combines disc shape profile with computed inter-vertebral disc space features and uses voting consensus for finding similar images. In addressing this important problem, this effort makes advances the state of the art in CBIR taking advantage of clustering ensemble based machine learning methods [1], [14], [15], [16], [17].

The proposed algorithm and DSN similarity measures are discussed in Section 2. Feature ranking for weight computation of extracted features is presented in Section 3. Section 4 introduces the proposed voting consensus mechanism. Experimental results and analysis are presented in Section 5. Conclusions and future directions are described in Section 6.

Section snippets

DSN features selected for similarity measurement

X-ray images from the NHANES II data set used for this study are segmented using active contour segmentation method and the resulting 9-point and 36-point contour shapes are validated by a board certified radiologist. Examples of these contours are shown in Fig. 1b and c. Fig. 1b shows the 9-point model commonly used by radiologists. The left side (Points 8–7–9) of the vertebra is the anterior edge and the right side (Points 1–4) is the posterior edge. Fig. 1c shows a vertebra contour

Feature ranking for assigning feature weights

Feature selection has been used for assigning weights to features in CBIR in recent years [23], [24], [25]. Relevance feedback has been used to assign feature weights according to user’s judgment [23], [24]. Relevant features are selected and assigned greater weights [25]. Six features extracted in Section 2 are of different importance for measuring DSN similarity. They are categorized into two feature sets. Of these features, mean and standard deviation of distance, skewness, and I(Rd,2) are

Voting consensus

Generally, CBIR system retrieves images by comparing the query image against images in the database using similarity measures. Voting consensus has shown success in clustering ensemble [14], object classification [15], and information extraction [16]. In [17], authors used a clustering algorithm to retrieve clusters of images that are in the vicinity of the query image. These clusters can be deemed as semantic groups. This paper presents a voting consensus mechanism to achieve a similar task.

Results

A set of 801 cervical and 972 lumbar vertebral outlines (shapes) segmented from a total of 400 digitized spine X-ray images was used for performance evaluation. Ten disc pairs of both cervical and lumbar shapes were selected randomly as queries. From the set of cervical vertebrae outlines, pairs of adjacent vertebrae were used to identify discs. Three discs from the C3–C4 pair, 2 discs from the C4–C5 pair, 3 discs from the C5–C6 pair, and 2 discs from the C6–C7 vertebrae pair were selected as

Conclusion

This paper presents a novel approach for content-based retrieval of vertebra pairs using spatial, geometrical, and shape constraints applied to inter-vertebral disc space and using both the 9-point model familiar to radiologists and bone morphometrists and the computationally meaningful 36-point vertebral shape profiles. The mean and standard deviation of disc space distances and skewness measures are used as the spatial and geometrical properties of DSN. Furthermore, features such as

Acknowledgement

This work research was supported by the National Library of Medicine (NLM) under Contract No. HHSN276200700335P and the intramural research funds of the Lister Hill National Center for Biomedical Communications, the National Library of Medicine (NLM), and the National Institutes of Health (NIH).

Dah-Jye Lee received his B.S.E.E. from National Taiwan University of Science and Technology in 1984, M.S. and Ph.D. degrees in electrical engineering from Texas Tech University in 1987 and 1990, respectively. He also received his MBA degree from Shenandoah University, Winchester, Virginia in 1999.

He is currently a Professor in the Department of Electrical and Computer Engineering at Brigham Young University. He worked in the machine vision industry for eleven years prior to joining BYU in 2001.

References (34)

  • W. Mao et al.

    The phrase-based vector space model for automatic retrieval of free-text medical documents

    Data and Knowledge Engineering

    (2007)
  • H. Muller et al.

    A review of content-based image retrieval systems in medical applications – clinical benefits and future directions

    International Journal of Medical Informatics

    (2004)
  • P. Chamarthy et al.

    Image analysis techniques for characterizing disc space narrowing in cervical vertebrae interface

    Computerized medical Imaging and Graphics

    (2004)
  • Fact Sheet: Osteoarthritis. American College of Rheumatology, Atlanta, Georgia,...
  • A.W.M. Smeulders et al.

    Content-based image retrieval at the end of the early years

    IEEE Transactions on Pattern Analysis and Machine Intelligence

    (2000)
  • I. El-Naqa et al.

    A similarity learning approach to content-based image retrieval: application to digital mammography

    IEEE Transactions on Medical Imaging

    (2004)
  • J. Kim et al.

    A new way for multidimensional medical data management: volume of interest (VOI)-based retrieval of medical images with visual and functional features

    IEEE Transactions on Information Technology in Biomedicine

    (2006)
  • A. Mojsilovic, J. Gomes, Semantic based categorization, browsing and retrieval in medical image databases,...
  • H. Greenspan et al.

    Medical image categorization and retrieval for PACS using the GMM-KL framework

    IEEE Transactions on Information Technology in Biomedicine

    (2007)
  • T.M. Lehmann et al.

    A monohierarchical multiaxial classification code for medical images in content-based retrieval

    IEEE International Symposium on Biomedical Imaging

    (2002)
  • W. Liu, Q.Y. Tong, medical image retrieval using salient point detector, in: IEEE-EMBS 27th Annual International...
  • X.Q. Xu et al.

    A spine X-ray image retrieval system using partial shape matching

    IEEE Transactions on Information Technology in Biomedicine

    (2008)
  • L.R. Long, G.R. Thoma, Image query and indexing for digital X-rays, in: SPIE Conference on Storage and Retrieval for...
  • H.G. Avad et al.

    Cumulative voting consensus method for partitions with variable number of clusters

    IEEE Transactions on Pattern Analysis and Machine Intelligence

    (2008)
  • T.L. Berg et al.

    Animal on the web

    IEEE International Conference on Computer Vision and Pattern Recognition

    (2006)
  • G. Sigletos et al.

    Combining information extraction systems using voting and stacked generalization

    Journal of Machine Learning Research

    (2005)
  • Y. Chen et al.

    CLUE: clustering-based retrieval of image by unsupervised learning

    IEEE Transactions on Image Processing

    (2005)
  • Cited by (21)

    • Content-based large-scale medical image retrieval

      2019, Biomedical Information Technology
    • Feature selection in multimedia: The state-of-the-art review

      2017, Image and Vision Computing
      Citation Excerpt :

      Bhanu and Lin [92] and Yu and Srinath [72] validated the robustness of one algorithm on many databases. Nine papers validated one algorithm on a single database [52,71,78,79,83,85,88,93,94]. Elguebaly and Bouguila [14], Elguebaly and Bouguila [15], Yang et al. [20], Wang et al. [21], and Sun et al. [61] tested their algorithms against databases with different applications.

    • Multimodal medical imaging (CT and dynamic MRI) data and computer-graphics multi-physical model for the estimation of patient specific lumbar spine muscle forces

      2015, Data and Knowledge Engineering
      Citation Excerpt :

      Knowledge extraction from biomedical data is one of the most challenging topics in the health engineering [1–3]. From biomedical informatics point of view, knowledge could be derived from complex multimodal and multidimensional data [4–8]. In the context of biomedical applications, there are many approaches to extract the knowledge such as machine learning or physics-based simulation [9].

    • A graph-based approach for the retrieval of multi-modality medical images

      2014, Medical Image Analysis
      Citation Excerpt :

      In the medical domain, Huang et al. (2012) calculated the spatial relationships between adjacent objects through the intensity profile of the local neighbourhood of tumour ROIs. Spine X-ray retrieval has been achieved using partial shape matching (Xu et al., 2008), and the spatial and geometric features between adjacent vertebrae (Lee et al., 2009). Graphs are a more general approach to represent relational data (Bunke and Riesen, 2011).

    • A Lattice-Computing ensemble for reasoning based on formal fusion of disparate data types, and an industrial dispensing application

      2014, Information Fusion
      Citation Excerpt :

      In the domain of Soft Computing or, equivalently, Computational Intelligence, the term “hybrid (system/algorithm)” frequently denotes an integration of different techniques/technologies including artificial neural networks, fuzzy systems, evolutionary/swarm computing, etc. towards improving an index of performance in real-world applications [1,15]; the term “intelligence” is pertinent to decision-making, e.g. in pattern classification/recognition [82]; moreover, the term “(intelligent) fusion” may signify an aggregate intelligence towards improving decision-making [48]. In the aforementioned sense, a “hybrid intelligent fusion system” may be a Multiple Classifier System (MCS) [46,49] also known in the literature as Classifier Ensemble [16,59,65] or Committee [21,80] or Voting Consensus [5,51]. Note that a number of MCS architectures/strategies including applications have been reported [22,29,30,47,50,52,55,56,70,71,74,81,85,86].

    • A Content Based Image Retrieval Approach based on Multiple Multimedia Features Descriptors in E-health Environment

      2020, IEEE Medical Measurements and Applications, MeMeA 2020 - Conference Proceedings
    View all citing articles on Scopus

    Dah-Jye Lee received his B.S.E.E. from National Taiwan University of Science and Technology in 1984, M.S. and Ph.D. degrees in electrical engineering from Texas Tech University in 1987 and 1990, respectively. He also received his MBA degree from Shenandoah University, Winchester, Virginia in 1999.

    He is currently a Professor in the Department of Electrical and Computer Engineering at Brigham Young University. He worked in the machine vision industry for eleven years prior to joining BYU in 2001. His research work focuses on Medical informatics and imaging, shape-based pattern recognition, hardware implementation of real-time 3-D vision algorithms and machine vision applications.

    He is a senior member of IEEE and a member of SPIE. He has actively served the research community as a paper and proposal reviewer and conference organizer.

    Sameer Antani received his B.E. (Computer) from University of Pune in 1994, M.E. and Ph.D. degrees in Computer Science and Engineering from the Pennsylvania State University in 1998 and 2001, respectively.

    He is currently a Staff Scientist with the Lister Hill National Center for Biomedical Communications an intramural R&D division of the National Library of Medicine which is an institute within the US National Institutes of Health. His research interests are in data management of and retrieval from, large biomedical multimedia archives. His research includes content-based indexing, and retrieval of biomedical images, combining image and text retrieval, and next-generation documents that are enriched with interconnections to data sets and multimedia content.

    He is a member of IEEE and IEEE Computer Society. He serves on the steering committee for IEEE Symposium for Computer Based Medical Systems (CBMS). He is a reviewer for several journals including various IEEE transactions.

    Yuchou Chang was born in Hunan, China in 1980. He received his B.S. degree in automatic control department from Northwestern Polytechnical University, Xi’An, China, in 2003 and M.S. degree in institute of image processing and pattern recognition from Shanghai Jiao Tong University, Shanghai, China, in 2006. He worked in the Robotic Vision Laboratory in the Electrical and Computer Engineering Department at Brigham Young University as a research assistant from September 2006 to December 2008.

    His research interest includes machine learning-assisted multimedia analysis, image segmentation, image and video semantic content description, content-based multimedia indexing and retrieval. He is an IEEE student member.

    Kent Gledhill was born and raised in Utah. He attended undergraduate and medical school at the University of Utah in Salt Lake City, graduating in 1992. He completed his Radiology residency and Neuroradiology fellowship at the University of New Mexico in Albuquerque. He currently practices diagnostic and neuroradiology in Utah County. He serves on the Board of Directors of the Central Utah Clinic in Provo and has been the medical staff president and a board member at Timpanogos Regional Hospital in Orem. He lives in Provo, Utah with his wife and four children.

    L. Rodney Long received his B.A. and M.A. degrees in mathematics from the University of Texas in 1971 and 1976, respectively, and M.A. degree in applied mathematics from the University of Maryland in 1987.

    Since 1990 he has been an Electronics Engineer in the Communications Engineering Branch of the National Library of Medicine. He previously worked for 14 years in industry as a programmer and engineer. His research work is concerned with Content-Based Image Retrieval, image processing, and image databases for biomedical applications.

    He has been a member of IEEE since 1986 and has served as co-chair of the IEEE International Symposium of Computer-Based Medical Systems.

    Paul Christensen is currently an undergraduate student in the Computer Science Department at Brigham Young University. He is expecting to receive his B.S. degree in December 2009. He served a two-year church mission for The Church of Jesus Christ of Latter-day Saints in Edinburgh, Scotland from 2004 to 2006. He worked as a research assistant in the Robotic Vision Lab at BYU from 2006 to 2008 and is currently an intern at Intel in Hillsboro, OR.

    View full text