Skip to main content
Top
Published in: Neural Processing Letters 3/2020

06-04-2020

Md-Net: Multi-scale Dilated Convolution Network for CT Images Segmentation

Authors: Haiying Xia, Weifan Sun, Shuxiang Song, Xiangwei Mou

Published in: Neural Processing Letters | Issue 3/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Accurate CT image segmentation is of great importance to the clinical diagnosis. Due to the high similarity of gray values in CT image, the segmented areas are easily affected by their surroundings, which leads to the loss of semantic information. In this paper, we propose a multi-scale dilated convolution network (Md-Net) for CT image segmentation with superior segmentation performance compared with state-of-the-art methods. Specifically, our Md-Net utilizes the dilated convolutions with different sizes to form feature pyramids for extracting the semantic information. Moreover, we use a weighted Diceloss to accelerate the convergence in training process. Meanwhile, the bilinear interpolation and multiple convolutions are taken to reduce the computational cost. Experiment results show that our proposed Md-Net outperforms the representative medical image segmentation methods, including Unet, Unet++, MaskRcnn and CE-Net, in terms of sensitivity, accuracy and area under curve both on lung dataset and Bladder dataset.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Al-Kofahi Y, Lassoued W, Lee W, Roysam B (2009) Improved automatic detection and segmentation of cell nuclei in histopathology images. IEEE Trans Biomed Eng 57(4):841CrossRef Al-Kofahi Y, Lassoued W, Lee W, Roysam B (2009) Improved automatic detection and segmentation of cell nuclei in histopathology images. IEEE Trans Biomed Eng 57(4):841CrossRef
2.
go back to reference Ronneberger O, Fischer P, Brox TN (2015) Convolutional networks for biomedical image segmentation. In: Paper presented at international conference on medical image computing and computer-assisted intervention Ronneberger O, Fischer P, Brox TN (2015) Convolutional networks for biomedical image segmentation. In: Paper presented at international conference on medical image computing and computer-assisted intervention
3.
go back to reference Song TH, Sanchez V, EIDaly H, Rajpoot NM (2017) Dual-channel active contour model for megakaryocytic cell segmentation in bone marrow trephine histology images. IEEE Trans Biomed Eng 64(12):2913CrossRef Song TH, Sanchez V, EIDaly H, Rajpoot NM (2017) Dual-channel active contour model for megakaryocytic cell segmentation in bone marrow trephine histology images. IEEE Trans Biomed Eng 64(12):2913CrossRef
4.
go back to reference Shen W, Zhou M, Yang F, Dong D, Yang C, Zang Y, Tian J (2016) Learning from experts: developing transferable deep features for patient-level lung cancer prediction. In: International conference on medical image computing and computer-assisted intervention. Springer, Berlin, pp 124–131 Shen W, Zhou M, Yang F, Dong D, Yang C, Zang Y, Tian J (2016) Learning from experts: developing transferable deep features for patient-level lung cancer prediction. In: International conference on medical image computing and computer-assisted intervention. Springer, Berlin, pp 124–131
5.
go back to reference Hu S, Hoffman EA, Reinhardt JM (2001) Automatic lung segmentation for accurate quantitation of volumetric X-ray CT images. IEEE Trans Med Imaging 20(6):490CrossRef Hu S, Hoffman EA, Reinhardt JM (2001) Automatic lung segmentation for accurate quantitation of volumetric X-ray CT images. IEEE Trans Med Imaging 20(6):490CrossRef
6.
go back to reference Sluimer I, Schilham A, Prokop M, Van Ginneken B (2006) Computer analysis of computed tomography scans of the lung: a survey. IEEE Trans Med Imaging 25(4):385CrossRef Sluimer I, Schilham A, Prokop M, Van Ginneken B (2006) Computer analysis of computed tomography scans of the lung: a survey. IEEE Trans Med Imaging 25(4):385CrossRef
7.
go back to reference Prasad MN, Brown MS, Ahmad S, Abtin F, Allen J, da Costa I, Kim HJ, McNitt-Gray MF, Goldin JG (2008) Automatic segmentation of lung parenchyma in the presence of diseases based on curvature of ribs. Acad Radiol 15(9):1173CrossRef Prasad MN, Brown MS, Ahmad S, Abtin F, Allen J, da Costa I, Kim HJ, McNitt-Gray MF, Goldin JG (2008) Automatic segmentation of lung parenchyma in the presence of diseases based on curvature of ribs. Acad Radiol 15(9):1173CrossRef
8.
go back to reference Wang S, Zhou M, Liu Z, Liu Z, Gu D, Zang Y, Dong D, Gevaert O, Tian J (2017) Central focused convolutional neural networks: developing a data-driven model for lung nodule segmentation. Med Image Anal 40:172CrossRef Wang S, Zhou M, Liu Z, Liu Z, Gu D, Zang Y, Dong D, Gevaert O, Tian J (2017) Central focused convolutional neural networks: developing a data-driven model for lung nodule segmentation. Med Image Anal 40:172CrossRef
9.
go back to reference Makropoulos A, Gousias IS, Ledig C, Aljabar P, Serag A, Hajnal JV, Edwards AD, Counsell SJ, Rueckert D (2014) Automatic whole brain MRI segmentation of the developing neonatal brain. IEEE Trans Med Imaging 33(9):1818CrossRef Makropoulos A, Gousias IS, Ledig C, Aljabar P, Serag A, Hajnal JV, Edwards AD, Counsell SJ, Rueckert D (2014) Automatic whole brain MRI segmentation of the developing neonatal brain. IEEE Trans Med Imaging 33(9):1818CrossRef
10.
go back to reference Menze BH, Jakab A, Bauer S, Kalpathy-Cramer J, Farahani K, Kirby J, Burren Y, Porz N, Slotboom J, Wiest R et al (2014) The multimodal brain tumor image segmentation benchmark (BRATS). IEEE Trans Med Imaging 34(10):1993CrossRef Menze BH, Jakab A, Bauer S, Kalpathy-Cramer J, Farahani K, Kirby J, Burren Y, Porz N, Slotboom J, Wiest R et al (2014) The multimodal brain tumor image segmentation benchmark (BRATS). IEEE Trans Med Imaging 34(10):1993CrossRef
11.
go back to reference Cherukuri V, Ssenyonga P, Warf BC, Kulkarni AV, Monga V, Schiff SJ (2017) Learning based segmentation of ct brain images: application to postoperative hydrocephalic scans. IEEE Trans Biomed Eng 65(8):1871 Cherukuri V, Ssenyonga P, Warf BC, Kulkarni AV, Monga V, Schiff SJ (2017) Learning based segmentation of ct brain images: application to postoperative hydrocephalic scans. IEEE Trans Biomed Eng 65(8):1871
12.
go back to reference Huang M, Yang W, Wu Y, Jiang J, Chen W, Feng Q (2014) Brain tumor segmentation based on local independent projection-based classification. IEEE Trans Biomed Eng 61(10):2633CrossRef Huang M, Yang W, Wu Y, Jiang J, Chen W, Feng Q (2014) Brain tumor segmentation based on local independent projection-based classification. IEEE Trans Biomed Eng 61(10):2633CrossRef
13.
go back to reference Kass M, Witkin A, Terzopoulos D (1988) Snakes: active contour models. Int J Comput Vis 1(4):321CrossRef Kass M, Witkin A, Terzopoulos D (1988) Snakes: active contour models. Int J Comput Vis 1(4):321CrossRef
14.
go back to reference Adams R, Bischof L (1994) Seeded region growing. IEEE Trans Pattern Anal Mach Intell 16(6):641CrossRef Adams R, Bischof L (1994) Seeded region growing. IEEE Trans Pattern Anal Mach Intell 16(6):641CrossRef
15.
go back to reference Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440 Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440
16.
go back to reference Lin TY, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125 Lin TY, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125
18.
go back to reference LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278CrossRef LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278CrossRef
19.
go back to reference Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Proceedings of the advances in neural information processing systems, pp 1097–1105 Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Proceedings of the advances in neural information processing systems, pp 1097–1105
20.
go back to reference Yu J, Zhu C, Zhang J, Huang Q, Tao D (2019) Spatial pyramid-enhanced NetVLAD with weighted triplet loss for place recognition. IEEE Trans Neural Netw Learn Syst 31(2):661–674 CrossRef Yu J, Zhu C, Zhang J, Huang Q, Tao D (2019) Spatial pyramid-enhanced NetVLAD with weighted triplet loss for place recognition. IEEE Trans Neural Netw Learn Syst 31(2):661–674 CrossRef
21.
go back to reference Zhang J, Yu J, Tao D (2018) Local deep-feature alignment for unsupervised dimension reduction. IEEE Trans Image Process 27(5):2420MathSciNetCrossRef Zhang J, Yu J, Tao D (2018) Local deep-feature alignment for unsupervised dimension reduction. IEEE Trans Image Process 27(5):2420MathSciNetCrossRef
22.
go back to reference Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99 Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99
24.
go back to reference Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708 Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708
25.
go back to reference Tian Z, Shen C, Chen H, He T (2019) Fcos: Fully convolutional one-stage object detection. In: Proceedings of the IEEE international conference on computer vision, pp 9627–9636 Tian Z, Shen C, Chen H, He T (2019) Fcos: Fully convolutional one-stage object detection. In: Proceedings of the IEEE international conference on computer vision, pp 9627–9636
26.
go back to reference Yu J, Li J, Yu Z, Huang Q (2019) Multimodal transformer with multi-view visual representation for image captioning. arXiv preprint arXiv:1905.07841 Yu J, Li J, Yu Z, Huang Q (2019) Multimodal transformer with multi-view visual representation for image captioning. arXiv preprint arXiv:​1905.​07841
28.
go back to reference Hong C, Yu J, Zhang J, Jin X, Lee KH (2018) Multimodal face-pose estimation with multitask manifold deep learning. IEEE Trans Ind Inf 15(7):3952CrossRef Hong C, Yu J, Zhang J, Jin X, Lee KH (2018) Multimodal face-pose estimation with multitask manifold deep learning. IEEE Trans Ind Inf 15(7):3952CrossRef
29.
go back to reference Hong C, Yu J, Wan J, Tao D, Wang M (2015) Multimodal deep autoencoder for human pose recovery. IEEE Trans Image Process 24(12):5659MathSciNetCrossRef Hong C, Yu J, Wan J, Tao D, Wang M (2015) Multimodal deep autoencoder for human pose recovery. IEEE Trans Image Process 24(12):5659MathSciNetCrossRef
30.
go back to reference Hong C, Yu J, Chen X (2013) Image-based 3D human pose recovery with locality sensitive sparse retrieval. In: 2013 IEEE international conference on systems, man, and cybernetics (IEEE), pp 2103–2108 Hong C, Yu J, Chen X (2013) Image-based 3D human pose recovery with locality sensitive sparse retrieval. In: 2013 IEEE international conference on systems, man, and cybernetics (IEEE), pp 2103–2108
31.
go back to reference Yu J, Tao D, Wang M, Rui Y (2014) Learning to rank using user clicks and visual features for image retrieval. IEEE Trans Cybern 45(4):767CrossRef Yu J, Tao D, Wang M, Rui Y (2014) Learning to rank using user clicks and visual features for image retrieval. IEEE Trans Cybern 45(4):767CrossRef
32.
go back to reference Badrinarayanan V, Kendall A, Cipolla R (2017) Segnet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans Pattern Anal Mach Intell 39(12):2481CrossRef Badrinarayanan V, Kendall A, Cipolla R (2017) Segnet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans Pattern Anal Mach Intell 39(12):2481CrossRef
33.
go back to reference Zhao H, Shi J, Qi X, Wang X, Jia J (2017) Pyramid scene parsing network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2881–2890 Zhao H, Shi J, Qi X, Wang X, Jia J (2017) Pyramid scene parsing network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2881–2890
34.
go back to reference Chen LC, Papandreou G, Kokkinos I, Murphy K, Yuille AL (2017) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans Pattern Anal Mach Intell 40(4):834CrossRef Chen LC, Papandreou G, Kokkinos I, Murphy K, Yuille AL (2017) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans Pattern Anal Mach Intell 40(4):834CrossRef
35.
go back to reference Zhou Z, Siddiquee MMR, Tajbakhsh N, Liang J (2018) CE-Net: context encoder network for 2D medical image segmentation. In: Deep learning in medical image analysis and multimodal learning for clinical decision support. Springer, Berlin, pp 3–11 Zhou Z, Siddiquee MMR, Tajbakhsh N, Liang J (2018) CE-Net: context encoder network for 2D medical image segmentation. In: Deep learning in medical image analysis and multimodal learning for clinical decision support. Springer, Berlin, pp 3–11
36.
go back to reference Gu Z, Cheng J, Fu H, Zhou K, Hao H, Zhao Y, Zhang T, Gao S, Liu J (2019) Ssd: single shot multibox detector. IEEE Trans Med Imaging 38(10):2281CrossRef Gu Z, Cheng J, Fu H, Zhou K, Hao H, Zhao Y, Zhang T, Gao S, Liu J (2019) Ssd: single shot multibox detector. IEEE Trans Med Imaging 38(10):2281CrossRef
37.
go back to reference Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) Encoder–decoder with atrous separable convolution for semantic image segmentation. In: European conference on computer vision. Springer, Berlin, pp 21–37 Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) Encoder–decoder with atrous separable convolution for semantic image segmentation. In: European conference on computer vision. Springer, Berlin, pp 21–37
38.
go back to reference Chen LC, Zhu Y, Papandreou G, Schroff F, Adam H (2018) Network in network. In: Proceedings of the European conference on computer vision (ECCV), pp 801–818 Chen LC, Zhu Y, Papandreou G, Schroff F, Adam H (2018) Network in network. In: Proceedings of the European conference on computer vision (ECCV), pp 801–818
40.
go back to reference Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Spatial transformer networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9 Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Spatial transformer networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9
41.
go back to reference Jaderberg M, Simonyan K, Zisserman A et al (2015) V-net: fully convolutional neural networks for volumetric medical image segmentation. In: Proceedings of the advances in neural information processing systems, pp 2017–2025 Jaderberg M, Simonyan K, Zisserman A et al (2015) V-net: fully convolutional neural networks for volumetric medical image segmentation. In: Proceedings of the advances in neural information processing systems, pp 2017–2025
42.
go back to reference Milletari F, Navab N, Ahmadi SA (2016) Unet++: a nested u-net architecture for medical image segmentation In: 2016 fourth international conference on 3D vision (3DV). IEEE, pp 565–571 Milletari F, Navab N, Ahmadi SA (2016) Unet++: a nested u-net architecture for medical image segmentation In: 2016 fourth international conference on 3D vision (3DV). IEEE, pp 565–571
43.
go back to reference He K, Gkioxari G, Dollár P, Girshick R (2017) Deep residual learning for image recognition. In: Proceedings of the IEEE international conference on computer vision, pp 2961–2969 He K, Gkioxari G, Dollár P, Girshick R (2017) Deep residual learning for image recognition. In: Proceedings of the IEEE international conference on computer vision, pp 2961–2969
44.
go back to reference He K, Zhang X, Ren S, Sun J (2016) Mask r-cnn. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778 He K, Zhang X, Ren S, Sun J (2016) Mask r-cnn. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Metadata
Title
Md-Net: Multi-scale Dilated Convolution Network for CT Images Segmentation
Authors
Haiying Xia
Weifan Sun
Shuxiang Song
Xiangwei Mou
Publication date
06-04-2020
Publisher
Springer US
Published in
Neural Processing Letters / Issue 3/2020
Print ISSN: 1370-4621
Electronic ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-020-10230-x

Other articles of this Issue 3/2020

Neural Processing Letters 3/2020 Go to the issue