Skip to main content
Erschienen in: Neural Computing and Applications 5/2023

11.05.2021 | S.I. : Deep Geospatial Data Understanding

Joint image and feature adaptative attention-aware networks for cross-modality semantic segmentation

verfasst von: Qihuang Zhong, Fanzhou Zeng, Fei Liao, Juhua Liu, Bo Du, Jedi S. Shang

Erschienen in: Neural Computing and Applications | Ausgabe 5/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Deep learning-based methods are widely used for the task of semantic segmentation in recent years. However, due to the difficulty and labor cost of collecting pixel-level annotations, it is hard to acquire sufficient training images for a certain imaging modality, which greatly hinders the performance of these methods. The intuitive solution to this issue is to train a pre-trained model on label-rich imaging modality (source domain) and then apply the pre-trained model to the label-poor imaging modality (target domain). Unsurprisingly, since the severe domain shift between different modalities, the pre-trained model would perform poorly on the target imaging modality. To this end, we propose a novel unsupervised domain adaptation framework, called Joint Image and Feature Adaptive Attention-aware Networks (JIFAAN), to alleviate the domain shift for cross-modality semantic segmentation. The proposed framework mainly consists of two procedures. The first procedure is image adaptation, which transforms the source domain images into target-like images using the adversarial learning with cycle-consistency constraint. For further bridging the gap between transformed images and target domain images, the second procedure employs feature adaptation to extract the domain-invariant features and thus aligns the distribution in feature space. In particular, we introduce an attention module in the feature adaptation to focus on noteworthy regions and generate attention-aware results. Lastly, we combine two procedures in an end-to-end manner. Experiments on two cross-modality semantic segmentation datasets demonstrate the effectiveness of our proposed framework. Specifically, JIFAAN surpasses the cutting-edge domain adaptation methods and achieves the state-of-the-art performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Zhang, Qiming, et al (2019) "Category anchor-guided unsupervised domain adaptation for semantic segmentation." Advances in Neural Information Processing Systems. 433--443. Zhang, Qiming, et al (2019) "Category anchor-guided unsupervised domain adaptation for semantic segmentation." Advances in Neural Information Processing Systems. 433--443.
2.
Zurück zum Zitat Gros, Charley, et al (2019) "Automatic segmentation of the spinal cord and intramedullary multiple sclerosis lesions with convolutional neural networks." Neuroimage 184; 901–915. Gros, Charley, et al (2019) "Automatic segmentation of the spinal cord and intramedullary multiple sclerosis lesions with convolutional neural networks." Neuroimage 184; 901–915.
3.
Zurück zum Zitat Lee, Yongbum, et al (2001) "Automated detection of pulmonary nodules in helical CT images based on an improved template-matching technique." IEEE Transactions on medical imaging 20.7; 595–604. Lee, Yongbum, et al (2001) "Automated detection of pulmonary nodules in helical CT images based on an improved template-matching technique." IEEE Transactions on medical imaging 20.7; 595–604.
4.
Zurück zum Zitat Coupé, Pierrick, et al (2011) "Patch-based segmentation using expert priors: Application to hippocampus and ventricle segmentation." NeuroImage 54.2; 940–954. Coupé, Pierrick, et al (2011) "Patch-based segmentation using expert priors: Application to hippocampus and ventricle segmentation." NeuroImage 54.2; 940–954.
5.
Zurück zum Zitat Litjens, Geert, et al (2017) "A survey on deep learning in medical image analysis." Medical image analysis 42; 60–88. Litjens, Geert, et al (2017) "A survey on deep learning in medical image analysis." Medical image analysis 42; 60–88.
6.
Zurück zum Zitat Long, Jonathan (2015) Evan Shelhamer and Trevor Darrell. "Fully convolutional networks for semantic segmentation." Proceedings of the IEEE conference on computer vision and pattern recognition. Long, Jonathan (2015) Evan Shelhamer and Trevor Darrell. "Fully convolutional networks for semantic segmentation." Proceedings of the IEEE conference on computer vision and pattern recognition.
7.
Zurück zum Zitat Çiçek, Özgün, et al (2016) "3D U-Net: learning dense volumetric segmentation from sparse annotation." International conference on medical image computing and computer-assisted intervention. Springer, Cham. Çiçek, Özgün, et al (2016) "3D U-Net: learning dense volumetric segmentation from sparse annotation." International conference on medical image computing and computer-assisted intervention. Springer, Cham.
8.
Zurück zum Zitat Ronneberger, Olaf, Philipp Fischer and Thomas Brox. "U-net: Convolutional networks for biomedical image segmentation." International Conference on Medical image computing and computer-assisted intervention. Springer, Cham. (2015). Ronneberger, Olaf, Philipp Fischer and Thomas Brox. "U-net: Convolutional networks for biomedical image segmentation." International Conference on Medical image computing and computer-assisted intervention. Springer, Cham. (2015).
9.
Zurück zum Zitat Pham DL, Chenyang Xu, Prince JL (2000) Current methods in medical image segmentation. Annu Rev Biomed Eng 2(1):315–337CrossRef Pham DL, Chenyang Xu, Prince JL (2000) Current methods in medical image segmentation. Annu Rev Biomed Eng 2(1):315–337CrossRef
10.
Zurück zum Zitat Kaus, Michael R., et al (2001) "Automated segmentation of MR images of brain tumors." Radiology 218.2; 586–591. Kaus, Michael R., et al (2001) "Automated segmentation of MR images of brain tumors." Radiology 218.2; 586–591.
11.
Zurück zum Zitat Chen, Xu, et al (2020) "Anatomy-Regularized Representation Learning for Cross-Modality Medical Image Segmentation." IEEE Transactions on Medical Imaging 40.1; 274–285. Chen, Xu, et al (2020) "Anatomy-Regularized Representation Learning for Cross-Modality Medical Image Segmentation." IEEE Transactions on Medical Imaging 40.1; 274–285.
12.
Zurück zum Zitat Zhang, Zizhao, Lin Yang and Yefeng Zheng (2018) "Translating and segmenting multimodal medical volumes with cycle-and shape-consistency generative adversarial network." Proceedings of the IEEE conference on computer vision and pattern recognition. Zhang, Zizhao, Lin Yang and Yefeng Zheng (2018) "Translating and segmenting multimodal medical volumes with cycle-and shape-consistency generative adversarial network." Proceedings of the IEEE conference on computer vision and pattern recognition.
13.
Zurück zum Zitat Huo, Yuankai, et al (2018) "Synseg-net: Synthetic segmentation without target modality ground truth." IEEE transactions on medical imaging 38.4; 1016–1025. Huo, Yuankai, et al (2018) "Synseg-net: Synthetic segmentation without target modality ground truth." IEEE transactions on medical imaging 38.4; 1016–1025.
14.
Zurück zum Zitat Chen, Cheng, et al (2019) "Synergistic image and feature adaptation: Towards cross-modality domain adaptation for medical image segmentation." Proceedings of the AAAI Conference on Artificial Intelligence. 33. Chen, Cheng, et al (2019) "Synergistic image and feature adaptation: Towards cross-modality domain adaptation for medical image segmentation." Proceedings of the AAAI Conference on Artificial Intelligence. 33.
15.
Zurück zum Zitat Xu, Yonghao, et al (2019) "Self-ensembling attention networks: Addressing domain shift for semantic segmentation." Proceedings of the AAAI Conference on Artificial Intelligence. 33. Xu, Yonghao, et al (2019) "Self-ensembling attention networks: Addressing domain shift for semantic segmentation." Proceedings of the AAAI Conference on Artificial Intelligence. 33.
16.
Zurück zum Zitat Zhu, Jun-Yan, et al (2017) "Unpaired image-to-image translation using cycle-consistent adversarial networks." Proceedings of the IEEE international conference on computer vision. Zhu, Jun-Yan, et al (2017) "Unpaired image-to-image translation using cycle-consistent adversarial networks." Proceedings of the IEEE international conference on computer vision.
17.
Zurück zum Zitat Chen, Cheng, et al (2018) "Semantic-aware generative adversarial nets for unsupervised domain adaptation in chest x-ray segmentation." International workshop on machine learning in medical imaging. Springer, Cham. Chen, Cheng, et al (2018) "Semantic-aware generative adversarial nets for unsupervised domain adaptation in chest x-ray segmentation." International workshop on machine learning in medical imaging. Springer, Cham.
18.
Zurück zum Zitat Bousmalis, Konstantinos, et al (2017) "Unsupervised pixel-level domain adaptation with generative adversarial networks." Proceedings of the IEEE conference on computer vision and pattern recognition. Bousmalis, Konstantinos, et al (2017) "Unsupervised pixel-level domain adaptation with generative adversarial networks." Proceedings of the IEEE conference on computer vision and pattern recognition.
19.
Zurück zum Zitat Hoffman, Judy, et al (2018) "Cycada: Cycle-consistent adversarial domain adaptation." International conference on machine learning. PMLR. Hoffman, Judy, et al (2018) "Cycada: Cycle-consistent adversarial domain adaptation." International conference on machine learning. PMLR.
20.
Zurück zum Zitat Tsai, Yi-Hsuan, et al (2018) "Learning to adapt structured output space for semantic segmentation." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Tsai, Yi-Hsuan, et al (2018) "Learning to adapt structured output space for semantic segmentation." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
21.
Zurück zum Zitat Dou, Qi, et al (2016) "Unsupervised Cross-Modality Domain Adaptation of ConvNets for Biomedical Image Segmentations with Adversarial Loss." IJCAI. 2018. Ganin, Yaroslav, et al. "Domain-adversarial training of neural networks." The Journal of Machine Learning Research 17.1; 2096–2030. Dou, Qi, et al (2016) "Unsupervised Cross-Modality Domain Adaptation of ConvNets for Biomedical Image Segmentations with Adversarial Loss." IJCAI. 2018. Ganin, Yaroslav, et al. "Domain-adversarial training of neural networks." The Journal of Machine Learning Research 17.1; 2096–2030.
22.
Zurück zum Zitat Menze, Bjoern H., et al (2014) "The multimodal brain tumor image segmentation benchmark (BRATS)." IEEE transactions on medical imaging 34.10; 1993–2024. Menze, Bjoern H., et al (2014) "The multimodal brain tumor image segmentation benchmark (BRATS)." IEEE transactions on medical imaging 34.10; 1993–2024.
23.
Zurück zum Zitat Zhuang X, Shen J (2016) Multi-scale patch and multi-modality atlases for whole heart segmentation of MRI. Med Image Anal 31:77–87CrossRef Zhuang X, Shen J (2016) Multi-scale patch and multi-modality atlases for whole heart segmentation of MRI. Med Image Anal 31:77–87CrossRef
24.
Zurück zum Zitat Zhang, Wenlu, et al (2015) "Deep convolutional neural networks for multi-modality isointense infant brain image segmentation." NeuroImage 108; 214–224. Zhang, Wenlu, et al (2015) "Deep convolutional neural networks for multi-modality isointense infant brain image segmentation." NeuroImage 108; 214–224.
25.
Zurück zum Zitat Bar, Yaniv, et al (2015) "Deep learning with non-medical training used for chest pathology identification." Medical Imaging 2015: Computer-Aided Diagnosis. Vol. 9414. International Society for Optics and Photonics. Bar, Yaniv, et al (2015) "Deep learning with non-medical training used for chest pathology identification." Medical Imaging 2015: Computer-Aided Diagnosis. Vol. 9414. International Society for Optics and Photonics.
26.
Zurück zum Zitat Bergamo A, Torresani L, Fitzgibbon A (2011) Picodes: Learning a compact code for novel-category recognition. Adv Neural Inf Process Syst 24:2088–2096 Bergamo A, Torresani L, Fitzgibbon A (2011) Picodes: Learning a compact code for novel-category recognition. Adv Neural Inf Process Syst 24:2088–2096
27.
Zurück zum Zitat Prasoon, Adhish, et al (2013) "Deep feature learning for knee cartilage segmentation using a triplanar convolutional neural network." International conference on medical image computing and computer-assisted intervention. Springer, Berlin, Heidelberg. Prasoon, Adhish, et al (2013) "Deep feature learning for knee cartilage segmentation using a triplanar convolutional neural network." International conference on medical image computing and computer-assisted intervention. Springer, Berlin, Heidelberg.
28.
Zurück zum Zitat Roth, Holger R., et al (2014) "A new 2.5 D representation for lymph node detection using random sets of deep convolutional neural network observations." International conference on medical image computing and computer-assisted intervention. Springer, Cham. Roth, Holger R., et al (2014) "A new 2.5 D representation for lymph node detection using random sets of deep convolutional neural network observations." International conference on medical image computing and computer-assisted intervention. Springer, Cham.
29.
Zurück zum Zitat Dou, Qi, et al (2017) "3D deeply supervised network for automated segmentation of volumetric medical images." Medical image analysis 41; 40–54. Dou, Qi, et al (2017) "3D deeply supervised network for automated segmentation of volumetric medical images." Medical image analysis 41; 40–54.
30.
Zurück zum Zitat Kamnitsas, Konstantinos, et al (2015) "Multi-scale 3D convolutional neural networks for lesion segmentation in brain MRI." Ischemic stroke lesion segmentation 13; 46. Kamnitsas, Konstantinos, et al (2015) "Multi-scale 3D convolutional neural networks for lesion segmentation in brain MRI." Ischemic stroke lesion segmentation 13; 46.
31.
Zurück zum Zitat Perone, Christian S., et al (2019) "Unsupervised domain adaptation for medical imaging segmentation with self-ensembling." NeuroImage 194; 1–11. Perone, Christian S., et al (2019) "Unsupervised domain adaptation for medical imaging segmentation with self-ensembling." NeuroImage 194; 1–11.
32.
Zurück zum Zitat Gordienko, Yu, et al (2018) "Deep learning with lung segmentation and bone shadow exclusion techniques for chest x-ray analysis of lung cancer." International Conference on Computer Science, Engineering and Education Applications. Springer, Cham. Gordienko, Yu, et al (2018) "Deep learning with lung segmentation and bone shadow exclusion techniques for chest x-ray analysis of lung cancer." International Conference on Computer Science, Engineering and Education Applications. Springer, Cham.
33.
Zurück zum Zitat Zeng, Guodong, et al (2017) "3D U-net with multi-level deep supervision: fully automatic segmentation of proximal femur in 3D MR images." International workshop on machine learning in medical imaging. Springer, Cham. Zeng, Guodong, et al (2017) "3D U-net with multi-level deep supervision: fully automatic segmentation of proximal femur in 3D MR images." International workshop on machine learning in medical imaging. Springer, Cham.
34.
Zurück zum Zitat Richter, Stephan R., et al (2016) "Playing for data: Ground truth from computer games." European conference on computer vision. Springer, Cham. Richter, Stephan R., et al (2016) "Playing for data: Ground truth from computer games." European conference on computer vision. Springer, Cham.
35.
Zurück zum Zitat Ros, German, et al (2016) "The synthia dataset: A large collection of synthetic images for semantic segmentation of urban scenes." Proceedings of the IEEE conference on computer vision and pattern recognition. Ros, German, et al (2016) "The synthia dataset: A large collection of synthetic images for semantic segmentation of urban scenes." Proceedings of the IEEE conference on computer vision and pattern recognition.
36.
Zurück zum Zitat Cordts, Marius, et al (2016) "The cityscapes dataset for semantic urban scene understanding." Proceedings of the IEEE conference on computer vision and pattern recognition. Cordts, Marius, et al (2016) "The cityscapes dataset for semantic urban scene understanding." Proceedings of the IEEE conference on computer vision and pattern recognition.
37.
Zurück zum Zitat Zhao, Can, et al (2017) "Whole brain segmentation and labeling from CT using synthetic MR images." International Workshop on Machine Learning in Medical Imaging. Springer, Cham. Zhao, Can, et al (2017) "Whole brain segmentation and labeling from CT using synthetic MR images." International Workshop on Machine Learning in Medical Imaging. Springer, Cham.
38.
Zurück zum Zitat Ghifary, Muhammad, et al (2016) "Deep reconstruction-classification networks for unsupervised domain adaptation." European Conference on Computer Vision. Springer, Cham. Ghifary, Muhammad, et al (2016) "Deep reconstruction-classification networks for unsupervised domain adaptation." European Conference on Computer Vision. Springer, Cham.
39.
Zurück zum Zitat Sankaranarayanan, Swami, et al (2018) "Learning from synthetic data: Addressing domain shift for semantic segmentation." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Sankaranarayanan, Swami, et al (2018) "Learning from synthetic data: Addressing domain shift for semantic segmentation." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
40.
Zurück zum Zitat Zhang, Yiheng, et al (2018) "Fully convolutional adaptation networks for semantic segmentation." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Zhang, Yiheng, et al (2018) "Fully convolutional adaptation networks for semantic segmentation." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
41.
Zurück zum Zitat Chen, et al (2016) "Attention to scale: Scale-aware semantic image segmentation." Proceedings of the IEEE conference on computer vision and pattern recognition. Chen, et al (2016) "Attention to scale: Scale-aware semantic image segmentation." Proceedings of the IEEE conference on computer vision and pattern recognition.
42.
Zurück zum Zitat Wang, et al (2017) "Residual attention network for image classification." Proceedings of the IEEE conference on computer vision and pattern recognition. Wang, et al (2017) "Residual attention network for image classification." Proceedings of the IEEE conference on computer vision and pattern recognition.
43.
Zurück zum Zitat Ma, Benteng, et al (2020) "Auto Learning Attention." Advances in Neural Information Processing Systems 33. Ma, Benteng, et al (2020) "Auto Learning Attention." Advances in Neural Information Processing Systems 33.
44.
Zurück zum Zitat Chen, Cheng, et al (2020) "Unsupervised bidirectional cross-modality adaptation via deeply synergistic image and feature alignment for medical image segmentation." IEEE transactions on medical imaging 39.7; 2494–2505. Chen, Cheng, et al (2020) "Unsupervised bidirectional cross-modality adaptation via deeply synergistic image and feature alignment for medical image segmentation." IEEE transactions on medical imaging 39.7; 2494–2505.
45.
Zurück zum Zitat Isola, Phillip, et al (2017) "Image-to-image translation with conditional adversarial networks." Proceedings of the IEEE conference on computer vision and pattern recognition. Isola, Phillip, et al (2017) "Image-to-image translation with conditional adversarial networks." Proceedings of the IEEE conference on computer vision and pattern recognition.
46.
Zurück zum Zitat Yu, Fisher (2017) Vladlen Koltun and Thomas Funkhouser. "Dilated residual networks." Proceedings of the IEEE conference on computer vision and pattern recognition. Yu, Fisher (2017) Vladlen Koltun and Thomas Funkhouser. "Dilated residual networks." Proceedings of the IEEE conference on computer vision and pattern recognition.
Metadaten
Titel
Joint image and feature adaptative attention-aware networks for cross-modality semantic segmentation
verfasst von
Qihuang Zhong
Fanzhou Zeng
Fei Liao
Juhua Liu
Bo Du
Jedi S. Shang
Publikationsdatum
11.05.2021
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 5/2023
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-021-06064-w

Weitere Artikel der Ausgabe 5/2023

Neural Computing and Applications 5/2023 Zur Ausgabe

Premium Partner