1 Introduction
2 Related work
2.1 Medical image segmentation
2.2 Feature alignment
2.3 Attention mechanism
3 Method
3.1 PVT-based encoder
3.2 Feature-aligned local enhancement module
3.3 Progressive local-induced decoder
3.4 Mutual information loss
Method | Seen dataset | Unseen dataset | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
CVC-ClinicDB | Kvasir | CVC-ColonDB | ETIS- LaribPolypDB | CVC-300 | ||||||
mDice | mIoU | mDice | mIoU | mDice | mIoU | mDice | mIoU | mDice | mIoU | |
FCN [5] | 0.825 | 0.747 | 0.775 | 0.686 | 0.578 | 0.481 | 0.379 | 0.313 | 0.660 | 0.558 |
U-Net [7] | 0.842 | 0.775 | 0.818 | 0.746 | 0.512 | 0.444 | 0.398 | 0.335 | 0.710 | 0.627 |
UNet++ [30] | 0.846 | 0.774 | 0.821 | 0.743 | 0.599 | 0.499 | 0.456 | 0.375 | 0.707 | 0.624 |
AttentionU-Net [11] | 0.809 | 0.744 | 0.782 | 0.694 | 0.614 | 0.524 | 0.440 | 0.360 | 0.686 | 0.580 |
DCRNet [58] | 0.896 | 0.844 | 0.886 | 0.825 | 0.704 | 0.631 | 0.556 | 0.496 | 0.856 | 0.788 |
SegNet [8] | 0.915 | 0.857 | 0.878 | 0.814 | 0.647 | 0.570 | 0.612 | 0.529 | 0.841 | 0.773 |
SFA [1] | 0.700 | 0.607 | 0.723 | 0.611 | 0.469 | 0.347 | 0.297 | 0.217 | 0.467 | 0.329 |
PraNet [59] | 0.899 | 0.849 | 0.898 | 0.840 | 0.709 | 0.640 | 0.628 | 0.567 | 0.871 | 0.797 |
ACSNet [39] | 0.912 | 0.858 | 0.907 | 0.850 | 0.709 | 0.643 | 0.609 | 0.537 | 0.862 | 0.784 |
EU-Net [60] | 0.902 | 0.846 | 0.908 | 0.854 | 0.756 | 0.681 | 0.687 | 0.609 | 0.837 | 0.765 |
SANet [20] | 0.916 | 0.859 | 0.904 | 0.847 | 0.753 | 0.670 | 0.750 | 0.654 | 0.888 | 0.815 |
BLE-Net [61] | 0.926 | 0.878 | 0.905 | 0.854 | 0.731 | 0.658 | 0.673 | 0.594 | 0.879 | 0.805 |
CaraNet [22] | 0.936 | 0.887 | 0.918 | 0.865 | 0.773 | 0.689 | 0.747 | 0.672 | 0.903 | 0.838 |
SETR-PUP [18] | 0.934 | 0.885 | 0.911 | 0.854 | 0.773 | 0.690 | 0.726 | 0.646 | 0.889 | 0.814 |
TransUnet [43] | 0.935 | 0.887 | 0.913 | 0.857 | 0.781 | 0.699 | 0.731 | 0.660 | 0.893 | 0.824 |
LET-Net(Ours) | 0.945 | 0.899 | 0.926 | 0.876 | 0.795 | 0.717 | 0.773 | 0.698 | 0.907 | 0.839 |
4 Experiments
4.1 Experimental setup
4.1.1 Implementation details
4.1.2 Datasets
4.1.3 Evaluation metrics
4.2 Experimental results
4.2.1 Polyp segmentation
Method | Benign lesion | Malignant lesion | ||||||
---|---|---|---|---|---|---|---|---|
Accuracy | Jaccard | Precision | Dice | Accuracy | Jaccard | Precision | Dice | |
U-Net [7] | 0.966 | 0.615 | 0.750 | 0.705 | 0.901 | 0.511 | 0.650 | 0.635 |
STAN [21] | 0.969 | 0.643 | 0.744 | 0.723 | 0.910 | 0.511 | 0.647 | 0.626 |
AttentionU-Net [11] | 0.969 | 0.650 | 0.752 | 0.733 | 0.912 | 0.511 | 0.616 | 0.630 |
Abraham et al. [68] | 0.969 | 0.667 | 0.767 | 0.748 | 0.915 | 0.541 | 0.675 | 0.658 |
UNet++ [30] | 0.971 | 0.683 | 0.759 | 0.756 | 0.915 | 0.540 | 0.655 | 0.655 |
UNet3+ [69] | 0.971 | 0.676 | 0.756 | 0.751 | 0.916 | 0.548 | 0.658 | 0.662 |
SegNet [8] | 0.972 | 0.679 | 0.770 | 0.755 | 0.922 | 0.549 | 0.638 | 0.659 |
PraNet [59] | 0.972 | 0.691 | 0.799 | 0.763 | 0.925 | 0.582 | 0.763 | 0.698 |
CPF-Net [29] | 0.973 | 0.699 | 0.801 | 0.766 | 0.927 | 0.605 | 0.755 | 0.716 |
C-Net [2] | 0.975 | 0.724 | 0.827 | 0.794 | 0.926 | 0.597 | 0.757 | 0.699 |
LET-Net(Ours) | 0.977 | 0.740 | 0.835 | 0.815 | 0.930 | 0.615 | 0.772 | 0.727 |
4.2.2 Breast lesion segmentation
Method | Seen dataset | Unseen dataset | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
CVC-ClinicDB | Kvasir | CVC-ColonDB | ETIS- LaribPolypDB | CVC-300 | ||||||
mDice | mIoU | mDice | mIoU | mDice | mIoU | mDice | mIoU | mDice | mIoU | |
w/o FLE | 0.936 | 0.887 | 0.918 | 0.871 | 0.779 | 0.698 | 0.751 | 0.674 | 0.884 | 0.816 |
w/o LRR | 0.940 | 0.894 | 0.910 | 0.859 | 0.790 | 0.711 | 0.759 | 0.681 | 0.890 | 0.821 |
LET-Net | 0.945 | 0.899 | 0.926 | 0.876 | 0.795 | 0.717 | 0.773 | 0.698 | 0.907 | 0.839 |
Loss setting | Seen dataset | Unseen dataset | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
CVC-ClinicDB | Kvasir | CVC-ColonDB | ETIS- LaribPolypDB | CVC-300 | ||||||
mDice | mIoU | mDice | mIoU | mDice | mIoU | mDice | mIoU | mDice | mIoU | |
w/o \({L_{\textrm{PPA}}}\) | 0.937 | 0.888 | 0.917 | 0.864 | 0.782 | 0.697 | 0.737 | 0.663 | 0.885 | 0.812 |
w/o \({L_{\textrm{VSD}}}\) | 0.940 | 0.892 | 0.923 | 0.872 | 0.785 | 0.702 | 0.762 | 0.688 | 0.895 | 0.826 |
w/o \({L_{\textrm{PPA}}}\, \& \,{L_{\textrm{VSD}}}\) | 0.934 | 0.882 | 0.914 | 0.861 | 0.772 | 0.692 | 0.716 | 0.648 | 0.879 | 0.807 |
LET-Net | 0.945 | 0.899 | 0.926 | 0.876 | 0.795 | 0.717 | 0.773 | 0.698 | 0.907 | 0.839 |