Skip to main content
Erschienen in: The Journal of Supercomputing 7/2024

02.12.2023

AAR:Attention Remodulation for Weakly Supervised Semantic Segmentation

verfasst von: Yu-e Lin, Houguo Li, Xingzhu Liang, Mengfan Li, Huilin Liu

Erschienen in: The Journal of Supercomputing | Ausgabe 7/2024

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Weakly Supervised Semantic Segmentation is a crucial task in computer vision. However, existing methods that utilize Class Activation Maps (CAMs) with classification tasks can only identify a small part of the region. To address this limitation, we propose a novel Attention Activation Remodulation (AAR) scheme that leverages traditional CAMs and the remodulation branch to obtain weighted CAMs for recalibrated supervision. The AAR scheme re-arranges important features’ distribution from the channel and space perspectives, which regulates segmentation-oriented activation responses. In addition, we propose a Feature Pixel Extraction Module (FPEM) that utilizes contextual information to improve pixel prediction. Furthermore, the proposed scheme can be combined with other methods to improve overall performance. Extensive experiments on the PASCAL VOC 2012 dataset demonstrate the effectiveness of the AAR mechanism and FPEM module.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Dai J, He K, Sun J (2015) Boxsup: Exploiting bounding boxes to supervise convolutional networks for semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 1635–1643 Dai J, He K, Sun J (2015) Boxsup: Exploiting bounding boxes to supervise convolutional networks for semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 1635–1643
2.
Zurück zum Zitat Li S, Liu Y, Zhang Y et al (2023) Adaptive generation of weakly supervised semantic segmentation for object detection. Neural Process Lett 55(1):657–670CrossRef Li S, Liu Y, Zhang Y et al (2023) Adaptive generation of weakly supervised semantic segmentation for object detection. Neural Process Lett 55(1):657–670CrossRef
3.
Zurück zum Zitat Vernaza P, Chandraker M (2017) Learning random-walk label propagation for weakly-supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 7158–7166 Vernaza P, Chandraker M (2017) Learning random-walk label propagation for weakly-supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 7158–7166
4.
Zurück zum Zitat Qian R, Wei Y, Shi H et al (2019) Weakly supervised scene parsing with point-based distance metric learning. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 8843–8850 Qian R, Wei Y, Shi H et al (2019) Weakly supervised scene parsing with point-based distance metric learning. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 8843–8850
5.
Zurück zum Zitat Bearman A, Russakovsky O, Ferrari V et al (2016) What’s the point: semantic segmentation with point supervision. In: Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, Proceedings, Part VII 14, Springer, pp 549–565 Bearman A, Russakovsky O, Ferrari V et al (2016) What’s the point: semantic segmentation with point supervision. In: Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, Proceedings, Part VII 14, Springer, pp 549–565
6.
Zurück zum Zitat Jiang PT, Yang Y, Hou Q et al (2022) L2g: a simple local-to-global knowledge transfer framework for weakly supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 16886–16896 Jiang PT, Yang Y, Hou Q et al (2022) L2g: a simple local-to-global knowledge transfer framework for weakly supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 16886–16896
7.
Zurück zum Zitat Cao Z, Gao Y, Zhang J (2022) Scale-aware attention network for weakly supervised semantic segmentation. Neurocomputing 492:34–49CrossRef Cao Z, Gao Y, Zhang J (2022) Scale-aware attention network for weakly supervised semantic segmentation. Neurocomputing 492:34–49CrossRef
8.
Zurück zum Zitat Zhou L, Gong C, Liu Z et al (2020) Sal: selection and attention losses for weakly supervised semantic segmentation. IEEE Transact Multimed 23:1035–1048CrossRef Zhou L, Gong C, Liu Z et al (2020) Sal: selection and attention losses for weakly supervised semantic segmentation. IEEE Transact Multimed 23:1035–1048CrossRef
9.
Zurück zum Zitat Jiang PT, Han LH, Hou Q et al (2021) Online attention accumulation for weakly supervised semantic segmentation. IEEE Transact Pattern Anal Mach Intell 44(10):7062–7077CrossRef Jiang PT, Han LH, Hou Q et al (2021) Online attention accumulation for weakly supervised semantic segmentation. IEEE Transact Pattern Anal Mach Intell 44(10):7062–7077CrossRef
10.
Zurück zum Zitat Wang Y, Zhang J, Kan M et al (2020) Self-supervised equivariant attention mechanism for weakly supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 12275–12284 Wang Y, Zhang J, Kan M et al (2020) Self-supervised equivariant attention mechanism for weakly supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 12275–12284
11.
Zurück zum Zitat Wei Y, Feng J, Liang X et al (2017) Object region mining with adversarial erasing: A simple classification to semantic segmentation approach. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 1568–1576 Wei Y, Feng J, Liang X et al (2017) Object region mining with adversarial erasing: A simple classification to semantic segmentation approach. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 1568–1576
12.
Zurück zum Zitat Ahn J, Kwak S (2018) Learning pixel-level semantic affinity with image-level supervision for weakly supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4981–4990 Ahn J, Kwak S (2018) Learning pixel-level semantic affinity with image-level supervision for weakly supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4981–4990
13.
Zurück zum Zitat Chen Q, Yang L, Lai JH et al (2022) Self-supervised image-specific prototype exploration for weakly supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4288–4298 Chen Q, Yang L, Lai JH et al (2022) Self-supervised image-specific prototype exploration for weakly supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4288–4298
14.
Zurück zum Zitat Fan J, Zhang Z (2022) Memory-based cross-image contexts for weakly supervised semantic segmentation. IEEE transactions on pattern analysis and machine intelligence Fan J, Zhang Z (2022) Memory-based cross-image contexts for weakly supervised semantic segmentation. IEEE transactions on pattern analysis and machine intelligence
15.
Zurück zum Zitat Jo S, Yu IJ, Kim K (2023) Mars: model-agnostic biased object removal without additional supervision for weakly-supervised semantic segmentation. ArXiv Preprint ArXiv:2304.09913 Jo S, Yu IJ, Kim K (2023) Mars: model-agnostic biased object removal without additional supervision for weakly-supervised semantic segmentation. ArXiv Preprint ArXiv:​2304.​09913
16.
Zurück zum Zitat He J, Cheng L, Fang C et al (2023) Mitigating undisciplined over-smoothing in transformer for weakly supervised semantic segmentation. ArXiv Preprint ArXiv:2305.03112 He J, Cheng L, Fang C et al (2023) Mitigating undisciplined over-smoothing in transformer for weakly supervised semantic segmentation. ArXiv Preprint ArXiv:​2305.​03112
17.
Zurück zum Zitat Xu R, Wang C, Sun J et al (2023) Self correspondence distillation for end-to-end weakly-supervised semantic segmentation. ArXiv Preprint ArXiv:2302.13765 Xu R, Wang C, Sun J et al (2023) Self correspondence distillation for end-to-end weakly-supervised semantic segmentation. ArXiv Preprint ArXiv:​2302.​13765
18.
Zurück zum Zitat Guo MH, Xu TX, Liu JJ et al (2022) Attention mechanisms in computer vision: a survey. Comput Vis Med 8(3):331–368CrossRef Guo MH, Xu TX, Liu JJ et al (2022) Attention mechanisms in computer vision: a survey. Comput Vis Med 8(3):331–368CrossRef
19.
Zurück zum Zitat Niu Z, Zhong G, Yu H (2021) A review on the attention mechanism of deep learning. Neurocomputing 452:48–62CrossRef Niu Z, Zhong G, Yu H (2021) A review on the attention mechanism of deep learning. Neurocomputing 452:48–62CrossRef
20.
Zurück zum Zitat Ma WX (2022) Nonlocal integrable MKDV equations by two nonlocal reductions and their soliton solutions. J Geom Phys 177:104522MathSciNetCrossRef Ma WX (2022) Nonlocal integrable MKDV equations by two nonlocal reductions and their soliton solutions. J Geom Phys 177:104522MathSciNetCrossRef
21.
Zurück zum Zitat Zhu Z, Xu M, Bai S et al (2019) Asymmetric non-local neural networks for semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 593–602 Zhu Z, Xu M, Bai S et al (2019) Asymmetric non-local neural networks for semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 593–602
22.
Zurück zum Zitat Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 7132–7141 Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 7132–7141
23.
Zurück zum Zitat Wang Q, Wu B, Zhu P et al (2020) Eca-net: efficient channel attention for deep convolutional neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 11534–11542 Wang Q, Wu B, Zhu P et al (2020) Eca-net: efficient channel attention for deep convolutional neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 11534–11542
24.
Zurück zum Zitat Woo S, Park J, Lee JY et al (2018) Cbam: convolutional block attention module. In: Proceedings of the European Conference on Computer Vision (ECCV), pp 3–19 Woo S, Park J, Lee JY et al (2018) Cbam: convolutional block attention module. In: Proceedings of the European Conference on Computer Vision (ECCV), pp 3–19
25.
Zurück zum Zitat Cao Y, Xu J, Lin S et al (2019) Gcnet: Non-local networks meet squeeze-excitation networks and beyond. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 0–0 Cao Y, Xu J, Lin S et al (2019) Gcnet: Non-local networks meet squeeze-excitation networks and beyond. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 0–0
26.
Zurück zum Zitat Wei Y, Feng J, Liang X et al (2017) Object region mining with adversarial erasing: a simple classification to semantic segmentation approach. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 1568–1576 Wei Y, Feng J, Liang X et al (2017) Object region mining with adversarial erasing: a simple classification to semantic segmentation approach. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 1568–1576
27.
Zurück zum Zitat Xie J, Xiang J, Chen J et al (2022) C2am: contrastive learning of class-agnostic activation map for weakly supervised object localization and semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 989–998 Xie J, Xiang J, Chen J et al (2022) C2am: contrastive learning of class-agnostic activation map for weakly supervised object localization and semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 989–998
28.
Zurück zum Zitat Jiang PT, Hou Q, Cao Y et al (2019) Integral object mining via online attention accumulation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 2070–2079 Jiang PT, Hou Q, Cao Y et al (2019) Integral object mining via online attention accumulation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 2070–2079
29.
Zurück zum Zitat Wang X, Girshick R, Gupta A et al (2018) Non-local neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 7794–7803 Wang X, Girshick R, Gupta A et al (2018) Non-local neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 7794–7803
30.
Zurück zum Zitat He K, Zhang X, Ren S et al (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 770–778 He K, Zhang X, Ren S et al (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 770–778
31.
Zurück zum Zitat Yang Y, Wan F, Ye Q et al (2022) Weakly supervised learning of instance segmentation with confidence feedback. In: Part I (ed) Artificial Intelligence: Second CAAI International Conference, CICAI 2022, Beijing, China, Revised Selected Papers. Springer, pp 392–403 Yang Y, Wan F, Ye Q et al (2022) Weakly supervised learning of instance segmentation with confidence feedback. In: Part I (ed) Artificial Intelligence: Second CAAI International Conference, CICAI 2022, Beijing, China, Revised Selected Papers. Springer, pp 392–403
32.
Zurück zum Zitat Chen LC, Papandreou G, Kokkinos I et al (2017) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFS. IEEE Transact Pattern Anal Mach Intell 40(4):834–848CrossRef Chen LC, Papandreou G, Kokkinos I et al (2017) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFS. IEEE Transact Pattern Anal Mach Intell 40(4):834–848CrossRef
33.
Zurück zum Zitat Ahn J, Cho S, Kwak S (2019) Weakly supervised learning of instance segmentation with inter-pixel relations. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 2209–2218 Ahn J, Cho S, Kwak S (2019) Weakly supervised learning of instance segmentation with inter-pixel relations. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 2209–2218
34.
Zurück zum Zitat Fan J, Zhang Z, Tan T et al (2020) Cian: cross-image affinity net for weakly supervised semantic segmentation. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 10762–10769 Fan J, Zhang Z, Tan T et al (2020) Cian: cross-image affinity net for weakly supervised semantic segmentation. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 10762–10769
35.
Zurück zum Zitat Shimoda W, Yanai K (2019) Self-supervised difference detection for weakly-supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 5208–5217 Shimoda W, Yanai K (2019) Self-supervised difference detection for weakly-supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 5208–5217
36.
Zurück zum Zitat Yao Y, Chen T, Xie GS et al (2021) Non-salient region object mining for weakly supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 2623–2632 Yao Y, Chen T, Xie GS et al (2021) Non-salient region object mining for weakly supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 2623–2632
37.
Zurück zum Zitat Lee J, Kim E, Yoon S (2021) Anti-adversarially manipulated attributions for weakly and semi-supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4071–4080 Lee J, Kim E, Yoon S (2021) Anti-adversarially manipulated attributions for weakly and semi-supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4071–4080
38.
Zurück zum Zitat Kim B, Han S, Kim J (2021) Discriminative region suppression for weakly-supervised semantic segmentation. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 1754–1761 Kim B, Han S, Kim J (2021) Discriminative region suppression for weakly-supervised semantic segmentation. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp 1754–1761
39.
Zurück zum Zitat Zhang B, Xiao J, Wei Y et al (2022) End-to-end weakly supervised semantic segmentation with reliable region mining. Pattern Recogn 128:108663CrossRef Zhang B, Xiao J, Wei Y et al (2022) End-to-end weakly supervised semantic segmentation with reliable region mining. Pattern Recogn 128:108663CrossRef
40.
Zurück zum Zitat Mai J, Zhang F, Ye J et al (2023) Exploit cam by itself: Complementary learning system for weakly supervised semantic segmentation. ArXiv Preprint ArXiv:2303.02449 Mai J, Zhang F, Ye J et al (2023) Exploit cam by itself: Complementary learning system for weakly supervised semantic segmentation. ArXiv Preprint ArXiv:​2303.​02449
41.
Zurück zum Zitat Zhou L, Gong C, Liu Z et al (2020) Sal: selection and attention losses for weakly supervised semantic segmentation. IEEE Transact Multimed 23:1035–1048CrossRef Zhou L, Gong C, Liu Z et al (2020) Sal: selection and attention losses for weakly supervised semantic segmentation. IEEE Transact Multimed 23:1035–1048CrossRef
42.
Zurück zum Zitat Chen Z, Wang T, Wu X et al (2022) Class re-activation maps for weakly-supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 969–978 Chen Z, Wang T, Wu X et al (2022) Class re-activation maps for weakly-supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 969–978
43.
Zurück zum Zitat Xie J, Hou X, Ye K et al (2022) Clims: Cross language image matching for weakly supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4483–4492 Xie J, Hou X, Ye K et al (2022) Clims: Cross language image matching for weakly supervised semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 4483–4492
Metadaten
Titel
AAR:Attention Remodulation for Weakly Supervised Semantic Segmentation
verfasst von
Yu-e Lin
Houguo Li
Xingzhu Liang
Mengfan Li
Huilin Liu
Publikationsdatum
02.12.2023
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 7/2024
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-023-05786-z

Weitere Artikel der Ausgabe 7/2024

The Journal of Supercomputing 7/2024 Zur Ausgabe

Premium Partner