Top

The Journal of Supercomputing

Published in:

22-09-2023

Saliency-based dual-attention network for unsupervised video object segmentation

Authors: Guifang Zhang, Hon-Cheng Wong

Published in: The Journal of Supercomputing | Issue 4/2024

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

This paper solves the task of unsupervised video object segmentation (UVOS) that segments the objects of interest through the entire videos without any annotation. In recent years, many unsupervised video object segmentation (UVOS) methods have been proposed. Although these methods perform well, they rely on networks with heavy weights, often leading to large model size. In order to reduce the model size while keeping a competitive performance, we propose a saliency-based dual-attention (SDA) method for UVOS in this paper. In our method, we take optical flow and video frames as inputs and extract the appearance information and motion information from optical flow and video frames. We design a two-branch network with appearance information and motion information. The information from these two branches is fused via a saliency-based dual-attention module to segment the primary object in one path. The saliency-based dual-attention module is composed of saliency attention and saliency-based reverse attention. To demonstrate the effectiveness of our network, we tested it on the DAVIS-2016 and SegtrackV2 datasets. Experimental results demonstrate that our method can achieve competitive results in terms of accuracy and model size.

previous article Gtpsum: guided tensor product framework for abstractive summarization

next article Hybrid medical named entity recognition using document structure and surrounding context

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Tokmakov Pavel KA, Schmid C (2017) Learning video object segmentation with visual memory. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)

Li S, Seybold B, Vorobyov A, et al (2018) Unsupervised video object segmentation with motion-based bilateral networks. In: proceedings of the European Conference on Computer Vision (ECCV)

Zhou T, Li J, Wang S et al (2020) Matnet: motion-attentive transition network for zero-shot video object segmentation. IEEE Trans Image Process 29:8326–8338. https://doi.org/10.1109/TIP.2020.3013162CrossRefADS

Wang W, Song H, Zhao S, et al (2019) Learning unsupervised video object segmentation through visual attention. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Lu X, Wang W, Ma C, et al (2019) See more, know more: Unsupervised video object segmentation with co-attention siamese networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Yang Z, Wang Q, Bertinetto L, et al (2019) Anchor diffusion for unsupervised video object segmentation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)

Caelles S, Montes A, Maninis KK, et al (2018) The 2018 davis challenge on video object segmentation. arXiv preprint arXiv:1803.00557

Zhao X, Pang Y, Yang J, et al (2021) Multi-source fusion and automatic predictor selection for zero-shot video object segmentation. In: Proceedings of the 29th ACM International Conference on Multimedia, pp. 2645–2653

Cho S, Lee M, Lee S, et al (2023) Treating motion as option to reduce motion dependency in unsupervised video object segmentation. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 5140–5149

10.

Pei G, Shen F, Yao Y, et al (2022) Hierarchical feature alignment network for unsupervised video object segmentation. In: European Conference on Computer Vision, Springer, pp. 596–613

11.

Lee M, Cho S, Lee S, et al (2023) Unsupervised video object segmentation via prototype memory network. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 5924–5934

12.

Zhen M, Li S, Zhou L, et al (2020) Learning discriminative feature with crf for unsupervised video object segmentation. In: Proceedings of the European Conference on Computer Vision (ECCV)

13.

Mahadevan S, Athar A, Ošep A, et al (2020) Making a case for 3D convolutions for object segmentation in videos. arXiv preprint arXiv:2008.11516

14.

Caelles S, Pont-Tuset J, Perazzi F, et al (2019) The 2019 davis challenge on vos: Unsupervised multi-object segmentation. arXiv preprint arXiv:1905.00737

15.

Ventura C, Bellver M, Girbau A, et al (2019) Rvos: End-to-end recurrent network for video object segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

16.

Luiten J, Zulfikar IE, Leibe B (2020) Unovost: Unsupervised offline video object segmentation and tracking. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision

17.

Zhou T, Li J, Li X, et al (2021) Target-aware object discovery and association for unsupervised video multi-object segmentation. arXiv preprint arXiv:2104.04782

18.

Caelles S, Maninis KK, Pont-Tuset J, et al (2017) One-shot video object segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

19.

Voigtlaender P, Leibe B (2017) Online adaptation of convolutional neural networks for video object segmentation. arXiv preprint arXiv:1706.09364

20.

Lin F, Chou Y, Martinez T (2020) Flow adaptive video object segmentation. Image Vis Comput 94(103):864. https://doi.org/10.1016/j.imavis.2019.103864CrossRef

21.

Li X, Loy CC (2018) Video object segmentation with joint re-identification and attention-aware mask propagation. In: Proceedings of the European Conference on Computer Vision (ECCV)

22.

Perazzi F, Khoreva A, Benenson R, et al (2017) Learning video object segmentation from static images. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

23.

Yang Z, Wei Y, Yang Y (2020) Collaborative video object segmentation by foreground-background integration. In: European Conference on Computer Vision (ECCV)

24.

Hu YT, Huang JB, Schwing AG (2018) Videomatch: Matching based video object segmentation. In: Proceedings of the European conference on computer vision (ECCV)

25.

Cheng J, Tsai YH, Hung WC, et al (2018) Fast and accurate online video object segmentation via tracking parts. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

26.

Li H, Chen G, Li G, et al (2019) Motion guided attention for video salient object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)

27.

Fan DP, Wang W, Cheng MM, et al (2019) Shifting more attention to video salient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

28.

Song H, Wang W, Zhao S, et al (2018) Pyramid dilated deeper convlstm for video salient object detection. In: Proceedings of the European Conference on Computer Vision (ECCV)

29.

Su Y, Wang W, Liu J, et al (2020) Ds-net: Dynamic spatiotemporal network for video salient object detection. arXiv preprint arXiv:2012.04886

30.

Chen C, Song J, Peng C, et al (2020) A novel video salient object detection method via semi-supervised motion quality perception. arXiv preprint arXiv:2008.02966

31.

Wang Y, Liu Z, Xia Y et al (2021) Spatiotemporal module for video saliency prediction based on self-attention. Image Vis Comput 112(104):216. https://doi.org/10.1016/j.imavis.2021.104216CrossRef

32.

Sun D, Yang X, Liu MY, et al (2018) Pwc-net: Cnns for optical flow using pyramid, warping, and cost volume. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

33.

He K, Zhang X, Ren S, et al (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

34.

Chen S, Tan X, Wang B et al (2020) Reverse attention-based residual network for salient object detection. IEEE Trans Image Process 29:3763–3776. https://doi.org/10.1109/TIP.2020.2965989CrossRefADS

35.

Wang L, Lu H, Wang Y, et al (2017) Learning to detect salient objects with image-level supervision. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

36.

Wang W, Lu X, Shen J, et al (2019) Zero-shot video object segmentation via attentive graph neural networks. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)

37.

Yang Y, Loquercio A, Scaramuzza D, et al (2019) Unsupervised moving object detection via contextual information separation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (ICCV)

Title: Saliency-based dual-attention network for unsupervised video object segmentation
Authors: Guifang Zhang
Hon-Cheng Wong
Publication date: 22-09-2023
Publisher: Springer US
Published in: The Journal of Supercomputing / Issue 4/2024
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI: https://doi.org/10.1007/s11227-023-05637-x

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Other articles of this Issue 4/2024

A novel grey wolf optimization algorithm based on geometric transformations for gene selection and cancer classification

An efficient resource allocation of IoT requests in hybrid fog–cloud environment

Optimisation of artificial intelligence models and response surface methodology for predicting viscosity and relative viscosity of GNP-alumina hybrid nanofluid: incorporating the effects of mixing ratio and temperature

Temporal-order association-based dynamic graph evolution for recommendation

Insights into cloud autoscaling: a unique perspective through MDP and DTMC formal models

Battle royale optimizer for multilevel image thresholding

Premium Partner