Skip to main content
Top
Published in: Multimedia Systems 3/2023

01-12-2022 | Regular Paper

A multi-level feature weight fusion model for salient object detection

Authors: Zhang Shanqing, Chen Yujie, Meng Yiheng, Lu Jianfeng, Li Li, Bai Rui

Published in: Multimedia Systems | Issue 3/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Although the Fully Convolutional Neural Networks (FCNs) has achieved good performance in salient object detection, there are problems, such as fuzzy boundary and unsatisfactory performance in complex scenes. Hence, how to better integrate multi-level convolution feature requires further investigation. This paper proposes a salient object detection algorithm, which uses Gram matrix and its F norm to weigh the importance of each multi-level feature map and uses weight to fuse multi-level prediction results recursively, finally generate the final saliency map. The algorithm evaluates the importance of different depth multi-level feature maps by calculating the Gram matrix's F norm of feature tensor slices. The multi-level feature maps are fused effectively according to the weight. It reduces the loss of multi-level prediction results during fusion, and preserves the spatial details. Besides, to achieve a more accurate boundary, a deep supervision is used to optimize salient feature maps’ results. Pixel-level supervision information from ground truth will guide each layer’s prediction. Experiments on five benchmark data sets demonstrate that the proposed method performs well in various scenes, especially in complex scenes.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Chang, C.: Research on natural scene image classification algorithm based on saliency detection [D], pp. 1–55. Wuhan University of Technology, Wuhan (2018) Chang, C.: Research on natural scene image classification algorithm based on saliency detection [D], pp. 1–55. Wuhan University of Technology, Wuhan (2018)
2.
go back to reference Muratov, O., Zontone, P., Boato, G., et al.: A segment-based image saliency detection [C]. IEEE International Conference on Acoustics. IEEE, pp.1217–1220 (2011) Muratov, O., Zontone, P., Boato, G., et al.: A segment-based image saliency detection [C]. IEEE International Conference on Acoustics. IEEE, pp.1217–1220 (2011)
3.
go back to reference Runchun, Ye.: The optimization model of saliency detection and its application in image compression [D], pp. 1–45. University of Science and Technology of China, Hefei (2018) Runchun, Ye.: The optimization model of saliency detection and its application in image compression [D], pp. 1–45. University of Science and Technology of China, Hefei (2018)
4.
go back to reference Cox, I.J., Kilian, J., Leighton, F.T., et al.: Secure spread spectrum watermarking for multimedia [J]. IEEE Trans. Image Process. 6(12), 1673–1687 (1997)CrossRef Cox, I.J., Kilian, J., Leighton, F.T., et al.: Secure spread spectrum watermarking for multimedia [J]. IEEE Trans. Image Process. 6(12), 1673–1687 (1997)CrossRef
5.
go back to reference Achanta, R., Estrada, F., Wils, P., et al.: Salient Region Detection and Segmentation [C]. International Conference on Computer Vision Systems, pp. 66–75. Springer, Berlin (2008) Achanta, R., Estrada, F., Wils, P., et al.: Salient Region Detection and Segmentation [C]. International Conference on Computer Vision Systems, pp. 66–75. Springer, Berlin (2008)
6.
go back to reference Ma, Y.-F., Zhang, H.-J.: Contrast-based image attention analysis by using fuzzy growing [C]. Proceedings of the Eleventh ACM International Conference on Multimedia, pp. 374–381 (2003) Ma, Y.-F., Zhang, H.-J.: Contrast-based image attention analysis by using fuzzy growing [C]. Proceedings of the Eleventh ACM International Conference on Multimedia, pp. 374–381 (2003)
7.
go back to reference Liu, T., Yuan, Z., Sun, J., et al.: Learning to detect a salient object [J]. IEEE Trans. Pattern Anal. Mach. Intell. 33(2), 353–367 (2010) Liu, T., Yuan, Z., Sun, J., et al.: Learning to detect a salient object [J]. IEEE Trans. Pattern Anal. Mach. Intell. 33(2), 353–367 (2010)
8.
go back to reference Wei, Y., Wen, F., Zhu, W., et al.: Geodesic saliency using background priors [C]. European Conference on Computer Vision, pp. 29–42. Springer, Berlin (2012) Wei, Y., Wen, F., Zhu, W., et al.: Geodesic saliency using background priors [C]. European Conference on Computer Vision, pp. 29–42. Springer, Berlin (2012)
9.
go back to reference Tong, N., Lu, H., Ruan, X., et al.: Salient object detection via bootstrap learning [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1884–1892 (2015) Tong, N., Lu, H., Ruan, X., et al.: Salient object detection via bootstrap learning [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1884–1892 (2015)
10.
go back to reference Cheng, M.M., Mitra, N.J., Huang, X., et al.: Global contrast based salient region detection [J]. IEEE Trans. Pattern Anal. Mach. Intell. 37(3), 569–582 (2014)CrossRef Cheng, M.M., Mitra, N.J., Huang, X., et al.: Global contrast based salient region detection [J]. IEEE Trans. Pattern Anal. Mach. Intell. 37(3), 569–582 (2014)CrossRef
11.
go back to reference Wang L, Lu H, Ruan X, et al. Deep networks for saliency detection via local estimation and global search [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3183–3192 (2015) Wang L, Lu H, Ruan X, et al. Deep networks for saliency detection via local estimation and global search [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3183–3192 (2015)
12.
go back to reference Lee, G., Tai, Y.W., Kim, J.: Deep saliency with encoded low level distance map and high level features [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 660–668 (2016) Lee, G., Tai, Y.W., Kim, J.: Deep saliency with encoded low level distance map and high level features [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 660–668 (2016)
13.
go back to reference Li, H., Chen, J., Lu, H., et al.: CNN for saliency detection with low-level feature integration [J]. Neurocomputing 226, 212–220 (2017)CrossRef Li, H., Chen, J., Lu, H., et al.: CNN for saliency detection with low-level feature integration [J]. Neurocomputing 226, 212–220 (2017)CrossRef
14.
go back to reference Wang, T., Borji, A., Zhang, L., et al.: A stagewise refinement model for detecting salient objects in images [C]. Proceedings of the IEEE International Conference on Computer Vision, pp. 4019–4028 (2017) Wang, T., Borji, A., Zhang, L., et al.: A stagewise refinement model for detecting salient objects in images [C]. Proceedings of the IEEE International Conference on Computer Vision, pp. 4019–4028 (2017)
15.
go back to reference Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015) Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
16.
go back to reference Simonyan, K., Zisserman, A.: Very Deep Convolutional Networks for Large-Scale Image Recognition [J]. arXiv preprint arXiv:1409.1556, (2014) Simonyan, K., Zisserman, A.: Very Deep Convolutional Networks for Large-Scale Image Recognition [J]. arXiv preprint arXiv:​1409.​1556, (2014)
17.
go back to reference He, K., Zhang, X., Ren, S., et al.: Deep residual learning for image recognition [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016) He, K., Zhang, X., Ren, S., et al.: Deep residual learning for image recognition [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
18.
go back to reference Zhang, P., Wang, D., Lu, H., et al.: Amulet: Aggregating multi-level convolutional features for salient object detection [C]. Proceedings of the IEEE International Conference on Computer Vision. 2017: 202–211. Zhang, P., Wang, D., Lu, H., et al.: Amulet: Aggregating multi-level convolutional features for salient object detection [C]. Proceedings of the IEEE International Conference on Computer Vision. 2017: 202–211.
19.
go back to reference Hariharan, B., Arbeláez, P., Girshick, R., et al.: Hypercolumns for object segmentation and fine-grained localization [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 447–456 (2015) Hariharan, B., Arbeláez, P., Girshick, R., et al.: Hypercolumns for object segmentation and fine-grained localization [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 447–456 (2015)
20.
go back to reference Huang, J., Rathod, V., Sun, C., et al.: Speed/accuracy trade-offs for modern convolutional object detectors [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7310–7311 (2017) Huang, J., Rathod, V., Sun, C., et al.: Speed/accuracy trade-offs for modern convolutional object detectors [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7310–7311 (2017)
21.
go back to reference Liu, N., Han, J.W.: Dhsnet: deep hierarchical saliency network for salient object detection [C]. IEEE Conference on Computer Vision and Pattern Recognition, pp. 678–686. IEEE Computer Society Press, Los Alamitos (2016) Liu, N., Han, J.W.: Dhsnet: deep hierarchical saliency network for salient object detection [C]. IEEE Conference on Computer Vision and Pattern Recognition, pp. 678–686. IEEE Computer Society Press, Los Alamitos (2016)
22.
go back to reference Zhang, L., Dai, J., Lu, H., et al.: A bi-directional message passing model for salient object detection [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1741–1750 (2018) Zhang, L., Dai, J., Lu, H., et al.: A bi-directional message passing model for salient object detection [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1741–1750 (2018)
23.
go back to reference Zhao, H., Shi, J., Qi, X., et al.: Pyramid scene parsing network [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2881–2890 (2017) Zhao, H., Shi, J., Qi, X., et al.: Pyramid scene parsing network [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2881–2890 (2017)
24.
go back to reference Hu, X., Zhu, L., Qin, J., et al.: Recurrently aggregating deep features for salient object detection [C]. Proceedings of the AAAI Conference on Artificial Intelligence, pp. 32–39 (2018) Hu, X., Zhu, L., Qin, J., et al.: Recurrently aggregating deep features for salient object detection [C]. Proceedings of the AAAI Conference on Artificial Intelligence, pp. 32–39 (2018)
25.
go back to reference Gatys, L., Ecker, A., Bethge, M.: A neural algorithm of artistic style [J]. J. Vis. 16(12), 326–326 (2016)CrossRef Gatys, L., Ecker, A., Bethge, M.: A neural algorithm of artistic style [J]. J. Vis. 16(12), 326–326 (2016)CrossRef
26.
go back to reference Wang, L., Lu, H., Wang, Y., et al.: Learning to detect salient objects with image-level supervision [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 136–145 (2017). Wang, L., Lu, H., Wang, Y., et al.: Learning to detect salient objects with image-level supervision [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 136–145 (2017).
27.
go back to reference Li, Y., Hou, X., Koch, C., et al.: The secrets of salient object segmentation [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 280–287 (2014) Li, Y., Hou, X., Koch, C., et al.: The secrets of salient object segmentation [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 280–287 (2014)
28.
go back to reference Movahedi, V., Elder, J.H.: Design and perceptual validation of performance measures for salient object segmentation [C]. Computer Vision & Pattern Recognition Workshops. IEEE, pp. 49–56 (2010) Movahedi, V., Elder, J.H.: Design and perceptual validation of performance measures for salient object segmentation [C]. Computer Vision & Pattern Recognition Workshops. IEEE, pp. 49–56 (2010)
29.
go back to reference Li, G., Yu, Y.: Visual saliency based on multiscale deep features [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5455–5463 (2015) Li, G., Yu, Y.: Visual saliency based on multiscale deep features [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5455–5463 (2015)
30.
go back to reference Yang, C., Zhang, L., Lu, H., et al.: Saliency detection via graph-based manifold ranking [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3166–3173 (2013) Yang, C., Zhang, L., Lu, H., et al.: Saliency detection via graph-based manifold ranking [C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3166–3173 (2013)
31.
go back to reference Lecun, Y., Bottou, L.: Gradient-based learning applied to document recognition [J]. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef Lecun, Y., Bottou, L.: Gradient-based learning applied to document recognition [J]. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef
32.
go back to reference Borji, A., Cheng, M.M., Jiang, H., et al.: Salient object detection: a benchmark [J]. IEEE Trans. Image Process. 24(12), 5706–5722 (2015)MathSciNetCrossRefMATH Borji, A., Cheng, M.M., Jiang, H., et al.: Salient object detection: a benchmark [J]. IEEE Trans. Image Process. 24(12), 5706–5722 (2015)MathSciNetCrossRefMATH
Metadata
Title
A multi-level feature weight fusion model for salient object detection
Authors
Zhang Shanqing
Chen Yujie
Meng Yiheng
Lu Jianfeng
Li Li
Bai Rui
Publication date
01-12-2022
Publisher
Springer Berlin Heidelberg
Published in
Multimedia Systems / Issue 3/2023
Print ISSN: 0942-4962
Electronic ISSN: 1432-1882
DOI
https://doi.org/10.1007/s00530-022-01018-1

Other articles of this Issue 3/2023

Multimedia Systems 3/2023 Go to the issue