Skip to main content

2023 | OriginalPaper | Buchkapitel

PSR-Net: A Dual-Branch Pyramid Semantic Reasoning Network for Segmentation of Remote Sensing Images

verfasst von : Lijun Wang, Bicao Li, Bei Wang, Chunlei Li, Jie Huang, Mengxing Song

Erschienen in: Artificial Neural Networks and Machine Learning – ICANN 2023

Verlag: Springer Nature Switzerland

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The long-range context information in the semantic segmentation network for remote sensing images (RSIs) plays an important role in the improvement of segmentation performance. However, in large RSIs, the interaction between local information and global information is limited. In order to solve the problem, we propose a dual-branch pyramid semantic reasoning segmentation network. Our dual-branch network consists of a global and local branch. The traditional CNN network is employed on the global branch, and a lightweight multi-scale hierarchical feature aggregation (MHFA) module is introduced into the local branch. In addition, the Feature Semantic Reasoning (FSR) module is proposed to enhance the valuable features and weaken the useless features to improve the semantic representation of RSIs, and then the double branch transformer is embedded. The ablation experiment on the Beijing Land-Use (BLU) dataset illustrates the effectiveness of the added modules, and the results presented by comparison with other traditional networks also confirm the superiority of our proposed network. The proposed network can achieve better segmentation accuracy on large-scale RSI datasets.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Koutini, K., Eghbal-zadeh, H., Widmer, G: Receptive-field-regularized CNN variants for acoustic scene classification. J. a. p. a. (2019) Koutini, K., Eghbal-zadeh, H., Widmer, G: Receptive-field-regularized CNN variants for acoustic scene classification. J. a. p. a. (2019)
2.
Zurück zum Zitat Niu, Z., Zhong, G., Yu, H.: A review on the attention mechanism of deep learning. J. N. 452, 48–62 (2021) Niu, Z., Zhong, G., Yu, H.: A review on the attention mechanism of deep learning. J. N. 452, 48–62 (2021)
3.
Zurück zum Zitat Yaman, O., Tuncer, T.: Exemplar pyramid deep feature extraction based cervical cancer image classification model using pap-smear images. J. Biomed. Signal Process Control 73, 103428 (2022) Yaman, O., Tuncer, T.: Exemplar pyramid deep feature extraction based cervical cancer image classification model using pap-smear images. J. Biomed. Signal Process Control 73, 103428 (2022)
4.
Zurück zum Zitat Zeng, N., Wu, P., Wang, Z., Li, H., Liu, W., Liu, X.: A small-sized object detection oriented multi-scale feature fusion approach with application to defect detection. IEEE Trans. Instrument. Measure. 71, 1–14 (2022) Zeng, N., Wu, P., Wang, Z., Li, H., Liu, W., Liu, X.: A small-sized object detection oriented multi-scale feature fusion approach with application to defect detection. IEEE Trans. Instrument. Measure. 71, 1–14 (2022)
5.
Zurück zum Zitat Wang, Z., Gao, X., Wu, R., Kang, J., Zhang, Y.: Fully automatic image segmentation based on FCN and graph cuts. J. M. S 28, 1753–1765 (2022) Wang, Z., Gao, X., Wu, R., Kang, J., Zhang, Y.: Fully automatic image segmentation based on FCN and graph cuts. J. M. S 28, 1753–1765 (2022)
6.
Zurück zum Zitat Wang, X., Wang, W., Lu, J., Wang, H.: HRST: an Improved HRNet for detecting joint points of pigs. J. S. 22, 7215 (2022) Wang, X., Wang, W., Lu, J., Wang, H.: HRST: an Improved HRNet for detecting joint points of pigs. J. S. 22, 7215 (2022)
7.
Zurück zum Zitat Zhou, H.,et al.: Refine-net: normal refinement neural network for noisy point clouds. J. I. T. o. P. A. Intell. M. 45, 946–963 (2022) Zhou, H.,et al.: Refine-net: normal refinement neural network for noisy point clouds. J. I. T. o. P. A. Intell. M. 45, 946–963 (2022)
8.
Zurück zum Zitat Yang, Z., Chen, L., Fu, T., Yin, Z., Yang, F.: Spine image segmentation based on U-Net and Atrous spatial pyramid pooling. J. Phys. Conf. Ser. IOP Publishing (2022) Yang, Z., Chen, L., Fu, T., Yin, Z., Yang, F.: Spine image segmentation based on U-Net and Atrous spatial pyramid pooling. J. Phys. Conf. Ser. IOP Publishing (2022)
9.
Zurück zum Zitat Xue, H., Liu, C., Wan, F., Jiao, J., Ji, X., Ye, Q.: Danet: divergent activation for weakly supervised object localization. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (2019) Xue, H., Liu, C., Wan, F., Jiao, J., Ji, X., Ye, Q.: Danet: divergent activation for weakly supervised object localization. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (2019)
10.
Zurück zum Zitat Diakogiannis, F.I., Waldner, F., Caccetta, P., Wu, C.: ResUNet-a: a deep learning framework for semantic segmentation of remotely sensed data. J. I. J. o. P. Sens. R. 162, 94–114 (2020) Diakogiannis, F.I., Waldner, F., Caccetta, P., Wu, C.: ResUNet-a: a deep learning framework for semantic segmentation of remotely sensed data. J. I. J. o. P. Sens. R. 162, 94–114 (2020)
11.
Zurück zum Zitat Li, R., et al.: DeepUNet: a deep fully convolutional network for pixel-level sea-land segmentation. J. I. j. o. s. t. i. a. e. o., Sens. r. 11, 3954–3962 (2018) Li, R., et al.: DeepUNet: a deep fully convolutional network for pixel-level sea-land segmentation. J. I. j. o. s. t. i. a. e. o., Sens. r. 11, 3954–3962 (2018)
12.
Zurück zum Zitat Maji, D., Sigedar, P., Singh, M.: Attention Res-UNet with guided decoder for semantic segmentation of brain tumors. J. B. S. P., Control 71, 103077 (2022) Maji, D., Sigedar, P., Singh, M.: Attention Res-UNet with guided decoder for semantic segmentation of brain tumors. J. B. S. P., Control 71, 103077 (2022)
13.
Zurück zum Zitat Tolstikhin, I.O., et al.: MLP-mixer: an all-MLP architecture for vision. J. A. i. n. i. p. s. 34, 24261–24272 (2021) Tolstikhin, I.O., et al.: MLP-mixer: an all-MLP architecture for vision. J. A. i. n. i. p. s. 34, 24261–24272 (2021)
14.
Zurück zum Zitat Pinkus, A.: Approximation theory of the MLP model in neural networks. J. A. n. 8, 143–195 (1999) Pinkus, A.: Approximation theory of the MLP model in neural networks. J. A. n. 8, 143–195 (1999)
15.
Zurück zum Zitat Tu, Z., et al.: Maxim: multi-axis MLP for image processing. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2022) Tu, Z., et al.: Maxim: multi-axis MLP for image processing. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2022)
16.
Zurück zum Zitat Nie, D., Lan, R., Wang, L., Ren, X.: Pyramid architecture for multi-scale processing in point cloud segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2022) Nie, D., Lan, R., Wang, L., Ren, X.: Pyramid architecture for multi-scale processing in point cloud segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2022)
17.
Zurück zum Zitat Rajabi, S., Roozkhosh, P., Farimani, N.M.: MLP-based learnable window size for bitcoin price prediction. J. A. S. C. 129, 109584 (2022) Rajabi, S., Roozkhosh, P., Farimani, N.M.: MLP-based learnable window size for bitcoin price prediction. J. A. S. C. 129, 109584 (2022)
18.
Zurück zum Zitat Fan, Z., Lin, H., Li, C., Su, J., Bruno, S., Loprencipe, G.: Use of parallel ResNet for high-performance pavement crack detection and measurement. J. S. 14, 1825 (2022) Fan, Z., Lin, H., Li, C., Su, J., Bruno, S., Loprencipe, G.: Use of parallel ResNet for high-performance pavement crack detection and measurement. J. S. 14, 1825 (2022)
19.
Zurück zum Zitat Wang, S., et al.: Improved single shot detection using DenseNet for tiny target detection. J. C., Practice, C., Exp. 35, e7491 (2023) Wang, S., et al.: Improved single shot detection using DenseNet for tiny target detection. J. C., Practice, C., Exp. 35, e7491 (2023)
20.
Zurück zum Zitat Wang, X., Guo, Y., Wang, S., Cheng, G., Wang, X., He, L.: Rapid detection of incomplete coal and gangue based on improved PSPNet. J. M. 201, 111646 (2022) Wang, X., Guo, Y., Wang, S., Cheng, G., Wang, X., He, L.: Rapid detection of incomplete coal and gangue based on improved PSPNet. J. M. 201, 111646 (2022)
Metadaten
Titel
PSR-Net: A Dual-Branch Pyramid Semantic Reasoning Network for Segmentation of Remote Sensing Images
verfasst von
Lijun Wang
Bicao Li
Bei Wang
Chunlei Li
Jie Huang
Mengxing Song
Copyright-Jahr
2023
DOI
https://doi.org/10.1007/978-3-031-44210-0_47

Premium Partner