Skip to main content
Top

2020 | OriginalPaper | Chapter

Segmentation of Aerial Image with Multi-scale Feature and Attention Model

Authors : Shiyu Hu, Qian Ning, Bingcai Chen, Yinjie Lei, Xinzhi Zhou, Hua Yan, Chengping Zhao, Tiantian Tang, Ruiheng Hu

Published in: Artificial Intelligence in China

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Aerial image labeling plays an important part in the mapping of maps with high precision. The knowledge about the range and intensive degree of aerial building segmentation is necessary for urban planning. Fully convolutional networks (FCNs) have recently shown state-of-the-art performance in image segmentation. In order to get better aerial images segmentation performance, we use a method of combing FCNs with multi-scale features and attention model in order to carry out segmentation automatically in aerial images. Attention model gives each scale feature added extra supervision to achieve better segmentation. Here, U-net and FCN-8s are used as original semantic segmentation model to train with multi-scale images and attention models. The datasets use different proportions of Inria Aerial Image Labeling Dataset, including two semantic classes: building and not building. The results show that the semantic segmentation model combined with multi-scale features and attention model has higher segmentation accuracy and better performance.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Lagrange A, Le Saux B, Beaupere A et al (2015) Benchmarking classification of earth-observation data: from learning explicit features to convolutional networks. In: Geoscience and remote sensing symposium (IGARSS). IEEE, Milan, pp 4173–4176 Lagrange A, Le Saux B, Beaupere A et al (2015) Benchmarking classification of earth-observation data: from learning explicit features to convolutional networks. In: Geoscience and remote sensing symposium (IGARSS). IEEE, Milan, pp 4173–4176
2.
go back to reference Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, Boston, pp 3431–3440 Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, Boston, pp 3431–3440
3.
go back to reference Badrinarayanan V, Kendall A, Cipolla R (2015) Segnet: a deep convolutional encoder-decoder architecture for image segmentation. arXiv preprint arXiv:1511.00561 Badrinarayanan V, Kendall A, Cipolla R (2015) Segnet: a deep convolutional encoder-decoder architecture for image segmentation. arXiv preprint arXiv:​1511.​00561
4.
go back to reference Chen LC, Papandreou G, Kokkinos I et al (2014) Semantic image segmentation with deep convolutional nets and fully connected CRFs. arXiv preprint arXiv:1412.7062 Chen LC, Papandreou G, Kokkinos I et al (2014) Semantic image segmentation with deep convolutional nets and fully connected CRFs. arXiv preprint arXiv:​1412.​7062
5.
go back to reference Chen LC, Papandreou G, Kokkinos I et al (2018) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans Pattern Anal Mach Intell 40(4):834–848CrossRef Chen LC, Papandreou G, Kokkinos I et al (2018) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans Pattern Anal Mach Intell 40(4):834–848CrossRef
6.
go back to reference Chen LC, Papandreou G, Schroff F et al (2017) Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587 Chen LC, Papandreou G, Schroff F et al (2017) Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:​1706.​05587
7.
go back to reference Arbelaez P, Maire M, Fowlkes C et al (2011) Contour detection and hierarchical image segmentation. IEEE Trans Pattern Anal Mach Intell 33(5):898–916CrossRef Arbelaez P, Maire M, Fowlkes C et al (2011) Contour detection and hierarchical image segmentation. IEEE Trans Pattern Anal Mach Intell 33(5):898–916CrossRef
8.
9.
go back to reference Eigen D, Fergus R (2015) Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture. In: Proceedings of the IEEE international conference on computer vision. IEEE, Santiago, pp 2650–2658 Eigen D, Fergus R (2015) Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture. In: Proceedings of the IEEE international conference on computer vision. IEEE, Santiago, pp 2650–2658
10.
go back to reference Cao C, Liu X, Yang Y et al (2015) Look and think twice: capturing top-down visual attention with feedback convolutional neural networks. In: Proceedings of the IEEE international conference on computer vision. IEEE, Santiago, pp 2956–2964 Cao C, Liu X, Yang Y et al (2015) Look and think twice: capturing top-down visual attention with feedback convolutional neural networks. In: Proceedings of the IEEE international conference on computer vision. IEEE, Santiago, pp 2956–2964
12.
go back to reference Chen LC, Yang Y, Wang J et al (2016) Attention to scale: scale-aware semantic image segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, Las Vegas, pp 3640–3649 Chen LC, Yang Y, Wang J et al (2016) Attention to scale: scale-aware semantic image segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, Las Vegas, pp 3640–3649
14.
go back to reference Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention. Springer, Munich, pp 234–241 Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention. Springer, Munich, pp 234–241
Metadata
Title
Segmentation of Aerial Image with Multi-scale Feature and Attention Model
Authors
Shiyu Hu
Qian Ning
Bingcai Chen
Yinjie Lei
Xinzhi Zhou
Hua Yan
Chengping Zhao
Tiantian Tang
Ruiheng Hu
Copyright Year
2020
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-15-0187-6_7