Skip to main content
Erschienen in: Earth Science Informatics 3/2022

04.07.2022 | Research Article

Performance evaluation of shallow and deep CNN architectures on building segmentation from high-resolution images

verfasst von: Batuhan Sariturk, Dursun Zafer Seker, Ozan Ozturk, Bulent Bayram

Erschienen in: Earth Science Informatics | Ausgabe 3/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Building extraction from high-resolution images has been studied extensively for its great importance in obtaining geographical information. As an advanced machine learning technique, deep learning has achieved great progress along with developments in hardware and larger datasets. In this study, the performance evaluation of convolutional neural network architectures in building segmentation from high-resolution images was investigated. Four U-Net based architectures were generated and their performances were compared with each other and to the U-Net. Models were trained and tested on datasets that were prepared using the Inria Aerial Image Labelling Dataset and the Massachusetts Buildings Dataset. On the INRIA test dataset, Deeper 1 architecture provided 0.79 F1 and 0.66 IoU scores. Deeper 1 was followed by Deeper 2 and U-Net architectures, both with an F1 score of 0.78 and an IoU score of 0.65. On the Massachusetts test dataset, the U-Net architecture provided 0.79 F1 and 0.66 IoU scores. This architecture was followed by Deeper 2 with 0.78 F1 score and 0.65 IoU score, and Shallower 1 and Deeper 1 architectures both with 0.77 F1 score and 0.64 IoU score. The successful results of Deeper 1 and Deeper 2 architectures show that deeper architectures can provide better results even if there is not too much data. Also, Shallower 1 architecture appears to have a performance not far behind deep architectures, with less computational cost, and this shows usefulness for geographic applications.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Badrinarayanan V, Kendall A, Cipolla R (2017) Segnet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans Pattern Anal Mach Intell 39:2481–2495CrossRef Badrinarayanan V, Kendall A, Cipolla R (2017) Segnet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans Pattern Anal Mach Intell 39:2481–2495CrossRef
Zurück zum Zitat Blaschke T (2010) Object based image analysis for remote sensing. ISPRS J Photogramm Remote Sens Blaschke T (2010) Object based image analysis for remote sensing. ISPRS J Photogramm Remote Sens
Zurück zum Zitat Cheng G, Han J (2016) A survey on object detection in optical remote sensing images. ISPRS J Photogramm Remote Sens Cheng G, Han J (2016) A survey on object detection in optical remote sensing images. ISPRS J Photogramm Remote Sens
Zurück zum Zitat Cordts M, Omran M, Ramos S et al (2016) The cityscapes dataset for semantic urban scene understanding. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition Cordts M, Omran M, Ramos S et al (2016) The cityscapes dataset for semantic urban scene understanding. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition
Zurück zum Zitat Deepsense.ai (2021) Deep learning for satellite imagery via image segmentation. deepsense.ai/deep-learning-for-satellite-imagery-via-image-segmentation/. Accessed 12 May 2021 Deepsense.ai (2021) Deep learning for satellite imagery via image segmentation. deepsense.ai/deep-learning-for-satellite-imagery-via-image-segmentation/. Accessed 12 May 2021
Zurück zum Zitat Demir I, Koperski K, Lindenbaum D, et al. (2018) Deepglobe 2018: a challenge to parse the earth through satellite images. In: IEEE computer society conference on computer vision and pattern recognition workshops Demir I, Koperski K, Lindenbaum D, et al. (2018) Deepglobe 2018: a challenge to parse the earth through satellite images. In: IEEE computer society conference on computer vision and pattern recognition workshops
Zurück zum Zitat Dumoulin V, Visin F (2018) A guide to convolution arithmetic for deep learning. pp 1–31 Dumoulin V, Visin F (2018) A guide to convolution arithmetic for deep learning. pp 1–31
Zurück zum Zitat Ghanea M, Moallem P, Momeni M (2016) Building extraction from high-resolution satellite images in urban areas: recent methods and strategies against significant challenges. Int J Remote Sens Ghanea M, Moallem P, Momeni M (2016) Building extraction from high-resolution satellite images in urban areas: recent methods and strategies against significant challenges. Int J Remote Sens
Zurück zum Zitat He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition
Zurück zum Zitat Ivanovsky L, Khryashchev V, Pavlov V, Ostrovskaya A (2019) Building detection on aerial images using u-NET neural networks. In: Conference of Open Innovation Association, FRUCT Ivanovsky L, Khryashchev V, Pavlov V, Ostrovskaya A (2019) Building detection on aerial images using u-NET neural networks. In: Conference of Open Innovation Association, FRUCT
Zurück zum Zitat Jia D, Wei D, Socher R et al (2009) ImageNet: a large-scale hierarchical image database Jia D, Wei D, Socher R et al (2009) ImageNet: a large-scale hierarchical image database
Zurück zum Zitat Kingma DP, Ba JL (2015) Adam: a method for stochastic optimization. In: 3rd International conference on learning representations, ICLR 2015 - conference track proceedings Kingma DP, Ba JL (2015) Adam: a method for stochastic optimization. In: 3rd International conference on learning representations, ICLR 2015 - conference track proceedings
Zurück zum Zitat Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Advances in neural information processing systems Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Advances in neural information processing systems
Zurück zum Zitat Li Y, Wu H (2008) Adaptive building edge detection by combining LiDAR data and aerial images. Int Arch Photogramm Remote Sens Spat Inf Sci Li Y, Wu H (2008) Adaptive building edge detection by combining LiDAR data and aerial images. Int Arch Photogramm Remote Sens Spat Inf Sci
Zurück zum Zitat Lin TY, Maire M, Belongie S, et al. (2014) Microsoft COCO: common objects in context. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics) Lin TY, Maire M, Belongie S, et al. (2014) Microsoft COCO: common objects in context. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics)
Zurück zum Zitat Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. Proc IEEE Conf Comput Vis Pattern Recognit:3431–3440 Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. Proc IEEE Conf Comput Vis Pattern Recognit:3431–3440
Zurück zum Zitat Mnih V (2013) Machine learning for aerial image labeling. Dissertation, University of Toronto Mnih V (2013) Machine learning for aerial image labeling. Dissertation, University of Toronto
Zurück zum Zitat Mnih V, Hinton G (2012) Learning to label aerial images from noisy data. In: Proceedings of the 29th international conference on machine learning, ICML Mnih V, Hinton G (2012) Learning to label aerial images from noisy data. In: Proceedings of the 29th international conference on machine learning, ICML
Zurück zum Zitat Mnih V, Hinton GE (2010) Learning to detect roads in high-resolution aerial images. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics) Mnih V, Hinton GE (2010) Learning to detect roads in high-resolution aerial images. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics)
Zurück zum Zitat Nogueira K, Miranda WO, Santos JAD (2015) Improving spatial feature representation from aerial scenes by using convolutional networks. In: Brazilian symposium of computer graphic and image processing Nogueira K, Miranda WO, Santos JAD (2015) Improving spatial feature representation from aerial scenes by using convolutional networks. In: Brazilian symposium of computer graphic and image processing
Zurück zum Zitat Patterson J, Gibson A (2017) Deep learning - A Practitioner’s Approach Patterson J, Gibson A (2017) Deep learning - A Practitioner’s Approach
Zurück zum Zitat Rezatofighi H, Tsoi N, Gwak J, et al. (2019) Generalized intersection over union: a metric and a loss for bounding box regression. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition Rezatofighi H, Tsoi N, Gwak J, et al. (2019) Generalized intersection over union: a metric and a loss for bounding box regression. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition
Zurück zum Zitat Rosebrock A (2017) Deep Learning for Computer Vision with Python. PyImageSearch Rosebrock A (2017) Deep Learning for Computer Vision with Python. PyImageSearch
Zurück zum Zitat Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: 3rd International conference on learning representations, ICLR 2015 - conference track proceedings Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: 3rd International conference on learning representations, ICLR 2015 - conference track proceedings
Zurück zum Zitat Szegedy C, Liu W, Jia Y, et al. (2015) Going deeper with convolutions. Proc IEEE Conf Comput Vis Pattern Recognit:1–9 Szegedy C, Liu W, Jia Y, et al. (2015) Going deeper with convolutions. Proc IEEE Conf Comput Vis Pattern Recognit:1–9
Zurück zum Zitat Zhong C, Xu Q, Yang F, Hu L (2015) Building change detection for high-resolution remotely sensed images based on a semantic dependency. In: International geoscience and remote sensing symposium (IGARSS) Zhong C, Xu Q, Yang F, Hu L (2015) Building change detection for high-resolution remotely sensed images based on a semantic dependency. In: International geoscience and remote sensing symposium (IGARSS)
Metadaten
Titel
Performance evaluation of shallow and deep CNN architectures on building segmentation from high-resolution images
verfasst von
Batuhan Sariturk
Dursun Zafer Seker
Ozan Ozturk
Bulent Bayram
Publikationsdatum
04.07.2022
Verlag
Springer Berlin Heidelberg
Erschienen in
Earth Science Informatics / Ausgabe 3/2022
Print ISSN: 1865-0473
Elektronische ISSN: 1865-0481
DOI
https://doi.org/10.1007/s12145-022-00840-5

Weitere Artikel der Ausgabe 3/2022

Earth Science Informatics 3/2022 Zur Ausgabe

Premium Partner