nach oben

Earth Science Informatics

Erschienen in:

04.07.2022 | Research Article

Performance evaluation of shallow and deep CNN architectures on building segmentation from high-resolution images

verfasst von: Batuhan Sariturk, Dursun Zafer Seker, Ozan Ozturk, Bulent Bayram

Erschienen in: Earth Science Informatics | Ausgabe 3/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Building extraction from high-resolution images has been studied extensively for its great importance in obtaining geographical information. As an advanced machine learning technique, deep learning has achieved great progress along with developments in hardware and larger datasets. In this study, the performance evaluation of convolutional neural network architectures in building segmentation from high-resolution images was investigated. Four U-Net based architectures were generated and their performances were compared with each other and to the U-Net. Models were trained and tested on datasets that were prepared using the Inria Aerial Image Labelling Dataset and the Massachusetts Buildings Dataset. On the INRIA test dataset, Deeper 1 architecture provided 0.79 F1 and 0.66 IoU scores. Deeper 1 was followed by Deeper 2 and U-Net architectures, both with an F1 score of 0.78 and an IoU score of 0.65. On the Massachusetts test dataset, the U-Net architecture provided 0.79 F1 and 0.66 IoU scores. This architecture was followed by Deeper 2 with 0.78 F1 score and 0.65 IoU score, and Shallower 1 and Deeper 1 architectures both with 0.77 F1 score and 0.64 IoU score. The successful results of Deeper 1 and Deeper 2 architectures show that deeper architectures can provide better results even if there is not too much data. Also, Shallower 1 architecture appears to have a performance not far behind deep architectures, with less computational cost, and this shows usefulness for geographic applications.

Vorheriger Artikel Geometric and radiometric evaluation of remote sensing information in virtual platforms

Nächster Artikel A homogeneous approach in modeling a coastal karst aquifer

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Alshehhi R, Marpu PR, Woon WL, Mura MD (2017) Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks. ISPRS J Photogramm Remote Sens. https://doi.org/10.1016/j.isprsjprs.2017.05.002

Audebert N, Le Saux B, Lefèvre S (2017) Segment-before-detect: vehicle detection and classification through semantic segmentation of aerial images. Remote Sens. https://doi.org/10.3390/rs9040368

Badrinarayanan V, Kendall A, Cipolla R (2017) Segnet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans Pattern Anal Mach Intell 39:2481–2495CrossRef

Bai Y, Mas E, Koshimura S (2018) Towards operational satellite-based damage-mapping using U-net convolutional network: a case study of 2011 Tohoku Earthquake-Tsunami. Remote Sens. https://doi.org/10.3390/rs10101626

Bauerle A, Van Onzenoodt C, Ropinski T (2021) Net2vis - a visual grammar for automatically generating publication-ready cnn architecture visualizations. IEEE Trans Vis Comput Graph:1–1, https://doi.org/10.1109/TVCG.2021.3057483

Bischke B, Helber P, Folz J, et al. (2019) Multi-task learning for segmentation of building footprints with deep neural networks. Proc - Int Conf Image Process ICIP 2019-Septe:1480–1484, https://doi.org/10.1109/ICIP.2019.8803050

Blaschke T (2010) Object based image analysis for remote sensing. ISPRS J Photogramm Remote Sens

Cao R, Zhu J, Tu W et al (2018) Integrating aerial and street view images for urban land use classification. Remote Sens, https://doi.org/10.3390/rs10101553

Chen LC, Papandreou G, Kokkinos I et al (2018) DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans Pattern Anal Mach Intell. https://doi.org/10.1109/TPAMI.2017.2699184

Chen Q, Wang L, Wu Y, et al. (2019) Aerial imagery for roof segmentation: a large-scale dataset towards automatic mapping of buildings. ISPRS J Photogramm Remote Sens 147:42–55. https://doi.org/10.1016/j.isprsjprs.2018.11.011CrossRef

Cheng G, Han J (2016) A survey on object detection in optical remote sensing images. ISPRS J Photogramm Remote Sens

Cheng G, Wang Y, Xu S et al (2017) Automatic road detection and centerline extraction via cascaded end-to-end convolutional neural network. IEEE Trans Geosci Remote Sens. https://doi.org/10.1109/TGRS.2017.2669341

Cordts M, Omran M, Ramos S et al (2016) The cityscapes dataset for semantic urban scene understanding. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition

Deepsense.ai (2021) Deep learning for satellite imagery via image segmentation. deepsense.ai/deep-learning-for-satellite-imagery-via-image-segmentation/. Accessed 12 May 2021

Demir I, Koperski K, Lindenbaum D, et al. (2018) Deepglobe 2018: a challenge to parse the earth through satellite images. In: IEEE computer society conference on computer vision and pattern recognition workshops

Dumoulin V, Visin F (2018) A guide to convolution arithmetic for deep learning. pp 1–31

Fu G, Liu C, Zhou R et al (2017) Classification for high resolution remote sensing imagery using a fully convolutional network. Remote Sens, https://doi.org/10.3390/rs9050498

Ghanea M, Moallem P, Momeni M (2016) Building extraction from high-resolution satellite images in urban areas: recent methods and strategies against significant challenges. Int J Remote Sens

Ghorbanzadeh O, Blaschke T, Gholamnia K et al (2019) Evaluation of different machine learning methods and deep-learning convolutional neural networks for landslide detection. Remote Sens 11, https://doi.org/10.3390/rs11020196

Girisha S, Pai MMM, Verma U, Pai RM (2019) Performance analysis of semantic segmentation algorithms for finely annotated new UAV aerial video dataset (manipaluavid). IEEE Access 7:136239–136253. https://doi.org/10.1109/ACCESS.2019.2941026CrossRef

Gorokhovatskyi O, Peredrii O (2018) Shallow convolutional neural networks for pattern recognition problems. Proc 2018 IEEE 2nd Int Conf Data Stream Min Process DSMP 2018:459–463. https://doi.org/10.1109/DSMP.2018.8478540CrossRef

He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition

Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. Proc - 30th IEEE Conf Comput Vis Pattern Recognition CVPR 2017 2017-Janua:2261–2269, https://doi.org/10.1109/CVPR.2017.243

Hui J, Du M, Ye X, et al. (2019) Effective building extraction from High-Resolution remote sensing images with multitask driven deep neural network. IEEE Geosci Remote Sens Lett 16:786–790. https://doi.org/10.1109/LGRS.2018.2880986CrossRef

Inria (2021) Inria Aerial Image Labeling Dataset. https://project.inria.fr/aerialimagelabeling/. Accessed 12 May 2021

Ivanovsky L, Khryashchev V, Pavlov V, Ostrovskaya A (2019) Building detection on aerial images using u-NET neural networks. In: Conference of Open Innovation Association, FRUCT

Ji S, Wei S, Lu M (2019) Fully convolutional networks for multisource building extraction from an open aerial and satellite imagery data set. IEEE Trans Geosci Remote Sens 57:574–586. https://doi.org/10.1109/TGRS.2018.2858817CrossRef

Jia D, Wei D, Socher R et al (2009) ImageNet: a large-scale hierarchical image database

Kim DE, Gofman M (2018) Comparison of shallow and deep neural networks for network intrusion detection. 2018 IEEE 8th Annu Comput Commun Work Conf CCWC 2018 2018-Janua:204–208. https://doi.org/10.1109/CCWC.2018.8301755

Kim JH, Lee H, Hong SJ, et al. (2019) Objects segmentation from high-resolution aerial images using U-Net with pyramid pooling layers. IEEE Geosci Remote Sens Lett 16:115–119. https://doi.org/10.1109/LGRS.2018.2868880CrossRef

Kingma DP, Ba JL (2015) Adam: a method for stochastic optimization. In: 3rd International conference on learning representations, ICLR 2015 - conference track proceedings

Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Advances in neural information processing systems

LeCun Y, Boser B, Denker JS et al (1989) Backpropagation Applied to Handwritten Zip Code Recognition. Neural Comput, https://doi.org/10.1162/neco.1989.1.4.541

Lei F, Liu X, Dai Q, Ling BWK (2020) Shallow convolutional neural network for image classification. SN Appl Sci 2, https://doi.org/10.1007/s42452-019-1903-4

Li L, Liang J, Weng m, Zhu H (2018) A multiple-feature reuse network to extract buildings from remote sensing imagery. Remote Sens, https://doi.org/10.3390/rs10091350

Li W, He C, Fang J et al (2019) Semantic segmentation-based building footprint extraction using very high-resolution satellite images and multi-source GIS data. Remote Sens:11, https://doi.org/10.3390/rs11040403

Li X, Chen H, Qi X, et al. (2018c) H-DenseUNet: hybrid Densely Connected UNet for Liver and Tumor Segmentation from CT Volumes. IEEE Trans Med Imaging 37:2663–2674. https://doi.org/10.1109/TMI.2018.2845918CrossRef

Li Y, Wu H (2008) Adaptive building edge detection by combining LiDAR data and aerial images. Int Arch Photogramm Remote Sens Spat Inf Sci

Lin TY, Maire M, Belongie S, et al. (2014) Microsoft COCO: common objects in context. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics)

Liu W, Yang MY, Xie M et al (2019) Accurate building extraction from fused DSM and UAV images using a chain fully convolutional neural network. Remote Sens, https://doi.org/10.3390/rs11242912

Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. Proc IEEE Conf Comput Vis Pattern Recognit:3431–3440

Ma L, Li M, Ma X, et al. (2017) A review of supervised object-based land-cover image classification. ISPRS J Photogramm Remote Sens 130:277–293. https://doi.org/10.1016/j.isprsjprs.2017.06.001CrossRef

Maggiori E, Tarabalka Y, Charpiat G, Alliez P (2017a) Convolutional neural networks for large-scale remote-sensing image classification. IEEE Trans Geosci Remote Sens 55:645–657. https://doi.org/10.1109/TGRS.2016.2612821CrossRef

Maggiori E, Tarabalka Y, Charpiat G, Alliez P (2017b) Can semantic labeling methods generalize to any city? the inria aerial image labeling benchmark. Int Geosci Remote Sens Symp 2017-July:3226–3229. https://doi.org/10.1109/IGARSS.2017.8127684

Milosavljević A (2020) Automated processing of remote sensing imagery using deep semantic segmentation: a building footprint extraction case. ISPRS Int J Geo-Information:9, https://doi.org/10.3390/ijgi9080486

Mnih V (2013) Machine learning for aerial image labeling. Dissertation, University of Toronto

Mnih V, Hinton G (2012) Learning to label aerial images from noisy data. In: Proceedings of the 29th international conference on machine learning, ICML

Mnih V, Hinton GE (2010) Learning to detect roads in high-resolution aerial images. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics)

Nagi J, Ducatelle F, Di Caro GA, et al. (2011) Max-pooling convolutional neural networks for vision-based hand gesture recognition. IEEE Int Conf Signal Image Process Appl ICSIPA 2011:342–347. https://doi.org/10.1109/ICSIPA.2011.6144164CrossRef

Nogueira K, Miranda WO, Santos JAD (2015) Improving spatial feature representation from aerial scenes by using convolutional networks. In: Brazilian symposium of computer graphic and image processing

Nogueira K, Penatti OAB, Dos Santos JA (2017) Towards better exploiting convolutional neural networks for remote sensing scene classification. Pattern Recognit, https://doi.org/10.1016/j.patcog.2016.07.001

Noh H, Hong S, Han B (2015) Learning deconvolution network for semantic segmentation. Proc IEEE Int Conf Comput Vis 2015 Inter:1520–1528. https://doi.org/10.1109/ICCV.2015.178

Ohleyer S (2018) Building Segmentation on Satellite Images. https://project.inria.fr/aerialimagelabeling/files/2018/01/fp_ohleyer_compressed.pdf. Accessed 12 May 2021

Patterson J, Gibson A (2017) Deep learning - A Practitioner’s Approach

Rezatofighi H, Tsoi N, Gwak J, et al. (2019) Generalized intersection over union: a metric and a loss for bounding box regression. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition

Ronneberger O, Fischer P, Brox T (2015) U-net: convolutional networks for biomedical image segmentation. Lect Notes Comput Sci (including Subser Lect Notes Artif Intell Lect Notes Bioinformatics) 9351:234–241. https://doi.org/10.1007/978-3-319-24574-4_28CrossRef

Rosebrock A (2017) Deep Learning for Computer Vision with Python. PyImageSearch

Scikit-Learn (2021) 3.1. Cross-validation: evaluating estimator performance. https://scikit-learn.org/stable/modules/cross_validation.html. Accessed 12 May 2021

Shi Y, Li Q, Zhu XX (2020) Building segmentation through a gated graph convolutional neural network with deep structured feature embedding. ISPRS J Photogramm Remote Sens 159:184–197. https://doi.org/10.1016/j.isprsjprs.2019.11.004CrossRef

Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: 3rd International conference on learning representations, ICLR 2015 - conference track proceedings

Sirmaçek B, Ünsalan C (2008) Building detection from aerial images using invariant color features and shadow information. In: 2008 23Rd international symposium on computer and information sciences, ISCIS, pp 1–5. https://doi.org/10.1109/ISCIS.2008.4717854

Szegedy C, Liu W, Jia Y, et al. (2015) Going deeper with convolutions. Proc IEEE Conf Comput Vis Pattern Recognit:1–9

Wen Q, Jiang K, Wang W, et al. (2019) Automatic building extraction from google earth images under complex backgrounds based on deep instance segmentation network. Sensors (Switzerland) 19:1–16. https://doi.org/10.3390/s19020333CrossRef

Wu G, Shao X, Guo Z, et al. (2018) Automatic building segmentation of aerial imagery usingmulti-constraint fully convolutional networks. Remote Sens 10:1–18. https://doi.org/10.3390/rs10030407CrossRef

Xu X, Lim W, Ran Q et al (2018a) Multisource remote sensing data classification based on convolutional neural network. IEEE Trans Geosci Remote Sens, https://doi.org/10.1109/TGRS.2017.2756851

Xu Y, Wu L, Xie Z, Chen Z (2018b) Building extraction in very high resolution remote sensing imagery using deep learning and guided filters. Remote Sens:10, https://doi.org/10.3390/rs10010144

Xu Y, Xie Z, Feng y, Chen Z (2018c) Road extraction from high-resolution remote sensing imagery using deep learning. Remote Sens, https://doi.org/10.3390/rs10091461

Yang X, Li X, Ye Y, et al. (2019) Road detection and centerline extraction via deep recurrent convolutional neural network U-Net. IEEE Trans Geosci Remote Sens 57:7209–7220. https://doi.org/10.1109/TGRS.2019.2912301CrossRef

Yang X, Ye Y, Li X et al (2018) Hyperspectral image classification with deep learning models. IEEE Trans Geosci Remote Sens, https://doi.org/10.1109/TGRS.2018.2815613

Ye Z, Fu Y, Gan M, et al. (2019) Building extraction from very high resolution aerial imagery using joint attention deep neural network. Remote Sens 11:1–21. https://doi.org/10.3390/rs11242970CrossRef

Yuan J (2018) Learning building extraction in aerial scenes with convolutional networks. IEEE Trans Pattern Anal Mach Intell 40:2793–2798. https://doi.org/10.1109/TPAMI.2017.2750680CrossRef

Zhang C, Sargent I, Pan X et al (2018) An object-based convolutional neural network (OCNN) for urban land use classification. Remote Sens Environ, https://doi.org/10.1016/j.rse.2018.06.034

Zhang L, Zhang L, Du B (2016) Deep learning for remote sensing data: a technical tutorial on the state of the art. IEEE Geosci Remote Sens Mag, https://doi.org/10.1109/MGRS.2016.2540798

Zhang Y (1999) Optimisation of building detection in satellite images by combining multispectral classification and texture filtering. ISPRS J Photogramm Remote Sens, https://doi.org/10.1016/s0924-2716(98)00027-6. Zhang, Z, Liu, Q, Wang Y, 2018, Road Extraction by Deep Residual U-Net. IEEE Geosci Remote Sens Lett, https://doi.org/10.1109/LGRS.2018.2802944

Zhao W, Du S (2016) Learning multiscale and deep representations for classifying remotely sensed imagery. ISPRS J Photogramm Remote Sens, https://doi.org/10.1016/j.isprsjprs.2016.01.004

Zhao W, Du S, Wang Q, Emery W J (2017) Contextually guided very-high-resolution imagery classification with semantic segments. ISPRS J Photogramm Remote Sens 132:48–60. https://doi.org/10.1016/j.isprsjprs.2017.08.011CrossRef

Zhong C, Xu Q, Yang F, Hu L (2015) Building change detection for high-resolution remotely sensed images based on a semantic dependency. In: International geoscience and remote sensing symposium (IGARSS)

Titel: Performance evaluation of shallow and deep CNN architectures on building segmentation from high-resolution images
verfasst von: Batuhan Sariturk
Dursun Zafer Seker
Ozan Ozturk
Bulent Bayram
Publikationsdatum: 04.07.2022
Verlag: Springer Berlin Heidelberg
Erschienen in: Earth Science Informatics / Ausgabe 3/2022
Print ISSN: 1865-0473
Elektronische ISSN: 1865-0481
DOI: https://doi.org/10.1007/s12145-022-00840-5

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Weitere Artikel der Ausgabe 3/2022

Study on failure characteristics of surrounding rock of benching excavation in loess overburden soil-rock contact zone

Proposing a Hybrid Genetic Algorithm based Parsimonious Random Forest Regression (H-GAPRFR) technique for solar irradiance forecasting with feature selection and parameter optimization

OpenAltimetry - rapid analysis and visualization of Spaceborne altimeter data

Automated machine learning pipeline for geochemical analysis

Evaluation of deep machine learning-based models of soil cumulative infiltration

Geometric and radiometric evaluation of remote sensing information in virtual platforms

Premium Partner