Skip to main content
Top

2022 | OriginalPaper | Chapter

Transformers with YOLO Network for Damage Detection in Limestone Wall Images

Authors : Koubouratou Idjaton, Xavier Desquesnes, Sylvie Treuillet, Xavier Brunetaud

Published in: Image Analysis and Processing. ICIAP 2022 Workshops

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Cultural heritage buildings damage detection is of a great significance for planning restoration operations. However, the buildings analysis is generally performed by experts through on-site qualitative visual assessments. A highly time-consuming task, hardly possible at the scale of large historical buildings.
This paper proposes a new neural network architecture for automatic detection of spalling zones in limestone walls with color images. This architecture consists of the latest YOLO network, enhanced with layers of transformers encoder providing more comprehensive features. The performances of the proposed network improve significantly those of the YOLO core network on our dataset of over 1000 high resolution images from the Renaissance style Château de Chaumont in the Loire Valley (France).

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference ICOMOS-ISCS: Illustrated glossary on stone deterioration patterns = icomos-iscs: Glossaire illustré sur les formes d’altération de la pierre (2008) ICOMOS-ISCS: Illustrated glossary on stone deterioration patterns = icomos-iscs: Glossaire illustré sur les formes d’altération de la pierre (2008)
2.
go back to reference Dosovitskiy, A., et al.: An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020) Dosovitskiy, A., et al.: An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:​2010.​11929 (2020)
3.
go back to reference He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904–1916 (2015)CrossRef He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904–1916 (2015)CrossRef
4.
go back to reference Hosang, J., Benenson, R., Schiele, B.: Learning non-maximum suppression. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 2017 Hosang, J., Benenson, R., Schiele, B.: Learning non-maximum suppression. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 2017
7.
go back to reference Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25, 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25, 1097–1105 (2012)
8.
go back to reference Kwon, D., Yu, J.: Automatic damage detection of stone cultural property based on deep learning algorithm. Int. Arch. Photogram. Remote Sens. Spatial Inf. Sci. 42(2/W15) (2019) Kwon, D., Yu, J.: Automatic damage detection of stone cultural property based on deep learning algorithm. Int. Arch. Photogram. Remote Sens. Spatial Inf. Sci. 42(2/W15) (2019)
10.
go back to reference Manferdini, A.M., Baroncini, V., Corsi, C.: An integrated and automated segmentation approach to deteriorated regions recognition on 3d reality-based models of cultural heritage artifacts. J. Cult. Herit. 13(4), 371–378 (2012)CrossRef Manferdini, A.M., Baroncini, V., Corsi, C.: An integrated and automated segmentation approach to deteriorated regions recognition on 3d reality-based models of cultural heritage artifacts. J. Cult. Herit. 13(4), 371–378 (2012)CrossRef
11.
go back to reference Pierrot, D.M.: Producing orthomosaic with a free open source software (micmac), application to the archeological survey of meremptah’s tomb. In: Workshop Digital Specimen, Berlin, pp. 8–12, September 2014 Pierrot, D.M.: Producing orthomosaic with a free open source software (micmac), application to the archeological survey of meremptah’s tomb. In: Workshop Digital Specimen, Berlin, pp. 8–12, September 2014
12.
go back to reference Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., Savarese, S.: Generalized intersection over union: A metric and a loss for bounding box regression. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 658–666 (2019) Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., Savarese, S.: Generalized intersection over union: A metric and a loss for bounding box regression. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 658–666 (2019)
13.
go back to reference Smith, L.N.: A disciplined approach to neural network hyper-parameters: Part 1-learning rate, batch size, momentum, and weight decay. arXiv preprint arXiv:1803.09820 (2018) Smith, L.N.: A disciplined approach to neural network hyper-parameters: Part 1-learning rate, batch size, momentum, and weight decay. arXiv preprint arXiv:​1803.​09820 (2018)
14.
go back to reference Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision And Pattern Recognition, pp. 1–9 (2015) Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision And Pattern Recognition, pp. 1–9 (2015)
15.
go back to reference Valero, E., Forster, A., Bosché, F., Hyslop, E., Wilson, L., Turmel, A.: Automated defect detection and classification in ashlar masonry walls using machine learning. Autom. Construct. 106, 102846 (2019) Valero, E., Forster, A., Bosché, F., Hyslop, E., Wilson, L., Turmel, A.: Automated defect detection and classification in ashlar masonry walls using machine learning. Autom. Construct. 106, 102846 (2019)
16.
go back to reference Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017) Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)
17.
go back to reference Wang, C.Y., Liao, H.Y.M., Wu, Y.H., Chen, P.Y., Hsieh, J.W., Yeh, I.H.: CspNet: a new backbone that can enhance learning capability of CNN. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp. 390–391 (2020) Wang, C.Y., Liao, H.Y.M., Wu, Y.H., Chen, P.Y., Hsieh, J.W., Yeh, I.H.: CspNet: a new backbone that can enhance learning capability of CNN. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp. 390–391 (2020)
18.
go back to reference Wang, N., Zhao, Q., Li, S., Zhao, X., Zhao, P.: Damage classification for masonry historic structures using convolutional neural networks based on still images. Comput. Aid. Civil Infrastruct. Eng. 33(12), 1073–1089, (2018) Wang, N., Zhao, Q., Li, S., Zhao, X., Zhao, P.: Damage classification for masonry historic structures using convolutional neural networks based on still images. Comput. Aid. Civil Infrastruct. Eng. 33(12), 1073–1089, (2018)
19.
go back to reference Wang, N., Zhao, X., Zhao, P., Zhang, Y., Zou, Z., Ou, J.: Automatic damage detection of historic masonry buildings based on mobile deep learning. Autom. Construct. 103, 53–66 (2019) Wang, N., Zhao, X., Zhao, P., Zhang, Y., Zou, Z., Ou, J.: Automatic damage detection of historic masonry buildings based on mobile deep learning. Autom. Construct. 103, 53–66 (2019)
20.
go back to reference Zhang, Z., Lu, X., Cao, G., Yang, Y., Jiao, L., Liu, F.: ViT-YOLO: transformer-based yolo for object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 2799–2808 (2021) Zhang, Z., Lu, X., Cao, G., Yang, Y., Jiao, L., Liu, F.: ViT-YOLO: transformer-based yolo for object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 2799–2808 (2021)
21.
go back to reference Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., Ren, D.: Distance-IoU loss: faster and better learning for bounding box regression. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 12993–13000 (2020) Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., Ren, D.: Distance-IoU loss: faster and better learning for bounding box regression. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 12993–13000 (2020)
Metadata
Title
Transformers with YOLO Network for Damage Detection in Limestone Wall Images
Authors
Koubouratou Idjaton
Xavier Desquesnes
Sylvie Treuillet
Xavier Brunetaud
Copyright Year
2022
DOI
https://doi.org/10.1007/978-3-031-13324-4_26

Premium Partner