Skip to main content

2019 | OriginalPaper | Buchkapitel

14. Deep Learning Architect: Classification for Architectural Design Through the Eye of Artificial Intelligence

verfasst von : Yuji Yoshimura, Bill Cai, Zhoutong Wang, Carlo Ratti

Erschienen in: Computational Urban Planning and Management for Smart Cities

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper applies state-of-the-art techniques in deep learning and computer vision to measure visual similarities between architectural designs by different architects. Using a dataset consisting of web-scraped images and an original collection of images of architectural works, we first train a deep convolutional neural network (DCNN) model capable of achieving 73% accuracy in classifying works belonging to 34 different architects. By examining the weights in the trained DCNN model, we are able to quantitatively measure the visual similarities between architects that are implicitly learned by our model. Using this measure, we cluster architects that are identified to be similar and compare our findings to conventional classification made by architectural historians and theorists. Our clustering of architectural designs remarkably corroborates conventional views in architectural history, and the learned architectural features also cohere with the traditional understanding of architectural designs.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abdel-Hamid O, Mohamed AR, Jiang H, Penn G (2012) Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition. In: 2012 IEEE international conference on acoustics, speech and signal processing (ICASSP), Kyoto international conference center, Kyoto, 25–30 March 2012 Abdel-Hamid O, Mohamed AR, Jiang H, Penn G (2012) Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition. In: 2012 IEEE international conference on acoustics, speech and signal processing (ICASSP), Kyoto international conference center, Kyoto, 25–30 March 2012
Zurück zum Zitat Bahdanau D, Cho K, Bengio Y (2015) Neural machine translation by jointly learning to align and translate. ArXiv Preprint ArXiv:1409.0473 Bahdanau D, Cho K, Bengio Y (2015) Neural machine translation by jointly learning to align and translate. ArXiv Preprint ArXiv:​1409.​0473
Zurück zum Zitat Cai BY, Li X, Seiferling I, Ratti C (2018) Treepedia 2.0: applying deep learning for large-scale quantification of urban tree cover. In: 2018 IEEE international congress on big data (BigData Congress), Seattle, 25–30 June 2018 Cai BY, Li X, Seiferling I, Ratti C (2018) Treepedia 2.0: applying deep learning for large-scale quantification of urban tree cover. In: 2018 IEEE international congress on big data (BigData Congress), Seattle, 25–30 June 2018
Zurück zum Zitat Doersch C, Singh S, Gupta A, Sivic J Efros AA (2012) What makes paris look like paris? In: ACM transactions on graphics (SIGGRAPH 2012), vol 31(4). ACM Press, New YorkCrossRef Doersch C, Singh S, Gupta A, Sivic J Efros AA (2012) What makes paris look like paris? In: ACM transactions on graphics (SIGGRAPH 2012), vol 31(4). ACM Press, New YorkCrossRef
Zurück zum Zitat Forty A (2000) Words and buildings: a vocabulary of modern architecture. Thames & Hudson, New York Forty A (2000) Words and buildings: a vocabulary of modern architecture. Thames & Hudson, New York
Zurück zum Zitat Frampton K (1992) Modern architecture: A critical history (3rd edn, Revised and Enlarged). London: Thames and Hudson Frampton K (1992) Modern architecture: A critical history (3rd edn, Revised and Enlarged). London: Thames and Hudson
Zurück zum Zitat Gatys LA, Ecker AS, Bethge M (2016) Image style transfer using convolutional neural networks. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), Las Vegas, 26 June to 1 July 2016 Gatys LA, Ecker AS, Bethge M (2016) Image style transfer using convolutional neural networks. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), Las Vegas, 26 June to 1 July 2016
Zurück zum Zitat Girshick R, Donahue J, Darrell T, Berkeley UC Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: 2014 IEEE conference on computer vision and pattern recognition, Columbus, 24–27 June 2014 Girshick R, Donahue J, Darrell T, Berkeley UC Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: 2014 IEEE conference on computer vision and pattern recognition, Columbus, 24–27 June 2014
Zurück zum Zitat He K, Zhang X, Ren S, Jian S (2016) Deep residual learning for image recognition. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), Las Vegas, 26 June to 1 July 2016 He K, Zhang X, Ren S, Jian S (2016) Deep residual learning for image recognition. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR), Las Vegas, 26 June to 1 July 2016
Zurück zum Zitat Hitchcock HR, Johnson P (1932) The international style: architecture since 1922. W.W. Norton & Company, New York Hitchcock HR, Johnson P (1932) The international style: architecture since 1922. W.W. Norton & Company, New York
Zurück zum Zitat Johnson P, Wigley M (1988) Deconstructivist architecture: the Museum of Modern Art. Museum of Modern Art, New York Johnson P, Wigley M (1988) Deconstructivist architecture: the Museum of Modern Art. Museum of Modern Art, New York
Zurück zum Zitat Jolliffe IT (2002) Principal component analysis, 2nd edn. Springer-Verlag, New York Jolliffe IT (2002) Principal component analysis, 2nd edn. Springer-Verlag, New York
Zurück zum Zitat Kant I (1952) The critique of judgment (1790). (trans: Meredith JC). Clarendon Press, Oxford Kant I (1952) The critique of judgment (1790). (trans: Meredith JC). Clarendon Press, Oxford
Zurück zum Zitat Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. Advances In Neural Information Processing Systems, pp. 1097–1105 Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. Advances In Neural Information Processing Systems, pp. 1097–1105
Zurück zum Zitat Kron J, Slesin S (1984) High-tech: the industrial style and source book for the home. Clarkson Potter, New York Kron J, Slesin S (1984) High-tech: the industrial style and source book for the home. Clarkson Potter, New York
Zurück zum Zitat Lee S, Maisonneuve N, Crandall D, Efros AA Sivic J (2015) Linking past to present: discovering style in two centuries of architecture. In: 2015 IEEE international conference on computational photography (ICCP), Houston, 24–26 April 2015 Lee S, Maisonneuve N, Crandall D, Efros AA Sivic J (2015) Linking past to present: discovering style in two centuries of architecture. In: 2015 IEEE international conference on computational photography (ICCP), Houston, 24–26 April 2015
Zurück zum Zitat Li J, Yao L, Hendriks E, Wang JZ (2012) Rhythmic brushstrokes distinguish van gogh from his contemporaries: findings via automated brushstroke extraction. IEEE Trans Pattern Anal Mach Intell 34(6):1159–1176CrossRef Li J, Yao L, Hendriks E, Wang JZ (2012) Rhythmic brushstrokes distinguish van gogh from his contemporaries: findings via automated brushstroke extraction. IEEE Trans Pattern Anal Mach Intell 34(6):1159–1176CrossRef
Zurück zum Zitat Llamas J, Lerones PM, Medina R, Zalama E, Gómez-García-Bermejo J (2017) Classification of architectural heritage images using deep learning techniques. Appl Sci 7(10):992CrossRef Llamas J, Lerones PM, Medina R, Zalama E, Gómez-García-Bermejo J (2017) Classification of architectural heritage images using deep learning techniques. Appl Sci 7(10):992CrossRef
Zurück zum Zitat Obeso AM, Vázquez GMS, Acosta AAR, Benois-Pineau J (2017) Connoisseur: classification of styles of Mexican architectural heritage with deep learning and visual attention prediction. In: 15th international workshop on content-based multimedia indexing (CBMI), Florence, 19–21 June 2017 Obeso AM, Vázquez GMS, Acosta AAR, Benois-Pineau J (2017) Connoisseur: classification of styles of Mexican architectural heritage with deep learning and visual attention prediction. In: 15th international workshop on content-based multimedia indexing (CBMI), Florence, 19–21 June 2017
Zurück zum Zitat Onians J (1988) Bearers of meaning: the classical orders in antiquity, the middle ages, and the renaissance. Princeton University Press, Princeton Onians J (1988) Bearers of meaning: the classical orders in antiquity, the middle ages, and the renaissance. Princeton University Press, Princeton
Zurück zum Zitat Saleh B, Abe K, Arora RS, Elgammal A (2016) Toward automated discovery of artistic influence. Multimed Tools Appl 75(7):3565–3591CrossRef Saleh B, Abe K, Arora RS, Elgammal A (2016) Toward automated discovery of artistic influence. Multimed Tools Appl 75(7):3565–3591CrossRef
Zurück zum Zitat Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D (2017) Grad-CAM: visual explanations from deep networks via gradient-based localization. In: 2017 IEEE international conference on computer vision (ICCV), Venice, 22–29 October 2017 Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D (2017) Grad-CAM: visual explanations from deep networks via gradient-based localization. In: 2017 IEEE international conference on computer vision (ICCV), Venice, 22–29 October 2017
Zurück zum Zitat Shalunts G, Haxhimusa Y, Sablatnig R (2011) Architectural style classification of building facade windows. In: Bebis G, Boyle R, Parvin B, Koracin D, Fowlkes C, Wang S, Choi M-H, Mantler S, Schulze J, Acevedo D, Mueller K, Michael P (eds) Advances in visual computing, ISVC 2011. Lecture Notes in Computer Science, vol 6939. Springer, Berlin, Heidelberg, New York, pp 280–289CrossRef Shalunts G, Haxhimusa Y, Sablatnig R (2011) Architectural style classification of building facade windows. In: Bebis G, Boyle R, Parvin B, Koracin D, Fowlkes C, Wang S, Choi M-H, Mantler S, Schulze J, Acevedo D, Mueller K, Michael P (eds) Advances in visual computing, ISVC 2011. Lecture Notes in Computer Science, vol 6939. Springer, Berlin, Heidelberg, New York, pp 280–289CrossRef
Zurück zum Zitat Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. ArXiv preprint ArXiv:1409.1556 Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. ArXiv preprint ArXiv:​1409.​1556
Zurück zum Zitat Stern RAM (1993) Frank O. Gehry: architecture with a serious smile. In: Futagawa Y (ed) Frank O. Gehry. GA Architect 10. A.D.A. Edita, Tokyo, pp 8–9 Stern RAM (1993) Frank O. Gehry: architecture with a serious smile. In: Futagawa Y (ed) Frank O. Gehry. GA Architect 10. A.D.A. Edita, Tokyo, pp 8–9
Zurück zum Zitat Wolfflin H (1950) Principles of art history: the problem of the development of style in later art (1915). (7th Edition Trans. Hottinger MD) Dover, New York Wolfflin H (1950) Principles of art history: the problem of the development of style in later art (1915). (7th Edition Trans. Hottinger MD) Dover, New York
Zurück zum Zitat Zhang F, Zhou B, Liu L, Liu Y, Fung HH, Lin H, Ratti C (2018) Measuring human perceptions of a large-scale urban region using machine learning. Landsc Urban Plan 180:148–160CrossRef Zhang F, Zhou B, Liu L, Liu Y, Fung HH, Lin H, Ratti C (2018) Measuring human perceptions of a large-scale urban region using machine learning. Landsc Urban Plan 180:148–160CrossRef
Zurück zum Zitat Zhou B, Lapedriza A, Xiao J, Torralba A, Oliva A (2014) Learning deep features for scene recognition using places database. In: Ghahramani Z, Welling M, Cortes C, Lawrence ND, Weinberger KQ (eds) Advances in neural information processing systems 27 (NIPS 2014), Curran Associates, Inc, pp 487–495 Zhou B, Lapedriza A, Xiao J, Torralba A, Oliva A (2014) Learning deep features for scene recognition using places database. In: Ghahramani Z, Welling M, Cortes C, Lawrence ND, Weinberger KQ (eds) Advances in neural information processing systems 27 (NIPS 2014), Curran Associates, Inc, pp 487–495
Zurück zum Zitat Zoph B, Shlens J (2018) Learning transferable architectures for scalable image recognition. ArXiv Preprint ArXiv:1707.07012 Zoph B, Shlens J (2018) Learning transferable architectures for scalable image recognition. ArXiv Preprint ArXiv:​1707.​07012
Metadaten
Titel
Deep Learning Architect: Classification for Architectural Design Through the Eye of Artificial Intelligence
verfasst von
Yuji Yoshimura
Bill Cai
Zhoutong Wang
Carlo Ratti
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-19424-6_14

Premium Partner