Skip to main content
Top

2021 | OriginalPaper | Chapter

Automatic Design of Deep Neural Networks Applied to Image Segmentation Problems

Authors : Ricardo Lima, Aurora Pozo, Alexander Mendiburu, Roberto Santana

Published in: Genetic Programming

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

A U-Net is a convolutional neural network mainly used for image segmentation domains such as medical image analysis. As other deep neural networks, the U-Net architecture influences the efficiency and accuracy of the network. We propose the use of a grammar-based evolutionary algorithm for the automatic design of deep neural networks for image segmentation tasks. The approach used is called Dynamic Structured Grammatical Evolution (DSGE), which employs a grammar to define the building blocks that are used to compose the networks, as well as the rules that help build them. We perform a set of experiments on the BSDS500 and ISBI12 datasets, designing networks tuned to image segmentation and edge detection. Subsequently, by using image similarity metrics, the results of our best performing networks are compared with the original U-Net. The results show that the proposed approach is able to design a network that is less complex in the number of trainable parameters, while also achieving slightly better results than the U-Net with a more consistent training.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Al-Amri, S.S., Kalyankar, N., Khamitkar, S.: Image segmentation by using edge detection. Int. J. Comput. Sci. Eng. 2(3), 804–807 (2010) Al-Amri, S.S., Kalyankar, N., Khamitkar, S.: Image segmentation by using edge detection. Int. J. Comput. Sci. Eng. 2(3), 804–807 (2010)
3.
go back to reference Arbelaez, P., Maire, M., Fowlkes, C., Malik, J.: Contour detection and hierarchical image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 33(5), 898–916 (2010)CrossRef Arbelaez, P., Maire, M., Fowlkes, C., Malik, J.: Contour detection and hierarchical image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 33(5), 898–916 (2010)CrossRef
4.
go back to reference Assunçao, F., Lourenço, N., Machado, P., Ribeiro, B.: Towards the evolution of multi-layered neural networks: a dynamic structured grammatical evolution approach. In: Proceedings of the Genetic and Evolutionary Computation Conference, pp. 393–400. ACM (2017) Assunçao, F., Lourenço, N., Machado, P., Ribeiro, B.: Towards the evolution of multi-layered neural networks: a dynamic structured grammatical evolution approach. In: Proceedings of the Genetic and Evolutionary Computation Conference, pp. 393–400. ACM (2017)
6.
go back to reference Bertasius, G., Shi, J., Torresani, L.: DeepEdge: a multi-scale bifurcated deep network for top-down contour detection. In: Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2015), pp. 4380–4389 (2015) Bertasius, G., Shi, J., Torresani, L.: DeepEdge: a multi-scale bifurcated deep network for top-down contour detection. In: Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2015), pp. 4380–4389 (2015)
7.
go back to reference Bertasius, G., Shi, J., Torresani, L.: High-for-low and low-for-high: efficient boundary detection from deep object features and its applications to high-level vision. In: Proceedings of the 2015 IEEE International Conference on Computer Vision (CVPR 2015), pp. 504–512 (2015) Bertasius, G., Shi, J., Torresani, L.: High-for-low and low-for-high: efficient boundary detection from deep object features and its applications to high-level vision. In: Proceedings of the 2015 IEEE International Conference on Computer Vision (CVPR 2015), pp. 504–512 (2015)
9.
go back to reference Canny, J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 6, 679–698 (1986)CrossRef Canny, J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 6, 679–698 (1986)CrossRef
10.
go back to reference Dollár, P., Zitnick, C.L.: Structured forests for fast edge detection. In: Proceedings of the 2013 IEEE International Conference on Computer Vision (ICCV 2013), pp. 1841–1848 (2013) Dollár, P., Zitnick, C.L.: Structured forests for fast edge detection. In: Proceedings of the 2013 IEEE International Conference on Computer Vision (ICCV 2013), pp. 1841–1848 (2013)
12.
go back to reference Hallman, S., Fowlkes, C.C.: Oriented edge forests for boundary detection. In: Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2015), pp. 1732–1740 (2015) Hallman, S., Fowlkes, C.C.: Oriented edge forests for boundary detection. In: Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2015), pp. 1732–1740 (2015)
13.
go back to reference Kivinen, J., Williams, C., Heess, N.: Visual boundary prediction: a deep neural prediction network and quality dissection. In: Artificial Intelligence and Statistics, pp. 512–521 (2014) Kivinen, J., Williams, C., Heess, N.: Visual boundary prediction: a deep neural prediction network and quality dissection. In: Artificial Intelligence and Statistics, pp. 512–521 (2014)
14.
go back to reference Kumar, A., Murthy, O.N., Ghosal, P., Mukherjee, A., Nandi, D., et al.: A dense U-Net architecture for multiple sclerosis lesion segmentation. In: Proceedings of the 2019 IEEE Region 10 Conference (TENCON 2019), pp. 662–667. IEEE (2019) Kumar, A., Murthy, O.N., Ghosal, P., Mukherjee, A., Nandi, D., et al.: A dense U-Net architecture for multiple sclerosis lesion segmentation. In: Proceedings of the 2019 IEEE Region 10 Conference (TENCON 2019), pp. 662–667. IEEE (2019)
15.
go back to reference Lima, R.H.R., Pozo, A.T.R.: A study on auto-configuration of multi-objective particle swarm optimization algorithm. In: Proceedings of the 2017 IEEE Congress on Evolutionary Computation (CEC 2017), pp. 718–725. IEEE (2017) Lima, R.H.R., Pozo, A.T.R.: A study on auto-configuration of multi-objective particle swarm optimization algorithm. In: Proceedings of the 2017 IEEE Congress on Evolutionary Computation (CEC 2017), pp. 718–725. IEEE (2017)
16.
go back to reference Lima, R.H.R., Pozo, A.T.R.: Evolving convolutional neural networks through grammatical evolution. In: Proceedings of the 2019 Genetic and Evolutionary Computation Conference (GECCO 2019), pp. 179–180. ACM (2019) Lima, R.H.R., Pozo, A.T.R.: Evolving convolutional neural networks through grammatical evolution. In: Proceedings of the 2019 Genetic and Evolutionary Computation Conference (GECCO 2019), pp. 179–180. ACM (2019)
17.
go back to reference Lima, R.H.R., Pozo, A.T.R., Mendiburu, A., Santana, R.: A Symmetric grammar approach for designing segmentation models. In: Proceedings of the 2020 IEEE Congress on Evolutionary Computation (CEC 2020), pp. 1–8. IEEE (2020) Lima, R.H.R., Pozo, A.T.R., Mendiburu, A., Santana, R.: A Symmetric grammar approach for designing segmentation models. In: Proceedings of the 2020 IEEE Congress on Evolutionary Computation (CEC 2020), pp. 1–8. IEEE (2020)
18.
go back to reference Liu, Y., Cheng, M.M., Hu, X., Wang, K., Bai, X.: Richer convolutional features for edge detection. In: Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), pp. 3000–3009 (2017) Liu, Y., Cheng, M.M., Hu, X., Wang, K., Bai, X.: Richer convolutional features for edge detection. In: Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), pp. 3000–3009 (2017)
20.
go back to reference Lourenço, N., Pereira, F., Costa, E.: Evolving evolutionary algorithms. In: Proceedings of the 14th Annual Conference Companion on Genetic and Evolutionary Computation (GECCO 2012), pp. 51–58. ACM (2012) Lourenço, N., Pereira, F., Costa, E.: Evolving evolutionary algorithms. In: Proceedings of the 14th Annual Conference Companion on Genetic and Evolutionary Computation (GECCO 2012), pp. 51–58. ACM (2012)
22.
go back to reference Martin, D.R., Fowlkes, C.C., Malik, J.: Learning to detect natural image boundaries using local brightness, color, and texture cues. Proc. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 26(5), 530–549 (2004) Martin, D.R., Fowlkes, C.C., Malik, J.: Learning to detect natural image boundaries using local brightness, color, and texture cues. Proc. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 26(5), 530–549 (2004)
23.
go back to reference Milletari, F., Navab, N., Ahmadi, S.: V-Net: fully convolutional neural networks for volumetric medical image segmentation. In: 2016 Fourth International Conference on 3D Vision (3DV 2016), pp. 565–571 (2016) Milletari, F., Navab, N., Ahmadi, S.: V-Net: fully convolutional neural networks for volumetric medical image segmentation. In: 2016 Fourth International Conference on 3D Vision (3DV 2016), pp. 565–571 (2016)
24.
go back to reference Mirunalini, P., Aravindan, C., Nambi, A.T., Poorvaja, S., Priya, V.P.: Segmentation of coronary arteries from CTA axial slices using deep learning techniques. In: Proceedings of the 2019 IEEE Region 10 Conference (TENCON 2019), pp. 2074–2080. IEEE (2019) Mirunalini, P., Aravindan, C., Nambi, A.T., Poorvaja, S., Priya, V.P.: Segmentation of coronary arteries from CTA axial slices using deep learning techniques. In: Proceedings of the 2019 IEEE Region 10 Conference (TENCON 2019), pp. 2074–2080. IEEE (2019)
25.
go back to reference Oktay, O., et al.: Anatomically constrained neural networks (ACNNs): application to cardiac image enhancement and segmentation. IEEE Trans. Med. Imaging 37(2), 384–395 (2018)CrossRef Oktay, O., et al.: Anatomically constrained neural networks (ACNNs): application to cardiac image enhancement and segmentation. IEEE Trans. Med. Imaging 37(2), 384–395 (2018)CrossRef
26.
go back to reference Prewitt, J.M.: Object enhancement and extraction. Picture Process. Psychopictorics 10(1), 15–19 (1970) Prewitt, J.M.: Object enhancement and extraction. Picture Process. Psychopictorics 10(1), 15–19 (1970)
27.
go back to reference Roberts, L.G.: Machine perception of three-dimensional solids. Ph.D. thesis, Massachusetts Institute of Technology (1963) Roberts, L.G.: Machine perception of three-dimensional solids. Ph.D. thesis, Massachusetts Institute of Technology (1963)
29.
go back to reference Ryan, C., Collins, J.J., Neill, M.O.: Grammatical evolution: evolving programs for an arbitrary language. In: Banzhaf, W., Poli, R., Schoenauer, M., Fogarty, T.C. (eds.) EuroGP 1998. LNCS, vol. 1391, pp. 83–96. Springer, Heidelberg (1998). https://doi.org/10.1007/BFb0055930CrossRef Ryan, C., Collins, J.J., Neill, M.O.: Grammatical evolution: evolving programs for an arbitrary language. In: Banzhaf, W., Poli, R., Schoenauer, M., Fogarty, T.C. (eds.) EuroGP 1998. LNCS, vol. 1391, pp. 83–96. Springer, Heidelberg (1998). https://​doi.​org/​10.​1007/​BFb0055930CrossRef
30.
go back to reference Sabarinathan, D., Beham, M.P., Roomi, S., et al.: Hyper vision net: kidney tumor segmentation using coordinate convolutional layer and attention unit. arXiv preprint arXiv:1908.03339 (2019) Sabarinathan, D., Beham, M.P., Roomi, S., et al.: Hyper vision net: kidney tumor segmentation using coordinate convolutional layer and attention unit. arXiv preprint arXiv:​1908.​03339 (2019)
31.
go back to reference Sagar, A., Soundrapandiyan, R.: Semantic segmentation with multi scale spatial attention for self driving cars. arXiv preprint arXiv:2007.12685 (2020) Sagar, A., Soundrapandiyan, R.: Semantic segmentation with multi scale spatial attention for self driving cars. arXiv preprint arXiv:​2007.​12685 (2020)
32.
go back to reference Shapiro, L.G., Stockman, G.C.: Computer Vision. Prentice-Hall, New Jersey (2001) Shapiro, L.G., Stockman, G.C.: Computer Vision. Prentice-Hall, New Jersey (2001)
33.
go back to reference Shen, W., Wang, X., Wang, Y., Bai, X., Zhang, Z.: DeepContour: a deep convolutional feature learned by positive-sharing loss for contour detection. In: Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2015), pp. 3982–3991 (2015) Shen, W., Wang, X., Wang, Y., Bai, X., Zhang, Z.: DeepContour: a deep convolutional feature learned by positive-sharing loss for contour detection. In: Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2015), pp. 3982–3991 (2015)
34.
go back to reference Sobel, I.: Camera models and machine perception. Technical report, Computer Science Department, Technion (1972) Sobel, I.: Camera models and machine perception. Technical report, Computer Science Department, Technion (1972)
35.
go back to reference Sun, W., You, S., Walker, J., Li, K., Barnes, N.: Structural edge detection: a dataset and benchmark. In: Proceedings of the 2018 Digital Image Computing: Techniques and Applications (DICTA 2018), pp. 1–8. IEEE (2018) Sun, W., You, S., Walker, J., Li, K., Barnes, N.: Structural edge detection: a dataset and benchmark. In: Proceedings of the 2018 Digital Image Computing: Techniques and Applications (DICTA 2018), pp. 1–8. IEEE (2018)
36.
go back to reference Umbaugh, S.E.: Digital Image Processing and Analysis: Human and Computer Vision Applications with CVIPtools. CRC Press, Boca Raton (2010) Umbaugh, S.E.: Digital Image Processing and Analysis: Human and Computer Vision Applications with CVIPtools. CRC Press, Boca Raton (2010)
37.
go back to reference Wang, G., et al.: Interactive medical image segmentation using deep learning with image-specific fine tuning. IEEE Trans. Med. Imaging 37(7), 1562–1573 (2018)CrossRef Wang, G., et al.: Interactive medical image segmentation using deep learning with image-specific fine tuning. IEEE Trans. Med. Imaging 37(7), 1562–1573 (2018)CrossRef
Metadata
Title
Automatic Design of Deep Neural Networks Applied to Image Segmentation Problems
Authors
Ricardo Lima
Aurora Pozo
Alexander Mendiburu
Roberto Santana
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-72812-0_7

Premium Partner