Skip to main content
Top

2018 | OriginalPaper | Chapter

Semantic Segmentation by Integrating Classifiers for Different Difficulty Levels

Authors : Daisuke Matsuzuki, Kazuhiro Hotta

Published in: Advances in Visual Computing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Semantic segmentation assigns class labels to all pixels in an input image. In general, when the number of classes is large or when the appearance of each class frequency changes, the segmentation accuracy decreases drastically. In this paper, we propose to divide a classification task into sub-tasks according to the difficulty of classes. Our proposed method consists of 2 parts; training a network for each sub-task and training an integration network. Difficulty level depends on the number of pixels. By training the network for each difficulty level, we obtain probability maps for each sub-task. Then we train the integration network from those maps. In experiments, we evaluate the segmentation accuracy on the CamVid dataset which contains 11 classes. We divide all classes to 3 classes; easy, normal, and difficult classes. We compared our method with conventional method using all classes. We confirmed that the proposed method outperformed the conventional method.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015) Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
2.
go back to reference Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for image segmentation. In: Proceedings of the IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 2481–2495 (2017)CrossRef Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for image segmentation. In: Proceedings of the IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 2481–2495 (2017)CrossRef
4.
go back to reference Szegedy, C., et al.: Going deeper with convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015) Szegedy, C., et al.: Going deeper with convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
5.
go back to reference He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016) He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
6.
go back to reference Xie, S., Girshick, R., Dollár, P., Tu, Z.: Aggregated residual transformations for deep neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 5987–5995 (2017) Xie, S., Girshick, R., Dollár, P., Tu, Z.: Aggregated residual transformations for deep neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 5987–5995 (2017)
8.
go back to reference Jégou, S., Drozdzal, M., Vazquez, D.: The one hundred layers tiramisu: fully convolutional densenets for semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 1175–1183 (2017) Jégou, S., Drozdzal, M., Vazquez, D.: The one hundred layers tiramisu: fully convolutional densenets for semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 1175–1183 (2017)
10.
go back to reference Zhao, H., Shi, J., Qi, X., Wang, X.: Pyramid scene parsing network. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2881–2890 (2017) Zhao, H., Shi, J., Qi, X., Wang, X.: Pyramid scene parsing network. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2881–2890 (2017)
11.
go back to reference Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.-L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell. 40, 834–848 (2018)CrossRef Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.-L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell. 40, 834–848 (2018)CrossRef
12.
go back to reference Chen, L.C., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587 (2017) Chen, L.C., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:​1706.​05587 (2017)
13.
go back to reference Lin, G., Milan, A., Shen, C., Reid, I.: RefineNet: multi-path refinement networks for high-resolution semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, no. 2, p. 5 (2017) Lin, G., Milan, A., Shen, C., Reid, I.: RefineNet: multi-path refinement networks for high-resolution semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, no. 2, p. 5 (2017)
14.
go back to reference Hiramatsu, Y., Hotta, K., Matsuda, M., Terai, K., Imanishi, A.: Cell image segmentation by integrating multiple CNNs. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 2205–2211 (2018) Hiramatsu, Y., Hotta, K., Matsuda, M., Terai, K., Imanishi, A.: Cell image segmentation by integrating multiple CNNs. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 2205–2211 (2018)
15.
go back to reference Bengio, Y., Louradour, J., Collobert, R., Weston, J.: Curriculum learning. In: International Conference on Machine Learning, pp. 1–127. ACM (2009) Bengio, Y., Louradour, J., Collobert, R., Weston, J.: Curriculum learning. In: International Conference on Machine Learning, pp. 1–127. ACM (2009)
16.
go back to reference Li, X., Liu, Z., Luo, P., Loy, C.C., Tang, X.: Not all pixels are equal: difficulty-aware semantic segmentation via deep layer cascade. In: IEEE Conference on Computer Vision and Pattern Recognition, p. 2 (2017) Li, X., Liu, Z., Luo, P., Loy, C.C., Tang, X.: Not all pixels are equal: difficulty-aware semantic segmentation via deep layer cascade. In: IEEE Conference on Computer Vision and Pattern Recognition, p. 2 (2017)
17.
go back to reference Szegedy, C., Ioffe, S., Vanhoucke, V., Alemi, A.A.: Inception-v4, Inception-ResNet and the impact of residual connections on learning. In: AAAI Conference on Artificial Intelligence, p. 12 (2017) Szegedy, C., Ioffe, S., Vanhoucke, V., Alemi, A.A.: Inception-v4, Inception-ResNet and the impact of residual connections on learning. In: AAAI Conference on Artificial Intelligence, p. 12 (2017)
19.
go back to reference Hayashi, Y., et al.: Automated adherent cell elimination by a high-speed laser mediated by a light-responsive polymer. bioRxiv 280487 (2018) Hayashi, Y., et al.: Automated adherent cell elimination by a high-speed laser mediated by a light-responsive polymer. bioRxiv 280487 (2018)
Metadata
Title
Semantic Segmentation by Integrating Classifiers for Different Difficulty Levels
Authors
Daisuke Matsuzuki
Kazuhiro Hotta
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-030-03801-4_53

Premium Partner