Skip to main content
Top

2020 | OriginalPaper | Chapter

DeepStroke: Understanding Glyph Structure with Semantic Segmentation and Tabu Search

Authors : Wenguang Wang, Zhouhui Lian, Yingmin Tang, Jianguo Xiao

Published in: MultiMedia Modeling

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Glyphs in many writing systems (e.g., Chinese) are composed of a sequence of strokes written in a specific order. Glyph structure interpreting (i.e., stroke extraction) is one of the most important processing steps in many tasks including aesthetic quality evaluation, handwriting synthesis, character recognition, etc. However, existing methods that rely heavily on accurate shape matching are not only time-consuming but also unsatisfactory in stroke extraction performance. In this paper, we propose a novel method based on semantic segmentation and tabu search to interpret the structure of Chinese glyphs. Specifically, we first employ an improved Fully Convolutional Network (FCN), DeepStroke, to extract strokes, and then use the tabu search to obtain the order how these strokes are drawn. We also build the Chinese Character Stroke Segmentation Dataset (CCSSD) consisting of 67630 character images that can be equally classified into 10 different font styles. This dataset provides a benchmark for both stroke extraction and semantic segmentation tasks. Experimental results demonstrate the effectiveness and efficiency of our method and validate its superiority against the state of the art.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Semantic image segmentation with deep convolutional nets and fully connected CRFs. Comput. Sci. 4, 357–361 (2014) Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Semantic image segmentation with deep convolutional nets and fully connected CRFs. Comput. Sci. 4, 357–361 (2014)
2.
go back to reference Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. arXiv preprint arXiv:1606.00915 (2016) Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. arXiv preprint arXiv:​1606.​00915 (2016)
3.
go back to reference Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation (2018)CrossRef Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation (2018)CrossRef
4.
go back to reference Chen, X., Lian, Z., Tang, Y., Xiao, J.: A benchmark for stroke extraction of Chinese characters. Acta Scientiarum Naturalium Universitatis Pekinensis 2, 4 (2016) Chen, X., Lian, Z., Tang, Y., Xiao, J.: A benchmark for stroke extraction of Chinese characters. Acta Scientiarum Naturalium Universitatis Pekinensis 2, 4 (2016)
5.
go back to reference Lian, Z., Xiao, J.: Automatic shape morphing for Chinese characters. In: SIGGRAPH Asia 2012 Technical Briefs, p. 2. ACM (2012) Lian, Z., Xiao, J.: Automatic shape morphing for Chinese characters. In: SIGGRAPH Asia 2012 Technical Briefs, p. 2. ACM (2012)
6.
go back to reference Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015) Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
7.
go back to reference Mottaghi, R., et al.: The role of context for object detection and semantic segmentation in the wild. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 891–898 (2014) Mottaghi, R., et al.: The role of context for object detection and semantic segmentation in the wild. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 891–898 (2014)
8.
go back to reference Shelhamer, E., Long, J., Darrell, T.: Fully convolutional models for semantic segmentation. TPAMI (2016) Shelhamer, E., Long, J., Darrell, T.: Fully convolutional models for semantic segmentation. TPAMI (2016)
9.
go back to reference Sun, Y., Qian, H., Xu, Y.: A geometric approach to stroke extraction for the Chinese calligraphy robot. In: 2014 IEEE International Conference on Robotics and Automation (ICRA), pp. 3207–3212. IEEE (2014) Sun, Y., Qian, H., Xu, Y.: A geometric approach to stroke extraction for the Chinese calligraphy robot. In: 2014 IEEE International Conference on Robotics and Automation (ICRA), pp. 3207–3212. IEEE (2014)
10.
go back to reference Wang, C., Lian, Z., Tang, Y., Xiao, J.: Automatic correspondence finding for Chinese characters using graph matching. In: Seventh International Conference on Image and Graphics, pp. 545–550 (2013) Wang, C., Lian, Z., Tang, Y., Xiao, J.: Automatic correspondence finding for Chinese characters using graph matching. In: Seventh International Conference on Image and Graphics, pp. 545–550 (2013)
11.
go back to reference Wang, X., Liang, X., Sun, L., Liu, M.: Triangular mesh based stroke segmentation for Chinese calligraphy. In: 2013 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 1155–1159. IEEE (2013) Wang, X., Liang, X., Sun, L., Liu, M.: Triangular mesh based stroke segmentation for Chinese calligraphy. In: 2013 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 1155–1159. IEEE (2013)
12.
go back to reference Yang, M., Yu, K., Zhang, C., Li, Z., Yang, K.: DenseASPP for semantic segmentation in street scenes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3684–3692 (2018) Yang, M., Yu, K., Zhang, C., Li, Z., Yang, K.: DenseASPP for semantic segmentation in street scenes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3684–3692 (2018)
13.
go back to reference Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., Sang, N.: BiSeNet: bilateral segmentation network for real-time semantic segmentation. arXiv preprint arXiv:1808.00897 (2018) Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., Sang, N.: BiSeNet: bilateral segmentation network for real-time semantic segmentation. arXiv preprint arXiv:​1808.​00897 (2018)
14.
go back to reference Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network (2016) Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network (2016)
Metadata
Title
DeepStroke: Understanding Glyph Structure with Semantic Segmentation and Tabu Search
Authors
Wenguang Wang
Zhouhui Lian
Yingmin Tang
Jianguo Xiao
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-37731-1_29