Skip to main content
Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) 3/2020

18.06.2020 | Original Paper

Model-based Persian calligraphy synthesis via learning to transfer templates to personal styles

verfasst von: Amirhossein Ahmadian, Kazim Fouladi, Babak Nadjar Araabi

Erschienen in: International Journal on Document Analysis and Recognition (IJDAR) | Ausgabe 3/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Current software tools for computer generation of Persian calligraphy can be mostly described as conventional fonts and typesetting software, which basically neglect the ‘variations’ of real calligraphy performed by hand, in terms of personalization to different calligraphers’ styles, as well as their statistical characteristics. In this paper, we address the problem of natural-looking Persian calligraphy synthesis via a machine learning based approach, at the level of subwords. Given images of samples written by a calligrapher, we train a parametric model to imitate the style. The core idea is to make use of templates (fonts) as a source of background knowledge, and learn a probabilistic mapping from them to personal styles of calligraphers, which is posed as transformation of attributed graphs using neural networks with sliding windows. This can be understood as adding ‘naturalness’ to a Persian calligraphy font, in essence. We report both objective and subjective evaluations, including the model performance in writer (calligrapher) identification task and Visual Turing Test. The results of the latter suggest that humans are unable to distinguish the calligraphy synthesized by our approach from real calligraphy in many cases.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
4.
Zurück zum Zitat Aksan, E., Pece, F., Hilliges, O.: DeepWriting: Making digital ink editable via deep generative modeling. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, p. 205. ACM (2018) Aksan, E., Pece, F., Hilliges, O.: DeepWriting: Making digital ink editable via deep generative modeling. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, p. 205. ACM (2018)
5.
Zurück zum Zitat Berio, D., Akten, M., Leymarie, F.F., Grierson, M., Plamondon, R.: Calligraphic stylisation learning with a physiologically plausible model of movement and recurrent neural networks. In: Proceedings of the 4th International Conference on Movement Computing, p. 25. ACM (2017) Berio, D., Akten, M., Leymarie, F.F., Grierson, M., Plamondon, R.: Calligraphic stylisation learning with a physiologically plausible model of movement and recurrent neural networks. In: Proceedings of the 4th International Conference on Movement Computing, p. 25. ACM (2017)
6.
Zurück zum Zitat Bishop, C.M.: Mixture density networks, Technical Report, Aston University, Birmingham (1994) Bishop, C.M.: Mixture density networks, Technical Report, Aston University, Birmingham (1994)
7.
Zurück zum Zitat Bulacu, M., Schomaker, L.: Text-independent writer identification and verification using textural and allographic features. IEEE Trans. Pattern Anal. Mach. Intell. 29(4), 701–717 (2007)CrossRef Bulacu, M., Schomaker, L.: Text-independent writer identification and verification using textural and allographic features. IEEE Trans. Pattern Anal. Mach. Intell. 29(4), 701–717 (2007)CrossRef
8.
Zurück zum Zitat Clouse, D.S., Giles, C.L., Horne, B.G., Cottrell, G.W.: Time-delay neural networks: representation and induction of finite-state machines. IEEE Trans. Neural Netw. 8(5), 1065–1070 (1997)CrossRef Clouse, D.S., Giles, C.L., Horne, B.G., Cottrell, G.W.: Time-delay neural networks: representation and induction of finite-state machines. IEEE Trans. Neural Netw. 8(5), 1065–1070 (1997)CrossRef
9.
Zurück zum Zitat Dinges, L., Al-Hamadi, A., Elzobi, M.: An approach for arabic handwriting synthesis based on active shape models. In: Proceedings of the international conference on document analysis and recognition, ICDAR, pp. 1260–1264 (2013) Dinges, L., Al-Hamadi, A., Elzobi, M.: An approach for arabic handwriting synthesis based on active shape models. In: Proceedings of the international conference on document analysis and recognition, ICDAR, pp. 1260–1264 (2013)
10.
Zurück zum Zitat Dolinský, J., Takagi, H.: Analysis and modeling of naturalness in handwritten characters. IEEE Trans. Neural Netw. 20(10), 1540–1553 (2009)CrossRef Dolinský, J., Takagi, H.: Analysis and modeling of naturalness in handwritten characters. IEEE Trans. Neural Netw. 20(10), 1540–1553 (2009)CrossRef
11.
Zurück zum Zitat Elarian, Y., Abdel-Aal, R., Ahmad, I., Parvez, M.T., Zidouri, A.: Handwriting synthesis: classifications and techniques. Int. J. Document Anal. Recogn. 17(4), 455–469 (2014)CrossRef Elarian, Y., Abdel-Aal, R., Ahmad, I., Parvez, M.T., Zidouri, A.: Handwriting synthesis: classifications and techniques. Int. J. Document Anal. Recogn. 17(4), 455–469 (2014)CrossRef
12.
Zurück zum Zitat Fouladi, K., Araabi, B.N.: Toward automatic development of handwritten personal Farsi/Arabic OpenType\({\textregistered }\) fonts. Int. J. Document Anal. Recogn. 18(3), 249–262 (2015) Fouladi, K., Araabi, B.N.: Toward automatic development of handwritten personal Farsi/Arabic OpenType\({\textregistered }\) fonts. Int. J. Document Anal. Recogn. 18(3), 249–262 (2015)
13.
Zurück zum Zitat Fouladi, K., Araabi, B.N., Kabir, E.: A fast and accurate contour-based method for writer-dependent offline handwritten Farsi/Arabic subwords recognition. Int. J. Document Anal. Recogn. 17(2), 181–203 (2014)CrossRef Fouladi, K., Araabi, B.N., Kabir, E.: A fast and accurate contour-based method for writer-dependent offline handwritten Farsi/Arabic subwords recognition. Int. J. Document Anal. Recogn. 17(2), 181–203 (2014)CrossRef
14.
Zurück zum Zitat Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Prentice-Hall, Upper Saddle (2008) Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Prentice-Hall, Upper Saddle (2008)
16.
Zurück zum Zitat Huszár, F.: How (not) to train your generative model: Scheduled sampling, likelihood, adversary? arXiv preprint arXiv:1511.05101 (2015) Huszár, F.: How (not) to train your generative model: Scheduled sampling, likelihood, adversary? arXiv preprint arXiv:​1511.​05101 (2015)
17.
Zurück zum Zitat Lake, B.M., Salakhutdinov, R., Tenenbaum, J.B.: Human-level concept learning through probabilistic program induction. Science 350(6266), 1332–1338 (2015)MathSciNetMATHCrossRef Lake, B.M., Salakhutdinov, R., Tenenbaum, J.B.: Human-level concept learning through probabilistic program induction. Science 350(6266), 1332–1338 (2015)MathSciNetMATHCrossRef
18.
Zurück zum Zitat Lyu, P., Bai, X., Yao, C., Zhu, Z., Huang, T., Liu, W.: Auto-Encoder Guided GAN for Chinese Calligraphy Synthesis. In: Document Analysis and Recognition (ICDAR), 2017 14th IAPR International Conference on, vol. 1, pp. 1095–1100. IEEE (2017) Lyu, P., Bai, X., Yao, C., Zhu, Z., Huang, T., Liu, W.: Auto-Encoder Guided GAN for Chinese Calligraphy Synthesis. In: Document Analysis and Recognition (ICDAR), 2017 14th IAPR International Conference on, vol. 1, pp. 1095–1100. IEEE (2017)
19.
Zurück zum Zitat Mash’Al, M., Sadri, J.: Persian calligraphy using genetic algorithm. In: 1st Iranian Conference on Pattern Recognition and Image Analysis, PRIA 2013 (2013) Mash’Al, M., Sadri, J.: Persian calligraphy using genetic algorithm. In: 1st Iranian Conference on Pattern Recognition and Image Analysis, PRIA 2013 (2013)
20.
Zurück zum Zitat Naz, S., Umar, A.I., Ahmad, R., Ahmed, S.B., Shirazi, S.H., Siddiqi, I., Razzak, M.I.: Offline cursive Urdu-Nastaliq script recognition using multidimensional recurrent neural networks. Neurocomputing 177, 228–241 (2016)CrossRef Naz, S., Umar, A.I., Ahmad, R., Ahmed, S.B., Shirazi, S.H., Siddiqi, I., Razzak, M.I.: Offline cursive Urdu-Nastaliq script recognition using multidimensional recurrent neural networks. Neurocomputing 177, 228–241 (2016)CrossRef
21.
Zurück zum Zitat Plamondon, R., Guerfali, W.: The generation of handwriting with delta-lognormal synergies. Biol. Cybern. 78(2), 119–132 (1998)MATHCrossRef Plamondon, R., Guerfali, W.: The generation of handwriting with delta-lognormal synergies. Biol. Cybern. 78(2), 119–132 (1998)MATHCrossRef
22.
Zurück zum Zitat Ratanamahatana, C.A., Keogh, E.: Everything you know about dynamic time warping is wrong. In: Third workshop on mining temporal and sequential data. Citeseer (2004) Ratanamahatana, C.A., Keogh, E.: Everything you know about dynamic time warping is wrong. In: Third workshop on mining temporal and sequential data. Citeseer (2004)
23.
Zurück zum Zitat Sadikoglu, F., Soroush, B.: Intelligent system for Persian calligraphy learning. Procedia Comput. Sci. 102, 555–561 (2016)CrossRef Sadikoglu, F., Soroush, B.: Intelligent system for Persian calligraphy learning. Procedia Comput. Sci. 102, 555–561 (2016)CrossRef
24.
Zurück zum Zitat Srihari, S.N., Cha, S.H., Arora, H., Lee, S.: Individuality of handwriting: a validation study. In: Proceedings of the international conference on document analysis and recognition, ICDAR, pp. 106–109 (2001) Srihari, S.N., Cha, S.H., Arora, H., Lee, S.: Individuality of handwriting: a validation study. In: Proceedings of the international conference on document analysis and recognition, ICDAR, pp. 106–109 (2001)
25.
Zurück zum Zitat Theis, L., van den Oord, A., Bethge, M.: A note on the evaluation of generative models. In: International conference on learning representations (2016) Theis, L., van den Oord, A., Bethge, M.: A note on the evaluation of generative models. In: International conference on learning representations (2016)
26.
Zurück zum Zitat Toomarian, N., Barhen, J.: Fast temporal neural learning using teacher forcing. In: Neural Networks, 1991., IJCNN-91-Seattle international joint conference on, vol. 1, pp. 817–822. IEEE (1991) Toomarian, N., Barhen, J.: Fast temporal neural learning using teacher forcing. In: Neural Networks, 1991., IJCNN-91-Seattle international joint conference on, vol. 1, pp. 817–822. IEEE (1991)
27.
Zurück zum Zitat Van Ness, J.: On the dominance of non-parametric Bayes rule discriminant algorithms in high dimensions. Pattern Recogn. 12(6), 355–368 (1980)CrossRef Van Ness, J.: On the dominance of non-parametric Bayes rule discriminant algorithms in high dimensions. Pattern Recogn. 12(6), 355–368 (1980)CrossRef
28.
Zurück zum Zitat Yaghmaei, F.: Discovering writing habits and using them to synthesize Persian handwritten words [in Persian]. J. Mach. Vis. Image Process. (2016) Yaghmaei, F.: Discovering writing habits and using them to synthesize Persian handwritten words [in Persian]. J. Mach. Vis. Image Process. (2016)
29.
Zurück zum Zitat Zamani, P., Williams, B., Soltanian-Zadeh, H.: Extracting Motion Primitives from Persian Handwriting [in Persian]. In: 14th Iranian Conference on Biomedical Engineering (2007) Zamani, P., Williams, B., Soltanian-Zadeh, H.: Extracting Motion Primitives from Persian Handwriting [in Persian]. In: 14th Iranian Conference on Biomedical Engineering (2007)
30.
Zurück zum Zitat Zhang, Y., Zhang, Y., Cai, W.: Separating style and content for generalized style transfer. In: Proceedings of the IEEE conference on computer vision and pattern recognition, vol. 1 (2018) Zhang, Y., Zhang, Y., Cai, W.: Separating style and content for generalized style transfer. In: Proceedings of the IEEE conference on computer vision and pattern recognition, vol. 1 (2018)
Metadaten
Titel
Model-based Persian calligraphy synthesis via learning to transfer templates to personal styles
verfasst von
Amirhossein Ahmadian
Kazim Fouladi
Babak Nadjar Araabi
Publikationsdatum
18.06.2020
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal on Document Analysis and Recognition (IJDAR) / Ausgabe 3/2020
Print ISSN: 1433-2833
Elektronische ISSN: 1433-2825
DOI
https://doi.org/10.1007/s10032-020-00353-1