Skip to main content
Top

2021 | OriginalPaper | Chapter

Survey on Deep Learning-Based Kuzushiji Recognition

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Owing to the overwhelming accuracy of the deep learning method demonstrated at the 2012 image classification competition, deep learning has been successfully applied to a variety of other tasks. The high-precision detection and recognition of Kuzushiji, a Japanese cursive script used for transcribing historical documents, has been made possible through the use of deep learning. In recent years, competitions on Kuzushiji recognition have been held, and many researchers have proposed various recognition methods. This study examines recent research trends, current problems, and future prospects in Kuzushiji recognition using deep learning.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Hiragana is one of the three different character sets used in Japanese writing. Each Hiragana character represents a particular syllable. There are 46 basic characters.
 
2
Kanji is another one of the three character sets used in the Japanese language. Along with syllabaries, Kanji is made up of ideographic characters, and each letter symbolizes a specific meaning. Most Kanji characters were imported from China, although some were developed in Japan. Although there are approximately 50,000 Kanji characters, only approximately 2,500 are actually used in daily life in Japan.
 
3
A Jibo is a root Kanji character of Hiragana. For example, the character https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-68787-8_7/MediaObjects/510916_1_En_7_Figa_HTML.gif is derived from different Jibos including https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-68787-8_7/MediaObjects/510916_1_En_7_Figb_HTML.gif and https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-68787-8_7/MediaObjects/510916_1_En_7_Figc_HTML.gif
 
4
In the same way as Hiragana, Katakana is one of the three different character sets used in Japanese. Katakana is also a phonetic syllabary, in which each letter represents the sound of a syllable. There are also 46 basic characters.
 
9
A contest held annually by the Pattern Recognition and Media Understanding (PRMU) for the purpose of revitalizing research group activities.
 
11
One book was eliminated from the competition.
 
12
Furigana is made up of phonetic symbols occasionally written next to difficult or rare Kanji to show their pronunciation.
 
14
A set of technology standards intended to make it easier for researchers, students, and the public at large to view, manipulate, compare, and annotate digital images on the web. https://​iiif.​io/​.
 
Literature
1.
go back to reference Yamada, S., et al.: Historical character recognition (HCR) project report (2). IPSJ SIG Comput. Hum. (CH) 50(2), 9–16 (2001). (in Japanese) Yamada, S., et al.: Historical character recognition (HCR) project report (2). IPSJ SIG Comput. Hum. (CH) 50(2), 9–16 (2001). (in Japanese)
2.
go back to reference Yamada, S., Waizumi, Y., Kato, N., Shibayama, M.: Development of a digital dictionary of historical characters with search function of similar characters. IPSJ SIG Comput. Hum. (CH) 54(7), 43–50 (2002). (in Japanese) Yamada, S., Waizumi, Y., Kato, N., Shibayama, M.: Development of a digital dictionary of historical characters with search function of similar characters. IPSJ SIG Comput. Hum. (CH) 54(7), 43–50 (2002). (in Japanese)
3.
go back to reference Onuma, M., Zhu, B., Yamada, S., Shibayama, M., Nakagawa, M.: Development of cursive character pattern recognition for accessing a digital dictionary to support decoding of historical documents. IEICE Technical Report, vol. 106, no. 606, PRMU2006-270, pp. 91–96 (2007). (in Japanese) Onuma, M., Zhu, B., Yamada, S., Shibayama, M., Nakagawa, M.: Development of cursive character pattern recognition for accessing a digital dictionary to support decoding of historical documents. IEICE Technical Report, vol. 106, no. 606, PRMU2006-270, pp. 91–96 (2007). (in Japanese)
4.
go back to reference Horiuchi, T., Kato, S.: A study on Japanese historical character recognition using modular neural networks. Int. J. Innov. Comput. Inf. Control 7(8), 5003–5014 (2011) Horiuchi, T., Kato, S.: A study on Japanese historical character recognition using modular neural networks. Int. J. Innov. Comput. Inf. Control 7(8), 5003–5014 (2011)
5.
go back to reference Kato, S., Asano, R.: A study on historical character recognition by using SOM template. In: Proceedings of 30th Fuzzy System Symposium, pp. 242–245 (2014). (in Japanese) Kato, S., Asano, R.: A study on historical character recognition by using SOM template. In: Proceedings of 30th Fuzzy System Symposium, pp. 242–245 (2014). (in Japanese)
6.
go back to reference Hayasaka, T., Ohno, W., Kato, Y.: Recognition of obsolete script in pre-modern Japanese texts by Neocognitron. J. Toyota Coll. Technol. 48, 5–12 (2015). (in Japanese) Hayasaka, T., Ohno, W., Kato, Y.: Recognition of obsolete script in pre-modern Japanese texts by Neocognitron. J. Toyota Coll. Technol. 48, 5–12 (2015). (in Japanese)
7.
go back to reference Hayasaka, T., Ohno, W., Kato, Y., Yamamoto, K.: Recognition of hentaigana by deep learning and trial production of WWW application. In: Proceedings of IPSJ Symposium of Humanities and Computer Symposium, pp. 7–12 (2016). (in Japanese) Hayasaka, T., Ohno, W., Kato, Y., Yamamoto, K.: Recognition of hentaigana by deep learning and trial production of WWW application. In: Proceedings of IPSJ Symposium of Humanities and Computer Symposium, pp. 7–12 (2016). (in Japanese)
8.
go back to reference Ueda, K., Sonogashira, M., Iiyama, M.: Old Japanese character recognition by convolutional neural net and character aspect ratio. ELCAS J. 3, 88–90 (2018). (in Japanese) Ueda, K., Sonogashira, M., Iiyama, M.: Old Japanese character recognition by convolutional neural net and character aspect ratio. ELCAS J. 3, 88–90 (2018). (in Japanese)
9.
go back to reference Kojima, T., Ueki, K.: Utilization and analysis of deep learning for Kuzushiji translation. J. Japan Soc. Precis. Eng. 85(12), 1081–1086 (2019). (in Japanese)CrossRef Kojima, T., Ueki, K.: Utilization and analysis of deep learning for Kuzushiji translation. J. Japan Soc. Precis. Eng. 85(12), 1081–1086 (2019). (in Japanese)CrossRef
10.
go back to reference Yang, Z., Doman, K., Yamada, M., Mekada, Y.: Character recognition of modern Japanese official documents using CNN for imblanced learning data. In: Proceedings of 2019 International Workshop on Advanced Image Technology (IWAIT), no. 74 (2019) Yang, Z., Doman, K., Yamada, M., Mekada, Y.: Character recognition of modern Japanese official documents using CNN for imblanced learning data. In: Proceedings of 2019 International Workshop on Advanced Image Technology (IWAIT), no. 74 (2019)
11.
go back to reference Nagai, A.: Recognizing three character string of old Japanese cursive by convolutional neural networks. In: Proceedings of Information Processing Society of Japan (IPSJ) Symposium, pp. 213–218 (2017). (in Japanese) Nagai, A.: Recognizing three character string of old Japanese cursive by convolutional neural networks. In: Proceedings of Information Processing Society of Japan (IPSJ) Symposium, pp. 213–218 (2017). (in Japanese)
12.
go back to reference Ueki, K., Kojima, T., Mutou, R., Nezhad, R.S., Hagiwara, Y.: Recognition of Japanese connected cursive characters using multiple softmax outputs. In: Proceedings of International Conference on Multimedia Information Processing and Retrieval (2020) Ueki, K., Kojima, T., Mutou, R., Nezhad, R.S., Hagiwara, Y.: Recognition of Japanese connected cursive characters using multiple softmax outputs. In: Proceedings of International Conference on Multimedia Information Processing and Retrieval (2020)
13.
go back to reference Hu, X., Inamoto, M., Konagaya, A.: Recognition of Kuzushi-ji with deep learning method: a case study of Kiritsubo chapter in the tale of Genji. In: The 33rd Annual Conference of the Japanese Society for Artificial Intelligence (2019) Hu, X., Inamoto, M., Konagaya, A.: Recognition of Kuzushi-ji with deep learning method: a case study of Kiritsubo chapter in the tale of Genji. In: The 33rd Annual Conference of the Japanese Society for Artificial Intelligence (2019)
14.
go back to reference Kitamoto, A., Clanuwat, T., Miyazaki, T., Yayamoto, K.: Analysis of character data: potential and impact of Kuzushiji recognition by machine learning. J. Inst. Electron. Inf. Commun. Eng. 102(6), 563–568 (2019). (in Japanese) Kitamoto, A., Clanuwat, T., Miyazaki, T., Yayamoto, K.: Analysis of character data: potential and impact of Kuzushiji recognition by machine learning. J. Inst. Electron. Inf. Commun. Eng. 102(6), 563–568 (2019). (in Japanese)
15.
go back to reference Kitamoto, A., Clanuwat, T., Lamb, A., Bober-Irizar, M.: Progress and results of kaggle machine learning competition for Kuzushiji recognition. In: Proceedings of the Computers and the Humanities Symposium, pp. 223–230 (2019). (in Japanese) Kitamoto, A., Clanuwat, T., Lamb, A., Bober-Irizar, M.: Progress and results of kaggle machine learning competition for Kuzushiji recognition. In: Proceedings of the Computers and the Humanities Symposium, pp. 223–230 (2019). (in Japanese)
16.
go back to reference Kitamoto, A., Clanuwat, T., Bober-Irizar, M.: Kaggle Kuzushiji recognition competition - challenges of hosting a world-wide competition in the digital humanities. J. Japanese Soc. Artif. Intell. 35(3), 366–376 (2020). (in Japanese) Kitamoto, A., Clanuwat, T., Bober-Irizar, M.: Kaggle Kuzushiji recognition competition - challenges of hosting a world-wide competition in the digital humanities. J. Japanese Soc. Artif. Intell. 35(3), 366–376 (2020). (in Japanese)
17.
go back to reference Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. arXiv:1506.01497 (2015) Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. arXiv:​1506.​01497 (2015)
19.
go back to reference Clanuwat, T., Lamb, A., Kitamoto, A.: KuroNet: pre-modern Japanese Kuzushiji character recognition with deep learning. In: Proceedings of International Conference on Document Analysis and Recognition (ICDAR2019) (2019) Clanuwat, T., Lamb, A., Kitamoto, A.: KuroNet: pre-modern Japanese Kuzushiji character recognition with deep learning. In: Proceedings of International Conference on Document Analysis and Recognition (ICDAR2019) (2019)
20.
go back to reference Lamb, A., Clanuwat, T., Kitamoto, A.: KuroNet: regularized residual U-nets for end-to-end Kuzushiji character recognition. In: Proceedings of SN Computer Science (2020) Lamb, A., Clanuwat, T., Kitamoto, A.: KuroNet: regularized residual U-nets for end-to-end Kuzushiji character recognition. In: Proceedings of SN Computer Science (2020)
21.
go back to reference Le, A.D., Clanuwat, T., Kitamoto, A.: A human-inspired recognition system for premodern Japanese historical documents. arXiv:1905.05377 (2019) Le, A.D., Clanuwat, T., Kitamoto, A.: A human-inspired recognition system for premodern Japanese historical documents. arXiv:​1905.​05377 (2019)
22.
go back to reference Le, A.D.: Automated Transcription for Pre-Modern Japanese Kuzushiji Documents by Random Lines Erasure and Curriculum Learning. arXiv:2005.02669 (2020) Le, A.D.: Automated Transcription for Pre-Modern Japanese Kuzushiji Documents by Random Lines Erasure and Curriculum Learning. arXiv:​2005.​02669 (2020)
23.
go back to reference Le, A.D.: Detecting Kuzushiji characters from historical documents by two-dimensional context box proposal network. Future Data Secur. Eng. 731–738 Le, A.D.: Detecting Kuzushiji characters from historical documents by two-dimensional context box proposal network. Future Data Secur. Eng. 731–738
24.
go back to reference Graves, A., Schmidhuber, J.: Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw. 18(5–6), 602–610 (2005)CrossRef Graves, A., Schmidhuber, J.: Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw. 18(5–6), 602–610 (2005)CrossRef
25.
go back to reference Yamamoto, S., Tomejiro, O.: Labor saving for reprinting Japanese rare classical books. J. Inf. Process. Manag. 58(11), 819–827 (2016). (in Japanese) Yamamoto, S., Tomejiro, O.: Labor saving for reprinting Japanese rare classical books. J. Inf. Process. Manag. 58(11), 819–827 (2016). (in Japanese)
26.
go back to reference Ueki, K., Kojima, T.: Feasibility study of deep learning based Japanese cursive character recognition. IIEEJ Trans. Image Electron. Vis. Comput. 8(1), 10–16 (2020) Ueki, K., Kojima, T.: Feasibility study of deep learning based Japanese cursive character recognition. IIEEJ Trans. Image Electron. Vis. Comput. 8(1), 10–16 (2020)
27.
go back to reference Ueki, K., Kojima, T.: Japanese cursive character recognition for efficient transcription. In: Proceedings of the International Conference on Pattern Recognition Applications and Methods (2020) Ueki, K., Kojima, T.: Japanese cursive character recognition for efficient transcription. In: Proceedings of the International Conference on Pattern Recognition Applications and Methods (2020)
28.
go back to reference Takeuchi, M.: Development of embedded system for recognizing Kuzushiji by deep learning. In: Proceedings of the 33rd Annual Conference of the Japanese Society for Artificial Intelligence (2019). (in Japanese) Takeuchi, M.: Development of embedded system for recognizing Kuzushiji by deep learning. In: Proceedings of the 33rd Annual Conference of the Japanese Society for Artificial Intelligence (2019). (in Japanese)
29.
go back to reference Sando, K., Suzuki, T., Aiba, A.: A constraint solving web service for recognizing historical Japanese KANA texts. In: Proceedings of the 10th International Conference on Agents and Artificial Intelligence (ICAART) (2018) Sando, K., Suzuki, T., Aiba, A.: A constraint solving web service for recognizing historical Japanese KANA texts. In: Proceedings of the 10th International Conference on Agents and Artificial Intelligence (ICAART) (2018)
30.
go back to reference Yamazaki, A., Suzuki, T., Sando, K., Aiba, A.: A handwritten Japanese historical kana reprint support system. In: Proceedings of the 18th ACM Symposium on Document Engineering (2018) Yamazaki, A., Suzuki, T., Sando, K., Aiba, A.: A handwritten Japanese historical kana reprint support system. In: Proceedings of the 18th ACM Symposium on Document Engineering (2018)
31.
go back to reference Panichkriangkrai, C., Li, L., Kaneko, T., Akama, R., Hachimura, K.: Character segmentation and transcription system for historical Japanese books with a self-proliferating character image database. Int. J. Doc. Anal. Recogn. (IJDAR) 20, 241–257 (2017)CrossRef Panichkriangkrai, C., Li, L., Kaneko, T., Akama, R., Hachimura, K.: Character segmentation and transcription system for historical Japanese books with a self-proliferating character image database. Int. J. Doc. Anal. Recogn. (IJDAR) 20, 241–257 (2017)CrossRef
32.
go back to reference Clanuwat, T., Bober-Irizar, M., Kitamoto, A., Lamb, A., Yamamoto, K., Ha, D.: Deep learning for classical Japanese literature. arXiv:1812.01718 (2018) Clanuwat, T., Bober-Irizar, M., Kitamoto, A., Lamb, A., Yamamoto, K., Ha, D.: Deep learning for classical Japanese literature. arXiv:​1812.​01718 (2018)
33.
go back to reference Nguyen, H.T., Ly, N.T., Nguyen, K.C., Nguyen, C.T., Nakagawa, M.: Attempts to recognize anomalously deformed Kana in Japanese historical documents. In: Proceedings of the International Workshop on Historical Document Imaging and Processing (HIP 2017) (2017) Nguyen, H.T., Ly, N.T., Nguyen, K.C., Nguyen, C.T., Nakagawa, M.: Attempts to recognize anomalously deformed Kana in Japanese historical documents. In: Proceedings of the International Workshop on Historical Document Imaging and Processing (HIP 2017) (2017)
34.
go back to reference Graves, A., Fernandez, S., Gomez, F., Schmidhuber, J.: Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: Proceedings of the International Conference on Machine Learning, pp. 369–376 (2006) Graves, A., Fernandez, S., Gomez, F., Schmidhuber, J.: Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: Proceedings of the International Conference on Machine Learning, pp. 369–376 (2006)
35.
go back to reference Cho, K., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1724–1734 (2014) Cho, K., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1724–1734 (2014)
37.
go back to reference Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-CAM: Visual Explanations from Deep Networks via Gradientbased Localization. arXiv:1610.02391 (2016) Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-CAM: Visual Explanations from Deep Networks via Gradientbased Localization. arXiv:​1610.​02391 (2016)
38.
40.
go back to reference Wang, J., et al.: Deep high-resolution representation learning for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. (2020) Wang, J., et al.: Deep high-resolution representation learning for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. (2020)
41.
go back to reference Xie, S., Girshick, R., Dollár, P., Tu, Z., He, K.: Aggregated residual transformation for deep neural networks. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017) Xie, S., Girshick, R., Dollár, P., Tu, Z., He, K.: Aggregated residual transformation for deep neural networks. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017)
42.
go back to reference Ke, G., et al.: LightGBM: a highly efficient gradient boosting decision tree. Adv. Neural Inf. Process. Syst. (NIPS) 30, 3148–3156 (2017) Ke, G., et al.: LightGBM: a highly efficient gradient boosting decision tree. Adv. Neural Inf. Process. Syst. (NIPS) 30, 3148–3156 (2017)
46.
go back to reference Takahashi, R., Matsubara, T., Uehara, K.: Data Augmentation using Random Image Cropping and Patching for Deep CNNs. arXiv:1811.09030 (2018) Takahashi, R., Matsubara, T., Uehara, K.: Data Augmentation using Random Image Cropping and Patching for Deep CNNs. arXiv:​1811.​09030 (2018)
47.
go back to reference Chen, K., et al.: Hybrid task cascade for instance segmentation. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4974–4983 (2019) Chen, K., et al.: Hybrid task cascade for instance segmentation. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4974–4983 (2019)
48.
go back to reference Tian, Z., Huang, W., He, T., He, P., Qiao, Y.: Detecting Text in Natural Image with Connectionist Text Proposal Network. arXiv:1609.03605 (2016) Tian, Z., Huang, W., He, T., He, P., Qiao, Y.: Detecting Text in Natural Image with Connectionist Text Proposal Network. arXiv:​1609.​03605 (2016)
49.
go back to reference Heafield, K.: KenLM: faster and smaller language model queries. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 187–197 (2011) Heafield, K.: KenLM: faster and smaller language model queries. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 187–197 (2011)
50.
go back to reference Kim, S., Hori, T., Watanabe, S.: Joint CTC-attention based end-to-end speech recognition using multi-task learning. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2017) Kim, S., Hori, T., Watanabe, S.: Joint CTC-attention based end-to-end speech recognition using multi-task learning. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2017)
52.
53.
Metadata
Title
Survey on Deep Learning-Based Kuzushiji Recognition
Authors
Kazuya Ueki
Tomoka Kojima
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-68787-8_7

Premium Partner