Skip to main content
Top

2020 | OriginalPaper | Chapter

Tibetan Case Grammar Error Correction Method Based on Neural Networks

Authors : Cizhen Jiacuo, Secha Jia, Sangjie Duanzhu, Cairang Jia

Published in: Chinese Lexical Semantics

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Grammar Error Correction (GEC) is an important researching subject among Nature Language Processing tasks. In this work, aiming at tackling with genitive and ergative grammatical errors in Tibetan formal text, we collect 1793563 consecutive sentence pairs as training set and 5000 sentence pairs with the same distribution as well as 1159 sentence pairs in different distributions as testing sets. In our approach, we firstly preprocess Tibetan text data with compositional rules and then build a neural network architecture which is a combination of BERT and Bi-LSTM, to estimate the probability of given token being genitive or ergative. In experiments, 98.38% and 86.16% in terms of accuracy are observed respectively in testing the proposed model on two different testing sets.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Ji, T.: Tibetan Syntactic Research. China Tibetology Press, Beijing (2013). (in Tibetan) Ji, T.: Tibetan Syntactic Research. China Tibetology Press, Beijing (2013). (in Tibetan)
2.
go back to reference Gesang, J., Gesang, Y.: Practical Tibetan Grammar Tutorial. Sichuan Nationalities Press, Chengdu (2008). (in Chinese) Gesang, J., Gesang, Y.: Practical Tibetan Grammar Tutorial. Sichuan Nationalities Press, Chengdu (2008). (in Chinese)
3.
go back to reference Zhu, J., Li, T., Liu, S.: The algorithm of spelling check base on TSRM. J. Chin. Inf. Process. 28(3), 92–98 (2014). (in Chinese) Zhu, J., Li, T., Liu, S.: The algorithm of spelling check base on TSRM. J. Chin. Inf. Process. 28(3), 92–98 (2014). (in Chinese)
4.
go back to reference Cai, Z., Sun, M., Cairang, Z.: Vector based spelling check for Tibetan characters. J. Chin. Inf. Proess. 32(9), 47–55 (2018). (in Chinese) Cai, Z., Sun, M., Cairang, Z.: Vector based spelling check for Tibetan characters. J. Chin. Inf. Proess. 32(9), 47–55 (2018). (in Chinese)
5.
go back to reference Zhu, J., Li, T., Liu, S.: An approach for Tibetan text automatic proofreading and its system design. Acta Scientiarum Naturalium Universitatis Pekinensis 50(1), 142–148 (2014). (in Chinese) Zhu, J., Li, T., Liu, S.: An approach for Tibetan text automatic proofreading and its system design. Acta Scientiarum Naturalium Universitatis Pekinensis 50(1), 142–148 (2014). (in Chinese)
6.
go back to reference Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9, 1735–1780 (1997)CrossRef Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9, 1735–1780 (1997)CrossRef
7.
go back to reference Luo, W., Luo, Z., Gong, X.: Study of techniques of automatic proofreading for Chinese texts. J. Comput. Res. Dev. 41(4), 244–249 (2004). (in Chinese) Luo, W., Luo, Z., Gong, X.: Study of techniques of automatic proofreading for Chinese texts. J. Comput. Res. Dev. 41(4), 244–249 (2004). (in Chinese)
8.
go back to reference Zhang, Y., Yu, S.: Summary of text automatic proofreading technology. Appl. Res. Comput. 23(6), 8–12 (2006). (in Chinese) Zhang, Y., Yu, S.: Summary of text automatic proofreading technology. Appl. Res. Comput. 23(6), 8–12 (2006). (in Chinese)
9.
go back to reference Chollampatt, S., Ng, H.T.: Connecting the dots: towards human-level grammatical error correction. In: Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications. Association for Computational Linguistics, Copenhagen (2017) Chollampatt, S., Ng, H.T.: Connecting the dots: towards human-level grammatical error correction. In: Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications. Association for Computational Linguistics, Copenhagen (2017)
10.
go back to reference Fu, K., Huang, J., Duan, Y.: Youdao’s winning solution to the NLPCC-2018 task 2 challenge: a neural machine translation approach to Chinese grammatical error correction. In: Zhang, M., Ng, V., Zhao, D., Li, S., Zan, H. (eds.) NLPCC 2018. LNCS (LNAI), vol. 11108, pp. 341–350. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-99495-6_29CrossRef Fu, K., Huang, J., Duan, Y.: Youdao’s winning solution to the NLPCC-2018 task 2 challenge: a neural machine translation approach to Chinese grammatical error correction. In: Zhang, M., Ng, V., Zhao, D., Li, S., Zan, H. (eds.) NLPCC 2018. LNCS (LNAI), vol. 11108, pp. 341–350. Springer, Cham (2018). https://​doi.​org/​10.​1007/​978-3-319-99495-6_​29CrossRef
11.
12.
go back to reference Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018) Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:​1810.​04805 (2018)
Metadata
Title
Tibetan Case Grammar Error Correction Method Based on Neural Networks
Authors
Cizhen Jiacuo
Secha Jia
Sangjie Duanzhu
Cairang Jia
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-38189-9_43

Premium Partner