Skip to main content
Top

2020 | OriginalPaper | Chapter

Towards the Uzbek Language Endings as a Language Resource

Authors : Sanatbek Matlatipov, Ualsher Tukeyev, Mersaid Aripov

Published in: Advances in Computational Collective Intelligence

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The Uzbek language belongs to low-resource languages. It is very important to increase the number of language resources such as dictionaries, corpora (monolingual and bilingual) for the Uzbek language. Dictionaries may be different kinds: monolingual, orthographical, bilingual, grammar special dictionaries: stems dictionaries, affixes dictionaries, etc. For different NLP tasks of agglutinative languages, such as morphological analysis, information retrieval, machine translation (segmentation preprocessing) in some cases needs a dictionary of words’ endings. In this paper, we proposed the first electronic dictionary of Uzbek words’ endings in variants for morphological segmentation preprocessing useful for neural machine translation. The resource analysed by the initial version of the Lexicon free stemming tool [3] created by authors. For creation of Uzbek words’ endings’ electronic dictionary, it was used a combinatorial approach inferring apply for part of speech of the Uzbek language: nouns, adjectives, numerals, verbs, participles, moods, voices.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Tukeyev, U., Sundetova, A., Abduali, B., Akhmadiyeva, Z., Zhanbussunov, N.: Inferring of the morphological chunk transfer rules on the base of complete set of Kazakh endings. In: Nguyen, N.-T., Manolopoulos, Y., Iliadis, L., Trawiński, B. (eds.) ICCCI 2016. LNCS (LNAI), vol. 9876, pp. 563–574. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-45246-3_54CrossRef Tukeyev, U., Sundetova, A., Abduali, B., Akhmadiyeva, Z., Zhanbussunov, N.: Inferring of the morphological chunk transfer rules on the base of complete set of Kazakh endings. In: Nguyen, N.-T., Manolopoulos, Y., Iliadis, L., Trawiński, B. (eds.) ICCCI 2016. LNCS (LNAI), vol. 9876, pp. 563–574. Springer, Cham (2016). https://​doi.​org/​10.​1007/​978-3-319-45246-3_​54CrossRef
3.
go back to reference Tukeyev, U., Turganbayeva, A., Abduali, B., Rakhimova, D., Amirova, D., Karibayeva, A.: Lexicon-free stemming for Kazakh language information retrieval. In: IEEE 12th International Conference on Application of Information and Communication Technologies, AICT 2018, Kazakhstan, Almaty, 17–19 October 2018, pp. 95–98 (2018) Tukeyev, U., Turganbayeva, A., Abduali, B., Rakhimova, D., Amirova, D., Karibayeva, A.: Lexicon-free stemming for Kazakh language information retrieval. In: IEEE 12th International Conference on Application of Information and Communication Technologies, AICT 2018, Kazakhstan, Almaty, 17–19 October 2018, pp. 95–98 (2018)
4.
go back to reference Tukeyev, U.: Automaton models of the morphology analysis and the completeness of the endings of the Kazakh language. In: Proceedings of the International Conference on “Turkic Languages Processing” TURKLANG-2015, Kazan, Tatarstan, Russia, 17–19 September 2015, pp. 91–100 (2015) Tukeyev, U.: Automaton models of the morphology analysis and the completeness of the endings of the Kazakh language. In: Proceedings of the International Conference on “Turkic Languages Processing” TURKLANG-2015, Kazan, Tatarstan, Russia, 17–19 September 2015, pp. 91–100 (2015)
5.
go back to reference Sennrich, R., Haddow, B., Birch, A.: Neural machine translation of rare words with subword units. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 1715–1725 (2016) Sennrich, R., Haddow, B., Birch, A.: Neural machine translation of rare words with subword units. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 1715–1725 (2016)
6.
go back to reference Creutz, M., Lagus, K.: Unsupervised discovery of morphemes. In: Proceedings of the ACL-2002 Workshop on Morphological and Phonological Learning, vol. 6, pp. 21–30 (2002) Creutz, M., Lagus, K.: Unsupervised discovery of morphemes. In: Proceedings of the ACL-2002 Workshop on Morphological and Phonological Learning, vol. 6, pp. 21–30 (2002)
7.
go back to reference Koskenniemi, K.: Two-level morphology: a general computational model for word-form recognition and production. Ph.D. thesis, University of Helsinki (1983) Koskenniemi, K.: Two-level morphology: a general computational model for word-form recognition and production. Ph.D. thesis, University of Helsinki (1983)
8.
go back to reference Oflazer, K.: Two-level description of Turkish morphology. Literary Linguist. Comput. 9(2), 137–148 (1994)CrossRef Oflazer, K.: Two-level description of Turkish morphology. Literary Linguist. Comput. 9(2), 137–148 (1994)CrossRef
9.
go back to reference Kessikbayeva, G., Cicekli, I.: Rule based morphological analyzer of Kazakh language. In: Proceedings of the 2014 Joint Meeting of SIGMORPHON and SIGFSM, Baltimore, Maryland, USA, pp. 46–54 (2014) Kessikbayeva, G., Cicekli, I.: Rule based morphological analyzer of Kazakh language. In: Proceedings of the 2014 Joint Meeting of SIGMORPHON and SIGFSM, Baltimore, Maryland, USA, pp. 46–54 (2014)
10.
go back to reference Madatov, Kh.: A prolog format of uzbek WordNet’s entries. In: Human Language Technology as a Challenge for Computer Science and Linguistics, pp. 316–320 (2019) Madatov, Kh.: A prolog format of uzbek WordNet’s entries. In: Human Language Technology as a Challenge for Computer Science and Linguistics, pp. 316–320 (2019)
Metadata
Title
Towards the Uzbek Language Endings as a Language Resource
Authors
Sanatbek Matlatipov
Ualsher Tukeyev
Mersaid Aripov
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-63119-2_59

Premium Partner