Skip to main content
Top

2018 | OriginalPaper | Chapter

Building Wordnet for Russian Language from Ru.Wiktionary

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper presents a method of fully-automatic transformation of the free-content Russian dictionary ru.wiktionary to WordNet-like thesaurus. The primary concern of this study is to describe a procedure of relating words to their meanings throughout Wiktionary pages and establish synonym and hyponym-hypernym relation between specific senses of words. The produced database contains 104696 synsets and is publicly available in alpha version as a python package wiki-ru-wordnet.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Miller, G.A., Charles, W.G.: Contextual correlates of semantic similarity. Lang. Cogn. Processes 6(1), 1–28 (1991)CrossRef Miller, G.A., Charles, W.G.: Contextual correlates of semantic similarity. Lang. Cogn. Processes 6(1), 1–28 (1991)CrossRef
2.
go back to reference Azarova, I., Mitrofanova, O., Sinopalnikova, A., Yavorskaya, M., Oparin, I.: Russnet: building a lexical database for the Russian language. In: Proceedings Workshop on Wordnet Structures and Standardisation and How this Affect Wordnet Applications and Evaluation, Las Palmas, pp. 60–64 (2002) Azarova, I., Mitrofanova, O., Sinopalnikova, A., Yavorskaya, M., Oparin, I.: Russnet: building a lexical database for the Russian language. In: Proceedings Workshop on Wordnet Structures and Standardisation and How this Affect Wordnet Applications and Evaluation, Las Palmas, pp. 60–64 (2002)
3.
go back to reference Baroni, M., Lenci, A.: How we blessed distributional semantic evaluation. In: Proceedings of the GEMS 2011 Workshop on Geometrical Models of Natural Language Semantics GEMS 2011, pp. 1–10. Association for Computational Linguistics, Stroudsburg (2011) Baroni, M., Lenci, A.: How we blessed distributional semantic evaluation. In: Proceedings of the GEMS 2011 Workshop on Geometrical Models of Natural Language Semantics GEMS 2011, pp. 1–10. Association for Computational Linguistics, Stroudsburg (2011)
4.
go back to reference Braslavski, P., Ustalov, D., Mukhin, M.: A spinning wheel for yarn: user interface for a crowdsourced thesaurus. In: Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics, pp. 101–104. Association for Computational Linguistics, Gothenburg, April 2014 Braslavski, P., Ustalov, D., Mukhin, M.: A spinning wheel for yarn: user interface for a crowdsourced thesaurus. In: Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics, pp. 101–104. Association for Computational Linguistics, Gothenburg, April 2014
5.
go back to reference Dmitry, U.: Expanding hierarchical contexts for constructing a semantic word network. In: Computational Linguistics and Intellectual Technologies: Papers from the Annual conference “Dialogue”, vol. 1, pp. 369–381. RGGU (2017) Dmitry, U.: Expanding hierarchical contexts for constructing a semantic word network. In: Computational Linguistics and Intellectual Technologies: Papers from the Annual conference “Dialogue”, vol. 1, pp. 369–381. RGGU (2017)
6.
go back to reference Faralli, S., Panchenko, A., Biemann, C., Ponzetto, S.P.: Linked disambiguated distributional semantic networks. In: The Semantic Web - ISWC 2016–15th International Semantic Web Conference Proceedings, Part II, Kobe, 17–21 October 2016, pp. 56–64 (2016) Faralli, S., Panchenko, A., Biemann, C., Ponzetto, S.P.: Linked disambiguated distributional semantic networks. In: The Semantic Web - ISWC 2016–15th International Semantic Web Conference Proceedings, Part II, Kobe, 17–21 October 2016, pp. 56–64 (2016)
7.
go back to reference Finkelstein, L., Gabrilovich, E., Matias, Y., Rivlin, E., Solan, Z., Wolfman, G., Ruppin, E.: Placing search in context: the concept revisited. In: Proceedings of the 10th International Conference on World Wide Web. WWW 2001, pp. 406–414. ACM, New York (2001) Finkelstein, L., Gabrilovich, E., Matias, Y., Rivlin, E., Solan, Z., Wolfman, G., Ruppin, E.: Placing search in context: the concept revisited. In: Proceedings of the 10th International Conference on World Wide Web. WWW 2001, pp. 406–414. ACM, New York (2001)
8.
go back to reference Gelfenbeyn, I., Goncharuk, A., Lehelt, V., Lipatov, A., Shilo, V.: Automatic translation of wordnet’s semantic network into Russian. In: Proceedings of the International Dialog Conference, pp. 193–198 (2003) Gelfenbeyn, I., Goncharuk, A., Lehelt, V., Lipatov, A., Shilo, V.: Automatic translation of wordnet’s semantic network into Russian. In: Proceedings of the International Dialog Conference, pp. 193–198 (2003)
9.
go back to reference Gonçalo Oliveira, H., Gomes, P.: Eco and onto.pt: a flexible approach for creating a portuguese wordnet automatically. Lang. Res. Eval. 48(2), 373–393 (2014)CrossRef Gonçalo Oliveira, H., Gomes, P.: Eco and onto.pt: a flexible approach for creating a portuguese wordnet automatically. Lang. Res. Eval. 48(2), 373–393 (2014)CrossRef
10.
go back to reference Rubenstein, H., Goodenough, J.B.: Contextual correlates of synonymy. Commun. ACM 8(10), 627–633 (1965)CrossRef Rubenstein, H., Goodenough, J.B.: Contextual correlates of synonymy. Commun. ACM 8(10), 627–633 (1965)CrossRef
11.
go back to reference Krizhanovsky, A.A., Smirnov, A.V.: An approach to automated construction of a general-purpose lexical ontology based on wiktionary. J. Comput. Syst. Sci. Int. 52(2), 215–225 (2013)CrossRef Krizhanovsky, A.A., Smirnov, A.V.: An approach to automated construction of a general-purpose lexical ontology based on wiktionary. J. Comput. Syst. Sci. Int. 52(2), 215–225 (2013)CrossRef
12.
go back to reference Loukachevitch, N.V., Dobrov, B.V., Chetviorkin, I.I.: Ruthes-lite, a publicly available version of thesaurus of Russian language ruthes. In: Computational Linguistics and Intellectual Technologies: Papers from the Annual conference “Dialogue”, vol. 13, pp. 340–350. RGGU (2014) Loukachevitch, N.V., Dobrov, B.V., Chetviorkin, I.I.: Ruthes-lite, a publicly available version of thesaurus of Russian language ruthes. In: Computational Linguistics and Intellectual Technologies: Papers from the Annual conference “Dialogue”, vol. 13, pp. 340–350. RGGU (2014)
13.
go back to reference Loukachevitch, N.V., Lashevich, G., Gerasimova, A.A., Ivanov, V.V., Dobrov, B.V.: Creating Russian wordnet by conversion. In: Komp’juternaja Lingvistika i Intellektual’nye Tehnologii, vol. 15, pp. 405–415. Rossiiskii Gosudarstvennyi Gumanitarnyi Universitet (2016) Loukachevitch, N.V., Lashevich, G., Gerasimova, A.A., Ivanov, V.V., Dobrov, B.V.: Creating Russian wordnet by conversion. In: Komp’juternaja Lingvistika i Intellektual’nye Tehnologii, vol. 15, pp. 405–415. Rossiiskii Gosudarstvennyi Gumanitarnyi Universitet (2016)
14.
go back to reference Miller, G.A.: Wordnet: a lexical database for english. Commun. ACM 38(11), 39–41 (1995)CrossRef Miller, G.A.: Wordnet: a lexical database for english. Commun. ACM 38(11), 39–41 (1995)CrossRef
15.
go back to reference Panchenko, A., Loukachevitch, N.V., Ustalov, D., Paperno, D., Meyer, C.M., Konstantinova, N.: RUSSE: the first workshop on Russian semantic similarity. In: Computational Linguistics and Intellectual Technologies: Papers from the Annual conference “Dialogue”, vol. 2, pp. 89–105. RGGU, Moscow (2015) Panchenko, A., Loukachevitch, N.V., Ustalov, D., Paperno, D., Meyer, C.M., Konstantinova, N.: RUSSE: the first workshop on Russian semantic similarity. In: Computational Linguistics and Intellectual Technologies: Papers from the Annual conference “Dialogue”, vol. 2, pp. 89–105. RGGU, Moscow (2015)
17.
go back to reference Perez, L.A., Gonçalo, H.O., Gomes, P.: Extracting lexical-semantic knowledge from the Portuguese wiktionary. In: 15th Portuguese Conference on Artificial Intelligence (EPIA 2011), Lisbon, Portugal (2011) Perez, L.A., Gonçalo, H.O., Gomes, P.: Extracting lexical-semantic knowledge from the Portuguese wiktionary. In: 15th Portuguese Conference on Artificial Intelligence (EPIA 2011), Lisbon, Portugal (2011)
18.
go back to reference Ustalov, D., Panchenko, A., Biemann, C.: Automatic induction of synsets from a graph of synonyms. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Vancouver, July 30 - August 4, vol. 1: Long Papers, pp. 1579–1590 (2017) Ustalov, D., Panchenko, A., Biemann, C.: Automatic induction of synsets from a graph of synonyms. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017, Vancouver, July 30 - August 4, vol. 1: Long Papers, pp. 1579–1590 (2017)
19.
go back to reference Zesch, T., Müller, C., Gurevych, I.: Extracting lexical semantic knowledge from wikipedia and wiktionary. In: Proceedings of the Conference on Language Resources and Evaluation (LREC), electronic proceedings. Ubiquitious Knowledge Processing, Universitt Darmstadt, Mai (2008) Zesch, T., Müller, C., Gurevych, I.: Extracting lexical semantic knowledge from wikipedia and wiktionary. In: Proceedings of the Conference on Language Resources and Evaluation (LREC), electronic proceedings. Ubiquitious Knowledge Processing, Universitt Darmstadt, Mai (2008)
20.
go back to reference Zesch, T., Müller, C., Gurevych, I.: Using wiktionary for computing semantic relatedness. In: Proceedings of AAAI, pp. 861–867 (2008) Zesch, T., Müller, C., Gurevych, I.: Using wiktionary for computing semantic relatedness. In: Proceedings of AAAI, pp. 861–867 (2008)
Metadata
Title
Building Wordnet for Russian Language from Ru.Wiktionary
Author
Yuliya Chernobay
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-71746-3_10

Premium Partner