Skip to main content

2020 | OriginalPaper | Buchkapitel

Wordnet – a Basic Resource for Natural Language Processing: The Case of plWordNet

verfasst von : Agnieszka Dziob, Tomasz Naskręt

Erschienen in: Advances in Computational Collective Intelligence

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper presents a wide scope of wordnet applications on the example of applications of plWordNet – a wordnet of Polish. Wordnets are large lexical-semantic databases functioning as primary resources for language technology. They are machine-readable dictionaries. Thus, they are indispensible for tasks such as basic flow of text processing, text mining, word sense disambiguation, information extraction and retrieval. On a larger scale, wordnets are used in research, education and business. In this paper a few examples of specific plWordNet applications are described in detail.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
As early as 2004, as Morato et al. mentioned, ontologies and semantic web were one of the most dynamically developing areas of wordnet applications.
 
Literatur
9.
Zurück zum Zitat Bond, F., Foster, R.: Linking and extending an open multilingual wordnet. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. 1, pp. 1352–1362 (2013) Bond, F., Foster, R.: Linking and extending an open multilingual wordnet. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. 1, pp. 1352–1362 (2013)
10.
Zurück zum Zitat Bond, F., Janz, A., Piasecki, M.: A comparison of sense-level sentiment scores. In: Proceedings of the 10th Global Wordnet Conference, pp. 363–372 (2019) Bond, F., Janz, A., Piasecki, M.: A comparison of sense-level sentiment scores. In: Proceedings of the 10th Global Wordnet Conference, pp. 363–372 (2019)
11.
Zurück zum Zitat Czerski, D., Ciesielski, K., Dramiński, M., Kłopotek, M., Łoziński, P., Wierzchoń, S.: What NEKST?—semantic search engine for polish internet. In: De Tré, G., Grzegorzewski, P., Kacprzyk, J., Owsiński, J.W., Penczek, W., Zadrożny, S. (eds.) Challenging Problems and Solutions in Intelligent Systems. SCI, vol. 634, pp. 335–347. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30165-5_16CrossRef Czerski, D., Ciesielski, K., Dramiński, M., Kłopotek, M., Łoziński, P., Wierzchoń, S.: What NEKST?—semantic search engine for polish internet. In: De Tré, G., Grzegorzewski, P., Kacprzyk, J., Owsiński, J.W., Penczek, W., Zadrożny, S. (eds.) Challenging Problems and Solutions in Intelligent Systems. SCI, vol. 634, pp. 335–347. Springer, Cham (2016). https://​doi.​org/​10.​1007/​978-3-319-30165-5_​16CrossRef
12.
Zurück zum Zitat Dębowski, Ł., Broda, B., Nitoń, B., Charzyńska, E.: Jasnopis-a program to compute readability of texts in polish based on psycholinguistic research. Nat. Lang. Process. Cogn. Sci. pp. 51–61 (2015) Dębowski, Ł., Broda, B., Nitoń, B., Charzyńska, E.: Jasnopis-a program to compute readability of texts in polish based on psycholinguistic research. Nat. Lang. Process. Cogn. Sci. pp. 51–61 (2015)
13.
Zurück zum Zitat Dziob, A., Piasecki, M.: Dynamic verbs in the wordnet of polish. Cogn. Stud. (18) (2018) Dziob, A., Piasecki, M.: Dynamic verbs in the wordnet of polish. Cogn. Stud. (18) (2018)
14.
Zurück zum Zitat Dziob, A., Piasecki, M., Rudnicka, E.: plWordNet 4.1-a linguistically motivated, corpus-based bilingual resource. In: Proceedings of the 10th Global Wordnet Conference, pp. 353–362 (2019) Dziob, A., Piasecki, M., Rudnicka, E.: plWordNet 4.1-a linguistically motivated, corpus-based bilingual resource. In: Proceedings of the 10th Global Wordnet Conference, pp. 353–362 (2019)
15.
Zurück zum Zitat Eder, M., Piasecki, M., Walkowiak, T.: An open stylometric system based on multilevel text analysis. Cognitive Studies| Études cognitives (17) (2017) Eder, M., Piasecki, M., Walkowiak, T.: An open stylometric system based on multilevel text analysis. Cognitive Studies| Études cognitives (17) (2017)
16.
Zurück zum Zitat Graliński, F., Jassem, K., Marcińczuk, M., Wawrzyniak, P.: Named entity recognition in machine anonymization. Recent Advances in Intelligent Information Systems, pp. 247–260 (2009) Graliński, F., Jassem, K., Marcińczuk, M., Wawrzyniak, P.: Named entity recognition in machine anonymization. Recent Advances in Intelligent Information Systems, pp. 247–260 (2009)
17.
Zurück zum Zitat Griesel, M., Bosch, S., Mojapelo, M.L.: Thinking globally, acting locally-progress in the african wordnet project. In: Proceedings of the 10th Global Wordnet Conference, pp. 191–196 (2019) Griesel, M., Bosch, S., Mojapelo, M.L.: Thinking globally, acting locally-progress in the african wordnet project. In: Proceedings of the 10th Global Wordnet Conference, pp. 191–196 (2019)
18.
Zurück zum Zitat Hajnicz, E., Bartosiak, T.: Connections between the semantic layer of Walenty valency dictionary and plWordNet. In: Proceedings of the 10th Global Wordnet Conference, pp. 99–107 (2019) Hajnicz, E., Bartosiak, T.: Connections between the semantic layer of Walenty valency dictionary and plWordNet. In: Proceedings of the 10th Global Wordnet Conference, pp. 99–107 (2019)
19.
Zurück zum Zitat Kędzia, P., Piasecki, M., Orlińska, M.: Word sense disambiguation based on large scale Polish CLARIN heterogeneous lexical resources. Cognitive Studies (15) (2015) Kędzia, P., Piasecki, M., Orlińska, M.: Word sense disambiguation based on large scale Polish CLARIN heterogeneous lexical resources. Cognitive Studies (15) (2015)
20.
Zurück zum Zitat Kocoń, J., Janz, A., Piasecki, M.: Context-sensitive sentiment propagation in WordNet. In: Proceedings of the 9th Global Wordnet Conference, pp. 333–338 (2018) Kocoń, J., Janz, A., Piasecki, M.: Context-sensitive sentiment propagation in WordNet. In: Proceedings of the 9th Global Wordnet Conference, pp. 333–338 (2018)
22.
Zurück zum Zitat Kocoń, J., Marcińczuk, M.: Supervised approach to recognise Polish temporal expressions and rule-based interpretation of timexes. Nat. Lang. Eng. 23(3), 385–418 (2017)CrossRef Kocoń, J., Marcińczuk, M.: Supervised approach to recognise Polish temporal expressions and rule-based interpretation of timexes. Nat. Lang. Eng. 23(3), 385–418 (2017)CrossRef
23.
Zurück zum Zitat Maciołek, P., Dobrowolski, G.: Cluo: web-scale text mining system for open source intelligence purposes. Comput. Sci. 14(1), 45–62 (2013)MathSciNetCrossRef Maciołek, P., Dobrowolski, G.: Cluo: web-scale text mining system for open source intelligence purposes. Comput. Sci. 14(1), 45–62 (2013)MathSciNetCrossRef
24.
Zurück zum Zitat Marchewka, A., et al.: Recognition of emotions, valence and arousal in large-scale multi-domain text reviews pp. 274–280 (2019) Marchewka, A., et al.: Recognition of emotions, valence and arousal in large-scale multi-domain text reviews pp. 274–280 (2019)
26.
Zurück zum Zitat Maryl, M., Piasecki, M., Walkowiak, T.: Literary exploration machine: a Web-based application for textual scholars. In: Selected papers from the CLARIN Annual Conference, (147) pp. 128–144 (2018) Maryl, M., Piasecki, M., Walkowiak, T.: Literary exploration machine: a Web-based application for textual scholars. In: Selected papers from the CLARIN Annual Conference, (147) pp. 128–144 (2018)
27.
Zurück zum Zitat Maziarz, M., Piasecki, M.: Towards mapping thesauri onto plWordNet. In: Proceedings of the 9th Global WordNet Conference (GWC 2018), pp. 45–53 (2018) Maziarz, M., Piasecki, M.: Towards mapping thesauri onto plWordNet. In: Proceedings of the 9th Global WordNet Conference (GWC 2018), pp. 45–53 (2018)
28.
Zurück zum Zitat Maziarz, M., Piasecki, M., Rudnicka, E.: Słowosieć-polski wordnet. Proces tworzenia tezaurusa. Polonica 34, 79–98 (2014) Maziarz, M., Piasecki, M., Rudnicka, E.: Słowosieć-polski wordnet. Proces tworzenia tezaurusa. Polonica 34, 79–98 (2014)
29.
Zurück zum Zitat Maziarz, M., Szpakowicz, S., Piasecki, M.: A procedural definition of multi-word lexical units. In: Proceedings of the International Conference Recent Advances in Natural Language Processing, pp. 427–435 (2015) Maziarz, M., Szpakowicz, S., Piasecki, M.: A procedural definition of multi-word lexical units. In: Proceedings of the International Conference Recent Advances in Natural Language Processing, pp. 427–435 (2015)
30.
Zurück zum Zitat McCrae, J.P., Rademaker, A., Bond, F., Rudnicka, E., Fellbaum, C.: English wordnet 2019–an open-source wordnet for english. In: Proceedings of the 10th Global Wordnet Conference, pp. 245–252 (2019) McCrae, J.P., Rademaker, A., Bond, F., Rudnicka, E., Fellbaum, C.: English wordnet 2019–an open-source wordnet for english. In: Proceedings of the 10th Global Wordnet Conference, pp. 245–252 (2019)
31.
Zurück zum Zitat Miller, G.: WordNet: An Electronic Lexical Database. MIT Press (1998) Miller, G.: WordNet: An Electronic Lexical Database. MIT Press (1998)
32.
Zurück zum Zitat Morato, J., Marzal, M.A., Lloréns, J., Moreiro, J.: WordNet applications. In: Proceedings of 2nd Global Wordnet Conference, pp. 270–278 (2004) Morato, J., Marzal, M.A., Lloréns, J., Moreiro, J.: WordNet applications. In: Proceedings of 2nd Global Wordnet Conference, pp. 270–278 (2004)
33.
Zurück zum Zitat Mykowiecka, A., Marciniak, M.: Combining wordnet and morphosyntactic information in terminology clustering. In: Proceedings of COLING 2012, pp. 1951–1962 (2012) Mykowiecka, A., Marciniak, M.: Combining wordnet and morphosyntactic information in terminology clustering. In: Proceedings of COLING 2012, pp. 1951–1962 (2012)
34.
Zurück zum Zitat Naskręt, T.: A collaborative system for building and maintaining wordnets. In: Proceedings of the 10th Global Wordnet Conference, pp. 323–328 (2019) Naskręt, T.: A collaborative system for building and maintaining wordnets. In: Proceedings of the 10th Global Wordnet Conference, pp. 323–328 (2019)
35.
Zurück zum Zitat Naskręt, T., Dziob, A., Piasecki, M., Saedi, C., Branco, A.: WordnetLoom-a multilingual wordnet editing system focused on graph-based presentation. In: Proceedings of the 9th Global Wordnet Conference, pp. 191–200 (2018) Naskręt, T., Dziob, A., Piasecki, M., Saedi, C., Branco, A.: WordnetLoom-a multilingual wordnet editing system focused on graph-based presentation. In: Proceedings of the 9th Global Wordnet Conference, pp. 191–200 (2018)
36.
Zurück zum Zitat Nowaczyk, A., Jackowska-Strumiłło, L.: Rozpoznawanie emocji w tekstach polskojęzycznych z wykorzystaniem metody słów kluczowych. Informatyka, Automatyka, Pomiary w Gospodarce i Ochronie Środowiska 7(2), 102–105 (2017)CrossRef Nowaczyk, A., Jackowska-Strumiłło, L.: Rozpoznawanie emocji w tekstach polskojęzycznych z wykorzystaniem metody słów kluczowych. Informatyka, Automatyka, Pomiary w Gospodarce i Ochronie Środowiska 7(2), 102–105 (2017)CrossRef
37.
Zurück zum Zitat Ogrodniczuk, M., Bronk, Z., Kieras, W.: Multisłownik: linking plWordNet-based lexical data for lexicography and educational purposes. In: Proceedings of the 9th Global Wordnet Conference, pp. 368–375 (2018) Ogrodniczuk, M., Bronk, Z., Kieras, W.: Multisłownik: linking plWordNet-based lexical data for lexicography and educational purposes. In: Proceedings of the 9th Global Wordnet Conference, pp. 368–375 (2018)
38.
Zurück zum Zitat Pedersen, B.S., Nimb, S., Olsen, I.R., Olsen, S.: Merging danNet with princeton wordnet. In: Proceedings of the 10th Global Wordnet Conference, pp. 125–134 (2019) Pedersen, B.S., Nimb, S., Olsen, I.R., Olsen, S.: Merging danNet with princeton wordnet. In: Proceedings of the 10th Global Wordnet Conference, pp. 125–134 (2019)
39.
Zurück zum Zitat Piasecki, M., Broda, B., Szpakowicz, S.: A Wordnet from the ground up. Oficyna Wydawnicza Politechniki Wrocławskiej Wrocław (2009) Piasecki, M., Broda, B., Szpakowicz, S.: A Wordnet from the ground up. Oficyna Wydawnicza Politechniki Wrocławskiej Wrocław (2009)
42.
Zurück zum Zitat Piasecki, M., Walkowiak, T., Rudnicka, E., Bond, F.: Lexical Platform-the first step towards user-centred integration of lexical resources. Cognitive Studies| Études cognitives (18) (2018) Piasecki, M., Walkowiak, T., Rudnicka, E., Bond, F.: Lexical Platform-the first step towards user-centred integration of lexical resources. Cognitive Studies| Études cognitives (18) (2018)
43.
Zurück zum Zitat Piasecki, M., Wendelberger, M., Maziarz, M.: Extraction of the multi-word lexical units in the perspective of the wordnet expansion. In: Proceedings of the International Conference RANLP, pp. 512–520 (2015) Piasecki, M., Wendelberger, M., Maziarz, M.: Extraction of the multi-word lexical units in the perspective of the wordnet expansion. In: Proceedings of the International Conference RANLP, pp. 512–520 (2015)
45.
Zurück zum Zitat Rudnicka, E., Maziarz, M., Piasecki, M., Szpakowicz, S.: A strategy of mapping Polish Wordnet onto Princeton WordNet. In: Proceedings of COLING 2012: Posters, pp. 1039–1048 (2012) Rudnicka, E., Maziarz, M., Piasecki, M., Szpakowicz, S.: A strategy of mapping Polish Wordnet onto Princeton WordNet. In: Proceedings of COLING 2012: Posters, pp. 1039–1048 (2012)
46.
Zurück zum Zitat Rudnicka, E., Piasecki, M., Bond, F., Grabowski, Ł., Piotrowski, T.: Sense equivalence in plWordNet to Princeton WordNet mapping. Int. J. Lexicography 1, 1–30 (2019) Rudnicka, E., Piasecki, M., Bond, F., Grabowski, Ł., Piotrowski, T.: Sense equivalence in plWordNet to Princeton WordNet mapping. Int. J. Lexicography 1, 1–30 (2019)
47.
Zurück zum Zitat Rudnicka, E.K., Witkowski, W., Kaliński, M.: Towards the methodology for extending princeton wordnet. Cogn. Stud.| Études cognitives (15), pp. 335–351 (2015) Rudnicka, E.K., Witkowski, W., Kaliński, M.: Towards the methodology for extending princeton wordnet. Cogn. Stud.| Études cognitives (15), pp. 335–351 (2015)
48.
Zurück zum Zitat Rutkowski, S., Rychlik, P., Mykowiecka, A.: Estimating senses with sets of lexically related words for Polish word sense disambiguation. In: Proceedings of the 10th Global Wordnet Conference, pp. 118–124 (2019) Rutkowski, S., Rychlik, P., Mykowiecka, A.: Estimating senses with sets of lexically related words for Polish word sense disambiguation. In: Proceedings of the 10th Global Wordnet Conference, pp. 118–124 (2019)
49.
Zurück zum Zitat Rybiński, K.: Political sentiment analysis of Polish politicians. e-politicon, 24, 162–195 (2017) Rybiński, K.: Political sentiment analysis of Polish politicians. e-politicon, 24, 162–195 (2017)
50.
Zurück zum Zitat Twardowski, B., Gawrysiak, P.: Domain dependent product feature and opinion extraction based on e-commerce websites. In: Zgrzywa, A., Choroś, K., Siemiński, A. (eds.) Multimedia and Internet Systems: Theory and Practice, pp. 261–270. Springer, Berlin Heidelberg, Berlin (2013). https://doi.org/10.1007/978-3-642-32335-5_25CrossRef Twardowski, B., Gawrysiak, P.: Domain dependent product feature and opinion extraction based on e-commerce websites. In: Zgrzywa, A., Choroś, K., Siemiński, A. (eds.) Multimedia and Internet Systems: Theory and Practice, pp. 261–270. Springer, Berlin Heidelberg, Berlin (2013). https://​doi.​org/​10.​1007/​978-3-642-32335-5_​25CrossRef
51.
Zurück zum Zitat Wróblewska, A.: Polish corpus of annotated descriptions of images. In: Proceedings of the 11th Int. Conference on Language Resources and Evaluation (2018) Wróblewska, A.: Polish corpus of annotated descriptions of images. In: Proceedings of the 11th Int. Conference on Language Resources and Evaluation (2018)
52.
Zurück zum Zitat Wróblewska, A., Protaziuk, G., Bembenik, R., Podsiadły-Marczykowska, T.: Associations between texts and ontology. In: Bembenik R., Skonieczny L., Rybinski H., Kryszkiewicz M., Niezgodka M. (eds.) Intelligent Tools for Building a Scientific Information Platform. Studies in Computational Intelligence, vol 467. Springer, Berlin, Heidelberg (2013). https://doi.org/10.1007/978-3-642-35647-6_20 Wróblewska, A., Protaziuk, G., Bembenik, R., Podsiadły-Marczykowska, T.: Associations between texts and ontology. In: Bembenik R., Skonieczny L., Rybinski H., Kryszkiewicz M., Niezgodka M. (eds.) Intelligent Tools for Building a Scientific Information Platform. Studies in Computational Intelligence, vol 467. Springer, Berlin, Heidelberg (2013). https://​doi.​org/​10.​1007/​978-3-642-35647-6_​20
53.
Zurück zum Zitat Zaśko-Zielińska, M., Piasecki, M., Szpakowicz, S.: A large wordnet-based sentiment lexicon for Polish. In: Proceedings of the International Conference RANLP, pp. 721–730 (2015) Zaśko-Zielińska, M., Piasecki, M., Szpakowicz, S.: A large wordnet-based sentiment lexicon for Polish. In: Proceedings of the International Conference RANLP, pp. 721–730 (2015)
Metadaten
Titel
Wordnet – a Basic Resource for Natural Language Processing: The Case of plWordNet
verfasst von
Agnieszka Dziob
Tomasz Naskręt
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-63119-2_56

Premium Partner