Skip to main content
Erschienen in: Cognitive Computation 5/2023

07.09.2020

A Reservoir Computing Approach to Word Sense Disambiguation

verfasst von: Kiril Simov, Petia Koprinkova-Hristova, Alexander Popov, Petya Osenova

Erschienen in: Cognitive Computation | Ausgabe 5/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Reservoir computing (RC) has emerged as an alternative approach for the development of fast trainable recurrent neural networks (RNNs). It is considered to be biologically plausible due to the similarity between randomly designed artificial reservoir structures and cortical structures in the brain. The paper continues our previous research on the application of a member of the family of RC approaches—the echo state network (ESN)—to the natural language processing (NLP) task of Word Sense Disambiguation (WSD). A novel deep bi-directional ESN (DBiESN) structure is proposed, as well as a novel approach for exploiting reservoirs’ steady states. The models also make use of ESN-enhanced word embeddings. The paper demonstrates that our DBiESN approach offers a good alternative to previously tested BiESN models in the context of the word sense disambiguation task having smaller number of trainable parameters. Although our DBiESN-based model achieves similar accuracy to other popular RNN architectures, we could not outperform the state of the art. However, due to the smaller number of trainable parameters in the reservoir models, in contrast to fully trainable RNNs, it is to be expected that they would have better generalization properties as well as higher potential to increase their accuracy, which should justify further exploration of such architectures.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Wang P, Qian Y, Soong FK, He L, Zhao H. 2015. Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Recurrent Neural Network. arXiv:1510.06168. Wang P, Qian Y, Soong FK, He L, Zhao H. 2015. Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Recurrent Neural Network. arXiv:1510.​06168.
2.
Zurück zum Zitat Wang P, Qian Y, Soong FK, He L, Zhao H. 2015. A Unified Tagging Solution: Bidirectional LSTM Recurrent Neural Network with Word Embedding. arXiv:1511.00215. Wang P, Qian Y, Soong FK, He L, Zhao H. 2015. A Unified Tagging Solution: Bidirectional LSTM Recurrent Neural Network with Word Embedding. arXiv:1511.​00215.
3.
Zurück zum Zitat Huang Z, Xu W, Yu K. 2015. Bidirectional LSTM-CRF Models for Sequence Tagging. arXiv:1508.01991. Huang Z, Xu W, Yu K. 2015. Bidirectional LSTM-CRF Models for Sequence Tagging. arXiv:1508.​01991.
4.
Zurück zum Zitat Wang W, Chang B. Graph-based Dependency Parsing with Bidirectional LSTM. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics Volume 1: Long Papers Association for Computational Linguistics, Berlin Germany; 2016. p. 2306–2315. https://doi.org/10.18653/v1/P16-1218. Wang W, Chang B. Graph-based Dependency Parsing with Bidirectional LSTM. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics Volume 1: Long Papers Association for Computational Linguistics, Berlin Germany; 2016. p. 2306–2315. https://​doi.​org/​10.​18653/​v1/​P16-1218.
5.
Zurück zum Zitat Popov A. Neural network models for word sense disambiguation: an overview. Cybernetics and Information Technologies 2018;18:139–151.MathSciNetCrossRef Popov A. Neural network models for word sense disambiguation: an overview. Cybernetics and Information Technologies 2018;18:139–151.MathSciNetCrossRef
7.
Zurück zum Zitat Cho K, van Merriënboer B., Bahdanau D, Bengio Y. On the Properties of Neural Machine translation: encoder–Decoder approaches. proceedings of SSST-8, Eighth Workshop on Syntax Semantics and Structure in Statistical Translation Association for Computational Linguistics Doha Qatar; 2014 . p. 103–111. https://doi.org/10.3115/v1/W14-4012. Cho K, van Merriënboer B., Bahdanau D, Bengio Y. On the Properties of Neural Machine translation: encoder–Decoder approaches. proceedings of SSST-8, Eighth Workshop on Syntax Semantics and Structure in Statistical Translation Association for Computational Linguistics Doha Qatar; 2014 . p. 103–111. https://​doi.​org/​10.​3115/​v1/​W14-4012.
9.
Zurück zum Zitat Jaeger H. 2002. Tutorial on training recurrent neural networks, covering BPPT, RTRL, EKF and the Echo State Network Approach, GMD Report 159 German National Research Center for Information Technology. Jaeger H. 2002. Tutorial on training recurrent neural networks, covering BPPT, RTRL, EKF and the Echo State Network Approach, GMD Report 159 German National Research Center for Information Technology.
15.
Zurück zum Zitat Gallicchio C, Micheli A, Pedrelli L. 2018. Comparison between Deep ESNs and Gated RNNs on Multivariate Time-Series Prediction. arXiv:1812.11527. Gallicchio C, Micheli A, Pedrelli L. 2018. Comparison between Deep ESNs and Gated RNNs on Multivariate Time-Series Prediction. arXiv:1812.​11527.
16.
Zurück zum Zitat Frank SL, Čerňanský M.P. Generalization and Systematicity in Echo State Networks, in the Annual Meeting of the Cognitive Science Society; 2008. pp. 733–738. Frank SL, Čerňanský M.P. Generalization and Systematicity in Echo State Networks, in the Annual Meeting of the Cognitive Science Society; 2008. pp. 733–738.
19.
Zurück zum Zitat Twiefel J, Hinaut X, Soares MB, Strahl E, Wermter S. Using Natural Language Feedback in a Neuro-Inspired Integrated Multimodal Robotic Architecture. 25th IEEE International Symposium on Robot and Human Interactive Communication, RO-MAN New york, NY, USA, August 26-31, 2016; 2016. p. 52–57. https://doi.org/10.1109/ROMAN.2016.7745090. Twiefel J, Hinaut X, Soares MB, Strahl E, Wermter S. Using Natural Language Feedback in a Neuro-Inspired Integrated Multimodal Robotic Architecture. 25th IEEE International Symposium on Robot and Human Interactive Communication, RO-MAN New york, NY, USA, August 26-31, 2016; 2016. p. 52–57. https://​doi.​org/​10.​1109/​ROMAN.​2016.​7745090.
21.
Zurück zum Zitat Squartini S, Cecchi S, Rossini M, Piazza F. Echo State Networks for Real-Time Audio Applications. Advances in Neural Networks – ISNN 2007, ed. by D. Liu, S. Fei, Z. Hou, H. Zhang, C. Sun Springer Berlin Heidelberg, Berlin, Heidelberg; 2007. p. 731–740. https://doi.org/10.1007/978-3-540-72395-0_90. Squartini S, Cecchi S, Rossini M, Piazza F. Echo State Networks for Real-Time Audio Applications. Advances in Neural Networks – ISNN 2007, ed. by D. Liu, S. Fei, Z. Hou, H. Zhang, C. Sun Springer Berlin Heidelberg, Berlin, Heidelberg; 2007. p. 731–740. https://​doi.​org/​10.​1007/​978-3-540-72395-0_​90.
23.
Zurück zum Zitat Ramamurthy R, Stenzel R, Sifa R, Ladi A, Bauckhage C. Echo State Networks for Named Entity Recognition. Artificial Neural Networks and Machine Learning – ICANN 2019: Workshop and Special Sessions ed. by I.V. Tetko, V. Ku̇rková, P. Karpov, F. Theis Springer International Publishing, Cham; 2019. p. 110–120. Ramamurthy R, Stenzel R, Sifa R, Ladi A, Bauckhage C. Echo State Networks for Named Entity Recognition. Artificial Neural Networks and Machine Learning – ICANN 2019: Workshop and Special Sessions ed. by I.V. Tetko, V. Ku̇rková, P. Karpov, F. Theis Springer International Publishing, Cham; 2019. p. 110–120.
24.
Zurück zum Zitat Koprinkova-Hristova P, Popov A, Simov K, Osenova P. Echo State Network for Word Sense Disambiguation. Artificial intelligence: Methodology, Systems, and Applications - 18th International Conference, AIMSA 2018, Varna, Bulgaria, September 12-14, 2018 Proceedings; 2018. p. 73–82. https://doi.org/10.1007/978-3-319-99344-7_7. Koprinkova-Hristova P, Popov A, Simov K, Osenova P. Echo State Network for Word Sense Disambiguation. Artificial intelligence: Methodology, Systems, and Applications - 18th International Conference, AIMSA 2018, Varna, Bulgaria, September 12-14, 2018 Proceedings; 2018. p. 73–82. https://​doi.​org/​10.​1007/​978-3-319-99344-7_​7.
25.
Zurück zum Zitat Popov A, Koprinkova-Hristova P, Simov K, Osenova P. Echo State vs. LSTM Networks for Word Sense Disambiguation. Artificial Neural Networks and Machine Learning – ICANN 2019: Workshop and Special Sessions ed. by I.V. Tetko, V. Ku̇rková, P. Karpov, F. Theis Springer International Publishing, Cham; 2019. p. 94–109. Popov A, Koprinkova-Hristova P, Simov K, Osenova P. Echo State vs. LSTM Networks for Word Sense Disambiguation. Artificial Neural Networks and Machine Learning – ICANN 2019: Workshop and Special Sessions ed. by I.V. Tetko, V. Ku̇rková, P. Karpov, F. Theis Springer International Publishing, Cham; 2019. p. 94–109.
26.
Zurück zum Zitat Gallicchio C, Micheli A. A Reservoir Computing Approach for Human Gesture Recognition from Kinect Data. Proceedings of the Artificial Intelligence for Ambient Assisted Living 2016 co-located with 15th International Conference of the Italian Association for Artificial Intelligence AIxIA 2016 Genova, Italy, November 28th, 2016; 2016. p. 33–42. http://ceur-ws.org/Vol-1803/paper3.pdf. Gallicchio C, Micheli A. A Reservoir Computing Approach for Human Gesture Recognition from Kinect Data. Proceedings of the Artificial Intelligence for Ambient Assisted Living 2016 co-located with 15th International Conference of the Italian Association for Artificial Intelligence AIxIA 2016 Genova, Italy, November 28th, 2016; 2016. p. 33–42. http://​ceur-ws.​org/​Vol-1803/​paper3.​pdf.
29.
Zurück zum Zitat Simov KI, Koprinkova-Hristova PD, Popov A, Osenova P. Word Embeddings Improvement via Echo State Networks. IEEE International Symposium on INnovations in Intelligent SysTems and Applications, INISTA 2019, Sofia, Bulgaria, July 3-5, 2019; 2019. p. 1–6. https://doi.org/10.1109/INISTA.2019.8778297. Simov KI, Koprinkova-Hristova PD, Popov A, Osenova P. Word Embeddings Improvement via Echo State Networks. IEEE International Symposium on INnovations in Intelligent SysTems and Applications, INISTA 2019, Sofia, Bulgaria, July 3-5, 2019; 2019. p. 1–6. https://​doi.​org/​10.​1109/​INISTA.​2019.​8778297.
30.
Zurück zum Zitat Koprinkova-Hristova PD, Tontchev N. Echo State Networks for Multi-dimensional Data Clustering. Artificial Neural Networks and Machine Learning - ICANN 2012 - 22nd International Conference on Artificial Neural Networks, Lausanne, Switzerland, September 11-14, 2012, Proceedings, Part I; 2012. p. 571–578. https://doi.org/10.1007/978-3-642-33269-2_72. Koprinkova-Hristova PD, Tontchev N. Echo State Networks for Multi-dimensional Data Clustering. Artificial Neural Networks and Machine Learning - ICANN 2012 - 22nd International Conference on Artificial Neural Networks, Lausanne, Switzerland, September 11-14, 2012, Proceedings, Part I; 2012. p. 571–578. https://​doi.​org/​10.​1007/​978-3-642-33269-2_​72.
31.
Zurück zum Zitat Gallicchio C, Micheli A. 2016. A reservoir computing approach for human gesture recognition from kinect data inproceedings of the AI for ambient assisted living. Gallicchio C, Micheli A. 2016. A reservoir computing approach for human gesture recognition from kinect data inproceedings of the AI for ambient assisted living.
32.
Zurück zum Zitat Raganato A, Camacho-Collados J, Navigli R. Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers Association for Computational Linguistics, Valencia, Spain; 2017. p. 99–110. https://www.aclweb.org/anthology/E17-1010. Raganato A, Camacho-Collados J, Navigli R. Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers Association for Computational Linguistics, Valencia, Spain; 2017. p. 99–110. https://​www.​aclweb.​org/​anthology/​E17-1010.
34.
Zurück zum Zitat Jaeger H. 2007. Discovering multiscale dynamical features with hierarchical echo state networks. Tech. rep., Jacobs University Bremen. Jaeger H. 2007. Discovering multiscale dynamical features with hierarchical echo state networks. Tech. rep., Jacobs University Bremen.
35.
Zurück zum Zitat Fernández S, Graves A, Schmidhuber J. Sequence Labelling in Structured Domains with Hierarchical Recurrent Neural Networks. Proceedings of the 20th International Joint Conference on Artifical Intelligence Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, IJCAI’07; 2007. p. 774–779. Fernández S, Graves A, Schmidhuber J. Sequence Labelling in Structured Domains with Hierarchical Recurrent Neural Networks. Proceedings of the 20th International Joint Conference on Artifical Intelligence Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, IJCAI’07; 2007. p. 774–779.
39.
Zurück zum Zitat Bengio Y, Lee DH, Bornschein J, Lin Z. 2015. Towards biologically plausible deep learning. arXiv:1502.04156. Bengio Y, Lee DH, Bornschein J, Lin Z. 2015. Towards biologically plausible deep learning. arXiv:1502.​04156.
41.
42.
43.
Zurück zum Zitat Snyder B, Palmer M. The English all-words task. Proceedings of SENSEVAL-3, the Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text Association for Computational Linguistics, Barcelona, Spain; 2004. p. 41–43. https://www.aclweb.org/anthology/W04-0811. Snyder B, Palmer M. The English all-words task. Proceedings of SENSEVAL-3, the Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text Association for Computational Linguistics, Barcelona, Spain; 2004. p. 41–43. https://​www.​aclweb.​org/​anthology/​W04-0811.
44.
Zurück zum Zitat Pradhan S, Loper E, Dligach D, Palmer M. SemEval-2007 Task-17: English Lexical Sample, SRL and All Words. Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007) Association for Computational Linguistics, Prague, Czech Republic; 2007. p. 87–92. https://www.aclweb.org/anthology/S07-1016. Pradhan S, Loper E, Dligach D, Palmer M. SemEval-2007 Task-17: English Lexical Sample, SRL and All Words. Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007) Association for Computational Linguistics, Prague, Czech Republic; 2007. p. 87–92. https://​www.​aclweb.​org/​anthology/​S07-1016.
45.
Zurück zum Zitat Navigli R, Jurgens D, Vannella D. SemEval-2013 Task 12: Multilingual Word Sense Disambiguation. Second Joint Conference on Lexical and Computational Semantics *SEM, Volume 2: Proceedings of the Seventh International Workshop on Semantic Evaluation SemEval 2013 Association for Computational Linguistics, Atlanta, Georgia, USA; 2013 . p. 222–231. https://www.aclweb.org/anthology/S13-2040. Navigli R, Jurgens D, Vannella D. SemEval-2013 Task 12: Multilingual Word Sense Disambiguation. Second Joint Conference on Lexical and Computational Semantics *SEM, Volume 2: Proceedings of the Seventh International Workshop on Semantic Evaluation SemEval 2013 Association for Computational Linguistics, Atlanta, Georgia, USA; 2013 . p. 222–231. https://​www.​aclweb.​org/​anthology/​S13-2040.
46.
Zurück zum Zitat Moro A, Navigli R. SemEval-2015 Task 13: Multilingual All-Words Sense Disambiguation and Entity Linking. Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015) (Association for Computational Linguistics, Denver, Colorado); 2015. p. 288–297. https://doi.org/10.18653/v1/S15-2049. Moro A, Navigli R. SemEval-2015 Task 13: Multilingual All-Words Sense Disambiguation and Entity Linking. Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015) (Association for Computational Linguistics, Denver, Colorado); 2015. p. 288–297. https://​doi.​org/​10.​18653/​v1/​S15-2049.
47.
Zurück zum Zitat Yuxin C, Le-Ngoc T, Champagne B, Changjiang X. Recursive least squares constant modulus algorithm for blind adaptive array. IEEE Transactions on Signal Processing 2004;52(5):1452.MathSciNetCrossRefMATH Yuxin C, Le-Ngoc T, Champagne B, Changjiang X. Recursive least squares constant modulus algorithm for blind adaptive array. IEEE Transactions on Signal Processing 2004;52(5):1452.MathSciNetCrossRefMATH
49.
Zurück zum Zitat Belkin M, Hsu D, Ma S, Mandal S. 2018. Reconciling modern machine learning practice and the bias-variance trade-off arxiv: Machine Learning. Belkin M, Hsu D, Ma S, Mandal S. 2018. Reconciling modern machine learning practice and the bias-variance trade-off arxiv: Machine Learning.
Metadaten
Titel
A Reservoir Computing Approach to Word Sense Disambiguation
verfasst von
Kiril Simov
Petia Koprinkova-Hristova
Alexander Popov
Petya Osenova
Publikationsdatum
07.09.2020
Verlag
Springer US
Erschienen in
Cognitive Computation / Ausgabe 5/2023
Print ISSN: 1866-9956
Elektronische ISSN: 1866-9964
DOI
https://doi.org/10.1007/s12559-020-09758-w

Weitere Artikel der Ausgabe 5/2023

Cognitive Computation 5/2023 Zur Ausgabe

Premium Partner