Skip to main content
Top
Published in: Journal of Intelligent Information Systems 1/2019

12-04-2018

Word sense disambiguation application in sentiment analysis of news headlines: an applied approach to FOREX market prediction

Authors: Saeed Seifollahi, Mehdi Shajari

Published in: Journal of Intelligent Information Systems | Issue 1/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Sentiment analysis of textual content has become a popular approach for market prediction. However, lack of a process for word sense disambiguation makes it questionable whether the sentiment expressed by the context is correctly identified. Meanwhile, many studies in natural language processing have focused on word sense disambiguation. However, there has been a weak link between the two logically relevant fields of study. Therefore, with two motivations, we propose a system for FOREX market prediction that exploits word sense disambiguation in sentiment analysis of news headlines and predicts the directional movement of a currency pair. The first motivation is the implementation of a novel word sense disambiguation that can determine the proper senses of all significant words in a news headline. The main contributions of this work that make the first motivation possible, are the introduction of novel approaches termed Relevant Gloss Retrieval, Similarity Threshold, Verb Nominalization, and also optimization measures to decrease execution time. The second motivation is to prove that determination of proper senses of significant words in textual contents can improve the determination of sentiment, conveyed by the context, and consequently any application based on sentiment analysis. Inclusion of the word sense disambiguation into the proposed system proves the achievement of the second motivation. Carried out tests with the same dataset prove that the proposed system outperforms one of the best systems (to our best knowledge) proposed for market prediction and improves accuracy from 83.33% to 91.67%. The detail for reproduction of the system is amply provided.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Anastasakis, L., & Mort, N. (2009). Exchange rate forecasting using a combined parametric and nonparametric self-organising modelling approach. Expert Systems with Applications, 36(10), 12001–12011.CrossRef Anastasakis, L., & Mort, N. (2009). Exchange rate forecasting using a combined parametric and nonparametric self-organising modelling approach. Expert Systems with Applications, 36(10), 12001–12011.CrossRef
go back to reference Baccianella, S., Esuli, A., Sebastiani, F. (2010). Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In LREC (Vol. 10 pp. 2200–2204). Baccianella, S., Esuli, A., Sebastiani, F. (2010). Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In LREC (Vol. 10 pp. 2200–2204).
go back to reference Banerjee, S., & Pedersen, T. (2002). An adapted lesk algorithm for word sense disambiguation using wordnet. In International conference on intelligent text processing and computational linguistics (pp. 136–145). Berlin: Springer. Banerjee, S., & Pedersen, T. (2002). An adapted lesk algorithm for word sense disambiguation using wordnet. In International conference on intelligent text processing and computational linguistics (pp. 136–145). Berlin: Springer.
go back to reference Bollen, J., Mao, H., Zeng, X. (2011). Twitter mood predicts the stock market. Journal of Computational Science, 2(1), 1–8.CrossRef Bollen, J., Mao, H., Zeng, X. (2011). Twitter mood predicts the stock market. Journal of Computational Science, 2(1), 1–8.CrossRef
go back to reference Dong, R., O’Mahony, M.P., Schaal, M., McCarthy, K., Smyth, B. (2016). Combining similarity and sentiment in opinion mining for product recommendation. Journal of Intelligent Information Systems, 46(2), 285–312.CrossRef Dong, R., O’Mahony, M.P., Schaal, M., McCarthy, K., Smyth, B. (2016). Combining similarity and sentiment in opinion mining for product recommendation. Journal of Intelligent Information Systems, 46(2), 285–312.CrossRef
go back to reference Evans, M.D., & Lyons, R.K. (2008). How is macro news transmitted to exchange rates? Journal of Financial Economics, 88(1), 26–50.CrossRef Evans, M.D., & Lyons, R.K. (2008). How is macro news transmitted to exchange rates? Journal of Financial Economics, 88(1), 26–50.CrossRef
go back to reference Farooq, U., Dhamala, T.P., Nongaillard, A., Ouzrout, Y., Qadir, M.A. (2015). A word sense disambiguation method for feature level sentiment analysis. In 2015 9th international conference on software, knowledge, information management and applications (SKIMA) (pp. 1–8). IEEE. Farooq, U., Dhamala, T.P., Nongaillard, A., Ouzrout, Y., Qadir, M.A. (2015). A word sense disambiguation method for feature level sentiment analysis. In 2015 9th international conference on software, knowledge, information management and applications (SKIMA) (pp. 1–8). IEEE.
go back to reference Fellbaum C. (1998). WordNet. Wiley Online Library. Fellbaum C. (1998). WordNet. Wiley Online Library.
go back to reference Hagenau, M., Liebmann, M., Neumann, D. (2013). Automated news reading: stock price prediction based on financial news using context-capturing features. Decision Support Systems, 55(3), 685– 697.CrossRef Hagenau, M., Liebmann, M., Neumann, D. (2013). Automated news reading: stock price prediction based on financial news using context-capturing features. Decision Support Systems, 55(3), 685– 697.CrossRef
go back to reference Howe, D.C. (2009). Rita: creativity support for computational literature. In Proceedings of the seventh ACM conference on creativity and cognition (pp. 205–210). ACM. Howe, D.C. (2009). Rita: creativity support for computational literature. In Proceedings of the seventh ACM conference on creativity and cognition (pp. 205–210). ACM.
go back to reference Huang, C.J., Liao, J.J., Yang, D.X., Chang, T.Y., Luo, Y.C. (2010). Realization of a news dissemination agent based on weighted association rules and text mining techniques. Expert Systems with Applications, 37(9), 6409–6413.CrossRef Huang, C.J., Liao, J.J., Yang, D.X., Chang, T.Y., Luo, Y.C. (2010). Realization of a news dissemination agent based on weighted association rules and text mining techniques. Expert Systems with Applications, 37(9), 6409–6413.CrossRef
go back to reference Jiang, J.J., & Conrath, D.W. (1997). Semantic similarity based on corpus statistics and lexical taxonomy. arXiv:cmp-lg/9709008. Jiang, J.J., & Conrath, D.W. (1997). Semantic similarity based on corpus statistics and lexical taxonomy. arXiv:cmp-lg/​9709008.
go back to reference Kehagias, A., Petridis, V., Kaburlasos, V.G., Fragkou, P. (2003). A comparison of word-and sense-based text categorization using several classification algorithms. Journal of Intelligent Information Systems, 21(3), 227–247.CrossRef Kehagias, A., Petridis, V., Kaburlasos, V.G., Fragkou, P. (2003). A comparison of word-and sense-based text categorization using several classification algorithms. Journal of Intelligent Information Systems, 21(3), 227–247.CrossRef
go back to reference Lesk, M. (1986). Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In Proceedings of the 5th annual international conference on systems documentation (pp. 24–26). ACM. Lesk, M. (1986). Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In Proceedings of the 5th annual international conference on systems documentation (pp. 24–26). ACM.
go back to reference Levinson, M. et al. (2014). The economist guide to financial markets: why they exist and how they work. The Economist, 1, 17. 6th edition. Levinson, M. et al. (2014). The economist guide to financial markets: why they exist and how they work. The Economist, 1, 17. 6th edition.
go back to reference Li, X., Szpakowicz, S., Matwin, S. (1995). A wordnet-based algorithm for word sense disambiguation. In IJCAI (Vol. 95 pp. 1368–1374). Li, X., Szpakowicz, S., Matwin, S. (1995). A wordnet-based algorithm for word sense disambiguation. In IJCAI (Vol. 95 pp. 1368–1374).
go back to reference Li, Q., Wang, T., Li, P., Liu, L., Gong, Q., Chen, Y. (2014). The effect of news and public mood on stock movements. Information Sciences, 278, 826–840.CrossRef Li, Q., Wang, T., Li, P., Liu, L., Gong, Q., Chen, Y. (2014). The effect of news and public mood on stock movements. Information Sciences, 278, 826–840.CrossRef
go back to reference Lin, D. et al. (1998). An information-theoretic definition of similarity. In Icml, (Vol. 98 pp. 296–304). Lin, D. et al. (1998). An information-theoretic definition of similarity. In Icml, (Vol. 98 pp. 296–304).
go back to reference Liu, Y., Scheuermann, P., Li, X., Zhu, X. (2007). Using wordnet to disambiguate word senses for text classification. In Computational Science–ICCS 2007 (pp. 781–789). Liu, Y., Scheuermann, P., Li, X., Zhu, X. (2007). Using wordnet to disambiguate word senses for text classification. In Computational Science–ICCS 2007 (pp. 781–789).
go back to reference Luk, A. (1993). Statistical sense disambiguation with relatively small corpora using dictionary definitions. In Proceedings of the 33rd annual meeting of ACL (pp. 181–188). Luk, A. (1993). Statistical sense disambiguation with relatively small corpora using dictionary definitions. In Proceedings of the 33rd annual meeting of ACL (pp. 181–188).
go back to reference Miller, G., & Fellbaum, C. (1998). Wordnet: an electronic lexical database. MIT Press, Cambridge. Miller, G., & Fellbaum, C. (1998). Wordnet: an electronic lexical database. MIT Press, Cambridge.
go back to reference Mittermayer, M.A. (2004). Forecasting intraday stock price trends with text mining techniques. In Proceedings of the 37th annual Hawaii international conference on system sciences, 2004 (p. 10). IEEE. Mittermayer, M.A. (2004). Forecasting intraday stock price trends with text mining techniques. In Proceedings of the 37th annual Hawaii international conference on system sciences, 2004 (p. 10). IEEE.
go back to reference Mladenović, M., Mitrović, J., Krstev, C., Vitas, D. (2016). Hybrid sentiment analysis framework for a morphologically rich language. Journal of Intelligent Information Systems, 46(3), 599–620.CrossRef Mladenović, M., Mitrović, J., Krstev, C., Vitas, D. (2016). Hybrid sentiment analysis framework for a morphologically rich language. Journal of Intelligent Information Systems, 46(3), 599–620.CrossRef
go back to reference Mostafa, M.M. (2013). More than words: social networks’ text mining for consumer brand sentiments. Expert Systems with Applications, 40(10), 4241–4251.CrossRef Mostafa, M.M. (2013). More than words: social networks’ text mining for consumer brand sentiments. Expert Systems with Applications, 40(10), 4241–4251.CrossRef
go back to reference Nassirtoussi, A.K., Aghabozorgi, S., Wah, T.Y., Ngo, D.C.L. (2015). Text mining of news-headlines for forex market prediction: a multi-layer dimension reduction algorithm with semantics and sentiment. Expert Systems with Applications, 42(1), 306–324.CrossRef Nassirtoussi, A.K., Aghabozorgi, S., Wah, T.Y., Ngo, D.C.L. (2015). Text mining of news-headlines for forex market prediction: a multi-layer dimension reduction algorithm with semantics and sentiment. Expert Systems with Applications, 42(1), 306–324.CrossRef
go back to reference Nizer, P., & Nievola, J.C. (2012). Predicting published news effect in the Brazilian stock market. Expert Systems with Applications, 39(12), 10674–10680.CrossRef Nizer, P., & Nievola, J.C. (2012). Predicting published news effect in the Brazilian stock market. Expert Systems with Applications, 39(12), 10674–10680.CrossRef
go back to reference Patwardhan, S., Banerjee, S., Pedersen, T. (2007). Umnd1: unsupervised word sense disambiguation using contextual semantic relatedness. In Proceedings of the 4th international workshop on semantic evaluations, association for computational linguistics (pp. 390–393). Patwardhan, S., Banerjee, S., Pedersen, T. (2007). Umnd1: unsupervised word sense disambiguation using contextual semantic relatedness. In Proceedings of the 4th international workshop on semantic evaluations, association for computational linguistics (pp. 390–393).
go back to reference Peramunetilleke, D., & Wong, R.K. (2002). Currency exchange rate forecasting from news headlines. Australian Computer Science Communications, 24(2), 131–139. Peramunetilleke, D., & Wong, R.K. (2002). Currency exchange rate forecasting from news headlines. Australian Computer Science Communications, 24(2), 131–139.
go back to reference Rao, T., & Srivastava, S. (2012). Using twitter sentiments and search volumes index to predict oil, gold, forex and markets indices. Tech. rep. Rao, T., & Srivastava, S. (2012). Using twitter sentiments and search volumes index to predict oil, gold, forex and markets indices. Tech. rep.
go back to reference Schumaker, R.P., Zhang, Y., Huang, C.N., Chen, H. (2012). Evaluating sentiment in financial news articles. Decision Support Systems, 53(3), 458–464.CrossRef Schumaker, R.P., Zhang, Y., Huang, C.N., Chen, H. (2012). Evaluating sentiment in financial news articles. Decision Support Systems, 53(3), 458–464.CrossRef
go back to reference Siganos, A., Vagenas-Nanos, E., Verwijmeren, P. (2014). Facebook’s daily sentiment and international stock markets. Journal of Economic Behavior & Organization, 107, 730–743.CrossRef Siganos, A., Vagenas-Nanos, E., Verwijmeren, P. (2014). Facebook’s daily sentiment and international stock markets. Journal of Economic Behavior & Organization, 107, 730–743.CrossRef
go back to reference Sprenger, T.O., Tumasjan, A., Sandner, P.G., Welpe, I.M. (2014). Tweets and trades: the information content of stock microblogs. European Financial Management, 20(5), 926–957.CrossRef Sprenger, T.O., Tumasjan, A., Sandner, P.G., Welpe, I.M. (2014). Tweets and trades: the information content of stock microblogs. European Financial Management, 20(5), 926–957.CrossRef
go back to reference Sul, H.K., Dennis, A.R., Yuan, L.I. (2014). Trading on Twitter: the financial information content of emotion in social media. In 2014 47th Hawaii international conference on system sciences (HICSS) (pp. 806–815). IEEE. Sul, H.K., Dennis, A.R., Yuan, L.I. (2014). Trading on Twitter: the financial information content of emotion in social media. In 2014 47th Hawaii international conference on system sciences (HICSS) (pp. 806–815). IEEE.
go back to reference Toutanova, K., Klein, D., Manning, C.D., Singer, Y. (2003). Feature-rich part-of-speech tagging with a cyclic dependency network. In Proceedings of the 2003 conference of the North American chapter of the association for computational linguistics on human language technology (Vol. 1 pp. 173–180). Association for Computational Linguistics. Toutanova, K., Klein, D., Manning, C.D., Singer, Y. (2003). Feature-rich part-of-speech tagging with a cyclic dependency network. In Proceedings of the 2003 conference of the North American chapter of the association for computational linguistics on human language technology (Vol. 1 pp. 173–180). Association for Computational Linguistics.
go back to reference Wilson, T., Wiebe, J., Hoffmann, P. (2005). Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the conference on human language technology and empirical methods in natural language processing (pp. 347–354). Association for Computational Linguistics. Wilson, T., Wiebe, J., Hoffmann, P. (2005). Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the conference on human language technology and empirical methods in natural language processing (pp. 347–354). Association for Computational Linguistics.
go back to reference Yarowsky, D. (1995). Unsupervised word sense disambiguation rivaling supervised methods. In Proceedings of the 33rd annual meeting on association for computational linguistics (pp. 189–196). Association for Computational Linguistics. Yarowsky, D. (1995). Unsupervised word sense disambiguation rivaling supervised methods. In Proceedings of the 33rd annual meeting on association for computational linguistics (pp. 189–196). Association for Computational Linguistics.
go back to reference Zhang, X., Fuehres, H., Gloor, P.A. (2011). Predicting stock market indicators through twitter “hope it is not as bad as i fear”. Procedia-Social and Behavioral Sciences, 26, 55–62.CrossRef Zhang, X., Fuehres, H., Gloor, P.A. (2011). Predicting stock market indicators through twitter “hope it is not as bad as i fear”. Procedia-Social and Behavioral Sciences, 26, 55–62.CrossRef
Metadata
Title
Word sense disambiguation application in sentiment analysis of news headlines: an applied approach to FOREX market prediction
Authors
Saeed Seifollahi
Mehdi Shajari
Publication date
12-04-2018
Publisher
Springer US
Published in
Journal of Intelligent Information Systems / Issue 1/2019
Print ISSN: 0925-9902
Electronic ISSN: 1573-7675
DOI
https://doi.org/10.1007/s10844-018-0504-9

Other articles of this Issue 1/2019

Journal of Intelligent Information Systems 1/2019 Go to the issue

Premium Partner