Skip to main content
Top

2015 | OriginalPaper | Chapter

A Proposed Framework for Evaluating the Effectiveness of Financial News Sentiment Scoring Datasets

Authors : Islam Qudah, Fethi A. Rabhi, Maurice Peat

Published in: Enterprise Applications and Services in the Finance Industry

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The impact of financial news on financial markets has been studied extensively. A number of news sentiment scoring techniques are being widely used in research and industry. However, results from sentiment studies are hard to interpret contextual and sentiment related parameters change. Sometimes, the conditions which lead to the results are not fully documented and the results are not repeatable. Based on service-oriented computing principles, this paper proposes a framework that automates the process of incorporating different contextual parameters when running news sentiment impact studies. The framework also preserves the set of parameters/dataset and conditions for the end user to enable them to reproduce their results. This is demonstrated using a case study that shows how end users can flexibly select different contextual and sentiment related parameters and conduct news impact studies on daily stock prices.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Healy, A.D., Lo, A.W.: Managing real-time risks and returns: the thomson reuters newsscope event indices. In: Professor Hand, D.J., Professor of Statistics, Imperial College, London; Chief Scientific Advisor, Winton Capital Management; and President, Royal Statistical Society, 73 Healy, A.D., Lo, A.W.: Managing real-time risks and returns: the thomson reuters newsscope event indices. In: Professor Hand, D.J., Professor of Statistics, Imperial College, London; Chief Scientific Advisor, Winton Capital Management; and President, Royal Statistical Society, 73
2.
go back to reference Moniz, A., Brar, G., Davis, C.: Have I got news for youMacQuarie Research Report (2009) Moniz, A., Brar, G., Davis, C.: Have I got news for youMacQuarie Research Report (2009)
3.
go back to reference Al Shaikh, M.M., Prendinger, H., Ishizuka, M.: An analytical approach to assess sentiment of text. In: 10th International Conference on Computer and Information Technology, 2007 ICCIT 2007, pp. 1–6 (2007) Al Shaikh, M.M., Prendinger, H., Ishizuka, M.: An analytical approach to assess sentiment of text. In: 10th International Conference on Computer and Information Technology, 2007 ICCIT 2007, pp. 1–6 (2007)
4.
go back to reference Antweiler, W., Frank, M.Z.: Is all that talk just noise? the information content of internet stock message boards. J. Finan. 59(3), 1259–1294 (2004)CrossRef Antweiler, W., Frank, M.Z.: Is all that talk just noise? the information content of internet stock message boards. J. Finan. 59(3), 1259–1294 (2004)CrossRef
5.
go back to reference Azar, P.D.: Sentiment analysis in financial news (Doctoral dissertation, Harvard University) (2009) Azar, P.D.: Sentiment analysis in financial news (Doctoral dissertation, Harvard University) (2009)
6.
go back to reference Baker, M., Wurgler, J.: Investor sentiment and the cross section of stock returns. J. Finan. 61(4), 1645–1680 (2006)CrossRef Baker, M., Wurgler, J.: Investor sentiment and the cross section of stock returns. J. Finan. 61(4), 1645–1680 (2006)CrossRef
7.
go back to reference Barber, B.M., Odean, T.: All that glitters: The effect of attention and news on the buying behavior of individual and institutional investors. Rev. Finan. Stud. 21(2), 785–818 (2008)CrossRef Barber, B.M., Odean, T.: All that glitters: The effect of attention and news on the buying behavior of individual and institutional investors. Rev. Finan. Stud. 21(2), 785–818 (2008)CrossRef
8.
go back to reference Beheshti, S., Venugopal, S., Ryu, S.H., Benatallah, B., Wang, W.: Big data and cross-document coreference resolution: Current state and future opportunities (2013). ArXiv Preprint arXiv:1311.3987 Beheshti, S., Venugopal, S., Ryu, S.H., Benatallah, B., Wang, W.: Big data and cross-document coreference resolution: Current state and future opportunities (2013). ArXiv Preprint arXiv:​1311.​3987
9.
go back to reference Bollen, J., Mao, H.: Twitter mood as a stock market predictor. Computer 44(10), 0091–94 (2011)CrossRef Bollen, J., Mao, H.: Twitter mood as a stock market predictor. Computer 44(10), 0091–94 (2011)CrossRef
11.
go back to reference Cahan, R., Jussa, J., Luo, Y.: Breaking news: How to use news sentiment to pick stocks. Macquarie US Equity Research (2009) Cahan, R., Jussa, J., Luo, Y.: Breaking news: How to use news sentiment to pick stocks. Macquarie US Equity Research (2009)
12.
go back to reference Cambria, E., Schuller, B., Xia, Y., Havasi, C.: New avenues in opinion mining and sentiment analysis ieeexplore.ieee.org. (2013) Cambria, E., Schuller, B., Xia, Y., Havasi, C.: New avenues in opinion mining and sentiment analysis ieeexplore.ieee.org. (2013)
13.
go back to reference Cambria, E., Song, Y., Wang, H., Howard, N.: Semantic multi-dimensional scaling for open-domain sentiment analysis ieeexplore.ieee.org. (2013) Cambria, E., Song, Y., Wang, H., Howard, N.: Semantic multi-dimensional scaling for open-domain sentiment analysis ieeexplore.ieee.org. (2013)
14.
go back to reference Cambria, E., Xia, Y., Hussain, A.: Affective common sense knowledge acquisition for sentiment analysis lrec.elra.info. (2012) Cambria, E., Xia, Y., Hussain, A.: Affective common sense knowledge acquisition for sentiment analysis lrec.elra.info. (2012)
16.
go back to reference Da, Z., Engelberg, J., Gao, P.: In search of attention. J. Finan. 66(5), 1461–1499 (2011)CrossRef Da, Z., Engelberg, J., Gao, P.: In search of attention. J. Finan. 66(5), 1461–1499 (2011)CrossRef
17.
go back to reference Das, S.R., Chen, M.Y.: Yahoo! for amazon: Sentiment extraction from small talk on the web. Manage. Sci. 53(9), 1375–1388 (2007)CrossRef Das, S.R., Chen, M.Y.: Yahoo! for amazon: Sentiment extraction from small talk on the web. Manage. Sci. 53(9), 1375–1388 (2007)CrossRef
18.
go back to reference Dzielinski, M., Rieger, M.O., Talpsepp, T.: Volatility asymmetry, news, and private investors. The Handbook of News Analytics in Finance, pp. 255–270 (2011) Dzielinski, M., Rieger, M.O., Talpsepp, T.: Volatility asymmetry, news, and private investors. The Handbook of News Analytics in Finance, pp. 255–270 (2011)
19.
go back to reference Fang, L., Peress, J.: Media coverage and the Cross section of stock returns. J. Finan. 64(5), 2023–2052 (2009)CrossRef Fang, L., Peress, J.: Media coverage and the Cross section of stock returns. J. Finan. 64(5), 2023–2052 (2009)CrossRef
20.
go back to reference Hafez, P.: Detection of seasonality in newsflow. White Paper Available from RavenPack (2009) Hafez, P.: Detection of seasonality in newsflow. White Paper Available from RavenPack (2009)
21.
go back to reference Hagenau, M., Korczak, A., Neumann, D.: Buy on bad news, sell on good news: How insider trading analysis can benefit from textual analysis of corporate disclosures. In: Workshop on Information Systems and Economics (WISE 2012), Orlando, Florida, USA (2012) Hagenau, M., Korczak, A., Neumann, D.: Buy on bad news, sell on good news: How insider trading analysis can benefit from textual analysis of corporate disclosures. In: Workshop on Information Systems and Economics (WISE 2012), Orlando, Florida, USA (2012)
22.
go back to reference Hirshleifer, D., Lim, S.S., Teoh, S.H.: Driven to distraction: Extraneous events and underreaction to earnings news. J. Finan. 64(5), 2289–2325 (2009)CrossRef Hirshleifer, D., Lim, S.S., Teoh, S.H.: Driven to distraction: Extraneous events and underreaction to earnings news. J. Finan. 64(5), 2289–2325 (2009)CrossRef
25.
go back to reference Jasny, B.R., Chin, G., Chong, L., Vignieri, S.: Data replication & reproducibility. again, and again, and again…. introduction. Science 334(6060), 1225 (2011). (New York, N.Y.)CrossRef Jasny, B.R., Chin, G., Chong, L., Vignieri, S.: Data replication & reproducibility. again, and again, and again…. introduction. Science 334(6060), 1225 (2011). (New York, N.Y.)CrossRef
26.
go back to reference Jindal, N., Liu, B.: Identifying comparative sentences in text documents. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 244–251 (2006) Jindal, N., Liu, B.: Identifying comparative sentences in text documents. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 244–251 (2006)
27.
go back to reference Joachims, T.: Making large scale SVM learning practical. Universität Dortmund (1999) Joachims, T.: Making large scale SVM learning practical. Universität Dortmund (1999)
29.
go back to reference Kothari, S., Li, X., Short, J.E.: The effect of disclosures by management, analysts, and business press on cost of capital, return volatility, and analyst forecasts: A study using content analysis. Account. Rev. 84(5), 1639–1670 (2009)CrossRef Kothari, S., Li, X., Short, J.E.: The effect of disclosures by management, analysts, and business press on cost of capital, return volatility, and analyst forecasts: A study using content analysis. Account. Rev. 84(5), 1639–1670 (2009)CrossRef
30.
go back to reference Leinweber, D.: Nerds on wall street. Math, Machines and Wired Markets (2009) Leinweber, D.: Nerds on wall street. Math, Machines and Wired Markets (2009)
32.
go back to reference Liu, B.: Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol. 5(1), 1–167 (2012)CrossRef Liu, B.: Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol. 5(1), 1–167 (2012)CrossRef
33.
go back to reference Loughran, T., McDonald, B.: When is a liability not a liability? textual analysis, dictionaries, and 10 Ks. J. Finan. 66(1), 35–65 (2011)CrossRef Loughran, T., McDonald, B.: When is a liability not a liability? textual analysis, dictionaries, and 10 Ks. J. Finan. 66(1), 35–65 (2011)CrossRef
34.
go back to reference Lugmayr, A.: Predicting the future of investor sentiment with social media in stock exchange investments: A basic framework for the DAX performance index. In: Handbook of social media management, pp. 565–589. Springer, Heidelberg (2013) Lugmayr, A.: Predicting the future of investor sentiment with social media in stock exchange investments: A basic framework for the DAX performance index. In: Handbook of social media management, pp. 565–589. Springer, Heidelberg (2013)
35.
go back to reference Mitra, G., Mitra, L.: The handbook of news analytics in finance John Wiley & Sons. (2011) Mitra, G., Mitra, L.: The handbook of news analytics in finance John Wiley & Sons. (2011)
36.
go back to reference Narayanan, R., Liu, B., & Choudhary, A. (2009). Sentiment analysis of conditional sentences. Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1-vol. 1, pp. 180–189 Narayanan, R., Liu, B., & Choudhary, A. (2009). Sentiment analysis of conditional sentences. Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1-vol. 1, pp. 180–189
37.
go back to reference Nicholls, C., Song, F.: Comparison of feature selection methods for sentiment analysis. In: Advances in Artificial Intelligence, pp. 286–289. Springer, Berlin Heidelberg (2010) Nicholls, C., Song, F.: Comparison of feature selection methods for sentiment analysis. In: Advances in Artificial Intelligence, pp. 286–289. Springer, Berlin Heidelberg (2010)
38.
go back to reference O’Keefe, T., Koprinska, I.: Feature selection and weighting methods in sentiment analysis cs.otago.ac.nz. (2009) O’Keefe, T., Koprinska, I.: Feature selection and weighting methods in sentiment analysis cs.otago.ac.nz. (2009)
39.
go back to reference Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: Sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing-vol. 10, pp. 79–86 (2002) Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: Sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing-vol. 10, pp. 79–86 (2002)
40.
go back to reference Pang, B., Lee, L.: A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd Annual Meeting on (2004) Pang, B., Lee, L.: A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd Annual Meeting on (2004)
41.
go back to reference Peng, R.D.: Reproducible research in computational science. Science 334(6060), 1226–1227 (2011). (New York, N.Y.)CrossRef Peng, R.D.: Reproducible research in computational science. Science 334(6060), 1226–1227 (2011). (New York, N.Y.)CrossRef
42.
go back to reference Pink, G., Radford, W., Cannings, W., Naoum, A., Nothman, J., Tse, D., et al.: SYDNEY CMCRC at TAC 2013. In: Proceedings of the Text Analysis Conference (TAC2013) (2013) Pink, G., Radford, W., Cannings, W., Naoum, A., Nothman, J., Tse, D., et al.: SYDNEY CMCRC at TAC 2013. In: Proceedings of the Text Analysis Conference (TAC2013) (2013)
44.
go back to reference Rabhi, F.A., Yao, L., Guabtni, A.: ADAGE: A framework for supporting user-driven ad-hoc data analysis processes. Computing 94(6), 489–519 (2012)CrossRef Rabhi, F.A., Yao, L., Guabtni, A.: ADAGE: A framework for supporting user-driven ad-hoc data analysis processes. Computing 94(6), 489–519 (2012)CrossRef
45.
go back to reference Rasolofo, Y., Savoy, J.: Term proximity scoring for keyword-based retrieval systems. In: Sebastiani, F. (ed.) ECIR 2003. LNCS, vol. 2633, pp. 207–218. Springer, Heidelberg (2003)CrossRef Rasolofo, Y., Savoy, J.: Term proximity scoring for keyword-based retrieval systems. In: Sebastiani, F. (ed.) ECIR 2003. LNCS, vol. 2633, pp. 207–218. Springer, Heidelberg (2003)CrossRef
46.
go back to reference RavenPack. RavenPack news scores user guideRavenPack (2010) RavenPack. RavenPack news scores user guideRavenPack (2010)
47.
go back to reference Robertson, C., Geva, S., Wolff, R.: What types of events provide the strongest evidence that the stock market is affected by company specific news? Proc. Fifth Australas. Conf. Data Min. Analystics 61, 145–153 (2006) Robertson, C., Geva, S., Wolff, R.: What types of events provide the strongest evidence that the stock market is affected by company specific news? Proc. Fifth Australas. Conf. Data Min. Analystics 61, 145–153 (2006)
48.
go back to reference Robertson, C.S., Rabhi, F.A., Peat, M.: A service-oriented approach towards real time financial news analysis. In: Consumer Information Systems (2011) Robertson, C.S., Rabhi, F.A., Peat, M.: A service-oriented approach towards real time financial news analysis. In: Consumer Information Systems (2011)
49.
go back to reference Schneider, K.: On word frequency information and negative evidence in naive bayes text classification. In: Advances in Natural Language Processing, pp. 474–485. Springer (2004) Schneider, K.: On word frequency information and negative evidence in naive bayes text classification. In: Advances in Natural Language Processing, pp. 474–485. Springer (2004)
50.
go back to reference Scott, J., Stumpp, M., Xu, P.: News, not trading volume, builds momentum. Finan. Anal. J. 46, 45–54 (2003)CrossRef Scott, J., Stumpp, M., Xu, P.: News, not trading volume, builds momentum. Finan. Anal. J. 46, 45–54 (2003)CrossRef
52.
go back to reference Siering, M.: “Boom” or “ruin”–does it make a difference? using text mining and sentiment analysis to support intraday investment decisions. In: 2012 45th Hawaii International Conference on System Science (HICSS), pp. 1050–1059 (2012) Siering, M.: “Boom” or “ruin”–does it make a difference? using text mining and sentiment analysis to support intraday investment decisions. In: 2012 45th Hawaii International Conference on System Science (HICSS), pp. 1050–1059 (2012)
55.
go back to reference Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Computat. Linguist. 37(2), 267–307 (2011)CrossRef Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Computat. Linguist. 37(2), 267–307 (2011)CrossRef
56.
go back to reference Tetlock, P.C.: Giving content to investor sentiment: The role of media in the stock market. J. Finan. 62(3), 1139–1168 (2007)CrossRef Tetlock, P.C.: Giving content to investor sentiment: The role of media in the stock market. J. Finan. 62(3), 1139–1168 (2007)CrossRef
57.
go back to reference Tetlock, P.C., Saar Tsechansky, M., Macskassy, S.: More than words: Quantifying language to measure firms’ fundamentals. J. Finan. 63(3), 1437–1467 (2008)CrossRef Tetlock, P.C., Saar Tsechansky, M., Macskassy, S.: More than words: Quantifying language to measure firms’ fundamentals. J. Finan. 63(3), 1437–1467 (2008)CrossRef
60.
go back to reference Turney, P.D.: Thumbs up or thumbs down?: Semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 417–424 (2002) Turney, P.D.: Thumbs up or thumbs down?: Semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 417–424 (2002)
63.
go back to reference Wiebe, J.M., Bruce, R.F., O’Hara, T.P.: Development and use of a gold-standard data set for subjectivity classifications. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics, pp. 246–253 (1999) Wiebe, J.M., Bruce, R.F., O’Hara, T.P.: Development and use of a gold-standard data set for subjectivity classifications. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics, pp. 246–253 (1999)
67.
go back to reference Li, F.: Do Stock Market Investors Understand the Downside Risk Sentiment of Corporate Annual Reports (2007) Li, F.: Do Stock Market Investors Understand the Downside Risk Sentiment of Corporate Annual Reports (2007)
68.
go back to reference Minev, M., Schommer, C., Grammatikos, T.: News and stock markets: A survey on abnormal returns and prediction models (2012) Minev, M., Schommer, C., Grammatikos, T.: News and stock markets: A survey on abnormal returns and prediction models (2012)
69.
go back to reference Nassirtoussi, A.K., Aghabozorgi, S., Wah, T.Y., Ngo, D.C.L.: Text mining for market prediction: A systematic review. Expert Syst. Appl. 41(16), 7653–7670 (2014)CrossRef Nassirtoussi, A.K., Aghabozorgi, S., Wah, T.Y., Ngo, D.C.L.: Text mining for market prediction: A systematic review. Expert Syst. Appl. 41(16), 7653–7670 (2014)CrossRef
Metadata
Title
A Proposed Framework for Evaluating the Effectiveness of Financial News Sentiment Scoring Datasets
Authors
Islam Qudah
Fethi A. Rabhi
Maurice Peat
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-28151-3_3

Premium Partner