Skip to main content
Erschienen in: Soft Computing 12/2017

18.01.2016 | Methodologies and Application

Exploring mutual information-based sentimental analysis with kernel-based extreme learning machine for stock prediction

verfasst von: Feng Wang, Yongquan Zhang, Qi Rao, Kangshun Li, Hao Zhang

Erschienen in: Soft Computing | Ausgabe 12/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Stock price volatility prediction is regarded as one of the most attractive and meaningful research issues in financial market. Some existing researches have pointed out that both the prediction accuracy and the prediction speed are the most important factors in the process of stock prediction. In this paper, we focus on the problem of how to design a methodology which can improve prediction accuracy as well as speed up prediction process, and propose a new prediction model which employs mutual information- based sentimental analysis methodology with extreme learning machine to enhance the prediction performance. The two major contributions of our work are (1) as the words in the news documents are not absolutely negative or positive, and the lengths of the financial news documents are various; here, we propose a new sentimental analysis methodology based on mutual information to improve the efficiency of feature selection, which is different from the traditional sentimental analysis algorithm, and a new weighting scheme is also used in the feature weighting process; (2) since ELM is a fast learning model and has been successfully applied in many research fields, we propose a prediction model which combined mutual information-based sentimental analysis with kernel-based ELM named as MISA-K-ELM. This model has the benefits of both statistical sentimental analysis and ELM, which can well balance the requirements of both prediction accuracy and prediction speed. We take experiments on HKEx 2001 stock market datasets to validate the performance of the proposed MISA-K-ELM. The market historical price and the market news are implemented in our MISA-K-ELM. To test the efficiency of MISA, we first compare the prediction accuracy of ELM model using MISA with ELM model using traditional sentimental analysis. Then, we compare our proposed MISA-K-ELM with existing state-of-the-art learning algorithms, such as Back-Propagation Neural Network (BP-NN), and Support Vector Machine (SVM). Our experimental results show that (1) MISA model can help get higher prediction accuracy than traditional sentimental analysis models; (2) MISA-K-ELM and MISA-SVM have a higher prediction accuracy than MISA-BP-NN and MISA-B-ELM; (3) both MISA-K-ELM and MISA-B-ELM can achieve faster prediction speed than MISA-SVM and MISA-BP-NN in most cases; (4) in most cases, MISA-K-ELM has higher prediction accuracy than the other three methodologies.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
Software is downloaded on http://​ictclas.​org.
 
3
Levenberg–Marquardt algorithm has its implementation in MATLAB toolbox.
 
4
Notation \(\#(X)\) indicates the number of object X.
 
Literatur
Zurück zum Zitat Baccianella S, Esuli A, Sebastiani F (2010) Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: LREC, vol. 10, 2010, pp 2200–2204 Baccianella S, Esuli A, Sebastiani F (2010) Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: LREC, vol. 10, 2010, pp 2200–2204
Zurück zum Zitat Bautin M, Vijayarenu L, Skiena S (2008) International sentiment analysis for news and blogs. In: ICWSM, 2008 Bautin M, Vijayarenu L, Skiena S (2008) International sentiment analysis for news and blogs. In: ICWSM, 2008
Zurück zum Zitat Bollen J, Mao H, Zeng X (2011) Twitter mood predicts the stock market. J Comput Sci 2(1):1–8CrossRef Bollen J, Mao H, Zeng X (2011) Twitter mood predicts the stock market. J Comput Sci 2(1):1–8CrossRef
Zurück zum Zitat Cheung C-C, Ng S-C, Lui AK, Xu SS (2010) Enhanced two-phase method in fast learning algorithms. In: Proceedings of the 2010 international joint conference on neural networks (IJCNN’10), IEEE, 2010, pp 1–7 Cheung C-C, Ng S-C, Lui AK, Xu SS (2010) Enhanced two-phase method in fast learning algorithms. In: Proceedings of the 2010 international joint conference on neural networks (IJCNN’10), IEEE, 2010, pp 1–7
Zurück zum Zitat Chum O, Philbin J, Zisserman A (2008) Near duplicate image detection: min-hash and tf-idf weighting. In: BMVC, vol 810, 2008, pp 812–815 Chum O, Philbin J, Zisserman A (2008) Near duplicate image detection: min-hash and tf-idf weighting. In: BMVC, vol 810, 2008, pp 812–815
Zurück zum Zitat Dai W, Wu J-Y, Lu C-J (2012) Combining nonlinear independent component analysis and neural network for the prediction of asian stock market indexes. Exp Syst Appl 39(4):4444–4452CrossRef Dai W, Wu J-Y, Lu C-J (2012) Combining nonlinear independent component analysis and neural network for the prediction of asian stock market indexes. Exp Syst Appl 39(4):4444–4452CrossRef
Zurück zum Zitat Deng S, Mitsubuchi T, Shioda K, Shimada T, Sakurai A (2011) Combining technical analysis with sentiment analysis for stock price prediction. In: Dependable, autonomic and secure computing (DASC), 2011 IEEE 9th international conference on, IEEE, 2011, pp 800–807 Deng S, Mitsubuchi T, Shioda K, Shimada T, Sakurai A (2011) Combining technical analysis with sentiment analysis for stock price prediction. In: Dependable, autonomic and secure computing (DASC), 2011 IEEE 9th international conference on, IEEE, 2011, pp 800–807
Zurück zum Zitat Feldman R, Rosenfeld B, Bar-Haim R, Fresko M (2011) The stock sonarłsentiment analysis of stocks based on a hybrid approach. In: 23rd IAAI Conference, 2011 Feldman R, Rosenfeld B, Bar-Haim R, Fresko M (2011) The stock sonarłsentiment analysis of stocks based on a hybrid approach. In: 23rd IAAI Conference, 2011
Zurück zum Zitat Feng G, Huang G-B, Lin Q, Gay RKL (2009) Error minimized extreme learning machine with growth of hidden nodes and incremental learning. IEEE Trans Neural Netw 20(8):1352–1357CrossRef Feng G, Huang G-B, Lin Q, Gay RKL (2009) Error minimized extreme learning machine with growth of hidden nodes and incremental learning. IEEE Trans Neural Netw 20(8):1352–1357CrossRef
Zurück zum Zitat Handoko SD, Keong KC, Soon OY, Zhang GL, Brusic V (2006) Extreme learning machine for predicting hla-peptide binding. In: Advances in neural networks-ISNN. Springer, 2006, pp 716–721 Handoko SD, Keong KC, Soon OY, Zhang GL, Brusic V (2006) Extreme learning machine for predicting hla-peptide binding. In: Advances in neural networks-ISNN. Springer, 2006, pp 716–721
Zurück zum Zitat Huang G-B, Zhu Q-Y, Siew C-K (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1):489–501CrossRef Huang G-B, Zhu Q-Y, Siew C-K (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1):489–501CrossRef
Zurück zum Zitat Huang G-B, Chen L (2007) Convex incremental extreme learning machine. Neurocomputing 70(16):3056–3062CrossRef Huang G-B, Chen L (2007) Convex incremental extreme learning machine. Neurocomputing 70(16):3056–3062CrossRef
Zurück zum Zitat Hung J-C (2015) Robust kalman filter based on a fuzzy garch model to forecast volatility using particle swarm optimization. Soft Comput 19(10):2861–2869CrossRef Hung J-C (2015) Robust kalman filter based on a fuzzy garch model to forecast volatility using particle swarm optimization. Soft Comput 19(10):2861–2869CrossRef
Zurück zum Zitat Ku L-W, Liang Y-T, Chen H-H (2006) Opinion extraction, summarization and tracking in news and blog corpora. In: Proceeding of AAAI, 2006 Ku L-W, Liang Y-T, Chen H-H (2006) Opinion extraction, summarization and tracking in news and blog corpora. In: Proceeding of AAAI, 2006
Zurück zum Zitat Li J, Fong S, Zhuang Y, Khoury R (2015) Hierarchical classification in text mining for sentiment analysis of online news. Soft Comput 2015:1–10 Li J, Fong S, Zhuang Y, Khoury R (2015) Hierarchical classification in text mining for sentiment analysis of online news. Soft Comput 2015:1–10
Zurück zum Zitat Li X, Wang C, Dong J, Wang F, Deng X, Zhu S (2011) Improving stock market prediction by integrating both market news and stock prices. In: Database and expert systems applications, Springer, 2011, pp 279–293 Li X, Wang C, Dong J, Wang F, Deng X, Zhu S (2011) Improving stock market prediction by integrating both market news and stock prices. In: Database and expert systems applications, Springer, 2011, pp 279–293
Zurück zum Zitat Martinez LC, da Hora DN, de Palotti JRM, Meira W, Pappa GL (2009) From an artificial neural network to a stock market day-trading system: a case study on the bm&f bovespa. In: Proceedings of the international joint conference on neural networks (IJCNN’09), IEEE, 2009, pp 2006–2013 Martinez LC, da Hora DN, de Palotti JRM, Meira W, Pappa GL (2009) From an artificial neural network to a stock market day-trading system: a case study on the bm&f bovespa. In: Proceedings of the international joint conference on neural networks (IJCNN’09), IEEE, 2009, pp 2006–2013
Zurück zum Zitat Nguyen NN, Quek C (2010) Stock price prediction using generic self-evolving takagi–sugeno–kang (gsetsk) fuzzy neural network. In: Proceedings of the international joint conference on neural networks (IJCNN’10), IEEE, 2010, pp 1–8 Nguyen NN, Quek C (2010) Stock price prediction using generic self-evolving takagi–sugeno–kang (gsetsk) fuzzy neural network. In: Proceedings of the international joint conference on neural networks (IJCNN’10), IEEE, 2010, pp 1–8
Zurück zum Zitat O’Connor B, Balasubramanyan R, Routledge BR, Smith NA (2010) From tweets to polls: Linking text sentiment to public opinion time series. ICWSM 11:122–129 O’Connor B, Balasubramanyan R, Routledge BR, Smith NA (2010) From tweets to polls: Linking text sentiment to public opinion time series. ICWSM 11:122–129
Zurück zum Zitat Paik JH (2013) A novel tf-idf weighting scheme for effective ranking. In: Proceedings of the 36th international ACM SIGIR conference on research and development in information retrieval. ACM, 2013, pp 343–352 Paik JH (2013) A novel tf-idf weighting scheme for effective ranking. In: Proceedings of the 36th international ACM SIGIR conference on research and development in information retrieval. ACM, 2013, pp 343–352
Zurück zum Zitat Ramos J (2003) Using tf-idf to determine word relevance in document queries. In: Proceedings of the first instructional conference on machine learning Ramos J (2003) Using tf-idf to determine word relevance in document queries. In: Proceedings of the first instructional conference on machine learning
Zurück zum Zitat Rong H-J, Huang G-B, Ong Y-S (2008) Extreme learning machine for multi-categories classification applications. In: Proceedings of the international joint conference on neural networks (IJCNN’08), 2008, pp 1709–1713 Rong H-J, Huang G-B, Ong Y-S (2008) Extreme learning machine for multi-categories classification applications. In: Proceedings of the international joint conference on neural networks (IJCNN’08), 2008, pp 1709–1713
Zurück zum Zitat Ruiz EJ, Hristidis V, Castillo C, Gionis A, Jaimes A (2012) Correlating financial time series with micro-blogging activity. In: Proceedings of the fifth ACM international conference on Web search and data mining, ACM, 2012, pp 513–522 Ruiz EJ, Hristidis V, Castillo C, Gionis A, Jaimes A (2012) Correlating financial time series with micro-blogging activity. In: Proceedings of the fifth ACM international conference on Web search and data mining, ACM, 2012, pp 513–522
Zurück zum Zitat Saraswathi S, Sundaram S, Sundararajan N, Zimmermann M, Nilsen-Hamilton M (2011) Icga-pso-elm approach for accurate multiclass cancer classification resulting in reduced gene sets in which genes encoding secreted proteins are highly represented. Computational biology and bioinformatics. IEEE/ACM Trans 8(2):452–463 Saraswathi S, Sundaram S, Sundararajan N, Zimmermann M, Nilsen-Hamilton M (2011) Icga-pso-elm approach for accurate multiclass cancer classification resulting in reduced gene sets in which genes encoding secreted proteins are highly represented. Computational biology and bioinformatics. IEEE/ACM Trans 8(2):452–463
Zurück zum Zitat Schumaker RP, Chen H (2006) Textual analysis of stock market prediction using financial news. In: Americas conference on information systems, 2006 Schumaker RP, Chen H (2006) Textual analysis of stock market prediction using financial news. In: Americas conference on information systems, 2006
Zurück zum Zitat Schumaker RP, Chen H (2009) Textual analysis of stock market prediction using breaking financial news: the azfin text system. ACM Trans Inf Syst (TOIS) 27(2):12CrossRef Schumaker RP, Chen H (2009) Textual analysis of stock market prediction using breaking financial news: the azfin text system. ACM Trans Inf Syst (TOIS) 27(2):12CrossRef
Zurück zum Zitat Si J, Mukherjee A, Liu B, Li Q, Li H, Deng X (2013) Exploiting topic based twitter sentiment for stock prediction. In: ACL (2), 2013, pp 24–29 Si J, Mukherjee A, Liu B, Li Q, Li H, Deng X (2013) Exploiting topic based twitter sentiment for stock prediction. In: ACL (2), 2013, pp 24–29
Zurück zum Zitat Sun Y, Yuan Y, Wang G (2011) An os-elm based distributed ensemble classification framework in p2p networks. Neurocomputing 74(16):2438–2443CrossRef Sun Y, Yuan Y, Wang G (2011) An os-elm based distributed ensemble classification framework in p2p networks. Neurocomputing 74(16):2438–2443CrossRef
Zurück zum Zitat Tang J, Wang D, Chai T (2012) Predicting mill load using partial least squares and extreme learning machines. Soft Comput 16(9):1585–1594 Tang J, Wang D, Chai T (2012) Predicting mill load using partial least squares and extreme learning machines. Soft Comput 16(9):1585–1594
Zurück zum Zitat Ticknor JL (2013) A bayesian regularized artificial neural network for stock market forecasting. Expert Syst Appl 40(14):5501–5506 Ticknor JL (2013) A bayesian regularized artificial neural network for stock market forecasting. Expert Syst Appl 40(14):5501–5506
Zurück zum Zitat Turney PD, Littman ML (2003) Measuring praise and criticism: inference of semantic orientation from association. ACM Trans Inf Syst 21(4):315–346CrossRef Turney PD, Littman ML (2003) Measuring praise and criticism: inference of semantic orientation from association. ACM Trans Inf Syst 21(4):315–346CrossRef
Zurück zum Zitat Wang R, Kwong S, Wang X (2012) A study on random weights between input and hidden layers in extreme learning machine. Soft Comput 16(9):1465–1475CrossRef Wang R, Kwong S, Wang X (2012) A study on random weights between input and hidden layers in extreme learning machine. Soft Comput 16(9):1465–1475CrossRef
Zurück zum Zitat Wu HC, Luk RWP, Wong KF, Kwok KL (2008) Interpreting tf-idf term weights as making relevance decisions. ACM Trans Inf Syst (TOIS) 26(3):13CrossRef Wu HC, Luk RWP, Wong KF, Kwok KL (2008) Interpreting tf-idf term weights as making relevance decisions. ACM Trans Inf Syst (TOIS) 26(3):13CrossRef
Zurück zum Zitat Wu Q, Tan S, Cheng X (2009) Graph ranking for sentiment transfer. In: Proceedings of the ACL-IJCNLP 2009 conference short papers. Association for computational linguistics, 2009, pp 317–320 Wu Q, Tan S, Cheng X (2009) Graph ranking for sentiment transfer. In: Proceedings of the ACL-IJCNLP 2009 conference short papers. Association for computational linguistics, 2009, pp 317–320
Zurück zum Zitat Zhang R, Xu Z-B, Huang G-B, Wang D (2012) Global convergence of online bp training with dynamic learning rate. IEEE Trans Neural Netw Learn Syst 23(2):330–341CrossRef Zhang R, Xu Z-B, Huang G-B, Wang D (2012) Global convergence of online bp training with dynamic learning rate. IEEE Trans Neural Netw Learn Syst 23(2):330–341CrossRef
Metadaten
Titel
Exploring mutual information-based sentimental analysis with kernel-based extreme learning machine for stock prediction
verfasst von
Feng Wang
Yongquan Zhang
Qi Rao
Kangshun Li
Hao Zhang
Publikationsdatum
18.01.2016
Verlag
Springer Berlin Heidelberg
Erschienen in
Soft Computing / Ausgabe 12/2017
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-015-2003-z

Weitere Artikel der Ausgabe 12/2017

Soft Computing 12/2017 Zur Ausgabe

Premium Partner