ABSTRACT
Event representative learning aims to embed news events into continuous space vectors for capturing syntactic and semantic information from text corpus, which is benefit to event-driven quantitative investments. However, the financial market reaction of events is also influenced by the lead-lag effect, which is driven by internal relationships. Therefore, in this paper, we present a knowledge graph-based event embedding framework for quantitative investments. In particular, we first extract structured events from raw texts, and construct the knowledge graph with the mentioned entities and relations simultaneously. Then, we leverage a joint model to merge the knowledge graph information into the objective function of an event embedding learning model. The learned representations are fed as inputs of downstream quantitative trading methods. Extensive experiments on real-world dataset demonstrate the effectiveness of the event embeddings learned from financial news and knowledge graphs. We also deploy the framework for quantitative algorithm trading. The accumulated portfolio return contributed by our method significantly outperforms other baselines.
- Jacob Benesty, Jingdong Chen, Yiteng Huang, and Israel Cohen. 2009. Pearson correlation coefficient. In Noise reduction in speech processing. Springer, 1--4.Google ScholarDigital Library
- Kurt Bollacker, Colin Evans, Praveen Paritosh, Tim Sturge, and Jamie Taylor. 2008. Freebase: a collaboratively created graph database for structuring human knowledge. In Proceedings of the 2008 ACM SIGMOD international conference on Management of data. 1247--1250.Google ScholarDigital Library
- Wesley S Chan. 2003. Stock price reaction to news and no-news: drift and reversal after headlines. Journal of Financial Economics, Vol. 70, 2 (2003), 223--260.Google ScholarCross Ref
- Dawei Cheng, Ye Liu, Zhibin Niu, and Liqing Zhang. 2018a. Modeling similarities among multi-dimensional financial time series. IEEE Access, Vol. 6 (2018), 43404--43413.Google ScholarCross Ref
- Dawei Cheng, Yi Tu, Zhibin Niu, and Liqing Zhang. 2018b. Learning Temporal Relationships Between Financial Signals. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2641--2645.Google Scholar
- Edouard Delasalles, Ali Ziat, Ludovic Denoyer, and Patrick Gallinari. 2019. Spatio-temporal neural networks for space-time data modeling and relation discovery. Knowledge and Information Systems, Vol. 61, 3 (2019), 1241--1267.Google ScholarDigital Library
- Shumin Deng, Ningyu Zhang, Wen Zhang, Jiaoyan Chen, Jeff Z Pan, and Huajun Chen. 2019. Knowledge-driven stock trend prediction and explanation via temporal convolutional network. In Companion Proceedings of The 2019 World Wide Web Conference. 678--685.Google ScholarDigital Library
- Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 4171--4186.Google Scholar
- Xiao Ding, Kuo Liao, Ting Liu, Zhongyang Li, and Junwen Duan. 2019. Event Representation Learning Enhanced with External Commonsense Knowledge. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 4896--4905.Google ScholarCross Ref
- Xiao Ding, Yue Zhang, Ting Liu, and Junwen Duan. 2015. Deep learning for event-driven stock prediction. In Twenty-fourth international joint conference on artificial intelligence .Google Scholar
- Xiao Ding, Yue Zhang, Ting Liu, and Junwen Duan. 2016. Knowledge-driven event embedding for stock prediction. In Proceedings of coling 2016, the 26th international conference on computational linguistics: Technical papers. 2133--2142.Google Scholar
- Nan Du, Hanjun Dai, Rakshit Trivedi, Utkarsh Upadhyay, Manuel Gomez-Rodriguez, and Le Song. 2016. Recurrent marked temporal point processes: Embedding event history to vector. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1555--1564.Google ScholarDigital Library
- Oren Etzioni, Michele Banko, Stephen Soderland, and Daniel S Weld. 2008. Open information extraction from the web. Commun. ACM, Vol. 51, 12 (2008), 68--74.Google ScholarDigital Library
- Eugene F Fama. 1965. The behavior of stock-market prices. The journal of Business, Vol. 38, 1 (1965), 34--105.Google ScholarCross Ref
- Fuli Feng, Xiangnan He, Xiang Wang, Cheng Luo, Yiqun Liu, and Tat-Seng Chua. 2019. Temporal relational ranking for stock prediction. ACM Transactions on Information Systems (TOIS), Vol. 37, 2 (2019), 1--30.Google ScholarDigital Library
- Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining. 855--864.Google ScholarDigital Library
- Florian Holzschuher and René Peinl. 2013. Performance of graph query languages: comparison of cypher, gremlin and native access in Neo4j. In Proceedings of the Joint EDBT/ICDT 2013 Workshops. 195--204.Google ScholarDigital Library
- Kewei Hou. 2007. Industry information diffusion and the lead-lag effect in stock returns. The Review of Financial Studies, Vol. 20, 4 (2007), 1113--1138.Google ScholarCross Ref
- Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).Google Scholar
- Shimon Kogan, Dimitry Levin, Bryan R Routledge, Jacob S Sagi, and Noah A Smith. 2009. Predicting risk from financial reports with regression. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics. 272--280.Google ScholarDigital Library
- Rafal Kuc and Marek Rogozinski. 2013. Elasticsearch server .Packt Publishing Ltd.Google Scholar
- Quoc Le and Tomas Mikolov. 2014. Distributed representations of sentences and documents. In International conference on machine learning. 1188--1196.Google ScholarDigital Library
- David Leinweber and Jacob Sisk. 2011. Event-driven trading and the "new news". The Journal of Portfolio Management, Vol. 38, 1 (2011), 110--124.Google ScholarCross Ref
- Qing Li, Jinghua Tan, Jun Wang, and HsinChun Chen. 2020. A Multimodal Event-driven LSTM Model for Stock Prediction Using Online News. IEEE Transactions on Knowledge and Data Engineering (2020).Google ScholarCross Ref
- Ying Li, Ting Jin, Meng Xi, Shengpeng Liu, and Zhiling Luo. 2018. Massive Text Mining for Abnormal Market Trend Detection. In 2018 IEEE International Conference on Big Data (Big Data). IEEE, 4135--4141.Google Scholar
- Zhongguo Li and Maosong Sun. 2009. Punctuation as implicit annotations for Chinese word segmentation. Computational Linguistics, Vol. 35, 4 (2009), 505--512.Google ScholarDigital Library
- Zhige Li, Derek Yang, Li Zhao, Jiang Bian, Tao Qin, and Tie-Yan Liu. 2019. Individualized indicator for all: Stock-wise technical indicator optimization with stock embedding. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 894--902.Google ScholarDigital Library
- Yankai Lin, Shiqi Shen, Zhiyuan Liu, Huanbo Luan, and Maosong Sun. 2016. Neural relation extraction with selective attention over instances. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2124--2133.Google ScholarCross Ref
- Andrew W Lo and A Craig MacKinlay. 1990. When are contrarian profits due to stock market overreaction? The review of financial studies, Vol. 3, 2 (1990), 175--205.Google Scholar
- Ronny Luss and Alexandre d'Aspremont. 2015. Predicting abnormal returns from news using text classification. Quantitative Finance, Vol. 15, 6 (2015), 999--1012.Google ScholarCross Ref
- Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of machine learning research, Vol. 9, Nov (2008), 2579--2605.Google Scholar
- Tiago Macedo and Fred Oliveira. 2011. Redis Cookbook: Practical Techniques for Fast Data Manipulation ." O'Reilly Media, Inc.".Google Scholar
- Daniel Myers and James W McGuffee. 2015. Choosing scrapy. Journal of Computing Sciences in Colleges, Vol. 31, 1 (2015), 83--89.Google ScholarDigital Library
- Shirui Pan, Jia Wu, Xingquan Zhu, Chengqi Zhang, and Yang Wang. 2016. Tri-party deep network representation. Network, Vol. 11, 9 (2016), 12.Google Scholar
- Swarnadeep Saha et al. 2018. Open information extraction from conjunctive sentences. In Proceedings of the 27th International Conference on Computational Linguistics. 2288--2299.Google Scholar
- Swarnadeep Saha, Harinder Pal, et al. 2017. Bootstrapping for numerical open ie. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 317--323.Google ScholarCross Ref
- Maarten Sap, Ronan Le Bras, Emily Allaway, Chandra Bhagavatula, Nicholas Lourie, Hannah Rashkin, Brendan Roof, Noah A Smith, and Yejin Choi. 2019. Atomic: An atlas of machine commonsense for if-then reasoning. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 3027--3035.Google ScholarDigital Library
- J Shaheen. 2017. Apache Kafka: Real Time Implementation with Kafka Architecture Review. International Journal Of Advanced Science And Technology, Vol. 109 (2017), 35--42.Google ScholarCross Ref
- Jianfeng Si, Arjun Mukherjee, Bing Liu, Sinno Jialin Pan, Qing Li, and Huayi Li. 2014. Exploiting social relations and sentiment for stock prediction. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). 1139--1145.Google ScholarCross Ref
- Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research, Vol. 15, 1 (2014), 1929--1958.Google Scholar
- Paul C Tetlock, Maytal Saar-Tsechansky, and Sofus Macskassy. 2008. More than words: Quantifying language to measure firms' fundamentals. The Journal of Finance, Vol. 63, 3 (2008), 1437--1467.Google ScholarCross Ref
- Jingyuan Wang, Yang Zhang, Ke Tang, Junjie Wu, and Zhang Xiong. 2019. AlphaStock: A Buying-Winners-and-Selling-Losers Investment Strategy using Interpretable Deep Reinforcement Attention Networks. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1900--1908.Google ScholarDigital Library
- Boyi Xie, Rebecca Passonneau, Leon Wu, and Germán G Creamer. 2013. Semantic frames to predict stock price movement. In Proceedings of the 51st annual meeting of the association for computational linguistics. 873--883.Google Scholar
- Yang Yang, ZHOU Deyu, Yulan He, and Meng Zhang. 2019. Interpretable Relevant Emotion Ranking with Event-Driven Attention. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 177--187.Google ScholarCross Ref
- Wenbo Zhang, Xiao Ding, and Ting Liu. 2018. Learning target-dependent sentence representations for chinese event detection. In China Conference on Information Retrieval. Springer, 251--262.Google ScholarCross Ref
- Sendong Zhao, Quan Wang, Sean Massung, Bing Qin, Ting Liu, Bin Wang, and ChengXiang Zhai. 2017. Constructing and embedding abstract event causality networks from text snippets. In Proceedings of the Tenth ACM International Conference on Web Search and Data Mining. 335--344.Google ScholarDigital Library
- Ali Ziat, Edouard Delasalles, Ludovic Denoyer, and Patrick Gallinari. 2017. Spatio-temporal neural networks for space-time series forecasting and relations discovery. In 2017 IEEE International Conference on Data Mining (ICDM). IEEE, 705--714.Google ScholarCross Ref
Index Terms
- Knowledge Graph-based Event Embedding Framework for Financial Quantitative Investments
Recommendations
Query Associations Over Big Financial Knowledge Graph
Big Scientific Data ManagementAbstractKnowledge graph, as the core technology of artificial intelligence, is playing a more and more important role in the financial field. In this paper, we study the problem of querying associations over big financial knowledge graph formed by equity ...
Financial Portfolio Construction for Quantitative Trading Using Deep Learning Technique
Artificial Intelligence and Soft ComputingAbstractStock portfolio construction is a difficult task which involves the simultaneous consideration of dynamic financial data as well as investment criteria (e.g.: investors required return, risk tolerance, goals, and time frame). The objective of this ...
Stock Market Return and Household Financial Investments
ICEBI '21: Proceedings of the 2021 5th International Conference on E-Business and InternetThe purpose of this study project was to explore the link between household investment and stock market fluctuations in the United States. The research's aims are to identify the relationship between household investment and the stock market index in ...
Comments