Swipe to navigate through the articles of this issue
This paper proposes and demonstrates an approach for the often-attempted problem of market prediction, framed as classification task. We restrict our study to a widely purchased and well recognized commodity, West Texas Intermediate crude oil, which experiences significant volatility. For this purpose, nine learners using features extracted from monthly International Energy Agency (IEA) reports to predict undervalued, overvalued, and accurate valuation of the oil futures between 2003 and 2015. The often touted “Efficient Market Hypothesis” (EMH) suggests that it is impossible for individual investors to “beat the market” as market and external forces, such as geopolitical crises and natural disasters, are nearly impossible to predict. However, four algorithms were statistically better at the 95% confidence interval than “Zero-Rule” and “Random-Guess” strategies which are expected to pseudo-reflect the EMH. Furthermore, the addition of text features can significantly improve performance compared to only using price history from the oil futures data, challenging the validity of the semi-strong versions of the EMH in the crude oil market.
Please log in to get access to this content
To get access to this content you need the following product:
Mittermayer, a.M., & Knolmayer, G.F. (2006). Newscats: A news categorization and trading systems. In Sixth international conference on data mining (icdm’06) (pp. 1002-1007), (to appear in print), https://doi.org/10.1109/ICDM.2006.115.
Berenson, M.L., Goldstein, M., Levine, D. (1983). Intermediate statistical methods and applications: a computer package approach, 2nd edn. Upper Saddle River: Prentice Hall.
Bong-Chan, K. (1996). Time-varying risk premia, volatility, and technical trading rule profits: Evidence from foreign currency futures markets. Journal of Financial Economics, 41(2), 249–290. Retrieved from https://EconPapers.repec.org/RePEc:eee:jfinec:v:41:y:1996:i:2:p:249-290. CrossRef
Choi, K., & Hammoudeh, S. (2010). Volatility behavior of oil, industrial commodity and stock markets in a regime-switching environment. Energy Policy, 38(8), 4388–4399. https://doi.org/10.1016/j.enpol.2010.03.067. Retrieved from http://www.sciencedirect.com/science/article/pii/S0301421510002570. CrossRef
Crawford, M., Khoshgoftaar, T.M., Prusa, J.D. (2016). Reducing feature set explosion to facilitate real-world review spam detection. In The twenty-ninth international flairs conference.
Fama, E.F. (1970). Efficient capital markets: A review of theory and empirical work. The journal of Finance, 25(2), 383–417. CrossRef
Froot, K.A., & Frankel, J.A. (1989). Forward discount bias: Is it an exchange risk premium?. The Quarterly Journal of Economics, 104(1), 139–161. CrossRef
Graham, J.R., & Harvey, C.R. (1996). Market timing ability and volatility implied in investment newletters’ asset allocation recommendations (Tech. Rep.). National Bureau of Economic Research.
Graham, J.R., & Harvey, C.R. (1997). Grading the performance of market-timing newsletters. Financial Analysts Journal, 53(6), 54–66. CrossRef
Grossman, S.J., & Stiglitz, J.E. (1980). On the impossibility of informationally efficient markets. The American economic review, 70(3), 393–408.
International Energy Agency. (n.d.). Monthly oil data service (mods). Retrieved from https://www.iea.org/statistics/mods/.
Jensen, M.C. (1968). The performance of mutual funds in the period 1945–1964. The Journal of Finance, 23(2), 389–416. CrossRef
Jones, C.P., & Litzenberger, R.H. (1970). Quarterly earnings reports and intermediate stock price trends. The Journal of Finance, 25(1), 143–148. CrossRef
Kaufmann, R.K., & Ullman, B. (2009). Oil prices, speculation, and fundamentals: Interpreting causal relations among spot and futures prices. Energy Economics, 31(4), 550–558. https://doi.org/10.1016/j.eneco.2009.01.013. Retrieved from http://www.sciencedirect.com/science/article/pii/S0140988309000243.
Lai, K., & et al. (2005). Journal of Systems Science and Complexity, 18(2), 145–166.
Laibson, D. (1997). Golden eggs and hyperbolic discounting. The Quarterly Journal of Economics, 112(2), 443–478. CrossRef
Lawrence, R. (1997). Using neural networks to forecast stock market prices. University of Manitoba, 333.
Li, X., & Yu, T. (2016). Forecasting oil price trends with sentiment of online news articles. Procedia Computer Science, 91(2016), 1081–1087. CrossRef
Malkiel, B.G. (2005). Reflections on the efficient market hypothesis: 30 years later. Financial Review, 40(1), 1–9. CrossRef
Nassirtoussi, A.K., Aghabozorgi, S., Wah, T.Y., Ngo, D.C.L. (2014). Text mining for market prediction:A systematic review. Expert Systems with Applications, 41(16), 7653–7670. CrossRef
Rachlin, G., Last, M., Alberg, D., Kandel, A. (2007). Admiral: A data mining based financial trading system . In 2007 ieee symposium on computational intelligence and data mining (pp. 720-0-725). https://doi.org/10.1109/CIDM.2007.368947.
Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing & Management, 24(5), 513–523. CrossRef
Sebastiani, F. (2002). Machine learning in automated text categorization. ACM computing surveys (CSUR), 34(1), 1–47. CrossRef
Seker, S.E., Mert, C., Al-Naami, K., Ozalp, N., Ayan, U. (2014). Time series analysis on stock market for text mining correlation of economy news. Retrieved from CoRR arXiv: 1403.2002.
Seliya, N., Khoshgoftaar, T.M., Van Hulse, J. (2009). A study on the relationships of classifier performance metrics. In 21st international conference on Tools with artificial intelligence, 2009. ictai’09 (pp. 59–66).
Sewell, M.V. (2012). The efficient market hypothesis: Empirical evidence. International Journal of Statistics and Probability, 1(2), 164. CrossRef
Sun, A., Lachanski, M., Fabozzi, F.J. (2016). Trade the tweet: Social media text mining and sparse matrix factorization for stock market prediction. International Review of Financial Analysis, 48, 272–281. https://doi.org/10.1016/j.irfa.2016.10.009. Retrieved from http://www.sciencedirect.com/science/article/pii/S1057521916301600. CrossRef
Weiss, G.M., & Provost, F. (2003). Learning when training data are costly: the effect of class distribution on tree induction. Journal of Artificial Intelligence Research, 19, 315–354. CrossRef
Witten, I.H., Frank, E., Hall, M.A., Pal, C.J. (2016). Data mining: practical machine learning tools and techniques. Morgan Kaufmann.
Xie, W., Yu, L., Xu, S., Wang, S. (2006). A new method for crude oil price forecasting based on support vector machines. In Computational Science—ICCS 2006 (pp. 444–451).
Yu, L., Dai, W., Tang, L. (2016). A novel decomposition ensemble model with extended extreme learning machine for crude oil price forecasting. Engineering Applications of Artificial Intelligence, 47, 110–121. CrossRef
Yu, L., Wang, S., Lai, K. (2005). A rough-set-refined text mining approach for crude oil market tendency forecasting. International Journal of Knowledge and Systems Sciences, 2(1), 33– 46.
Zhang, J.-L., Zhang, Y.-J., Zhang, L. (2015). A novel hybrid method for crude oil price forecasting. Energy Economics, 49, 649– 659. CrossRef
- Extracting Knowledge from Technical Reports for the Valuation of West Texas Intermediate Crude Oil Futures
Joseph D. Prusa
Ryan T. Sagul
Taghi M. Khoshgoftaar
- Publication date
- Springer US
Neuer Inhalt/© ITandMEDIA