Skip to main content
Top
Published in: Information Systems Frontiers 1/2019

30-05-2018

Extracting Knowledge from Technical Reports for the Valuation of West Texas Intermediate Crude Oil Futures

Authors: Joseph D. Prusa, Ryan T. Sagul, Taghi M. Khoshgoftaar

Published in: Information Systems Frontiers | Issue 1/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper proposes and demonstrates an approach for the often-attempted problem of market prediction, framed as classification task. We restrict our study to a widely purchased and well recognized commodity, West Texas Intermediate crude oil, which experiences significant volatility. For this purpose, nine learners using features extracted from monthly International Energy Agency (IEA) reports to predict undervalued, overvalued, and accurate valuation of the oil futures between 2003 and 2015. The often touted “Efficient Market Hypothesis” (EMH) suggests that it is impossible for individual investors to “beat the market” as market and external forces, such as geopolitical crises and natural disasters, are nearly impossible to predict. However, four algorithms were statistically better at the 95% confidence interval than “Zero-Rule” and “Random-Guess” strategies which are expected to pseudo-reflect the EMH. Furthermore, the addition of text features can significantly improve performance compared to only using price history from the oil futures data, challenging the validity of the semi-strong versions of the EMH in the crude oil market.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
The interval of $2.00 was selected to provide the largest minority class membership possible for the training period (2000–2002).
 
2
Reports before 1995 consist of images of scanned documents.
 
Literature
go back to reference Berenson, M.L., Goldstein, M., Levine, D. (1983). Intermediate statistical methods and applications: a computer package approach, 2nd edn. Upper Saddle River: Prentice Hall. Berenson, M.L., Goldstein, M., Levine, D. (1983). Intermediate statistical methods and applications: a computer package approach, 2nd edn. Upper Saddle River: Prentice Hall.
go back to reference Crawford, M., Khoshgoftaar, T.M., Prusa, J.D. (2016). Reducing feature set explosion to facilitate real-world review spam detection. In The twenty-ninth international flairs conference. Crawford, M., Khoshgoftaar, T.M., Prusa, J.D. (2016). Reducing feature set explosion to facilitate real-world review spam detection. In The twenty-ninth international flairs conference.
go back to reference Fama, E.F. (1970). Efficient capital markets: A review of theory and empirical work. The journal of Finance, 25(2), 383–417.CrossRef Fama, E.F. (1970). Efficient capital markets: A review of theory and empirical work. The journal of Finance, 25(2), 383–417.CrossRef
go back to reference Froot, K.A., & Frankel, J.A. (1989). Forward discount bias: Is it an exchange risk premium?. The Quarterly Journal of Economics, 104(1), 139–161.CrossRef Froot, K.A., & Frankel, J.A. (1989). Forward discount bias: Is it an exchange risk premium?. The Quarterly Journal of Economics, 104(1), 139–161.CrossRef
go back to reference Graham, J.R., & Harvey, C.R. (1996). Market timing ability and volatility implied in investment newletters’ asset allocation recommendations (Tech. Rep.). National Bureau of Economic Research. Graham, J.R., & Harvey, C.R. (1996). Market timing ability and volatility implied in investment newletters’ asset allocation recommendations (Tech. Rep.). National Bureau of Economic Research.
go back to reference Graham, J.R., & Harvey, C.R. (1997). Grading the performance of market-timing newsletters. Financial Analysts Journal, 53(6), 54–66.CrossRef Graham, J.R., & Harvey, C.R. (1997). Grading the performance of market-timing newsletters. Financial Analysts Journal, 53(6), 54–66.CrossRef
go back to reference Grossman, S.J., & Stiglitz, J.E. (1980). On the impossibility of informationally efficient markets. The American economic review, 70(3), 393–408. Grossman, S.J., & Stiglitz, J.E. (1980). On the impossibility of informationally efficient markets. The American economic review, 70(3), 393–408.
go back to reference Jensen, M.C. (1968). The performance of mutual funds in the period 1945–1964. The Journal of Finance, 23(2), 389–416.CrossRef Jensen, M.C. (1968). The performance of mutual funds in the period 1945–1964. The Journal of Finance, 23(2), 389–416.CrossRef
go back to reference Jones, C.P., & Litzenberger, R.H. (1970). Quarterly earnings reports and intermediate stock price trends. The Journal of Finance, 25(1), 143–148.CrossRef Jones, C.P., & Litzenberger, R.H. (1970). Quarterly earnings reports and intermediate stock price trends. The Journal of Finance, 25(1), 143–148.CrossRef
go back to reference Lai, K., & et al. (2005). Journal of Systems Science and Complexity, 18(2), 145–166. Lai, K., & et al. (2005). Journal of Systems Science and Complexity, 18(2), 145–166.
go back to reference Laibson, D. (1997). Golden eggs and hyperbolic discounting. The Quarterly Journal of Economics, 112(2), 443–478.CrossRef Laibson, D. (1997). Golden eggs and hyperbolic discounting. The Quarterly Journal of Economics, 112(2), 443–478.CrossRef
go back to reference Lawrence, R. (1997). Using neural networks to forecast stock market prices. University of Manitoba, 333. Lawrence, R. (1997). Using neural networks to forecast stock market prices. University of Manitoba, 333.
go back to reference Li, X., & Yu, T. (2016). Forecasting oil price trends with sentiment of online news articles. Procedia Computer Science, 91(2016), 1081–1087.CrossRef Li, X., & Yu, T. (2016). Forecasting oil price trends with sentiment of online news articles. Procedia Computer Science, 91(2016), 1081–1087.CrossRef
go back to reference Malkiel, B.G. (2005). Reflections on the efficient market hypothesis: 30 years later. Financial Review, 40(1), 1–9.CrossRef Malkiel, B.G. (2005). Reflections on the efficient market hypothesis: 30 years later. Financial Review, 40(1), 1–9.CrossRef
go back to reference Nassirtoussi, A.K., Aghabozorgi, S., Wah, T.Y., Ngo, D.C.L. (2014). Text mining for market prediction:A systematic review. Expert Systems with Applications, 41(16), 7653–7670.CrossRef Nassirtoussi, A.K., Aghabozorgi, S., Wah, T.Y., Ngo, D.C.L. (2014). Text mining for market prediction:A systematic review. Expert Systems with Applications, 41(16), 7653–7670.CrossRef
go back to reference Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing & Management, 24(5), 513–523.CrossRef Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing & Management, 24(5), 513–523.CrossRef
go back to reference Sebastiani, F. (2002). Machine learning in automated text categorization. ACM computing surveys (CSUR), 34(1), 1–47.CrossRef Sebastiani, F. (2002). Machine learning in automated text categorization. ACM computing surveys (CSUR), 34(1), 1–47.CrossRef
go back to reference Seker, S.E., Mert, C., Al-Naami, K., Ozalp, N., Ayan, U. (2014). Time series analysis on stock market for text mining correlation of economy news. Retrieved from CoRR arXiv:1403.2002. Seker, S.E., Mert, C., Al-Naami, K., Ozalp, N., Ayan, U. (2014). Time series analysis on stock market for text mining correlation of economy news. Retrieved from CoRR arXiv:1403.​2002.
go back to reference Seliya, N., Khoshgoftaar, T.M., Van Hulse, J. (2009). A study on the relationships of classifier performance metrics. In 21st international conference on Tools with artificial intelligence, 2009. ictai’09 (pp. 59–66). Seliya, N., Khoshgoftaar, T.M., Van Hulse, J. (2009). A study on the relationships of classifier performance metrics. In 21st international conference on Tools with artificial intelligence, 2009. ictai’09 (pp. 59–66).
go back to reference Sewell, M.V. (2012). The efficient market hypothesis: Empirical evidence. International Journal of Statistics and Probability, 1(2), 164.CrossRef Sewell, M.V. (2012). The efficient market hypothesis: Empirical evidence. International Journal of Statistics and Probability, 1(2), 164.CrossRef
go back to reference Weiss, G.M., & Provost, F. (2003). Learning when training data are costly: the effect of class distribution on tree induction. Journal of Artificial Intelligence Research, 19, 315–354.CrossRef Weiss, G.M., & Provost, F. (2003). Learning when training data are costly: the effect of class distribution on tree induction. Journal of Artificial Intelligence Research, 19, 315–354.CrossRef
go back to reference Witten, I.H., Frank, E., Hall, M.A., Pal, C.J. (2016). Data mining: practical machine learning tools and techniques. Morgan Kaufmann. Witten, I.H., Frank, E., Hall, M.A., Pal, C.J. (2016). Data mining: practical machine learning tools and techniques. Morgan Kaufmann.
go back to reference Xie, W., Yu, L., Xu, S., Wang, S. (2006). A new method for crude oil price forecasting based on support vector machines. In Computational Science—ICCS 2006 (pp. 444–451). Xie, W., Yu, L., Xu, S., Wang, S. (2006). A new method for crude oil price forecasting based on support vector machines. In Computational Science—ICCS 2006 (pp. 444–451).
go back to reference Yu, L., Dai, W., Tang, L. (2016). A novel decomposition ensemble model with extended extreme learning machine for crude oil price forecasting. Engineering Applications of Artificial Intelligence, 47, 110–121.CrossRef Yu, L., Dai, W., Tang, L. (2016). A novel decomposition ensemble model with extended extreme learning machine for crude oil price forecasting. Engineering Applications of Artificial Intelligence, 47, 110–121.CrossRef
go back to reference Yu, L., Wang, S., Lai, K. (2005). A rough-set-refined text mining approach for crude oil market tendency forecasting. International Journal of Knowledge and Systems Sciences, 2(1), 33– 46. Yu, L., Wang, S., Lai, K. (2005). A rough-set-refined text mining approach for crude oil market tendency forecasting. International Journal of Knowledge and Systems Sciences, 2(1), 33– 46.
go back to reference Zhang, J.-L., Zhang, Y.-J., Zhang, L. (2015). A novel hybrid method for crude oil price forecasting. Energy Economics, 49, 649– 659.CrossRef Zhang, J.-L., Zhang, Y.-J., Zhang, L. (2015). A novel hybrid method for crude oil price forecasting. Energy Economics, 49, 649– 659.CrossRef
Metadata
Title
Extracting Knowledge from Technical Reports for the Valuation of West Texas Intermediate Crude Oil Futures
Authors
Joseph D. Prusa
Ryan T. Sagul
Taghi M. Khoshgoftaar
Publication date
30-05-2018
Publisher
Springer US
Published in
Information Systems Frontiers / Issue 1/2019
Print ISSN: 1387-3326
Electronic ISSN: 1572-9419
DOI
https://doi.org/10.1007/s10796-018-9859-2

Other articles of this Issue 1/2019

Information Systems Frontiers 1/2019 Go to the issue

Premium Partner