Skip to main content
Top

2024 | OriginalPaper | Chapter

An Overview on Evaluation Methods of Sequence Prediction Problems

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Sequence prediction problems are prevalent in various domains, including natural language processing, time series analysis, bioinformatics, and condition-based maintenance. Evaluating the performance of sequence prediction models is crucial to assess their accuracy, robustness, and generalization capabilities. This paper presents an overview of evaluation methods used for sequence prediction problems. Throughout the paper, we emphasize the importance of selecting suitable evaluation methods that align with the specific characteristics and goals of typical sequence prediction problems. We also provide insights into the considerations associated with each evaluation method. The paper discusses the fundamental metrics commonly employed, such as accuracy, precision, recall, and F1-score, which provide insights into the overall performance of sequence prediction models. Additionally, some more specialized metrics tailored to sequence prediction, are presented. These metrics account for the unique characteristics and challenges of sequence data. In the paper evaluation techniques specific to distinct types of sequence prediction problems are evaluate such as, perplexity, BLEU score, and ROUGE score which are widely used to evaluate language models and machine translation systems. In time series analysis, metrics such as mean absolute error (MAE), root mean squared error (RMSE), and mean absolute percentage error (MAPE) are commonly employed.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Selvin, S., Vinayakumar, R., Gopalakrishnan, E.A., Menon, V.K., Soman, K.P.: Stock price prediction using LSTM, RNN and CNN-sliding window model. In: 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 1643–1647, Manipal (2017) Selvin, S., Vinayakumar, R., Gopalakrishnan, E.A., Menon, V.K., Soman, K.P.: Stock price prediction using LSTM, RNN and CNN-sliding window model. In: 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 1643–1647, Manipal (2017)
2.
go back to reference Sharma, A., Bhuriya, D., Singh, U.: Survey of stock market prediction using machine learning approach. In: 2017 International Conference of Electronics, Communication and Aerospace Technology (ICECA), vol. 2, pp. 506–509. RVS Technical Campus (2017) Sharma, A., Bhuriya, D., Singh, U.: Survey of stock market prediction using machine learning approach. In: 2017 International Conference of Electronics, Communication and Aerospace Technology (ICECA), vol. 2, pp. 506–509. RVS Technical Campus (2017)
3.
go back to reference Mehta, Y., Malhar, A., Shankarmani, R.: Stock price prediction using machine learning and sentiment analysis. In: 2021 2nd International Conference for Emerging Technology (INCET), pp. 1–4, Belgaum, May 2021 Mehta, Y., Malhar, A., Shankarmani, R.: Stock price prediction using machine learning and sentiment analysis. In: 2021 2nd International Conference for Emerging Technology (INCET), pp. 1–4, Belgaum, May 2021
4.
go back to reference Chen, M.Y., Chen, B.T.: A hybrid fuzzy time series model based on granular computing for stock price forecasting. Inf. Sci. 294, 227–241 (2015)MathSciNetCrossRef Chen, M.Y., Chen, B.T.: A hybrid fuzzy time series model based on granular computing for stock price forecasting. Inf. Sci. 294, 227–241 (2015)MathSciNetCrossRef
5.
go back to reference Li, Y., Pan, Y.: A novel ensemble deep learning model for stock prediction based on stock prices and news. Int. J. Data Sci. Anal. 1–11 (2022) Li, Y., Pan, Y.: A novel ensemble deep learning model for stock prediction based on stock prices and news. Int. J. Data Sci. Anal. 1–11 (2022)
6.
go back to reference Behera, R.K., Jena, M., Rath, S.K., Misra, S.: Co-LSTM: convolutional LSTM model for sentiment analysis in social big data. Inf. Process. Manag. 58(1), 102435 (2021)CrossRef Behera, R.K., Jena, M., Rath, S.K., Misra, S.: Co-LSTM: convolutional LSTM model for sentiment analysis in social big data. Inf. Process. Manag. 58(1), 102435 (2021)CrossRef
7.
go back to reference Hornyák, O., Iantovics, L.B.: AdaBoost algorithm could lead to weak results for data with certain characteristics. Mathematics 11(8), 1801 (2023)CrossRef Hornyák, O., Iantovics, L.B.: AdaBoost algorithm could lead to weak results for data with certain characteristics. Mathematics 11(8), 1801 (2023)CrossRef
8.
go back to reference Zhang, L., Wang, S., Liu, B.: Deep learning for sentiment analysis: a survey. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 8(4), e1253 (2018)CrossRef Zhang, L., Wang, S., Liu, B.: Deep learning for sentiment analysis: a survey. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 8(4), e1253 (2018)CrossRef
9.
go back to reference Che, Z., Purushotham, S., Cho, K., Sontag, D., Liu, Y.: Recurrent neural networks for multivariate time series with missing values. Sci. Rep. 8(1), 6085 (2018)CrossRef Che, Z., Purushotham, S., Cho, K., Sontag, D., Liu, Y.: Recurrent neural networks for multivariate time series with missing values. Sci. Rep. 8(1), 6085 (2018)CrossRef
10.
go back to reference Jelinek, F., Mercer, R.L., Bahl, L.R., Baker, J.K.: Perplexity—a measure of the difficulty of speech recognition tasks. J. Acoust. Soc. Am. 62(S1), S63–S63 (1977)CrossRef Jelinek, F., Mercer, R.L., Bahl, L.R., Baker, J.K.: Perplexity—a measure of the difficulty of speech recognition tasks. J. Acoust. Soc. Am. 62(S1), S63–S63 (1977)CrossRef
11.
go back to reference Reiter, E.: A structured review of the validity of BLEU. Comput. Linguist. 44(3), 393–401 (2018)CrossRef Reiter, E.: A structured review of the validity of BLEU. Comput. Linguist. 44(3), 393–401 (2018)CrossRef
12.
go back to reference Lin, C.Y.: Rouge: a package for automatic evaluation of summaries. In: Text Summarization Branches Out, pp. 74–81 (2004) Lin, C.Y.: Rouge: a package for automatic evaluation of summaries. In: Text Summarization Branches Out, pp. 74–81 (2004)
14.
go back to reference Dabre, R., Chu, C., Kunchukuttan, A.: A survey of multilingual neural machine translation. ACM Comput. Surv. (CSUR) 53(5), 1–38 (2020)CrossRef Dabre, R., Chu, C., Kunchukuttan, A.: A survey of multilingual neural machine translation. ACM Comput. Surv. (CSUR) 53(5), 1–38 (2020)CrossRef
15.
go back to reference Ranathunga, S., Lee, E.S.A., Prifti Skenduli, M., Shekhar, R., Alam, M., Kaur, R.: Neural machine translation for low-resource languages: a survey. ACM Comput. Surv. 55(11), 1–37 (2023) Ranathunga, S., Lee, E.S.A., Prifti Skenduli, M., Shekhar, R., Alam, M., Kaur, R.: Neural machine translation for low-resource languages: a survey. ACM Comput. Surv. 55(11), 1–37 (2023)
16.
go back to reference Banerjee, S., Lavie, A.: METEOR: an automatic metric for MT evaluation with improved correlation with human judgments. In: Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, pp. 65–72 (2005) Banerjee, S., Lavie, A.: METEOR: an automatic metric for MT evaluation with improved correlation with human judgments. In: Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, pp. 65–72 (2005)
17.
go back to reference Snover, M., Dorr, B., Schwartz, R., Micciulla, L., Makhoul, J.: A study of translation edit rate with targeted human annotation. In: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers, pp. 223–231 (2006) Snover, M., Dorr, B., Schwartz, R., Micciulla, L., Makhoul, J.: A study of translation edit rate with targeted human annotation. In: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers, pp. 223–231 (2006)
18.
go back to reference Lin, C.Y., Hovy, E.: Automatic evaluation of summaries using n-gram co-occurrence statis-tics. In: Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, pp. 150–157 (2003) Lin, C.Y., Hovy, E.: Automatic evaluation of summaries using n-gram co-occurrence statis-tics. In: Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, pp. 150–157 (2003)
19.
go back to reference Ribeiro, M.T., Wu, T., Guestrin, C., Singh, S. Beyond accuracy: behavioral testing of NLP models with CheckList. arXiv preprint arXiv:2005.04118 (2020) Ribeiro, M.T., Wu, T., Guestrin, C., Singh, S. Beyond accuracy: behavioral testing of NLP models with CheckList. arXiv preprint arXiv:​2005.​04118 (2020)
20.
go back to reference Kou, G., Yang, P., Peng, Y., Xiao, F., Chen, Y., Alsaadi, F.E.: Evaluation of feature selection methods for text classification with small datasets using multiple criteria decision-making methods. Appl. Soft Comput. 86, 105836 (2020)CrossRef Kou, G., Yang, P., Peng, Y., Xiao, F., Chen, Y., Alsaadi, F.E.: Evaluation of feature selection methods for text classification with small datasets using multiple criteria decision-making methods. Appl. Soft Comput. 86, 105836 (2020)CrossRef
21.
go back to reference Guerreiro, A.P., Fonseca, C.M., Paquete, L.: The hypervolume indicator: computational problems and algorithms. ACM Comput. Surv. (CSUR) 54(6), 1–42 (2021)CrossRef Guerreiro, A.P., Fonseca, C.M., Paquete, L.: The hypervolume indicator: computational problems and algorithms. ACM Comput. Surv. (CSUR) 54(6), 1–42 (2021)CrossRef
22.
go back to reference Shang, K., Ishibuchi, H., He, L., Pang, L.M.: A survey on the hypervolume indicator in evolutionary multiobjective optimization. IEEE Trans. Evol. Comput. 25(1), 1–20 (2020)CrossRef Shang, K., Ishibuchi, H., He, L., Pang, L.M.: A survey on the hypervolume indicator in evolutionary multiobjective optimization. IEEE Trans. Evol. Comput. 25(1), 1–20 (2020)CrossRef
23.
go back to reference Premkumar, M., et al.: A new arithmetic optimization algorithm for solving real-world multi-objective CEC-2021 constrained optimization problems: diversity analysis and validations. IEEE Access 9, 84263–84295 (2021)CrossRef Premkumar, M., et al.: A new arithmetic optimization algorithm for solving real-world multi-objective CEC-2021 constrained optimization problems: diversity analysis and validations. IEEE Access 9, 84263–84295 (2021)CrossRef
24.
go back to reference Van Veldhuizen, D.A., Lamont, G.B.: Evolutionary computation and convergence to a pare-to front. In: Late Breaking Papers at the Genetic Programming 1998 Conference, pp. 221–228 (1998) Van Veldhuizen, D.A., Lamont, G.B.: Evolutionary computation and convergence to a pare-to front. In: Late Breaking Papers at the Genetic Programming 1998 Conference, pp. 221–228 (1998)
25.
go back to reference Marler, R.T., Arora, J.S.: Survey of multi-objective optimization methods for engineering. Struct. Multidiscip. Optim. 26, 369–395 (2004)MathSciNetCrossRef Marler, R.T., Arora, J.S.: Survey of multi-objective optimization methods for engineering. Struct. Multidiscip. Optim. 26, 369–395 (2004)MathSciNetCrossRef
26.
go back to reference Gunantara, N.: A review of multi-objective optimization: methods and its applications. Cogent Eng. 5(1), 1502242 (2018)CrossRef Gunantara, N.: A review of multi-objective optimization: methods and its applications. Cogent Eng. 5(1), 1502242 (2018)CrossRef
27.
go back to reference Konak, A., Coit, D.W., Smith, A.E.: Multi-objective optimization using genetic algorithms: a tutorial. Reliab. Eng. Syst. Saf. 91(9), 992–1007 (2006)CrossRef Konak, A., Coit, D.W., Smith, A.E.: Multi-objective optimization using genetic algorithms: a tutorial. Reliab. Eng. Syst. Saf. 91(9), 992–1007 (2006)CrossRef
28.
go back to reference Lei, Y., Li, N., Guo, L., Li, N., Yan, T., Lin, J.: Machinery health prognostics: a systematic review from data acquisition to RUL prediction. Mech. Syst. Sig. Process. 104, 799–834 (2018)CrossRef Lei, Y., Li, N., Guo, L., Li, N., Yan, T., Lin, J.: Machinery health prognostics: a systematic review from data acquisition to RUL prediction. Mech. Syst. Sig. Process. 104, 799–834 (2018)CrossRef
29.
go back to reference Wang, B., Lei, Y., Yan, T., Li, N., Guo, L.: Recurrent convolutional neural network: a new framework for remaining useful life prediction of machinery. Neurocomputing 379, 117–129 (2020)CrossRef Wang, B., Lei, Y., Yan, T., Li, N., Guo, L.: Recurrent convolutional neural network: a new framework for remaining useful life prediction of machinery. Neurocomputing 379, 117–129 (2020)CrossRef
30.
go back to reference Ma, M., Mao, Z.: Deep-convolution-based LSTM network for remaining useful life prediction. IEEE Trans. Industr. Inf. 17(3), 1658–1667 (2020)CrossRef Ma, M., Mao, Z.: Deep-convolution-based LSTM network for remaining useful life prediction. IEEE Trans. Industr. Inf. 17(3), 1658–1667 (2020)CrossRef
Metadata
Title
An Overview on Evaluation Methods of Sequence Prediction Problems
Author
Olivér Hornyák
Copyright Year
2024
DOI
https://doi.org/10.1007/978-3-031-54674-7_32

Premium Partners