Top

Published in:

2024 | OriginalPaper | Chapter

An Overview on Evaluation Methods of Sequence Prediction Problems

Author : Olivér Hornyák

Published in: The 17th International Conference Interdisciplinarity in Engineering

Publisher: Springer Nature Switzerland

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Sequence prediction problems are prevalent in various domains, including natural language processing, time series analysis, bioinformatics, and condition-based maintenance. Evaluating the performance of sequence prediction models is crucial to assess their accuracy, robustness, and generalization capabilities. This paper presents an overview of evaluation methods used for sequence prediction problems. Throughout the paper, we emphasize the importance of selecting suitable evaluation methods that align with the specific characteristics and goals of typical sequence prediction problems. We also provide insights into the considerations associated with each evaluation method. The paper discusses the fundamental metrics commonly employed, such as accuracy, precision, recall, and F1-score, which provide insights into the overall performance of sequence prediction models. Additionally, some more specialized metrics tailored to sequence prediction, are presented. These metrics account for the unique characteristics and challenges of sequence data. In the paper evaluation techniques specific to distinct types of sequence prediction problems are evaluate such as, perplexity, BLEU score, and ROUGE score which are widely used to evaluate language models and machine translation systems. In time series analysis, metrics such as mean absolute error (MAE), root mean squared error (RMSE), and mean absolute percentage error (MAPE) are commonly employed.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

previous chapter A P300 Based Brain-Computer Interface LabVIEW Instrument for Controlling an Experimental Prototype of Juices Vending Machine Using the Unicorn EEG Headset

next chapter A Tale of Two Automotive Security Services: A Formal Analysis

Selvin, S., Vinayakumar, R., Gopalakrishnan, E.A., Menon, V.K., Soman, K.P.: Stock price prediction using LSTM, RNN and CNN-sliding window model. In: 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 1643–1647, Manipal (2017)

Sharma, A., Bhuriya, D., Singh, U.: Survey of stock market prediction using machine learning approach. In: 2017 International Conference of Electronics, Communication and Aerospace Technology (ICECA), vol. 2, pp. 506–509. RVS Technical Campus (2017)

Mehta, Y., Malhar, A., Shankarmani, R.: Stock price prediction using machine learning and sentiment analysis. In: 2021 2nd International Conference for Emerging Technology (INCET), pp. 1–4, Belgaum, May 2021

Chen, M.Y., Chen, B.T.: A hybrid fuzzy time series model based on granular computing for stock price forecasting. Inf. Sci. 294, 227–241 (2015)MathSciNetCrossRef

Li, Y., Pan, Y.: A novel ensemble deep learning model for stock prediction based on stock prices and news. Int. J. Data Sci. Anal. 1–11 (2022)

Behera, R.K., Jena, M., Rath, S.K., Misra, S.: Co-LSTM: convolutional LSTM model for sentiment analysis in social big data. Inf. Process. Manag. 58(1), 102435 (2021)CrossRef

Hornyák, O., Iantovics, L.B.: AdaBoost algorithm could lead to weak results for data with certain characteristics. Mathematics 11(8), 1801 (2023)CrossRef

Zhang, L., Wang, S., Liu, B.: Deep learning for sentiment analysis: a survey. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 8(4), e1253 (2018)CrossRef

Che, Z., Purushotham, S., Cho, K., Sontag, D., Liu, Y.: Recurrent neural networks for multivariate time series with missing values. Sci. Rep. 8(1), 6085 (2018)CrossRef

10.

Jelinek, F., Mercer, R.L., Bahl, L.R., Baker, J.K.: Perplexity—a measure of the difficulty of speech recognition tasks. J. Acoust. Soc. Am. 62(S1), S63–S63 (1977)CrossRef

11.

Reiter, E.: A structured review of the validity of BLEU. Comput. Linguist. 44(3), 393–401 (2018)CrossRef

12.

Lin, C.Y.: Rouge: a package for automatic evaluation of summaries. In: Text Summarization Branches Out, pp. 74–81 (2004)

13.

Stahlberg, F.: Neural machine translation: a review. J. Artif. Intell. Res. 69, 343–418 (2020)MathSciNetCrossRef

14.

Dabre, R., Chu, C., Kunchukuttan, A.: A survey of multilingual neural machine translation. ACM Comput. Surv. (CSUR) 53(5), 1–38 (2020)CrossRef

15.

Ranathunga, S., Lee, E.S.A., Prifti Skenduli, M., Shekhar, R., Alam, M., Kaur, R.: Neural machine translation for low-resource languages: a survey. ACM Comput. Surv. 55(11), 1–37 (2023)

16.

Banerjee, S., Lavie, A.: METEOR: an automatic metric for MT evaluation with improved correlation with human judgments. In: Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, pp. 65–72 (2005)

17.

Snover, M., Dorr, B., Schwartz, R., Micciulla, L., Makhoul, J.: A study of translation edit rate with targeted human annotation. In: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers, pp. 223–231 (2006)

18.

Lin, C.Y., Hovy, E.: Automatic evaluation of summaries using n-gram co-occurrence statis-tics. In: Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, pp. 150–157 (2003)

19.

Ribeiro, M.T., Wu, T., Guestrin, C., Singh, S. Beyond accuracy: behavioral testing of NLP models with CheckList. arXiv preprint arXiv:2005.04118 (2020)

20.

Kou, G., Yang, P., Peng, Y., Xiao, F., Chen, Y., Alsaadi, F.E.: Evaluation of feature selection methods for text classification with small datasets using multiple criteria decision-making methods. Appl. Soft Comput. 86, 105836 (2020)CrossRef

21.

Guerreiro, A.P., Fonseca, C.M., Paquete, L.: The hypervolume indicator: computational problems and algorithms. ACM Comput. Surv. (CSUR) 54(6), 1–42 (2021)CrossRef

22.

Shang, K., Ishibuchi, H., He, L., Pang, L.M.: A survey on the hypervolume indicator in evolutionary multiobjective optimization. IEEE Trans. Evol. Comput. 25(1), 1–20 (2020)CrossRef

23.

Premkumar, M., et al.: A new arithmetic optimization algorithm for solving real-world multi-objective CEC-2021 constrained optimization problems: diversity analysis and validations. IEEE Access 9, 84263–84295 (2021)CrossRef

24.

Van Veldhuizen, D.A., Lamont, G.B.: Evolutionary computation and convergence to a pare-to front. In: Late Breaking Papers at the Genetic Programming 1998 Conference, pp. 221–228 (1998)

25.

Marler, R.T., Arora, J.S.: Survey of multi-objective optimization methods for engineering. Struct. Multidiscip. Optim. 26, 369–395 (2004)MathSciNetCrossRef

26.

Gunantara, N.: A review of multi-objective optimization: methods and its applications. Cogent Eng. 5(1), 1502242 (2018)CrossRef

27.

Konak, A., Coit, D.W., Smith, A.E.: Multi-objective optimization using genetic algorithms: a tutorial. Reliab. Eng. Syst. Saf. 91(9), 992–1007 (2006)CrossRef

28.

Lei, Y., Li, N., Guo, L., Li, N., Yan, T., Lin, J.: Machinery health prognostics: a systematic review from data acquisition to RUL prediction. Mech. Syst. Sig. Process. 104, 799–834 (2018)CrossRef

29.

Wang, B., Lei, Y., Yan, T., Li, N., Guo, L.: Recurrent convolutional neural network: a new framework for remaining useful life prediction of machinery. Neurocomputing 379, 117–129 (2020)CrossRef

30.

Ma, M., Mao, Z.: Deep-convolution-based LSTM network for remaining useful life prediction. IEEE Trans. Industr. Inf. 17(3), 1658–1667 (2020)CrossRef

Title: An Overview on Evaluation Methods of Sequence Prediction Problems
Author: Olivér Hornyák
Publisher: Springer Nature Switzerland
Book: The 17th International Conference Interdisciplinarity in Engineering
Print ISBN: 978-3-031-54673-0

Electronic ISBN: 978-3-031-54674-7

Copyright Year: 2024
DOI: https://doi.org/10.1007/978-3-031-54674-7_32

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partners