Skip to main content
Erschienen in: Environmental Earth Sciences 6/2024

01.03.2024 | Original Article

Modeling and forecasting rainfall patterns in India: a time series analysis with XGBoost algorithm

verfasst von: Pradeep Mishra, Abdullah Mohammad Ghazi Al Khatib, Shikha Yadav, Soumik Ray, Achal Lama, Binita Kumari, Divya Sharma, Ramesh Yadav

Erschienen in: Environmental Earth Sciences | Ausgabe 6/2024

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This study utilizes time series analysis and machine learning techniques to model and forecast rainfall patterns across different seasons in India. The statistical models, i.e., autoregressive integrated moving average (ARIMA) and state space model and machine learning models, i.e., Support Vector Machine, Artificial Neural Network and Random Forest Model were developed and their performance was compared against XGBoost, an advanced machine learning algorithm, using training and testing datasets. The results demonstrate the superior accuracy of XGBoost compared to the statistical models in capturing complex nonlinear rainfall patterns. While ARIMA models tend to overfit the training data, state space models prove more robust to outliers in the testing set. Diagnostic checks show the models adequately capture the time series properties. The analysis indicates essential unchanging rainfall patterns in India for 2023–2027, with implications for water resource management and climate-sensitive sectors like agriculture and power generation. Overall, the study highlights the efficacy of modern machine learning approaches like XGBoost for forecasting complex meteorological time series. The framework presented enables rigorous validation and selection of optimal techniques. Further applications of such sophisticated data analysis can significantly enhance planning and research on the Indian monsoons amidst climate change challenges.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Al Khatib AMG, Yonar H, Abotaleb M, Mishra P, Yonar A, Karakaya K, Badr A, Dhaka V (2021) Modeling and forecasting of egg production in India using time series models. Eurasian J Vet Sci 37(4):265–273CrossRef Al Khatib AMG, Yonar H, Abotaleb M, Mishra P, Yonar A, Karakaya K, Badr A, Dhaka V (2021) Modeling and forecasting of egg production in India using time series models. Eurasian J Vet Sci 37(4):265–273CrossRef
Zurück zum Zitat Ali S, Shahbaz M (2020) Streamflow forecasting by modeling the rainfall–streamflow relationship using artificial neural networks. Model Earth Syst Environ 6:1645–1656CrossRef Ali S, Shahbaz M (2020) Streamflow forecasting by modeling the rainfall–streamflow relationship using artificial neural networks. Model Earth Syst Environ 6:1645–1656CrossRef
Zurück zum Zitat Desai VP, Kamat RK, Oza KS (2022) Rainfall modeling and prediction using neural networks: a case study of Maharashtra. Disaster Adv 15:39–43CrossRef Desai VP, Kamat RK, Oza KS (2022) Rainfall modeling and prediction using neural networks: a case study of Maharashtra. Disaster Adv 15:39–43CrossRef
Zurück zum Zitat Dutta PS, Tahbider H (2014) Prediction of rainfall using data mining technique over Assam. Indian J Comput Sci Eng 5:85–90 Dutta PS, Tahbider H (2014) Prediction of rainfall using data mining technique over Assam. Indian J Comput Sci Eng 5:85–90
Zurück zum Zitat Gil-Alana LA, Cunado J, Perez de Gracia F (2008) Tourism in the Canary Islands: forecasting using several seasonal time series models. J Forecast 27(7):621–636MathSciNetCrossRef Gil-Alana LA, Cunado J, Perez de Gracia F (2008) Tourism in the Canary Islands: forecasting using several seasonal time series models. J Forecast 27(7):621–636MathSciNetCrossRef
Zurück zum Zitat Joshi H, Tyagi D (2021) Forecasting and modeling monthly rainfall in Bengaluru, India: an application of time series models. Int J Sci Res Math Stat Sci 8(1):39–46 Joshi H, Tyagi D (2021) Forecasting and modeling monthly rainfall in Bengaluru, India: an application of time series models. Int J Sci Res Math Stat Sci 8(1):39–46
Zurück zum Zitat Luk KC, Ball JE, Sharma A (2001) An application of artificial neural networks for rainfall forecasting. Math Commun Model 33:683–693CrossRef Luk KC, Ball JE, Sharma A (2001) An application of artificial neural networks for rainfall forecasting. Math Commun Model 33:683–693CrossRef
Zurück zum Zitat Mishra P, Al Khatib AMG, Sardar I, Mohammed J, Ray M, Manish K et al (2020) Modelling and forecasting of COVID-19 in India. J Infect Dis Epidemiol 6(5):1–11 Mishra P, Al Khatib AMG, Sardar I, Mohammed J, Ray M, Manish K et al (2020) Modelling and forecasting of COVID-19 in India. J Infect Dis Epidemiol 6(5):1–11
Zurück zum Zitat Mishra P, Al Khatib AMG, Sardar I, Mohammed J, Karakaya K, Dash A et al (2021) Modeling and forecasting of sugarcane production in India. Sugar Tech 23(6):1317–1324CrossRef Mishra P, Al Khatib AMG, Sardar I, Mohammed J, Karakaya K, Dash A et al (2021) Modeling and forecasting of sugarcane production in India. Sugar Tech 23(6):1317–1324CrossRef
Zurück zum Zitat Navone HD, Ceccatto HA (1994) Predicting Indian monsoon rainfall: a neural network approach. Clim Dyn 10:305–312CrossRef Navone HD, Ceccatto HA (1994) Predicting Indian monsoon rainfall: a neural network approach. Clim Dyn 10:305–312CrossRef
Zurück zum Zitat Niranjan HK, Kumari B, Raghav YS, Mishra P, Al Khatib AMG, Abotaleb M (2022) Modeling and forecasting of tea production in India. J Anim Plant Sci 32(6):1598–1604 Niranjan HK, Kumari B, Raghav YS, Mishra P, Al Khatib AMG, Abotaleb M (2022) Modeling and forecasting of tea production in India. J Anim Plant Sci 32(6):1598–1604
Zurück zum Zitat Pal S, Mazumdar D (2018) Forecasting monthly rainfall using artificial neural network. Rashi 3:65–73 Pal S, Mazumdar D (2018) Forecasting monthly rainfall using artificial neural network. Rashi 3:65–73
Zurück zum Zitat Praveen B, Talukdar S, Shahfahad MS, Mondal J, Sharma P, Islam ARMdT, Rahman A (2020) Analyzing trend and forecasting of rainfall changes in India using non-parametrical and machine learning approaches. Sci Rep 10:10342CrossRefPubMedPubMedCentral Praveen B, Talukdar S, Shahfahad MS, Mondal J, Sharma P, Islam ARMdT, Rahman A (2020) Analyzing trend and forecasting of rainfall changes in India using non-parametrical and machine learning approaches. Sci Rep 10:10342CrossRefPubMedPubMedCentral
Zurück zum Zitat Raghav YS, Mishra P, Alakkari KM, Singh M, Al Khatib AMG, Balloo R (2022) Modelling and forecasting of pulses production in south Asian countries and its role in nutritional security. Legume Res Int J 45(4):454–461 Raghav YS, Mishra P, Alakkari KM, Singh M, Al Khatib AMG, Balloo R (2022) Modelling and forecasting of pulses production in south Asian countries and its role in nutritional security. Legume Res Int J 45(4):454–461
Zurück zum Zitat Rahman UH, Ray S, Al Khatib AMG, Lal P, Mishra P, Fatih C et al (2022) State of art of SARIMA model in second wave on COVID-19 in India. Int J Agric Stat Sci 18(1):141–152 Rahman UH, Ray S, Al Khatib AMG, Lal P, Mishra P, Fatih C et al (2022) State of art of SARIMA model in second wave on COVID-19 in India. Int J Agric Stat Sci 18(1):141–152
Zurück zum Zitat Sahoo A, Samantaray S, Ghose DK (2019) Stream flow forecasting in Mahanadi river basin using artificial neural networks. Procedia Comput Sci 157:168–174CrossRef Sahoo A, Samantaray S, Ghose DK (2019) Stream flow forecasting in Mahanadi river basin using artificial neural networks. Procedia Comput Sci 157:168–174CrossRef
Zurück zum Zitat Virmani A (2006) India’s economic growth history: fluctuations, trends, break points and phases. Indian Econ Rev 41:81–103 Virmani A (2006) India’s economic growth history: fluctuations, trends, break points and phases. Indian Econ Rev 41:81–103
Zurück zum Zitat Yadav S, Mishra P, Kumari B, Shah IA, Karakaya K, Shrivastri S et al (2022) Modelling and forecasting of maize production in South Asian countries. Econ Aff 67(4):519–531 Yadav S, Mishra P, Kumari B, Shah IA, Karakaya K, Shrivastri S et al (2022) Modelling and forecasting of maize production in South Asian countries. Econ Aff 67(4):519–531
Zurück zum Zitat Yonar H, Yonar A, Mishra P, Abotaleb M, Al Khatib AMG, Makarovskikh T, Cam M (2022) Modeling and forecasting of milk production in different breeds in Turkey. Indian J Anim Sci 92:105CrossRef Yonar H, Yonar A, Mishra P, Abotaleb M, Al Khatib AMG, Makarovskikh T, Cam M (2022) Modeling and forecasting of milk production in different breeds in Turkey. Indian J Anim Sci 92:105CrossRef
Zurück zum Zitat Beņkovskis K (2008) Short-term forecasts of Latvia's real gross domestic product growth using monthly indicators Beņkovskis K (2008) Short-term forecasts of Latvia's real gross domestic product growth using monthly indicators
Zurück zum Zitat Chen T, Guestrin C (2016) XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (KDD '16). Association for Computing Machinery, New York, pp 785–794. https://doi.org/10.1145/2939672.2939785 Chen T, Guestrin C (2016) XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (KDD '16). Association for Computing Machinery, New York, pp 785–794. https://​doi.​org/​10.​1145/​2939672.​2939785
Zurück zum Zitat Soto-Ferrari M (2020) A time-series forecasting performance comparison for neural networks with state space and ARIMA models. In: Proceedings of the 5th N.A. international conference on industrial engineering and operations management Detroit, Michigan, USA Soto-Ferrari M (2020) A time-series forecasting performance comparison for neural networks with state space and ARIMA models. In: Proceedings of the 5th N.A. international conference on industrial engineering and operations management Detroit, Michigan, USA
Zurück zum Zitat Swain S, Nandi S, Patel P (2018) Development of an ARIMA model for monthly rainfall forecasting over Khordha District, Odisha, India, In: Sa P, Bakshi S, Hatzilygeroudis I, Sahoo M (eds) Recent findings in intelligent computing techniques. Advances in intelligent systems and computing, vol 708. Springer, Singapore. https://doi.org/10.1007/978-981-10-8636-6_34 Swain S, Nandi S, Patel P (2018) Development of an ARIMA model for monthly rainfall forecasting over Khordha District, Odisha, India, In: Sa P, Bakshi S, Hatzilygeroudis I, Sahoo M (eds) Recent findings in intelligent computing techniques. Advances in intelligent systems and computing, vol 708. Springer, Singapore. https://​doi.​org/​10.​1007/​978-981-10-8636-6_​34
Metadaten
Titel
Modeling and forecasting rainfall patterns in India: a time series analysis with XGBoost algorithm
verfasst von
Pradeep Mishra
Abdullah Mohammad Ghazi Al Khatib
Shikha Yadav
Soumik Ray
Achal Lama
Binita Kumari
Divya Sharma
Ramesh Yadav
Publikationsdatum
01.03.2024
Verlag
Springer Berlin Heidelberg
Erschienen in
Environmental Earth Sciences / Ausgabe 6/2024
Print ISSN: 1866-6280
Elektronische ISSN: 1866-6299
DOI
https://doi.org/10.1007/s12665-024-11481-w

Weitere Artikel der Ausgabe 6/2024

Environmental Earth Sciences 6/2024 Zur Ausgabe