Hostname: page-component-5db58dd55d-688nx Total loading time: 0 Render date: 2026-05-31T12:54:45.576Z Has data issue: false hasContentIssue false

Prediction and uncertainty quantification of drought in North Benin

Published online by Cambridge University Press:  24 November 2025

Bernardin Marie Augustin Sèdjro Ligan*
Affiliation:
Université Mohammed VI Polytechnique (UM6P) , Ben Guerir, Marrakesh-Safi, Morocco
Gbègninougbo Aurel Davy Tchokponhoue
Affiliation:
Université Mohammed VI Polytechnique (UM6P) , Ben Guerir, Marrakesh-Safi, Morocco
Alain Thierry Iliho Manzi
Affiliation:
Université Mohammed VI Polytechnique (UM6P) , Ben Guerir, Marrakesh-Safi, Morocco
Kora B. Désiré Simperegui
Affiliation:
International Fertilizer Development Center (IFDC) , Accra, Greater Accra, Ghana
Rodrigue Bignon Wilfried Vodounon
Affiliation:
Université d’Abomey-Calavi Institut de Mathématiques et des Sciences Physiques (IMSP) , Dangbo, Ouémé, Bénin
*
Corresponding author: Bernardin Marie Augustin Sèdjro Ligan; Email: bernardin.ligan@um6p.ma

Abstract

Drought forecasting is a critical tool for mitigating the severe impacts of water scarcity, particularly in regions like North Benin, where agriculture is a cornerstone of livelihoods. Despite the vital importance of its accurate prediction in resource management, the ability to quantify uncertainties in forecasts is a significant pain point to enable more informed and trustworthy decision-making. So, this study aims to develop an uncertainty-aware prediction model for drought forecasting in six key localities within the Alibori department—Banikoara, Gogounou, Kandi, Karimama, Malanville, and Segbana—each facing unique challenges due to drought. To achieve this, we conducted a comprehensive experiment involving six machine learning models (linear regression, ridge regression, random forest, Xgboost, LightGBM, and SVM) and four deep learning models (Conv1D, LSTM, GRU, and Conv1D-LSTM) using the Standardized Precipitation Index at a 6-month scale. To address the uncertainty quantification challenge, we employed the Ensemble Batch Prediction Interval, a conformal prediction method specifically designed for time series data. Our comparative analysis, framed within the Borda count methodology, utilized performance metrics such as R2, RMSE, MSE, and carbon footprint, as well as uncertainty quantification metrics, including empirical coverage and the width of prediction intervals. The top-performing models achieved $ {R}^2 $ scores of 98.29, 97.84, 97.76, 97.42, 96.61, and 97.07%, and prediction interval coverages of 0.94, 0.79, 0.93, 0.77, 0.73, and 0.93, respectively, for Banikoara, Gogounou, Malanville, Kandi, Segbana, and Karimama. The Conv1D-LSTM model stood out as the most effective, offering an optimal balance between predictive accuracy and uncertainty coverage.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Flowchart of the proposed methodology: From multivariate time series preprocessing to model training, uncertainty quantification with EnbPI, and final performance evaluation using various statistical metrics.

Figure 1

Figure 2. Alibori department cartography and geographic distribution of the locations. The cropland cover is derived from the MODIS land cover.

Figure 2

Table 1. Description of weather variables used in the study

Figure 3

Table 2. SPI interpretation (Lloyd-Hughes and Saunders, 2002)

Figure 4

Table 3. Lag retained per covariate and locality

Figure 5

Figure 3. Barplots of the correlations between SPI6 and lagged weather variables.

Figure 6

Table 4. Architecture of the Conv1d model used in the study

Figure 7

Table 5. Architecture of the LSTM model used in the study

Figure 8

Table 6. Architecture of the GRU model used in the study

Figure 9

Table 7. Architecture of the Conv1D-LSTM hybrid model used in the study

Figure 10

Table 8. Performance of different models for Malanville

Figure 11

Figure 4. Malanville’s prediction intervals and observations of the most effective algorithm (a) and the least effective algorithm (b) using the EnBPI conformal prediction method for the SPI6. The shaded blue area represents the 90% prediction interval, the black line indicates the model’s point predictions, green dots show observations within the interval, and red dots denote observations outside the interval.

Figure 12

Figure 5. Malanville’s Taylor Diagram.

Figure 13

Table 9. Performance of different models for Karimama

Figure 14

Figure 6. Karimama’s prediction intervals and observations of the most effective algorithm (a) and the least effective algorithm (b) using the EnBPI conformal prediction method for the SPI6. The shaded blue area represents the 90% prediction interval, the black line indicates the model’s point predictions, green dots show observations within the interval, and red dots denote observations outside the interval.

Figure 15

Figure 7. Karimama’s Taylor Diagram.

Figure 16

Table 10. Performance of different models for Kandi

Figure 17

Figure 8. Kandi’s prediction intervals and observations of the most effective algorithm (a) and the least effective algorithm (b) using the EnBPI conformal prediction method for the SPI6. The shaded blue area represents the 90% prediction interval, the black line indicates the model’s point predictions, green dots show observations within the interval, and red dots denote observations outside the interval.

Figure 18

Figure 9. Kandi’s Taylor Diagram.

Figure 19

Table 11. Performance of different models for Gogounou

Figure 20

Figure 10. Gogounou’s prediction intervals and observations of the most effective algorithm (a) and the least effective algorithm (b) using the EnBPI conformal prediction method for the SPI6. The shaded blue area represents the 90% prediction interval, the black line indicates the model’s point predictions, green dots show observations within the interval, and red dots denote observations outside the interval.

Figure 21

Figure 11. Gogounou’s Taylor Diagram.

Figure 22

Table 12. Performance of different models for Banikoara

Figure 23

Figure 12. Banikoara’s prediction intervals and observations of the most effective algorithm (a) and the least effective algorithm (b) using the EnBPI conformal prediction method for the SPI6. The shaded blue area represents the 90% prediction interval, the black line indicates the model’s point predictions, green dots show observations within the interval, and red dots denote observations outside the interval.

Figure 24

Figure 13. Banikoara’s Taylor Diagram.

Figure 25

Table 13. Performance of different models for Segbana

Figure 26

Figure 14. Segbana’s prediction intervals and observations of the most effective algorithm (a) and the least effective algorithm (b) using the EnBPI conformal prediction method for the SPI6. The shaded blue area represents the 90% prediction interval, the black line indicates the model’s point predictions, green dots show observations within the interval, and red dots denote observations outside the interval.

Figure 27

Figure 15. Segbana’s Taylor Diagram.

Author comment: Prediction and uncertainty quantification of drought in North Benin — R0/PR1

Comments

Bernardin LIGAN

University Mohammed VI Polytechnic (UM6P)

Ben Guerir, Morocco, 43150

Bernardin.LIGAN@um6p.ma

September 11, 2024.

Environmental Data Science (EDS)

Dear Editors,

I am pleased to submit our manuscript titled Prediction and Uncertainty Quantification of Drought in North Benin for consideration for publication in the EDS Journal. Our study addresses a critical issue in climate science and agricultural planning by developing a novel approach for drought forecasting that incorporates uncertainty quantification in North Benin, a region where agriculture is the primary livelihood for the majority of the population.

In our research, we focused on six localities within the Alibori department of North Benin: Banikoara, Gogounou, Kandi, Karimama, Malanville, and Segbana. Each of these areas faces unique challenges due to frequent droughts, which have severe socio-economic impacts. To enhance the reliability of drought predictions, we employed an Ensemble Batch Prediction Interval (EnbPI) method, a conformal prediction approach tailored to time series data, to effectively quantify uncertainties in our forecasts.

Our study leveraged a comprehensive range of machine learning (ML) and deep learning (DL) models—linear regression, ridge regression, random forest, XGBoost, LightGBM, SVR, Conv1D, LSTM, GRU, and Conv1D-LSTM—to predict drought scenarios and assess their performance through a Borda count methodology. This comparative analysis was based on metrics such as R², RMSE, MAE, carbon footprint, and the coverage and width of prediction intervals. Our results demonstrated that the Conv1D-LSTM model provided the best balance between accuracy and uncertainty coverage, achieving a high R² value across all six localities.

We believe this research makes a significant contribution to the field by demonstrating the feasibility and importance of uncertainty quantification in drought forecasting, particularly in low-income countries like Benin. Our findings can inform decision-makers, agricultural researchers, and local communities to adopt more resilient strategies in response to drought scenarios.

Furthermore, in line with our commitment to transparency and reproducibility, we have made our full code and dataset publicly available on GitHub, enabling other researchers to build upon our work and further enhance the prediction of droughts in similar settings.

We believe that our manuscript is well-suited for publication in EDS, given its relevance to the journal’s readership and its potential to contribute valuable insights into the intersection of climate science, machine learning, and agricultural sustainability.

We appreciate your consideration of our manuscript and look forward to your feedback.

Thank you for your time and consideration.

Sincerely,

Bernardin LIGAN

Review: Prediction and uncertainty quantification of drought in North Benin — R0/PR2

Conflict of interest statement

No

Comments

Report on Prediction and uncertainty quantification of drought in North Benin

The study attempts to identify the uncertainties from the meteorological drought index namely the SPI through applying the Machine learning and deep learning in the Northern Benin with datasets covering 1981-2021. Some results are shown especially the R-square (R2) but others can be seen. The topic is interesting for water resource monitoring over the region and environmental management. It fits well with journal with clear objectives. However, the choice of the SPI time scale lacks of justification. In addition, the introduction should be updated with at least three to five recent references (e.g., 2023 and 2024).

My major concerns are listed as follows:

1) The data description is missing is very critical for understanding of the performance of the models used. I was particularly waiting to see the training model, the validation and then the prediction with their associated uncertainties. Please, at least consider using station data for Kandi to validate the gridded data before proceed.

2) While trying to predict the drought and assert the uncertainties, it is difficult to figure out from the methodology the employed equations for each machine learning method. I encourage the authors to show the equations for the methods used. For instance, RNN has been applied, how many layers and sub-layers (hidden) were utilised, the weight applied, etc.

3) The uncertainties are not shown except the R-square which is likely to be contribution in the section of abstract

4) Please, use more recent references. For example, one recent reference, you may refer to is: Environmental Data Science, Volume 3, 2024, e11; DOI: https://doi.org/10.1017/eds.2024.10 (Arsène Nounangnon Aïzansi et al, 2024)

5) Be aware of autocorrelation among the variables used to predict the SPI.

Specific comments

1) In the abstract, please clarify the values of the uncertainties and the SPI scale, and the reason for chosen the scale. Did you try for 12-month, 15-month, 24-month or 3-month?

2) Abstract: to tackle or to enable? Please, revise

3) I suggest that author (s) describe (s) in a table all the data used, time spanning, horizontal resolution, the variables.

4) Under the experimental part, “A batch size of 64 was used for all model training, with the number of epochs adjusted per locality to prevent overfitting”. Verify the spelling of overfitting

5) From “followed by three …. and linear activations, respectively”, why “D” are in capital ?

6) “which violate the exchangeability assumption these methods rely on (Barber et al., 2023; Xu and Xie, 2023)”. Rely on …; the sentence is incomplete.

7) “employing 30 bootstraps”, does it mean the size sample chosen for bootstrapping?

8) Figure 8, the title is confusing. Is it R-square that is presented in the figure or MAE? Please, clarify.

9) There are minor adjustments of some sentences, please read carefully and correct.

Review: Prediction and uncertainty quantification of drought in North Benin — R0/PR3

Conflict of interest statement

Reviewer declares none.

Comments

Ligan et al. present an uncertainty-aware drought prediction model for six localities in Benin’s Alibori department. The study addresses an important issue and suggests a potentially useful and efficient tool. However, the paper suffers from several significant flaws that prevent a proper assessment of the results' reliability. If the following suggested revisions are addressed, the paper will be suitable for publication.

Section 1

- Correct the typo “de” in line 1 on page 3.

Section 2.1

- Correct the typo “Pi” in equation (1) on page 5.

Section 2.2

-The sections in the manuscript are ordered incorrectly (2.2, 2.1, 2.2, 2.3). from page 4. Could you please correct the section numbering to follow a logical order? Also correct the table numbering to follow a logical order to match its reference in the manuscript.

- Once you introduce an acronym, you don’t need to write out the full phrase again.

--- For example, Standardized Precipitation Index (SPI) should only be spelled out the first time, and after that, you can simply use SPI throughout the rest of the paper. Similarly to others.

--- Are you using atmospheric data specific to each site, or are you relying on gridded data for broader regions?

--- Could you provide details on how the data for each town were extracted?

--- Did you consider using the local grid cell corresponding to each town’s longitude and latitude coordinates?

- There are other data sources, such as ERA5 from the Climate Data Store, that provide finer spatial resolution (e.g., 0.25° × 0.25°) compared to NASA POWER (1/2° × 5/8°). What led you to choose NASA POWER over these alternatives?

--- Did you consider using ERA5 or similar datasets for better spatial detail?

--- Were there specific advantages of NASA POWER (e.g., data availability, temporal coverage, or preprocessing convenience) that influenced your choice?

- Add the source of the SPI interpretation Table 2 on page 5.

- The table of lagged variables (page 6) is interesting. It would be valuable to explore this data further.

--- Consider adding bar charts for each site, showing the correlations per month of lag for each variable. Although this would require additional space, it is an important aspect worth highlighting.

- Add the Min-Max scaler formula.

- Include a methodology flowchart for better clarity.

- Clarify the normalization process:

---Was normalization done before or after splitting the dataset into training, validation, and test sets?

---To avoid data leakage, it is crucial to split the dataset first and then apply normalization only to the training set. The same normalization parameters should then be used for the validation and test sets.

--- Since real-world applications do not have access to test data during training, the normalization approach needs to be explicitly justified. Was this procedure followed in this study?

- Can you give information about the computer specifications used for the analysis?

Section 2.3.1

- Discuss the limitations of each model used.

Section 2.3.2

- Clarify the description of the validation set:

--- The text states: “To train the models, an additional validation set was created using 80% of the training dataset.” This phrasing might confuse readers, as it could suggest that the validation set itself comprises 80% of the training data. Consider rephrasing for clarity.

- Specify the approach used to adjust the number of epochs (e.g., validation monitoring, early stopping) to prevent overfitting.

- Provide details on hyperparameters, including:Learning rate, Optimizer, Any regularization techniques applied…

Section 2.4

- Write the limitations of the Ensemble Batch Prediction Interval methods

- Explain how uncertainty quantification affects the usefulness of predictions.

- Explain the rationale for selecting the Borda Count method, with more references

- Add sources for the evaluation metrics used.

- The formula used for R² in your study appears to be different from the conventional definition. Did this formulation come from a specific reference or prior work?

- Consider adding a Taylor diagram to visually compare model performance in terms of correlation, variance, and overall error. This would provide a more intuitive and comprehensive assessment of model accuracy and robustness.

Section 3

- Please increase the font size of the plot’s titles and axes, which are currently difficult to read.

- Include a visualization of the loss function during both training and validation to monitor potential overfitting or underfitting.

- Did you rescale the predicted values back to their original units before computing evaluation metrics such as RMSE, MAE, and R²? Make it clear in the methodology flowchart for better clarity.

--- Since the data was scaled before training, computing metrics directly on the scaled data may lead to values that are not interpretable in the original context.

--- If rescaling was not done before metric calculation, could you clarify whether this affects the reported performance of the models?

Section 4

- More information regarding the impact of this work would be helpful. Provide stakeholders important insights.

- Discuss the limitations of relying solely on SPI6. Add the study’s limitations and your suggestions to the results.

The authors should make it clear how novel the research is. Without a compelling argument for its uniqueness, the paper may fail to captivate readers. It is critical for authors to explain what distinguishes their work and how it advances the field.

- What novel contributions does your research make in terms of methodology or application?

- How does your approach improve uncertainty quantification in drought forecasting for these specific regions?

Recommendation: Prediction and uncertainty quantification of drought in North Benin — R0/PR4

Comments

Please use the GitHub integration with Zenodo to create a permanent record of your data, code and models, and update your Data Availability Statement to include the Zenodo DOI: https://docs.github.com/en/repositories/archiving-a-github-repository/referencing-and-citing-content

Decision: Prediction and uncertainty quantification of drought in North Benin — R0/PR5

Comments

No accompanying comment.

Author comment: Prediction and uncertainty quantification of drought in North Benin — R1/PR6

Comments

Cover Letter – Revised Manuscript Submission

Bernardin LIGAN

University Mohammed VI Polytechnic (UM6P)

Ben Guerir, Morocco, 43150

Bernardin.LIGAN@um6p.ma

August 15, 2025

Environmental Data Science (EDS)

Dear Editors,

I am pleased to resubmit our revised manuscript titled Prediction and Uncertainty Quantification of

Drought in North Benin for consideration in Environmental Data Science. This work addresses the

critical challenge of drought forecasting in a vulnerable region where agriculture is the main source

of livelihood. Our approach combines state-of-the-art machine learning and deep learning models

with the Ensemble Batch Prediction Interval (EnbPI) method to produce reliable forecasts and

quantify predictive uncertainties.

In the first submission, we presented results for six localities in North Benin—Banikoara,

Gogounou, Kandi, Karimama, Malanville, and Segbana—highlighting the capacity of hybrid deep

learning models, particularly Conv1D-LSTM, to achieve a strong balance between predictive

accuracy and robust uncertainty coverage.

Following the reviewers’ valuable feedback, we have substantially improved the manuscript:

• Clarified methodological details, including normalization procedures, avoidance of data

leakage, and scaling back predictions before computing metrics.

• Added missing equations, expanded model descriptions with limitations, and integrated a

methodology flowchart.

• Included a Taylor diagram for visual model comparison and enhanced the explanation of the

Borda count method with relevant references.

• Corrected formula inconsistencies, reorganized section and table numbering, and updated

execution environment details.

• Ensured complete transparency and reproducibility by making the dataset available on

Zenodo and linking the source code on GitHub with a Zenodo DOI.

We believe that these revisions have significantly strengthened the manuscript, both in

methodological rigor and clarity of presentation. We are confident that the improved version offers

greater value to the journal’s readership and advances the understanding of uncertainty-aware

drought forecasting in data-scarce regions.

Thank you for considering our revised submission. We look forward to your favorable response.

Sincerely,

Bernardin LIGAN

Review: Prediction and uncertainty quantification of drought in North Benin — R1/PR7

Conflict of interest statement

Reviewer declares none.

Comments

It is partly improved and I have no further comments.

Review: Prediction and uncertainty quantification of drought in North Benin — R1/PR8

Conflict of interest statement

There are no competing interests

Comments

The study aims to develop an uncertainty-aware prediction model for drought forecasting in six key localities within the Alibori department—Banikoara, Gogounou, Kandi, Karimama, Malanville, and Segbana—each facing unique challenges due to drought. There are some minor concerns that should be addressed.

Specific comments

1) Under the abstracts, the full name should be given before using the abbreviation, for example, Deep Learning (DL) should be defined in the first use. Please verify this through the MS, line 30.

2) which model gave which performance, i.e., XGBoost, LightGBM or which one ?

Most of work considered RMSE as one of the best metrics for models comparison; however, authors preferred interval coverage and R-square. Please what are advantages of showing interval coverage values over the RMSE?

3) Line 45; uncertainty quantification “of” drought ….

4) Line 13 in page 14, cRMSD(Centered Root-Mean-Squared-Difference). Space is needed

5) References section, https://scikit-learn.org/stable/about.html#citing-scikit-learn; Authors can get the reference as indicated (If you use scikit-learn in a scientific publication, we would appreciate citations to the following paper: Scikit-learn: Machine Learning in Python, Pedregosa et al., JMLR 12, pp. 2825-2830, 2011).

Recommendation: Prediction and uncertainty quantification of drought in North Benin — R1/PR9

Comments

There are still some open matters mentioned by one of the Reviewers. The Associate Editor would also like to point out that certain Authors‘ responses to Reviewers’ comments could have been better or more prominently incorporated in the manuscript. For example, the Authors‘ responses to Reviewers’ inquiries on “uncertainty quantification” with regards to the proposed method should have been stressed more clearly, as it seems a centerpiece of the Authors' work.

Decision: Prediction and uncertainty quantification of drought in North Benin — R1/PR10

Comments

No accompanying comment.

Author comment: Prediction and uncertainty quantification of drought in North Benin — R2/PR11

Comments

Dear Editors,

We are pleased to resubmit our second revised manuscript entitled Prediction and Uncertainty Quantification of Drought in North Benin for further consideration in Environmental Data Science.

This study is one of the first of its kind in a low-income country context, focusing on six localities in North Benin that are particularly vulnerable to drought. Our approach integrates machine learning, deep learning, and conformal prediction methods to deliver both accurate forecasts and reliable uncertainty quantification. This dual perspective is central to our contribution, as it provides decision-makers, researchers, and local communities with robust tools to anticipate and respond to drought scenarios.

In this second revision, we carefully addressed the remaining open matters raised by the reviewers and the Associate Editor. In particular:

We have more clearly emphasized the role of uncertainty quantification, highlighting its centrality in both methodology and results.

We revised the abstract to ensure that abbreviations (e.g., Deep Learning, DL) are defined at first mention, and that key metrics (RMSE, R2, and interval coverage) are presented with greater clarity.

We explained why interval coverage is particularly relevant for assessing model performance in an uncertainty-aware setting, complementing conventional metrics such as RMSE.

We implemented all minor corrections suggested (grammar, phrasing, and formatting issues such as spacing in cRMSD).

We updated the references to properly cite the scikit-learn library (Pedregosa et al., JMLR 2011), as recommended.

We believe that this second revised version strengthens the manuscript both in methodological rigor and in clarity of presentation. We are confident that the improved version now fully addresses the reviewers’ concerns and is suitable for publication.

We thank you for your continued consideration and look forward to your feedback.

Sincerely,

Bernardin LIGAN (on behalf of all co-authors)

Review: Prediction and uncertainty quantification of drought in North Benin — R2/PR12

Conflict of interest statement

Reviewer declares none.

Comments

I have no more comments to authors.

Recommendation: Prediction and uncertainty quantification of drought in North Benin — R2/PR13

Comments

No accompanying comment.

Decision: Prediction and uncertainty quantification of drought in North Benin — R2/PR14

Comments

No accompanying comment.