1 Introduction

Flooding in the Southern part of Ethiopia is mostly associated with the geography of the catchment and the natural drainage systems formed by the principal river basins [1, 2]. The extreme rainfall event results in an extreme runoff [3, 4] and usually causes a significant damage to the natural system and human–environment [5]. The extreme event frequency analysis and determination of its magnitude at different return periods are important for sustainable water resources management [6,7,8]. Streamflow drought defines a shortage of water resources, which includes groundwater, reservoir, or streamflow levels. According to A. K. Mishra and V. P. Singh [9], investigation of the past hydrological drought characteristics provides better information for effective monitoring of upcoming drought events.

The best-fit probability distribution model (PDM) for the extreme event study is determined based on goodness-of-fit measures. PDM is most widely used in hydrological extreme events estimation and prediction [1]. Understanding of the characteristics of hydrological extreme events using the best-fit PDM is very important to enhance effective management and utilization of water resources in the catchment [10, 11].

Ethiopia is considered as the water tower of the eastern Africa region has frequently experienced hydrological drought at an increasing intensity throughout the past many decades. For instance, the study conducted by Edossa et al. [12] shows an increasing occurrence of drought events, with the frequency of drought, occurred within 5 years. The study conducted by Gebrehiwot et al. [13] showed that the frequency of recurrent drought in Ethiopia has been showing an increasing trend for over the past decades. Moreover, flooding in the lower part of Kulfo River catchment becomes an unbeatable disaster and a challenge for the rural community living around the lower catchment. Farmers in the catchment extensively use Kulfo River for their irrigation, but their crops are affected by flooding each year.

Many previous studies have investigated long-term hydrological extreme even related to climate change [14, 15]. Changes in Earth’s climate system affect the balance of hydrological cycle and eventually lead to increased occurrence of extreme events such as flood and droughts [16,17,18]. Recent investigations also show that global climate change will create and intensify more severe frequent floods and droughts in the region [19, 20].

Extreme hydrological event analysis is easy to apply, but it includes large uncertainties in the choice of the best-fit probability distribution model, especially when study flood frequency analysis. Moreover, it cannot produce the accurate flood frequency curve for hydraulic structure and it is dependent on the PDMs. Some of the most frequently used PDMs in hydrology are the Gumbel, Generalized Extreme Values Distribution (GEV), Gamma (3P), Log-Normal (2P), Log-Normal (3P) and Log-Pearson (3P) PDMs [21]. However, the choice of an appropriate PDM is still one of the main problems for hydrologist.

To the knowledge of the experts, there was no attempt/study carried out related to hydrological extreme event analysis using the best-fit PDM in Kulfo River. This research was, therefore, initiated to characterize the hydrological extremes using the best-fit PDM of Kulfo River. The objectives of this study were to conduct (i) the best-fit PDM for determining flood frequency curve (ii) the temporal variation of hydrological drought using at different timescales (3, 6, and 12 months) and characterize the hydrological droughts, duration, severity, intensity and relative frequency, and finally, analyze the flood frequency curve over return periods of 2, 5, 10, 20, 50, and 100 years.

2 Materials and methods

2.1 Study area

This study was conducted in Kulfo River Southern Ethiopian Rift Valley Basin, near Arba Minch town with a geographical location between 5o 55′ and 6o 15′ N latitude and 37o 18′ and 37o 36′ E longitude, the elevation of between 1253 and 3550 m above sea level, and has a total area of 492 km2 (Fig. 1). The mean annual temperature rises to 14–23 °C, the catchment receives up to 620–1250 mm rainfall annually, and the mean annual discharge is estimated at around 1.8–22.8 m3/s (for the observation period of 1985–2014). The Kulfo catchment can be classified by four main soil types, namely cambisols, ferralsols, regosols, and fluvisols [22]. It is characterized by severe land degradation resulting in soil erosion, flooding, sediment, and nutrient intrusion to Lake Chamo [23]. In the catchment, agriculture is the major land use activity and occupies the flat alluvial land of the catchment surrounding the river. Thick bushlands, open woodland, forest, grassland with cultivated land are found on the catchment [24]. Extreme flooding events, in and around the catchment, induce restriction on agricultural productivity reduction through the cultivation of nonfarming activities. The Kulfo River also serves as a potential water source for the urban population and covers around 70% safe water supply to the rural community [25].

Fig. 1
figure 1

Regional map of Ethiopia (a), Abay–Chamo Rift Valley Basins watershed map (b), drainage network, hydrological station, and altitude, based on a digital elevation model of Shuttle Radar Topographic Mission 30 m of study area Kulfo catchment (c)

2.2 Data sources

Thirty-year daily discharge data series for the period of 1985–2014 were collected from Kulfo River station near the town of Arba Minch which were obtained from the Ministry of Water, Irrigation and Energy of Ethiopia (MoWIE) (http://www.mowr.gov.et/). However, in this study, the monthly discharge data analyzed from the above thirty-year daily discharge data have been used for hydrological drought assessment and annual daily peak-discharge data series were used for flood frequency analysis.

2.3 Probability distribution models (PDMs)

The PDM is the integral of the probability density function. This function is very important because it measures the inconsistency among the empirical and theoretical distributions. This is, therefore, for hydrological extreme event analysis the best-fit PDM has been identified as the best method [26].

2.3.1 Log-Normal (2P)

The probability density function and cumulative distribution function (cdf) of the 2-parameter Log-Normal (2P) are given as:

$$f\left( x \right) = \frac{1}{{x\sigma \gamma \sqrt {2\pi } }}exp\left[ { - \frac{1}{{2\sigma_{\gamma }^{2} }}\left( {\ln \left( x \right) - \mu \gamma } \right)^{2} } \right]$$
(1)
$$F\left( x \right) = \frac{1}{{x\sigma \gamma \sqrt {2\pi } }}\mathop \int \limits_{0}^{x} \left( {\frac{1}{x}exp\left[ { - \frac{1}{{2\sigma_{\gamma }^{2} }}\left( {\ln \left( x \right) - \mu \gamma } \right)^{2} } \right]} \right){\text{d}}x$$
(2)

where the assortment of random variable x is greater than zero. The logarithm of the x variable, y = ln(x), is best explained by a Normal PDM. Method of momentum (MOM) estimator has been used, and the 2 parameters are expressed as follows:

$$\sigma \gamma = \left[ {\ln \left( {1 + \frac{{\sigma_{X}^{2} }}{{\mu_{X}^{2} }}} \right)} \right]^{{{\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 2}}\right.\kern-0pt} \!\lower0.7ex\hbox{$2$}}}}$$
(3)
$$\mu \gamma = \ln \left( {\mu x} \right) - \frac{1}{2} \sigma_{\gamma }^{2} .$$
(4)

where \(\gamma\) is coefficient of skewness, and \(X\) is annual daily peak discharge of observed data, which are the commonly used parameters of the Log-Normal (2P) PDM. Here, \(\sigma \gamma\) is the standard deviation of coefficient of skewness and \(\mu \gamma\) is the mean of coefficient of skewness for the Log-Normal (2P) PDM. The Log-Normal (2P) should be more fitted than the simple Normal PDM as it is appropriate for a “heavy tail” on one side, which can be best-fit annual daily peak discharge of observed data.

2.3.2 Log-Normal (3P)

The probability density function of the three-parameter Log-Normal distribution is:

$$f\left( x \right) = \frac{1}{{\left( {x - x_{o} } \right)\sigma \sqrt {2\mu } }}exp\left\{ { - \frac{{\left[ {\ln \left( {x - x_{o} } \right) - \mu^{2} } \right]}}{{2\sigma^{2} }}} \right\}$$
(5)

where \(0 \le x_{o } < x, - \infty < \pi \left\langle {\infty ,\sigma } \right\rangle 0. \pi , \sigma , and x_{o}\) are the parameters of distribution.

2.3.3 Gumbel

Gumbel is extreme value (EV) distributions case, thus extreme value type I corresponds to Gumbel distribution, and the PDM is:

$$f\left( x \right) = \frac{1}{\alpha }exp\left[ { - \frac{x - u}{\alpha } - exp\left( { - \frac{x - u}{\alpha }} \right)} \right]$$
(6)
$$\alpha = \frac{{\sqrt 6 S_{x} }}{\pi }$$
(7)
$$u = \bar{x} - 0.5772\alpha$$
(8)

α-Scale and μ-shape are parameters of Gumbel distribution, estimated by L-moment.

2.3.4 Generalized Extreme Values (GEV)

The GEV distribution is a family of continuous probability distributions that combines the Gumbel (EV1) and Weibull distributions. GEV distribution makes use of 3 parameters: location \(\left( \xi \right)\), scale \(\left( \alpha \right)\), and shape \(\left( k \right)\) [27]. The parameters are estimated by weighted moments. The probability distribution function is:

$$f_{\xi ,\mu ,\sigma } \left( x \right) = \frac{1}{\sigma }\left( {1 + \xi \frac{{\left( {x + \mu } \right)}}{\sigma }} \right)^{1 - 1/\xi } exp\left( { - \left( {1 + \xi \frac{{\left( {x + \mu } \right)}}{\sigma }} \right)^{ - 1/\xi } } \right) \xi \ne 0$$
(9)
$$f_{0,\pi ,\sigma } \left( x \right) = \frac{1}{\sigma }e^{{ - \left( {x - \pi } \right)/\sigma }} \exp \left( { - e^{{ - \left( {x - \pi } \right)/\sigma }} } \right) \xi = 0$$
(10)
$$k = 7.8590c + 2.9554c^{2}$$

where \(c = \frac{2}{{3 + \tau_{3} }} - \frac{\ln 2}{\ln 3},\alpha = \frac{{\lambda_{2} k}}{{\left( {1 - 2^{ - k} } \right)\varGamma \left( {1 + k} \right)}},\xi = \lambda_{1} - \alpha \left\{ {1 - \varGamma \left( {1 + k} \right)} \right\}/k\)

2.3.5 Gamma (3P)

The other regularly used PDM in extreme event study is the Gamma (3P) PDM which is a 2-parameter Gamma distribution with a third parameter for the location. The pdf and cumulative distribution function of Gamma (3P) are:

$$f\left( x \right) = \frac{1}{\alpha \varGamma \left( \beta \right)}\left[ {\left( {\frac{x - }{\alpha }} \right)^{\beta - 1} } \right]exp\left[ { - \frac{{\left( {x - } \right)}}{\alpha }} \right]$$
(11)
$$F\left( x \right) = \frac{1}{\alpha \varGamma \left( \beta \right)}\mathop \int \limits_{{}}^{x} \left[ {\left( {\frac{x - }{\alpha }} \right)^{\beta - 1} } \right]\exp \left[ { - \frac{{\left( {x - } \right)}}{\alpha }} \right]{\text{d}}x$$
(12)

The parameters are shape \(\beta\), scale \(\alpha\), and location ξ which are estimated by MOM estimators that are,

$$\beta = {\raise0.7ex\hbox{$4$} \!\mathord{\left/ {\vphantom {4 {\gamma^{2} }}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\gamma^{2} }$}}$$
(13)
$$\alpha = {\raise0.7ex\hbox{${\sigma \gamma }$} \!\mathord{\left/ {\vphantom {{\sigma \gamma } 2}}\right.\kern-0pt} \!\lower0.7ex\hbox{$2$}}$$
(14)
$$\zeta = \mu - {\raise0.7ex\hbox{${2\sigma }$} \!\mathord{\left/ {\vphantom {{2\sigma } \gamma }}\right.\kern-0pt} \!\lower0.7ex\hbox{$\gamma $}}$$
(15)

2.3.6 Log-Pearson (3P)

The Log-Pearson (3P), one more Gamma family of PDM, defines a random variable and logarithm following the Log-Pearson type III PDM. The pdf and cdf of Log-Pearson (3P) are as follows:

$$f\left( x \right) = \frac{1}{\alpha \varGamma \left( \beta \right)}\left[ {\left( {\frac{x - }{\alpha }} \right)^{\beta - 1} } \right]\exp \left[ { - \frac{{\left( {\ln \left( x \right) - } \right)}}{\alpha }} \right]$$
(16)
$$F\left( x \right) = \frac{1}{\alpha \varGamma \left( \beta \right)}\mathop \int \limits_{{}}^{x} \left[ {\left( {\frac{x - }{\alpha }} \right)^{\beta - 1} } \right]\exp \left[ { - \frac{{\left( {\ln \left( x \right) - } \right)}}{\alpha }} \right]{\text{d}}x$$
(17)

2.4 Goodness-of-fit tests

Goodness-of-fit test statistics are used to validate a specified or assumed PDM [28], while empirical distribution function tests offer a measure of the inconsistency among the empirical and theoretical distributions. The mostly used empirical distribution function tests are the Kolmogorov–Smirnov (K–S) [29], the Anderson–Darling (A–D) [30], and Chi-squared (C–S) test. The K–S, A–D, and C–S tests were applied in this study. The factor of best-fit can also be quantified by using a quantile–quantile (Q–Q) plot. The goodness-of-fit tests were carried out using Easy Fit, available at http://www.mathwave.com/easyfit-distribution-fitting.html.

2.4.1 Kolmogorov–Smirnov (K–S)

The Kolmogorov–Smirnov (K–S) test statistic, the highest class of empirical distribution function (EDF) statistics, is based on the highest alteration among the hypothetical and empirical distributions [26]. The main target of this test is to relate the empirical cumulative frequency (Sn(x)) with the cumulative distribution function of an assumed theoretical distribution (FX(x)) [31]. The highest variance among Sn(x) and FX(x) is the K-S test statistic. For a model size n, the data are reordered in ascending order where \(x_{1} < x_{2} < . . . < x_{n}\) and the K-S statistic is evaluated for each value as given:

$$S_{n} \left( x \right) = 0;if X < X_{1} = {\raise0.7ex\hbox{$k$} \!\mathord{\left/ {\vphantom {k n}}\right.\kern-0pt} \!\lower0.7ex\hbox{$n$}};if X_{k} \left\langle { X_{k + 1} = 1;if X} \right\rangle X_{n}$$
(18)
$$D_{n} = max\left| {F_{x} \left( x \right) - S_{n} \left( x \right)} \right|$$
(19)
$$P\left( {D_{n} \le D_{n}^{\alpha } } \right) = 1 - \alpha$$
(20)

where \(D_{n}^{\alpha }\) is the critical value, \(\alpha\) is the highest level, and \(k\) is the descending order of the data set.

2.4.2 Anderson–Darling (A–D)

The Anderson–Darling (A–D) test was first developed by Anderson and Darling [30], and this test is accurate at the tails of the PDM [32]. In belongings with comparatively high extremes, it may be the A-D test to be more appropriate to select the best-fitted PDM to annual daily peak-discharge data [26]. The A–D test statistic, the quadratic class of the EDF test statistic, is explained as A2 as follows:

$$A^{2} = - \mathop \sum \limits_{i = 1}^{n} \left[ {\left( {2i - 1} \right)\ln F_{X} \left( {x_{i} } \right) + \ln \frac{{\left[ {1 - F_{X} \left( {x_{n + 1 - i} } \right)} \right]}}{n}} \right] - n$$
(21)

where \(F_{X} \left( {x_{i} } \right)\) is the cdf of the projected distribution at \(x_{i}\), for \(i = 1, 2, \ldots ,n\). The annual daily peak-discharge data essentially ordered in ascending order, as \(x_{1} < x_{2} < . . . < x_{n}\).

In the K-S test, both the hypothetical and experimental cumulative distribution functions are comparatively smooth at the tails of the PDMs, which means the A-D test gives more accurate to the tails [32]. This shows a more fitted test when the tails of the selected theoretical distribution are the emphasis of the analysis, as with hydrological events [26].

2.4.3 Chi-squared (C–S)

Chi-squared (C–S) test is a goodness-of-fit test that relates how the theoretical continuous probability distribution functions fit with the empirical continuous probability distribution functions. The C–S test is defined as:

$$x^{2} = \mathop \sum \limits_{i = 1}^{k} \frac{{\left( {Q_{i} - E_{i} } \right)}}{{E_{i} }}$$
(22)

where \(x^{2}\) is the test statistic, \(Q_{i}\) is the observed frequency, and \(E_{i}\) is the expected frequency defined as:

$$E_{1} = F\left( {x_{2} } \right) - F\left( {x_{2} } \right)$$
(23)

where \(F\) is the cumulative distribution function (CDF) of theoretical continuous probability distribution functions, and \(x_{1 } and x_{2}\) are the lower and upper limits.

2.4.4 Quantile–Quantile (Q–Q) plot

Quantile–Quantile (Q–Q) plot is an effective method to analyze whether the fitted distributions are reliable with the given set of observations. It is a mean of comparing annual daily peak-discharge data to a theoretical distribution.

The observed data \(x_{i }\) are ranked in ascending order and denoted from \(x_{1:n}\) to \(x_{n:n}\), where \(n\) is the total number of observations. A plotting position of the probability \(P_{i:n}\) is computed for each \(x_{1:n}\) using the plotting position formula defined as:

$$P_{i:n} = \frac{{\left( {i - 0.4} \right)}}{{\left( {n + 0.2} \right)}}$$
(24)

The Q–Q plot, each \(x_{1:n}\), is paired with \(y_{1:n}\), which is computed from the assumed cumulative distribution function, \(F\left( x \right)\). The set of points \(\left( {x_{1:n} ,y_{1:n} } \right)\) is plotted on normal graph with a \(1 :1\) straight line extended from the basis.

2.5 Drought indices analysis

2.5.1 Streamflow drought index (SDI)

According to Nalbantis and Tsakiris [33], if a time series of monthly discharge \(Q_{i,j}\) is available, in which i signifies the water year and j the month within that water year (j = 1 for October and j = 12 for September), \(V_{j,k}\) can be found based on the equation:

$$Vi,k = \mathop \sum \limits_{j = 1}^{3k} Qi,j i = 1, 2, j = 1,2, \ldots , 12 k = 1, 2, 3, 4$$
(25)

in which \(Vi,j\) is the overall amount of monthly discharge for the i-th water year and the k-th reference period, k = 1 for October–December, k = 2 for October–March, k = 3 for October-June, and k = 4 for October-September. Based on the overall amount of monthly discharge \(V_{i,k}\), the streamflow drought index (SDI) is defined for each reference period k of the i-th water year as given:

$$SDI_{i,k} = \frac{{V_{i,k} - \overline{{V_{k} }} }}{{s_{k} }} i = 1, 2, \ldots , k = 1, 2, 3, 4$$
(26)

where \(\bar{V}_{k}\) and \(s_{k}\) are, respectively, the mean and the standard deviation (SD) of overall amount of monthly discharge of the reference period k as these are simulated over an extent of time. In this definition the lopping level is set to \(V_{k}\) although other values based on normal criteria could also be used.

Usually, for small basins, streamflow may follow a skewed probability distribution which can well be approached by the family of the gamma distribution functions, and the SDI index is defined as:

$$SDI_{i,k} = \frac{{y_{i,k} - \overline{{y_{k} }} }}{{s_{y,k} }} i = 1, 2, \ldots , k = 1, 2, 3, 4$$
(27)

in which

$$y_{i,k} = \ell n\left( {V_{i,k} } \right), i = 1, 2, \ldots , k = 1, 2, 3, 4$$
(28)

are the natural logarithms of overall amount of flow with mean \(\bar{y}_{k}\) and standard deviation (SD) \(S_{y,k}\) as these statistics are estimated over an extent of time. According to Nalbantis and Tsakiris [33], classes of hydrological drought are explained for SDI in the same way to those used in the standardized precipitation index (SPI) as shown in (Table 1).

Table 1 Definition of hydrological drought indices (SDI)

2.5.2 Streamflow drought characteristics

A drought index is the main variable in order to evaluate the consequence of drought and to decide various drought characteristics, such as length of drought (L), intensity (I), magnitude (M), and relative frequency (RF) [9].

2.5.2.1 Drought duration

In order to measure the length of drought (duration) and magnitude of drought (severity), a threshold value must be defined. The drought length (L) is the period length in which the SDI value is continuously negative, started from the SDI values is equal to minus one and ends when the SDI values turn out to be positive.

2.5.2.2 Drought severity

The drought severity (S) is the summation of SDI values within the drought duration, which is defined by Eq. (29), and intensity of drought is the ratio of the severity of drought to its duration.

$$S_{i} = - \mathop \sum \limits_{i = 1}^{D} SDI_{i}$$
(29)
2.5.2.3 Relative frequency

The relative frequency (RF) of drought is the ratio of number of years with drought events (n) (negative SDI) to number of total years (N) [34], and the RF is defined as:

$$RF = \frac{n}{N} \times 100$$
(30)

2.6 Homogeneity, trend, independence, and stationarity test on annual daily peak discharge

There are several statistical tests to check the homogeneity, trend, independence, and stationarity of a data series. Some tests are known as parametric because they assume that the analyzed variable is normally distributed. Those that do not make this assumption are called nonparametric tests [35].

In this study, the standard normal homogeneity (SNHT) test for homogeneity [36], Mann–Kendall (M–K) test for trend detection, and Wald–Wolfowitz (W–W) test for independence and stationarity of data [21, 37] have been used. The homogeneity, trend, independence, and stationarity test of annual daily peak-discharge data series were carried out in r-programming using trend packages.

2.7 Flood frequency analysis (FFA)

The FFA has been carried out for the observed (1985–2014) periods of the annual daily peak-discharge data series were used. It was computed using flood frequency distribution (FFD) package (Run_Flood.m), run using MATLAB software. The Gumbel, Generalized Extreme Values (GEV), Gamma (3P), Log-Normal (2P), Log-Normal (3P), and Log-Pearson (3P) PDMs were used. The FFA has been performed for T-return periods of 2, 5, 10, 50, and 100 years. In this study, probability distribution model parameters were determined using the L-moment and method of moments.

3 Results and discussion

This study analyzed the characteristics of hydrological extremes in the Kulfo River of Southern Ethiopian Rift Valley Basin. In the catchment, due to heavy rainfall flooding has been a frequent phenomenon where the lower part of the catchment is affected. In preparation for hydrological extreme events, it is important to understand the extreme hydrological events such as flood and drought. Hydrological extreme event assessment contains consistent evidence as to the primary factor in the decision-making process [38].

3.1 Goodness-of-fit tests

The best-fit distribution was taken as the critical value at alpha 0.05, for K–S, A–D, and C–S tests as shown in Table 2. This value chooses which distribution is accepted or rejected in this study. The result shows all the distributions are statistically accepted. On the other hand, it showed the significance of the distribution. The ranking is given on the difference between statistical and critical value. Among all, the GEV probability distribution model is given the first rank in all three tests. Therefore, Generalized Extreme Values Distribution (GEV) is selected as the best-fit probability distribution model among Gumbel, Gamma (3P), Log-Normal (2P), Log-Normal (3P), and Log-Pearson (3P) probability distribution models.

Table 2 Goodness-of-fit test of the Kulfo River using Kolmogorov–Smirnov (K–S), Anderson–Darling (A–D), and Chi-squared (C–S) test

The quantile–quantile (Q–Q) plot was used to visually examine the level of best-fit probability distribution models. From the visual point of view, there is an insignificant difference to select among the various distributions in the study. The result of the quantile–quantile (Q–Q) plot for the study shows that all distributions are the best-fit with a significant correlation \((R > 0.9)\)(Fig. 2). In quantile–quantile (Q–Q) plot theoretically, all points should fall on the 1:1 line if the probability distribution models are the best-fit [39]. In many cases, it is difficult to find out the best-fit one from the visual display of the Q–Q plot. Q–Q plot result shown in (Fig. 2) in all models all points falls in the 1:1 line and significantly correlated. Therefore, in addition to the Q–Q plot further statistical test needs to find out the best-fit probability distribution model.

Fig. 2
figure 2

Quantile-Quantile (Q-Q) plot of annual daily observed vs. estimated peak-discharge data series for Kulfo River (1985–2014), correlation coefficient(R) as the computed p value is greater than the significance level alpha = 0.05, and the red straight line is 1:1 line

3.2 Temporal variations of SDI

In our study, SDI for 3-, 6-, and 12-month scales are computed to examine the characteristics of hydrological drought in short-, medium-, and long-term period, respectively. The alteration of SDI according to year is shown in Fig. 3. The SDI alteration shows that in the 6 and 12 month timescales exceed the threshold level for extreme dry (SDI < − 2) and extreme wet (SDI > 2). The SDI for the medium and long period values shows that the worst hydrological drought occurred during the hydrological year of 1991, 1992, and 2014 had the most severity indicating extreme drought events, but the 1992 drought event is not observed in the short-term period (3-month drought). However, extreme wet is observed in all timescales. Moreover, it showed that in 1985,1986, 1987, 1990, 1991, 1992, 1993, 1994, 1995, 1999, 2008, 2009, and 2014 hydrological drought events with varying severities occurred during these 30 years. Similarly, the study was conducted by Gebrehiwot et al. [13]; in the Northern Highlands of Ethiopia, severe drought events occurred in the year of 1985, 1987, 1991, and 2009. The Kulfo River has a leading role in the water balance of Lake Chamo as one of Rift Valley Lake in Southern Ethiopia. However, in a recent study, the amount of Lake Chamo water level slowly decreased [40]. This is maybe a result of the above continuously occurred drought events that affect the level of Lake. Therefore, the drought events, combined with the increasing water demand for irrigation and drinking and anthropogenic activates on the catchment, have decreased significantly the amount of water flow to the Lake Chamo.

Fig. 3
figure 3

Temporal variation of SDI for all set of timescales (3, 6 and 12 month) in Kulfo River of Southern Ethiopian Rift Valley Basin (1985–2014)

3.3 Hydrological drought characteristics

The hydrological characteristics of drought for (3, 6, and 12 month) timescales are shown in Tables 3 and 4. Recapitulation has shown the apparent difference in duration, severity, intensity, and relative frequency of each timescale, and it describes the characteristics of drought in a one timescale had a difference with others. The longest drought duration is observed in 3- and 6-month timescales as 6 year (1990–1995), with the relative frequency of 20%. The lowest drought duration is observed in 3-, -6, and 12-month timescales that are 1 year (1997, 1999, 2008, 2009, and 2014), with the relative frequency of 3.3%. The percentage of drought occurrence is at three timescales and the drought classes in the Kulfo River (Table 4). The result shows that for 3-, -6, and 12-month timescales, mild drought range (57.8–66.7%) occurs more frequently than expected in the Kulfo River. However, the incidence of drought events severity of mild and above for all timescales has approximately similar values, showing that drought occurs once every 1–3 years. This result confirmed by previous studies in the Awash River central Ethiopian Rift Valley Basin, for example, around 49% mild drought occurred most frequently and the basin hits by drought once every 2 years [12, 40].

Table 3 Recapitulation of streamflow drought events
Table 4 Drought occurrence in Kulfo River for observation period (1985–2014)

3.4 Homogeneity, trend, independence, and stationarity test of annual daily peak discharge

For carrying out homogeneity, trend detection, independence, and stationarity analysis, daily annual peak-discharge data series are with the help of the standard normal homogeneity (SNHT) test, Mann–Kendall (M–K) test, and Wald–Wolfowitz (W–W) test, respectively. The test statistic of various tests and acceptance or rejection of the null hypothesis of annual daily peak-discharge data series is presented in Table 5. The annual daily peak-discharge data series \(\left( {p = 0.39} \right)\) show significant \((p > 0.05)\) homogenous according to the SNHT test. According to the M–K test, the annual daily peak-discharge data series \(\left( {p = 0.75} \right)\) show significant \((p > 0.05)\) no trend observed. However, the W–W test result shows the data series is independent and stationary as the computed p value is lower than the significance level alpha = 0.05, accept the alternative hypothesis Ha. The time series plot shows there are no significant change point and trend detected (Fig. 4).

Table 5 Homogeneity, trend detection, independence, and stationarity test of annual daily peak discharge at significance level alpha = 0.05
Fig. 4
figure 4

Change point and trend of annual daily peak discharge of Kulfo River (1985–2014)

3.5 Flood frequency

The best-fit probability distribution model of the flood magnitude to return period events is given in Fig. 5. The result of the best-fit probability distribution model and the flood magnitude for return periods of 2, 5, 10, 20, 50, and 100 years were 42.2, 56.1, 68, 78.7, 91.9, and 101.2 m3/s, respectively. These flood magnitudes are important in the engineering design of hydraulic structures in the Kulfo River. The 50-year flood event was occurred in the river on an average of every 50 years and had a probability of 2%. The flood magnitude 68.0 and 78.7 m3/s would reoccur within the 10- to 20-year return period and had a probability of 5 and 10%, respectively. The highest flood magnitude occurred in 100-year return period which had the lowest probability of occurrence.

Fig. 5
figure 5

Flood magnitude and return period of Generalized Extreme Values (GEV) probability distribution model of Kulfo River

The results (Fig. 5) which show the hydrograph of flood magnitude at different return periods are significant in their flood risk management and early warning system. Flood causes more loss of life and distraction than any other natural event. The study conducted by Council [41] concludes that distraction from floods in many countries is increasing over time. The frequency and intensity of extreme hydrological events are changing in ways that are difficult to understand and predict.

4 Conclusion

Based on the Kolmogorov–Smirnov (K–S), Anderson–Darling (A–D), and Chi-square (C–S) test result among six probability distribution models (Log-Normal (2P), Gumbel, Generalized Extreme Values (GEV), Gamma (3P), Log-Pearson (3P), and Log-Normal (3P)), the Generalized Extreme Values (GEV) were found the best-fit probability distribution model in the Kulfo River.

The hydrological drought characteristics in the region showed an apparent change in all timescale; the extended drought duration is observed in 3- and 6-month timescales 6 years (1990–1995), with the relative frequency of 20%. This, in turn, delivers an indication for proper policy enforcement to safeguard and control drought-associated risks in the study area.

The 50-year return period of flood indicates increment in flood events of 10- and 20-year return periods. This will have significant impacts on the community and the river’s morphology. Therefore, the government authorities in the basin need to inform the community hydrological risks to create consciousness and work on flood control projects. The information should include best management practices such as flood and drought protection and early warning systems.

The flood frequency studies can be used in the application of hydraulic structures and the delineation of regulatory floodplains, and also, substantially the return period of these hydrological events can be used in understanding the risk level of damage by extreme flood events, while for the expansion of a long-term regional water plan and the readiness for drought in the region, the current information of drought characteristics is important.

Hydrological drought analyses based on one indicator may not be enough because drought singularities are related to multiple variables (e.g., rainfall, river discharge, and soil moisture).