Introduction

Water is the elixir of life and is crucial for sustainable development. Earlier, it was considered to be a limitless or at least fully renewable natural resource. However, in the last 20 years or so, there has been a tremendous pressure on this precious natural resource mainly due to rapid industrialization and human population. This is because an increase in the human population will simply result in increasing the demand for irrigation purpose to meet food production requirements. Though, the advancement in agricultural technology has been impressive in many regions, poor irrigation management has resulted in considerable depletion of the groundwater table, damaged soils and deterioration in the water quality thus making the availability of water in the future highly uncertain. Keeping in mind the scarcity of available water resources in the near future and its impending threats, it has become imperative on the part of water scientists as well as planners to quantify the available water resources for its judicial use. Thus, a ready reckoner to monitor the fluctuations in groundwater levels well in advance is the need of the hour to devise sustainable water management protocols (Sreekanth et al. 2009).

Due to the reliability and sustainability of the resource, groundwater has been generally accepted to be the best quality source of water for both drinking and irrigation purposes around the world (Hoque et al. 2009; Adiat et al. 2012). This, in addition to social development and agricultural production, has led to an increase in the awareness on the need to utilize and explore groundwater resources as alternative sources of water supply (Abdul et al. 2001).

Groundwater is reserved in the subsurface in a geologic system called aquifer. Groundwater level is an indicator of groundwater availability, groundwater flow and the physical characteristics of an aquifer or groundwater system (Nair and Sindhu 2016). A decrease in groundwater levels can trigger a number of eco-environmental problems capable of seriously affecting both local agricultural production and economic development (Li et al. 2019). Groundwater level is an important indicator of groundwater balance. Influence of climatic factors and human activities can make groundwater level exhibits cyclical and random characteristics. Therefore, the accurate prediction of groundwater level is of great significance for the rational utilization of groundwater resources and the sustainable development of the social economy (Li et al. 2019).

Electrical resistivity geophysical prospecting technique has been extensively utilized by many researchers in various domains of groundwater studies (Adiat et al. 2013). For instance, vertical electrical sounding (VES) technique can significantly contribute to the accurate location of aquifer not only through the development of its geometry (Zakari et al. 2015) but also by establishing relationship between the hydrogeological and geoelectrical parameters (Adiat et al. 2013).

An understanding of groundwater dynamics with the application of computer and mathematical tools can be used to predict groundwater flow and level fluctuation (Mao et al. 2002). In this direction, several studies were carried out for forecasting the groundwater levels using conceptual/physical models that are not only laborious, but also have practical limitations (Daliakopoulos et al. 2005; Lallahem et al. 2005a, b) as many inter-related variables are involved. In the recent past, soft computing tools like artificial neural networks (ANNs) have been used increasingly in various fields of science and technology for prediction purposes (Brion et al. 2002). In particular, ANNs have been found useful in the area of groundwater modeling.

The ANN is a general-purpose model with a limited set of variables and is used as a universal functional approximator (Hornik 1991). It can forecast many nonlinear time series events (Hill et al. 1996; Tang and Fishwick 1993; Zhang 2003) over conventional simulation methods (French et al. 1992). Basically, ANNs are intelligent systems that are related in some way to a simplified biological model of the human brain. They are composed of many simple elements called neurons operating in parallel and connected to each other in the forward path by some multipliers called connection weights. Usually, ANNs are trained by adjusting the values of these connection weights between the network elements. ANNs have applications in various fields like forecasting, system identification, pattern recognition, classification, speech recognition, image processing, etc. Many studies on various aspects of groundwater studies have adopted ANN as a research tool. Such aspects of groundwater studies include, but not limited to, groundwater remediation (Gumrah et al. 2000; Zhao et al. 2007; Yan and Minsker 2006), subsurface characterization (Parkin et al. 2007), groundwater pollution (Gemitzi et al. 2009; Coppola et al. 2007) and parameter estimation (Aziz and Wong 1992; Ajmera and Rastogi 2008).

Simulation of karstic and leaky aquifers (Coppola et al. 2003), fluctuation of alluvial aquifer groundwater level (Esmaili 2003), evaluation of dynamic water level in karstic aquifer (Lallahem et al. 2005a, b), simulation of the effects of hydrological, weather and humidity conditions on groundwater level (Shaouuan et al. 2007), etc., has been carried out by researchers around the world. It is, however, observed, within the context of literature review done for this study, that empirical relationship between geoelectric parameters and groundwater level has not been established in the literature. Furthermore, prediction of groundwater level from geoelectric parameters still remains a challenge that has hitherto not been carried out. More importantly, the capability of ANN has not been employed in this aspect of groundwater study in the study area. In order to overcome this challenge, an attempt would be made in this study to adopt ANN as a tool to predict groundwater level from geoelectric parameters. In addition to the attainment of the above aim, the study was also carried out to achieve the following objectives:

  1. 1.

    generate geoelectric parameters of hydrogeological importance;

  2. 2.

    produce piezometric head parameters;

  3. 3.

    establish nonparametric relationship between the input data, i.e., geoelectric parameters and groundwater level (i.e., the output) in the study area;

  4. 4.

    develop ANN and validate model for predicting groundwater column in basement complex terrain.

Description of the study area

The study area is Ijebu-Jesa, the capital of the Oriade Local Government Area of Osun state, southwestern Nigeria. It falls between latitude 7° 40′ N and 7° 43′ N and longitude 4° 48′ E and 4° 50′ E. The topography of the area is gently undulating. The climate is well defined with wet and dry seasons with annual rainfall varying between 150 and 200 cm. The annual relative humidity is over 80% with temperature ranging from 24 to 27 °C. The vegetation of the area is of rain forest.

The study area falls within the basement complex of the southwestern Nigeria. It forms part of the African crystalline shield which consists predominantly of migmatite and undifferentiated gneisses and quartzite (Rahaman 1976). The important structural features in the basement rocks include joints, faults, fractures, lineations and geological boundaries. These structural features are relevant in the control of groundwater accumulation and movement.

The major rock associations of this area form part of the Proterozoic Ilesha schist belt in southwestern part of Nigeria. This is predominantly developed in the western half of the country. In terms of structural features, lithology and mineralization, the schist belts of Nigeria show considerable similarities to the Achaean green stone belts (Olusegun et al. 1995). However, the latter usually contain much larger proportions of mafic and ultramafic bodies and assemblages of lower metamorphic grade (Olusegun et al. 1995; Ajayi and Ogedengbe 2003).

Rocks in the Ilesha schist belt are structurally divided into two main segments by two major fracture zones often called the Iwaraja faults in the eastern part and the Ifewara faults in the western part (Elueze 1986; Folami 1992; Kayode 2006). The regional geological map showing the study area is shown in Fig. 1. The northern part of the fault comprises mostly of amphibolites, amphibole schist, meta-ultramafic and meta-pelites. Extensive psammitic units with minor meta-pelite constitute the eastern segment (Rahaman 1976). These are found as quartzite and quartz schist. All these assemblages are associated with migmatitic gneisses and are cut by a variety of granitic bodies (Rahaman 1976; Elueze 1986; Olusegun et al. 1995; Ajayi and Ogedengbe 2003).

Fig. 1
figure 1

(adapted from Odeyemi 1993)

Regional geological map of Okemesi Fold Belt showing study area

The rocks of the Ilesha district may be broadly grouped into gneiss–migmatite complex, mafic–ultramafic suite (or amphibolites complex), meta-sedimentary assemblages and intrusive suite of granitic rocks. A variety of minor rock types are also related to these units. The gneiss–migmatite complex comprises migmatitic and granitic, calcareous and granulitic rocks. The mafic–ultramafic suite is composed mainly of amphibolites and amphibole schist and minor meta-ultramafites, made up of anthophyllite–tremolite–chlorite and talc schist (Rahaman 1976).

The meta-sedimentary assemblages, chiefly meta-pelites and psammitic units, are found as quartzite and quartz schist. The intrusive suite consists essentially of Pan African (c. 600 Ma.) granitic units. The minor rocks include garnet–quartz–chlorite bodies, biotite–garnet rock, syenitic bodies and dolerites (Olusegun et al. 1995; Folami 1992, Rahaman 1976).

The Ijebu-Jesa segment of the Ilesha schist belt falls into the migmatite–gneiss group with meta-sedimentary assemblages chiefly found as quartz schist. The quartz schist was mainly exposed by erosion within the study area (Fig. 2).

Fig. 2
figure 2

(modified after the Geologic Survey, Nigeria)

Local geological map of the study area

Methodologies

The study was executed in three phases which were groundwater level measurements, geophysical survey and prediction of groundwater level using ANN. The detail descriptions of methodology used in each of the phases are presented in “Groundwater level measurements”–“Prediction using artificial neural network (ANN)” sections below.

Groundwater level measurements

A total number of fifty-one (51) wells were accessible in the study area. The well location is shown in Fig. 3. The static water level and depth to the bottom of each well were determined at the peaks of dry and raining seasons and mean values obtained from the values obtained from the two seasons. In order to justify the use of the mean value used for the data obtained for the two seasons, reliability of the mean was statistically examined by carrying out some measures of dispersion which include the range and mean deviation. The coefficient of variation (CV), (the ratio of the standard deviation to the mean), was also determined to further ascertain the level of dispersion, occasioned by seasonal variation, on the values obtained in the two periods. The ground water level (GWL) was thereafter obtained from the two measured values.

Fig. 3
figure 3

Base map of the study area showing well/VES locations

Geophysical survey

The vertical electrical resistivity (VES) data were acquired using the Ohmega Terrameter and its accessories. A total of fifty-one (51) VES stations were occupied at each of the well location as shown in Fig. 3 (i.e., parametric VES). The Schlumberger array was adopted with electrode spacing (AB/2) ranging from 1 to 100 m. The coordinates of measurement station were taken using Garmin GPS 7.0. The VES data acquired were processed qualitatively and quantitatively. The qualitative analysis involved mere inspection of the curve for its type; the quantitative analysis on the other hand involved partial curve matching to generate geoelectric parameters that served as initial model parameters for subsequent computer iteration.

The information obtained from the results of the interpretation of the VES data was utilized to estimate the geoelectric parameters [aquifer resistivity (AQR), aquifer thickness (AQT), overburden resistivity (OR), overburden thickness and (OT) and coefficient of anisotropy (COA)] that were used as input parameters to develop the artificial neural network (ANN) model.

Prediction using artificial neural network (ANN)

Steps of the artificial neural network (ANN) as adopted in the study:

The final stage of the methodology was the implementation of the artificial neural network. Neural Network Toolbox in MATLAB Version (8.3) 2015 was used. A back projection based feed forward neural network was used to model the input and output parameters using MATLAB neural network toolbox. The procedural steps involved the following: parameters/variables selection, ANN architecture development, data processing and the model performance evaluation.

Parameters/variables selection

Five parameters [i.e., overburden resistivity (OR); aquifer resistivity (AQR); overburden thickness (OT), aquifer thickness (AQT)] and coefficient of anisotropy (COA) were used as the input parameters while the measured groundwater level (GWL) was the model output.

Data processing

Data preprocessing involves partitioning of the data into three sets which were the training, validation and testing sets in the ratio: 70:15:15, respectively. The training set was the largest set used by neural network to learn patterns present in the data. The 70% training set was selected randomly across the range of the whole data so as to ensure that the data contain sufficient representation of the patterns. This will enable the network to mimic the underlying relationship between input and output variables adequately. In order to shuffle the data, the training data were presented to the network in random order so as to ensure that the response of the training network will not vary with the order of pattern presentation (Manisha et al. 2008). Training was controlled by some conditions such as: the maximum number of iterations, target performance which specifies the tolerance between the neural network prediction and actual outputs, the maximum run time and the minimum allowed gradient. The desired results were generated in the output layer. The network achieves the desired learning by adjusting its interconnected weights continuously until there was a close match between the output from the neurons and the output from the training data. The difference between the predicted outputs and the original outputs is referred to as error. At the end of the training phase, the neural network correctly reproduced the target output values for the training data provided with minimal error. 15% of the data was used for testing, whereas the remaining 15% was allocated for model validation. The validation set was used to make a final check on the performance of the trained network after the completion of the training/testing processes. Validation set is an independent data set which was not used during training process. Performance criteria adopted in this study were mean square error (MSE) and regression coefficient (R).

ANN architecture development

The architecture of neural network defines its structure. It is the most important part of ANN development. Different neural network architectures were developed in order to establish a relationship between the input and output. All the networks were of the feed forward type. An ANN consists of input, hidden and output layers as shown in Fig. 4. The network architectures were trained by varying the number of hidden layer and then by varying number of neurons in each hidden layer.

Fig. 4
figure 4

Schematic diagram of the ANN architecture used for the study

The process involves determining the number of input neurons, the number of output neurons, the number of hidden layers and number of neurons in the hidden layers. Since the numbers of input and output neurons are decided by the nature of the problem, the number of hidden layer and associated hidden neurons represent the major decision to be made in overall architecture design.

In order to simplify the network architecture and thus reduce computational effort, a single layer network was adopted. In addition to this, it has been established that a single layer feed forward network with sufficient neurons can adequately approximate any nonlinear function (Haykin 1999; Hornik 1991). Starting with two neurons and then increasing the number while monitoring the performance criteria for each resulting architecture each time, the training was carried out until there was no significant improvement in the error. In order to objectively evaluate the model performance, the mean square error (MSE) and regression coefficient (R) were computed and are summarized.

Model performance evaluation

In order to assess the efficiency of the artificial neural network model, it was validated with 15% of the field data. The five parameters (i.e., OR, AQR, OT, AQT and COA) for all the validation data set represent the input parameters. These input parameters were put into the ANN model. The model separately simulated the input data to produce the output groundwater level (GWL). The model outputs (Pj) were compared with the expected outputs (Tj) (i.e., output from the field data measured). The efficiency of the model was determined by estimating the mean square error (MSE) and the regression coefficient (R) for the model.

Results and discussions

Results of the groundwater level measurements

The results of the measurements obtained from the wells/boreholes are presented in Table 1. The value of the coefficient of variation (CV) as determined from the results shown on Table 1 is 0.2% while the range varies from − 0.8 to 1.5. The small values obtained for both the CV and the range are an indication that the dispersion is small suggesting that seasonal variation has little effect on the water fluctuation in the wells and this further justifies the use of average of the two values. The groundwater level (GWL) shown in the last column of Table 1 was used as the output parameter for the ANN model.

Table 1 Results of the groundwater level measurement

Results of the geophysical survey

The geoelectric parameters obtained from the results of the interpretation of the VES are presented in Table 2. The geoelectric sections developed from results of the interpretation of the VES revealed that the subsurface is largely characterized by a maximum of between three and four layers. These layers are the top soil, the weathered basement, the fractured basement layer and the fresh basement. The weathered and or the fractured basement constitute the aquifer in the area. In other words, weathered basement aquifer and fractured basement aquifer are the aquifer types obtainable in the area. It is also observed that all the wells/boreholes in the area tap water from the delineated aquifer, this thus suggests that there is justification for predicting groundwater level from the geoelectric parameters obtained from the results of parametric soundings carried in the study area. The parameters shown in Table 2 were used as the input parameters for the ANN model.

Table 2 Geoelectric parameter for ANN input and output

Results of the ANN parameters selection

Five parameters, OR, AQR, OT, AQT and COA, shown in Table 2 were selected and used as the input parameters for the ANN model. These five parameters had been established to be significantly controlling the flow and accumulation of groundwater in the basement complex terrain and particularly in the study area (Adiat et al. 2018).

Results of the ANN data processing/ANN architecture development

The parameters shown in Table 2 are randomly partitioned into three in the ratio 70:15:15 to represent training, testing and validation data, respectively.

The results of performance evaluation of several networks trained were compared. It was observed that the number of iterations required by each network architecture differs. This implies that the number of iterations required by each network architecture to converge is not constant and this accounts for different iteration numbers recorded for different architecture (Table 3).

Table 3 Summary of performance evaluations of ANN architectures

The effect of network architecture on its performance is displayed in the table. No appreciable improvement was accomplished by the addition of extra neuron into the network before or after 22 hidden neurons.

Network with 22 hidden neurons gave the best validation performance values of 0.24903, 0.99998, 0.95223 and 0.98731 for regression coefficient (R), training, testing and validation, respectively. The closeness of the R value to 1 is an indication that both the predicted output (i.e., model output) and the expected output (i.e., the output used for the simulation) are well fitted.

Apart from the best performance in terms of MSE and R, network with 22 neurons provides simpler architecture needed for better computational efficiency when compared to others. Consequently, the network with 5 inputs, 22 hidden neurons and 1 output neuron was selected.

Results of the ANN model performance evaluation

Figure 5 is the regression coefficient (R) plot for the best architecture of 22 hidden layers. For training, neural network fits data along the blue line, the actual network outputs plotted in terms of the associated target values and R for training is 0.99998 which is high and close to 1. The closeness of the R value to 1 is an indication that both the predicted output and the expected output are well fitted. For testing, neural network fits data along the red line, the actual network outputs plotted in terms of the associated target values and R for testing is 0.95223. Validation data fit on the green light with R value of 0.98534. The value of R for all training, testing and validation simulated together is 0.98731. The closeness of the R value to 1 is an indication that the model fits the data well with accurate prediction of the groundwater level.

Fig. 5
figure 5

Regression plot for neural network with 22 neurons (hidden layers)

The results obtained from training of the data are presented in Table 4. The outputs generated by the ANN model [i.e., the predicted output (Pj)] as compared with the output used for the model training [i.e., the expected output (Tj)] are shown in the table. The mean square error (MSE) (one of the criteria used to evaluate the effectiveness of the performance of the model is shown in the table while the regression coefficient (R) had earlier been presented in Fig. 5).

Table 4 ANN summary table for training data set

The validation results for the ANN models are presented in Table 5. The mean square prediction error as shown in the table is 0.0014286 suggesting that the error limit of the prediction accuracy of the ANN model is 0.0014286. The regression value of 0.98534 was obtained for the ANN model (Fig. 6). The regression coefficient values of 0.98534 obtained for the ANN model indicate that the output values of the validation data and the expected output are well fitted in the model. Furthermore, the closeness of the regression value to 1 in the ANN model is an indication that the model is efficient in terms of groundwater level prediction.

Table 5 Table for validation data set
Fig. 6
figure 6

Regression plot for the validation of the ANN model with 22 hidden layers

High R value and small MSE value obtained ANN show that the model fits the data well. The results obtained from analysis of the data clearly showed that ANN model is best fit for predicting the groundwater level. This suggests that model can be applied in other areas of similar geology.

Conclusion

Prediction of groundwater level (GWL) from geoelectric parameters is still a challenge in groundwater studies. This is partly because the empirical relationship between groundwater level and geoelectric parameters has hitherto not been established. An attempt was made in this study to overcome these challenges by exploring the capability of artificial neural network (ANN) to model nonlinear system.

In order to achieve the objectives of the study, the piezometric head parameters of fifty-one (51) accessible wells in the study area were measured at the peaks of dry and raining seasons. These were used to obtain the GWL in these wells having established that a seasonal variation does not have significant effect on the groundwater fluctuation in the area. The GWL so obtained was used as the output parameter for the ANN model.

A total of fifty-one (51) parametric vertical electrical soundings (VES) stations were occupied at each of the well location by adopting Schlumberger array configuration with electrode spacing (AB/2) ranging from 1 to 100 m. The results obtained from the quantitative interpretation of the VES data were used to generate geoelectric parameters which include aquifer resistivity (AQR), aquifer thickness (AQT), overburden resistivity (OR), overburden thickness and (OT) and coefficient of anisotropy (COA). These parameters had been established to be controlling the groundwater flow and storage in the basement complex terrain and particularly the study area. These geoelectric parameters served as the input parameters for ANN model.

The capability of ANN as a nonlinear modeling system was thereafter applied to produce a model that can predict the GWL from the input parameters. The efficiency of the model was evaluated by estimating the mean square error (MSE) and the regression coefficient (R) for the model.

The results established that seasonal variation has little effect on the water fluctuation in the wells. The geoelectric sections developed from results of the interpretation of the VES revealed that the subsurface is largely characterized by a maximum of between three and four layers which are the top soil, the weathered basement, the fractured basement layer and the fresh basement. Two aquifer types (weathered basement aquifer and fractured basement aquifer) are obtainable in the area. It was also established that all the wells/boreholes in the area tap water from the delineated aquifer, this thus suggests that there is justification for predicting groundwater level from the geoelectric parameters obtained from the results of parametric soundings carried in the study area.

The data set was partitioned into training, testing and validation in ratio the 70:15:15, respectively. Apart from the best performance in terms of MSE and R, network with 22 neurons provides simpler architecture needed for better computational efficiency when compared to others. Consequently, the network with 5 inputs, 22 hidden neurons and 1 output neuron was selected.

For training, neural network fits data and R for training is 0.99998 which is high and close to 1. The closeness of the R value to 1 is an indication that both the predicted output and the expected output are well fitted. For testing, neural network fits and R for testing is 0.95223. The model was validated using validation data set which is 15% of the data set and randomly selected. The results of the validation showed that the mean square error (MSE) and the regression coefficient (R) for the ANN model were 0.0014286 and 0.98731, respectively. The results obtained from the validation of the ANN techniques showed that the modeling technique has effectiveness in predicting the groundwater level. This shows that the model is capable of producing effective and reliable prediction results. The model can be applied in other areas of similar geology. Therefore, from the study, the results are satisfactory and demonstrate that neural networks can be a useful prediction tool and it can be concluded that ANN is an effective tool for predicting groundwater level for the purposes of effective planning and management of groundwater resources.