Abstract
Inappropriate evaluation of uniaxial compression indexes (E and UCS) of rocks in high seismic intensity areas such as dam regions can lead to underestimation of the load, and possible settlement of the structure. Indirect assessments of these rock mechanical indexes based on non-destructive experiments and by using intelligent models is a well-accepted method to overcome associated limitations with laboratory tests of E and UCS. This study introduces the mutual information (MI) method as a unique system for variable importance measurement (VIM) and feature selection. Conducting MI-VIM assessments between various analyses of marl core samples (depth, density, ultrasonic tests (νd, Vp and Vs), Brazilian test (σt), triaxial compression test (C and and ϕ) and point load test (Is(50)) indicated that Vs and σt had the highest importance for E and UCS prediction. adaptive boosting–neural network ensemble (Adaboost–NNE) was used for the prediction of E and UCS. Testing of the generated Adaboost–NNE indicated that this model could accurately predict UCS and E with correlations of determinations 0.98 and 0.92, respectively. These results showed that VIM of MI coupled with Adaboost–NNE could develop a robust model that can be used for the prediction and modeling of other indexes of rocks.
Avoid common mistakes on your manuscript.
Introduction
The study of geotechnical and mechanical properties of rocks would be critical keys for the construction, control, and maintenance of high seismic intensity regions such as dam areas. Seydoon dam (Khoozestan Province, southwest of Iran) has been constructed over a series of marls, shale and sandstones. Among these rocks, marls are particularly important due to their specific properties. Marls are composed of clay and carbonate minerals in different proportions and their characters mainly depend on the type and percentage of carbonate and clay minerals (Bellair and Pomerol 1980; El Amrani et al. 1998). It is well accepted that the force load in dam areas has a large effect on the mechanical properties of rocks, such as elasticity and strength (Abrams 1917; Watstein 1953; Malvar and Ross 1998; Yan and Lin 2006). Uniaxial compressive strength (UCS) and Young’s modulus (E) can be used to determine the durability of rocks against weathering agents and their fabrics. UCS and E are also can be used to determine their deformation and bearing capacity (Dehghan et al. 2010; Matin et al. 2017). Therefore, the determination of UCS and E for marls from the ground of the dam can play an essential role in understanding their mechanical properties and help to do appropriate maintenance for the site.
American Standards for Testing Materials (ASTM) and International Society for Rock Mechanics (ISRM) have been introduced as standard procedures for the determination of UCS and E. However, direct determinations of these indexes based on ASTM and ISRM approaches in the laboratory have few drawbacks (complex sample preparation, expensive and time consuming process) (Sousa 2014; Jamshidi et al. 2016a, b; Armaghani et al. 2016a, b). For solving limitations associated with laboratory tests, various investigations were performed for the indirect determination of UCS and E, based on non-destructive index experiments. In those studies, mineral properties (composition, porosity and density), ultrasonic tests [P-wave velocity (Vp) and S-wave velocity (Vp)] and other standard indexes were used to predict UCS and E by regression or other intelligent computing methods [i.e., artificial neural networks (ANNs) or the adaptive neuro-fuzzy inference system (ANFIS)] (Demirdag et al. 2010; Ersoy and Kanik 2012; Armaghani et al. 2016a, b). For generating a robust predictive model, it is essential to build a highly accurate system based on the most relevant parameters (inputs). Development of a model which can explore inter-correlations through various variables, detect and select the most effective ones [variable importance measurement (VIM)], and use them as inputs of a precise predictive model has several advantages: as irrelevant variables make noises in modeling, by VIM redundant variables can be removed (save time and reduce cost of analyses) and outliers (influential points) identified (Chehreh Chelgani et al. 2016a, b; Matin and Chelgani 2016; Matin et al. 2016; Shahbazi et al. 2017).
Mutual information (MI) is an intelligent computer method which can explore both the linear and nonlinear relationship between wide ranges of inputs and rank them based on their influences. In other words, MI provides a decision-making system that can be used to select the most effective inputs (variables which can represent the influence of other ones) and reduce the noises for the development of a model (Chelgani et al. 2018). In a predictive modeling problem, various researches indicated that combination of intelligent predictor models and development of an ensemble of predictors (experts) can construct an accurate model to deal with complicated problems (Masoudnia et al. 2012; Hadavandi et al. 2015, 2016). One of the popular ensemble methods is the neural network ensemble (NNE) (Hansen and Salamon 1990) and an efficient approach for creating an NNE model is Adaptive Boosting (Adaboost) that can adaptively improve the probability of sampling cases for accurate training experts for the NNE model. This approach can develop a model by using a wide distribution of inputs and reduce the prediction errors by considering the information of previous experts (Hansen and Salamon 1990; Freund and Schapire 1996; Solomatine and Shrestha 2004; Masoudnia et al. 2012; Tian et al. 2012; Zhai et al. 2012). Although the last decade has witnessed increasing applications of MI and Adaboost–NNE models, they have not yet been used in the exploration and prediction of earth science sectors. This work explores the relationships between the mechanical properties of marls (from the cores of Seydoon dam) to predict UCS and E based on various analyses (depth, density, ultrasonic tests, Brazilian test, point load test, etc.). The interpretation of variables was evaluated by using MI and Adaboost–NNE predictive models. The results of this investigation would be useful for the maintenance of the dam and could introduce a unique model for the prediction of other rock and geomechanical indexes.
Materials and methods
Database
Thirty-nine core marl samples were collected from the Sydoom dam area by drilling exploration boreholes to different depths during geotechnical studies. The density of samples was determined by the weighting of cores. All sample analyses were based on ISRM and ASTM procedures. Core marls were highly weathered and weak against water and cutting blade; therefore, a high-speed thin cutting blade was used for sample preparation. Dynamic Poisson ratio (νd), S-wave (Vs) and P-wave (Vp) velocities were measured by 200 and 1000 kHz ultrasonic transducers through core samples. Brazilian test σt (MPa) was performed by using a 15 ton jack and a pump to generate forces to a cylindrical specimen between Brazilian frames. Point load test (Is(50)) core sample was subjected to a comprehensive load between two conical platens and, as a result of tension, the broke point was recorded. UCS and triaxial compression test [cohesion of rock material (C (MPa)) and friction angle of rock material (ϕ (°))] of samples were performed by an MTS machine. These tests were done using different confining pressures (1–6 MPa). The results of various experiments and their representative UCS and E are reported in Table 1.
Mutual information
Mutual information (MI) as a powerful VIM tool can quantify the inter-dependency between random variables. In other words, MI is the amount of shared information between model inputs. The MI (0 « I(X;Y) « 1) between two variables (X and Y) is defined based on the joint probability distribution p(x,y) and the product distribution p(x)p(y):
where Ep is the mathematical expectation. MI can reduce the prediction error by the maximization scheme between input variables and targets. In other words, MI can consider the information of more than single input to predict an output. This system detects the most relative variables, ranked them based on VIM and feed them to the predictive model. Variable selection by MI reduces learning algorithm time, increases the size of the search space and prevents overfitting in the predictive model (Kerroum et al. 2010; Lee and Kim 2013; Han et al. 2015; Hansen and Salamon 1990; Freund and Schapire 1996; Solomatine and Shrestha 2004; Masoudnia et al. 2012; Tian et al. 2012; Zhai et al. 2012).
Adaptive boosting
One approach for modeling complicated relationships is generating an ensemble system by combining the single prediction models (based on components) and exploiting the different local behavior of these base models to improve the performance of the overall prediction model (Masoudnia et al. 2012). The neural network ensemble (NNE) is a popular ensemble model that is developed based on a combination of neural network experts (Masoudnia et al. 2012; Tian et al. 2012; Zhai et al. 2012). The sequential manipulating of instances to train an individual neural network is one of the typical methods for the construction of NNEs that is called boosting method (Freund and Schapire 1996). For the last two decades, boosting as one of the most powerful ensemble methods was generated with a high learning capability. Adaptive boosting (Adaboost) changes the distribution of training set based on the performance of the previous NN components which is added in an ensemble model (Adaboost–NNE) (Tian et al. 2012; Zhai et al. 2012). Adaboost–NNE adaptively increases the probability of instances, which have higher prediction errors, by the previous components. The main idea in an Adaboost–NNE model is filtering out examples with the relative prediction error higher than the pre-set threshold value, and then following the Adaboost procedure (Hansen and Salamon 1990; Solomatine and Shrestha 2004). In this study, for prediction of rock mechanic indexes, an Adaboost–NNE model was developed in T iterations (T is the number of multi-layered perceptron neural network experts in the ensemble model). The training model is presented in algorithm 1:
Results and discussions
Variable selection
For making a robust system, before generating Adaboost–NNE model for the estimation of UCS and E, VIMs by MI (MI-VIM) is applied through all measured variables (νd, Vp, Vs, Is(50), σt, depth, density, C and ϕ) to evaluate their importance for the prediction, and as a result select the most effective inputs. MI-VIM results (Fig. 1) indicated that there are complicated interactions among variables, although σt and Vs showed a direct correlation with the outputs. On the other hand, MI ranked variables based on their importance and results illustrated that Vs and σt have the highest effectiveness for the prediction of UCS and E, respectively (Fig. 2). These outcomes demonstrated that Vs and σt can represent the correlation of other variables for the generation of predictive models and can be selected as input variables. There is a good agreement with these VIMs and theoretical studies where Castagna et al. (1985) and Pickett (1963) indicated that increasing the percentage of porosity would decrease the strength of rock. Therefore, the velocity of sonic waves (P or S) would be lower during passing through voids; therefore, results of ultrasonic tests potentially could be a good indicator of the mechanical properties of rocks. Moreover, several investigations demonstrated that sonic waves (Vp or Vs) can be strong predictors of UCS and E (Yasar and Erdogan, 2004a, b). On the other hand, the dependency evaluation of these indexes with Brazilian tensile strength (σt) (as an independent variable) showed that σt can be an appropriate predictor for UCS and E modeling (Jamshidi et al. 2016a, b; Fereidooni 2016).
The complex interactions of selected variables with the outputs are illustrated in Fig. 3. Linear correlations (Pearson correlation) between the MI selected variables and outputs (Table 2) indicate that there are significant positive correlations between them. To develop comprehensive Adaboost–NNE models (with 6 MLP experts) for the prediction of UCS and E based on MI-VIM selected variables, approximately 75% of records from the dataset were randomly applied for the training stage and the remaining 25% of the samples for the testing stage of the models. After training, the model was tested and the results of the testing phase showed that the Adaboost–NNE models could accurately predict the outputs, with the correlation of determination values (R2) of 0.92 and 0.98 for the E and UCS, respectively. The differences (Fig. 4) between laboratory measured variables (actual values) and Adaboost–NNE predicated ones showed that models could provide high satisfaction in the prediction of E and UCS. These results indicated the potential of MI coupled with Adaboost–NNE in the prediction of geomechanical indexes, and that these systems can be used for the assessment of other complicated variables in rock mechanics and other related disciplines.
Conclusion
Uniaxial compressive strength (UCS) and Young’s modulus (E) indexes can play critical roles for the control and maintenance of high seismic intensity regions such as dam areas. This study has introduced a new method [mutual information (MI)] for feature selections through rock properties based on variable importance measurements (VIMs) to predict UCS and E of marls from the Sydoon dam (Iran) by a powerful ensemble method called Adaboost–NNE (adaptive boosting- neural network ensemble). MI-VIM can assess the impact of each rock index individually and also in multivariate interactions with other variables. Based on MI-VIM results, the most effective features could be detected and selected to generate an unbiased and broadly applicable Adaboost–NNE model. Various rock mechanic analyses were performed [depth, density, ultrasonic tests (Vp and Vs), Brazilian test (σt), and point load test], MI-VIM through provided variables indicated that Vs and σt have the highest importance for the prediction of both UCS and E among other measured parameters. These two variables were selected to generate predictive Adaboost–NNE models. Testing results of the developed models indicated that Adaboost–NNE could predict UCS and E quite accurately with the correlation of determination values of 0.98 and 0.92, respectively. These results demonstrated that variable selection by MI and prediction by Adaboost–NNE make a robust system which can be used for expanding the knowledge surrounding modeling of rock and geomechanical indexes, and powerful tools for control and maintenance of other embankments.
References
Abrams DA (1917) Effect of rate of application of loading on the compressive strength of concrete. J ASTM 17:364–377
Armaghani DJ, Mohamad ET, Hajihassani M, Yagiz S, Motaghedi H (2016a) Application of several non-linear prediction tools for estimating uniaxial compressive strength of granitic rocks and comparison of their performances. Eng Comput 32:189–206
Armaghani DJ, Mohamad ET, Momeni E, Monjezi M, Narayanasamy MS (2016b) Prediction of the strength and elasticity modulus of granite through an expert artificial neural network. Arab J Geosci 9(48):1–16
Bellair M, Pomerol L (1980) Tratado de Geologı´a. Limusa, Mexico
Castagna JP, Batzle ML, Eastwood RL (1985) Relationships between compressional-wave and shear-wave velocities in clastic silicate rocks. Geophysics 50(4):571–581
Chelgani SC, Matin SS, Hower JC (2016a) Explaining relationships between coke quality index and coal properties by random forest method. Fuel 182:754–760
Chelgani SC, Matin SS, Makaremi S (2016b) Modeling of free swelling index based on variable importance measurements of parent coal properties by random forest method. Measurement 94:416–422
Chelgani SC, Shahbazi B, Hadavandi E (2018) Support vector regression modeling of coal flotation based on variable importance measurements by mutual information method. Measurement 114:102–108
Dehghan S, Sattari Gh, Chehreh Chelgani S (2010) Prediction of uniaxial compressive strength and modulus of elasticity for Travertine samples using regression and artificial neural networks. J Min Sci Technol 20:41–46
Demirdag S, Tufekci K, Kayacan R, Yavuz H, Altindag R (2010) Dynamic mechanical behavior of some carbonate rocks. Int J Rock Mech Min Sci 47:307–312
El Amrani Paaza N, Lamas F, Irigaray C, Chaco´n J (1998) Engineering geological characterisation of neogene marls in the southeastern Granada Basin, Spain. Eng Geol 50:165–175
Ersoy H, Kanik D (2012) Multicriteria decision-making analysis based methodology for predicting carbonate rocks’ uniaxial compressive strength. Earth Sci Res J 16(1):65–74
Fereidooni D (2016) Determination of the geotechnical characteristics of hornfelsic rocks with a particular emphasis on the correlation between physical and mechanical properties. Rock Mech Rock Eng 49(7):2595–2608
Freund Y, Schapire RE (1996) Experiments with a new boosting algorithm. In: Thirteenth International Conference on Machine Learning—ICML, pp 148–156
Hadavandi E, Shahrabi J, Shamshirband S (2015) A novel boosted-neural network ensemble for modeling multi-target regression problems. Eng Appl Artif Intell 45:204–219
Hadavandi E, Shahrabi J, Hayashi Y (2016) SPMoE: a novel subspace-projected mixture of experts model for multi-target regression problems. Soft Comput 20(5):2047–2065
Han M, Ren W, Liu X (2015) Joint mutual information-based input variable selection for multivariate time series modeling. Eng Appl Artif Intell 37:250–257
Hansen LK, Salamon P (1990) Neural network ensembles. IEEE Trans Pattern Anal Mach Intell 12:993–1001
Jamshidi A, Nikudel MR, Khamehchiyan M, Sahamieh RZ (2016a) The effect of specimen diameter size on uniaxial compressive strength, P-wave velocity and the correlation between them. Geomech Geoeng 11(1):13–19
Jamshidi A, Reza Nikudel M, Khamehchiyan M (2016b) A novel physico-mechanical parameter for estimating the mechanical strength of travertines after a freeze–thaw test. Bull Eng Geol Environ 76:1–10
Kerroum MA, Hammouch A, Aboutajdine D (2010) Textural feature selection by joint mutual information based on Gaussian mixture model for multispectral image classification. Pattern Recognit Lett 31:1168–1174
Lee J, Kim DW (2013) Feature selection for multi-label classification using multivariate mutual information. Pattern Recognit Lett 34(3):349–357
Malvar LJ, Ross CA (1998) Review of strain rate effects for concrete in tension. ACI Mater J 95(6):735–739
Masoudnia S, Ebrahimpour R, Arani SAAA (2012) Combining features of negative correlation learning with mixture of experts in proposed ensemble methods. Appl Soft Comput 12:3539–3551
Matin SS, Chelgani SC (2016) Estimation of coal gross calorific value based on various analyses by random forest. Fuel 177:274–278
Matin SS, Hower JC, Chelgani SC (2016) Explaining relationships among various coal analyses with coal grindability index by random forest. Int J Miner Process 155:140–146
Matin SS, Farahzadi L, Makaremi S, Chelgani SC, Sattari Gh (2017) Variable selection and prediction of uniaxial compressive strength and modulus of elasticity by Random Forest. Appl Soft Comput 70:980–987 (in press)
Pickett GR (1963) Acoustic character logs and their applications in formation evaluation. J Petrol Technol 15(6):659–667
Shahbazi B, Chelgani SC, Matin SS (2017) Prediction of froth flotation responses based on various conditioning parameters by random forest method. Coll Surf A Physicochem Eng Asp 529:936–941
Solomatine DP, Shrestha DL (2004) AdaBoost. RT: a boosting algorithm for regression problems. In: Neural networks, 2004. Proceedings. 2004 IEEE International Joint Conference, pp 1163–1168
Sousa LMO (2014) Petrophysical properties and durability of granites employed as building stone: a comprehensive evaluation. Bull Eng Geol Environ 73:569–588
Tian J, Li M, Chen F, Kou J (2012) Coevolutionary learning of neural network ensemble for complex classification tasks. Pattern Recogn 45:1373–1385
Watstein D (1953) Effect of straining rate on the compressive strength and elastic properties of concrete. ACI Mater J 49:729–744
Yan DM, Lin G (2006) Dynamic properties of plain concrete in direct tension. Cem Concr Res 36:1371–1378
Yasar E, Erdogan Y (2004a) Correlating sound velocity with the density, compressive strength and Young’s modulus of carbonate rocks. Int J Rock Mech Min Sci 41:871–875
Yasar E, Erdogan Y (2004b) Estimation of rock physicomechanical properties using hardness methods. Eng Geol 71:281–288
Zhai J, Xu H, Wang X (2012) Dynamic ensemble extreme learning machine based on sample entropy. Soft Comput 16:1493–1502
Acknowledgements
Open access funding provided by Lulea University of Technology.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Salehin, S., Hadavandi, E. & Chelgani, S.C. Exploring relationships between mechanical properties of marl core samples by a coupling of mutual information and predictive ensemble model. Model. Earth Syst. Environ. 6, 575–583 (2020). https://doi.org/10.1007/s40808-019-00672-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40808-019-00672-1