Non-parametric estimation of conditional moments for sensitivity analysis

doi:10.1016/j.ress.2008.02.023

Reliability Engineering & System Safety

Volume 94, Issue 2, February 2009, Pages 237-243

https://doi.org/10.1016/j.ress.2008.02.023 Get rights and content

Abstract

In this paper, we consider the non-parametric estimation of conditional moments, which is useful for applications in global sensitivity analysis (GSA) and in the more general emulation framework. The estimation is based on the state-dependent parameter (SDP) estimation approach and allows for the estimation of conditional moments of order larger than unity. This allows one to identify a wider spectrum of parameter sensitivities with respect to the variance-based main effects, like shifts in the variance, skewness or kurtosis of the model output, so adding valuable information for the analyst, at a small computational cost.

Introduction

In global sensitivity analysis (GSA), the mapping $Y = f (X)$ between an output Y of a computational model and a set of uncertain input factors $X = (X_{1}, \dots, X_{k})$ is analyzed in order to quantify the relative contribution of each input factor to the uncertainty of Y. Variance-based analysis is the most popular method in GSA. Variance-based sensitivity indices of single factors or of groups of them are defined as [1], [24] $S_{I} = \frac{Var (E (Y | X_{I}))}{Var (Y)}$ where $X_{I}$ denotes a group of factors indexed by $I = (i_{1}, \dots, i_{g})_{1 ⩽ g ⩽ k}$ , and they tell the portion of variance of $Y$ that is explained by $X_{I}$ .

The two most popular variance-based sensitivity measures are the main effect $S_{i} = \frac{Var (E (Y | X_{i}))}{Var (Y)}$ and the total effect $S_{Ti} = \frac{E (Var (Y | X_{- i}))}{Var (Y)}$ where $X_{- i}$ indicates all input factors except $X_{i}$ .

The main effect measures the singular contribution of the input factor $X_{i}$ to the uncertainty (variance) of the output Y, while the total effect measures the overall contribution of $X_{i}$ on Y, including all interaction terms of $X_{i}$ with all other input factors.

There are clear links between variance-based sensitivity analysis and model emulation. First, a statistical approximation (the emulator) $\hat{f} (X)$ can be used to compute sensitivity indices in place of the original computational mapping $f (X)$ . Second, the variance-based sensitivity measures can be interpreted as the non-parametric $R^{2}$ or correlation ratio, used in statistics to measure the explanatory power of covariates in regression [2], [3]. In fact, it is well known that the inner argument $E (Y | X_{I})$ of (1) is the function of the subset of input factors that approximates $f (X)$ , by minimizing a quadratic loss (i.e. maximizing the $R^{2}$ ). Therefore, estimating $E (Y | X_{I})$ provides a route for both a model approximation and sensitivity estimation. Smoothing methods that provide more or less accurate and efficient estimations of $E (Y | X_{I})$ are becoming a popular approach to sensitivity analysis [4], [5], [6], [7], [8]. State-dependent parameter (SDP) modelling is one class of non-parametric smoothing approach first suggested by Young [9], [10]. The estimation is performed with the help of the ‘classical’ recursive (numerically non-intensive) Kalman filter (KF) and associated fixed interval smoothing (FIS) algorithms: it has been applied for sensitivity analysis by Ratto et al. in [11], [12].

Variance-based techniques have a quite general applicability, since they apply to a very wide range of non-linear mappings $f (\cdot)$ and rely on only a few assumptions, namely Y has to be square integrable and the variance is an adequate measure of the uncertainty of Y. Nonetheless, these techniques are sometimes criticized, since all kinds of sensitivity patterns that cannot be attributed to shifts in the mean (the first moment—see factor $X_{3}$ in Fig. 1), are not accounted for by $E (Y | X_{i})$ and the related variance-based sensitivity index. Such sensitivity patterns can be characterized by a shift in higher order moments: the simplest example of which is the heteroscedastic process, where the variance of Y changes along the conditioning term $X_{i}$ . This lead to the development of a number of sensitivity techniques, such as entropy-based sensitivity measures [13], [14] or moment independent sensitivity measures [15], [16], that provide ‘main effects’ that are able to account for such phenomena.

In this paper, we show how non-parametric techniques can be applied to estimate conditional moments of order larger than one, allowing us to add valuable information to the standard variance-based analysis and, at the same time, avoid the computational load characterizing the latter class of sensitivity measures. In fact, the analysis does not require any additional model evaluation with respect to any standard smoothing method that may be applied to estimate the $E (Y | X_{i})$ terms.

Section snippets

The method

Readers can refer to [12] for a discussion of the SDP approach to sensitivity analysis and to [10] for a more comprehensive discussion of SDP modelling and its algorithms. Here we synthesize some key concepts regarding the estimation of main effects.

Summarizing considerably, a state-dependent model approximating $E (Y | X_{i})$ , based on a Monte Carlo sample of dimension N, can be written as $Y_{t} = E (Y | X_{i, t}) + e_{i, t} = p_{i, t} (s_{i, t}) + e_{i, t}$ where $e_{i, t}$ is the observation noise (i.e. what is not explained by $E (Y | X_{i})$ ), $p_{i, t}$

Test function analysis

The various estimation procedures described in the previous section have been evaluated by application to the following test function due to Ishigami [19]: $Y = \sin X_{1} + a \sin^{2} (X_{2}) + {bX}_{3}^{4} \sin X_{1}$ where $X_{i}$ are independent and uniformly distributed in $[- π, π]$ . The values of the constants a and b (5 and 0.1, respectively) are the same as in [16].

The analysis is carried out using a Sobol’ quasi-random sequence of size 1024 [20]. Fig. 1 illustrates the first step of the analysis, where the conditional expectations

Conclusions

In this paper, we have discussed an extended use of smoothing procedures for the estimation of conditional moments in sensitivity analysis. This analysis can be performed at no additional cost with respect to standard smoothing analysis for the estimate of $E (Y | X_{i})$ and provides a very useful completion of the standard variance-based sensitivity analysis. In particular, it generates:

•
estimates of significant patterns in the conditional variance that are an indication of an interaction structure

References (24)

M. Ratto et al.
State dependent parameter meta-modelling and sensitivity analysis
Comput Phys Commun
(2007)
E. Borgonovo
A new uncertainty importance measure
Reliab Eng Syst Saf
(2007)
I.M. Sobol’
Sensitivity estimates for nonlinear mathematical models
Mat Model
(1990)
K. Pearson
On the general theory of skew correlation and non-linear regression
Drapers Company Res Memo Biom Ser
(1905)
K. Doksum et al.
Nonparametric estimation of global functionals and a measure of the explanatory power of covariates in regression
Ann Statist
(1995)
G. Li et al.
Practical approaches to construct RS-HDMR component functions
J Phys Chem
(2002)
G. Li et al.
Random sampling-high dimensional model representation (RS-HDMR) and orthogonality of its different order component functions
J Phys Chem A
(2006)
J. Oakley et al.
Probabilistic sensitivity analysis of complex models: a Bayesian approach
J R Stat Soc B
(2004)
C.B. Storlie et al.
Multiple predictor smoothing methods for sensitivity analysis: description of techniques
Reliab Eng Syst Saf
(2007)
C.B. Storlie et al.
Multiple predictor smoothing methods for sensitivity analysis: example results
Reliab Eng Syst Saf
(2007)

P.C. Young

Time variable and state dependent modelling of nonstationary and nonlinear time series

P.C. Young

Stochastic dynamic modelling and signal processing: time variable and state dependent parameter estimation

Cited by (63)

Reliability-oriented global sensitivity analysis using subset simulation and space partition
2024, Reliability Engineering and System Safety
This paper presents a novel reliability-oriented global sensitivity analysis method using the Subset Simulation (SS) method and the Space Partition (SP) scheme. It can exploit the inherent information of uncertainty within all the conditional samples in each simulation level of SS along the spirit of the SP scheme. The reasoning of the first order indices is firstly provided, followed by a heuristic study on the influence of parametric settings upon the statistical properties of the indices. By extending the calculation process to the higher order indices and formally optimizing the partition scheme considering the coefficient of variation of all the indices, the framework of the proposed reliability-oriented global sensitivity for arbitrary order of indices can be formed. The performance of the direct Monte Carlo method, the Quasi-Monte Carlo method, SP and the proposed method is compared through three numerical examples and two engineering applications, to demonstrate the merits of the proposed one.
Importance measure analysis of design variables and uncertain parameters in multidisciplinary systems
2022, Applied Mathematical Modelling
The traditional reliability-based multidisciplinary design optimization consumes large computation resources due to a large number of design variables and uncertain parameters in multidisciplinary systems. Therefore, the screening of important design variables and uncertain parameters contributes to reducing the difficulty of optimization. However, the current methods ignore the interaction between design variables and uncertain parameters. Via a probabilistic and nonprobabilistic reliability model, the work in this paper establishes the concept of a comprehensive reliability-based importance measure that considers both design variables and uncertain parameters. An efficient method based on the Bayesian theorem is applied to calculate the above-mentioned indices of the proposed importance measure, which can estimate importance measure accurately adopting the same samples with reliability analysis. Therefore, the probabilistic method has faster convergence speed than traditional methods. The nonprobabilitic analysis method can promote the computation efficiency but ensure accuracy when the system performance functions are monotonic. Furthermore, the developed approach contributes to decreasing the time cost of reliability-based optimization in multidisciplinary systems. Additionally, the developed approach can also be applicable to single-disciplinary systems. Moreover, two practical engineering examples and the multidisciplinary design of a hypersonic wing are employed to demonstrate the validity and applicability of the proposed method.
Global reliability sensitivity analysis based on state dependent parameter method and efficient sampling techniques
2020, Aerospace Science and Technology
In order to efficiently assess the influence of input variables on the failure of the structural systems, an improved global reliability sensitivity analysis (SA) method is proposed in this paper. The new method is based on the state dependent parameter (SDP) method and the efficient sampling techniques. In the new method, the efficient sampling techniques are first used to generate samples that are more efficient for reliability analysis, and then the SDP method is further employed to estimate the global reliability sensitivity index by using the same set of samples as the reliability analysis. Two efficient sampling methods, e.g., importance sampling (IS) and truncated importance sampling (TIS), are employed in this paper, and the strategies of combining these methods with the SDP method for global reliability SA are discussed. Compared with the existing SDP method, the new method is more efficient for global reliability SA of structural systems. Three examples are used in the paper to demonstrate the efficiency and precision of the new methods.
A probabilistic procedure for quantifying the relative importance of model inputs characterized by second-order probability models
2018, International Journal of Approximate Reasoning
This paper develops a new global sensitivity analysis (GSA) framework for computational models with input variables being characterized by second-order probability models due to epistemic uncertainties. Firstly, two graphical tools, called individual effect (IE) function and total effect (TE) function, are defined for identifying the influential and non-influential input variables. Secondly, two probabilistic GSA indices, called T-indices, are introduced for comparing the relative importance of pairwise influential input variables. Thirdly, the expected Sobol' indices are introduced for ranking the importance of the input variables. For efficiently estimating the proposed GSA indices, the extended Monte Carlo simulation (EMCS), whose computational cost is the same as the Monte Carlo simulation for estimating the Sobol' indices, is firstly introduced, and then a procedure combining Kriging surrogate model and EMCS procedure is introduced for further reducing the computational cost. Three numerical examples and a ten-bar structure are introduced for illustrating the significance of the proposed GSA framework and demonstrating the effectiveness of the computational methods.
Copula-based decomposition approach for the derivative-based sensitivity of variance contributions with dependent variables
2018, Reliability Engineering and System Safety
Citation Excerpt :
Here we briefly review some key concepts of the SDP model and their application to the computation of variance based sensitivities. Readers can refer to references [38–41] for more details. 1) The first-order HDMR
Variance-based sensitivity analysis with dependent variables represents how the uncertainties and dependence of variables influence the output uncertainty. Since the distribution parameters of variables are difficult to be given precisely, this work defines the derivative-based sensitivity of variance contribution with respect to the distribution parameters, which reflects how small variation of distribution parameters influences the variance contributions. By introducing the copula functions to describe the dependence of variables, the derivative of variance contributions can be transformed into those of marginal PDF and copula function, which can be defined by kernel function and copula kernel function. Then the derivative-based sensitivity of variance contributions can be decomposed into the independent part and dependent part. Since the derivatives of marginal PDF and copula function can be given analytically, the proposed derivative-based sensitivity can be computed with no additional computational cost, which is seen as the ‘by-product’ of variance-based sensitivity analysis. To calculate the proposed sensitivity, two computational methods, numerical method and SDP (state dependent parameter) method are presented for comparison. Several examples are used to demonstrate the reasonability of the proposed sensitivity and the accuracy of the applied method.
Improving pooling method for regularization of convolutional networks based on the failure probability density
2017, Optik
Citation Excerpt :
In Section 5, three image classification datasets, namely, CIFAR-10 [14], CIFAR-100 [14], and SVHN [15], were used to verify feasibility and accuracy of the proposed method.In Section 6 summarizes this paper and provides relevant conclusions. The pooling principle seeks to summarize the output of nerve cells of the neighborhood in the feature image of the same convolution kernel mapping in the NN [16–18]. In general, the pooling elements selected can be overlapped or not overlapped, but many experiments have proved that non-overlapped pooling units can achieve better effects [19].
This research proposes an improved pooling method for regularized convolutional neural network (CNN). This pooling method intends to assign failure probability density (FPD) values to pixel points in image feature domains after projection of image eigenvectors from high to low dimensions to maintain the relationship of highly dimensional image features. As a result, feature mapping of some samples approximated failure probability, and residual samples featured low risk of failing. Optimization was implemented according to this idea and was realized by setting the threshold value of FPD to reserve high-quality features. Different from traditional pooling method based on CNN, the pooling method proposed in this study is based on failure probability theory, and it was used as basis for construction of CNN structure. Image classification tests on three kinds of image datasets (CIFAR-10, CIFAR-100, and SVHN) were respectively conducted. Afterward, comparisons were made on experimental accuracy and speed obtained through three relatively popular pooling methods (i.e., dropout-pooling, maxout-pooling, and stochastic- pooling). Research results indicated that pooling model based on failure probability theory featured scientific derivation without the need for empirical parameters and presented the most accurate results in experiments on three kinds of image in training data and test data. This model also presented high efficiency in speed of model training, proving its robustness.

View all citing articles on Scopus

View full text

Non-parametric estimation of conditional moments for sensitivity analysis

Abstract

Introduction

Section snippets

The method

Test function analysis

Conclusions

Comput Phys Commun

Reliab Eng Syst Saf

Sensitivity estimates for nonlinear mathematical models

Mat Model

On the general theory of skew correlation and non-linear regression

Drapers Company Res Memo Biom Ser

Nonparametric estimation of global functionals and a measure of the explanatory power of covariates in regression

Ann Statist

Practical approaches to construct RS-HDMR component functions

J Phys Chem

Random sampling-high dimensional model representation (RS-HDMR) and orthogonality of its different order component functions

J Phys Chem A

Probabilistic sensitivity analysis of complex models: a Bayesian approach

J R Stat Soc B

Multiple predictor smoothing methods for sensitivity analysis: description of techniques

Reliab Eng Syst Saf

Multiple predictor smoothing methods for sensitivity analysis: example results

Reliab Eng Syst Saf

Time variable and state dependent modelling of nonstationary and nonlinear time series

Stochastic dynamic modelling and signal processing: time variable and state dependent parameter estimation