Non-parametric estimation of conditional moments for sensitivity analysis

https://doi.org/10.1016/j.ress.2008.02.023Get rights and content

Abstract

In this paper, we consider the non-parametric estimation of conditional moments, which is useful for applications in global sensitivity analysis (GSA) and in the more general emulation framework. The estimation is based on the state-dependent parameter (SDP) estimation approach and allows for the estimation of conditional moments of order larger than unity. This allows one to identify a wider spectrum of parameter sensitivities with respect to the variance-based main effects, like shifts in the variance, skewness or kurtosis of the model output, so adding valuable information for the analyst, at a small computational cost.

Introduction

In global sensitivity analysis (GSA), the mapping Y=f(X) between an output Y of a computational model and a set of uncertain input factors X=(X1,,Xk) is analyzed in order to quantify the relative contribution of each input factor to the uncertainty of Y. Variance-based analysis is the most popular method in GSA. Variance-based sensitivity indices of single factors or of groups of them are defined as [1], [24]SI=Var(E(Y|XI))Var(Y)where XI denotes a group of factors indexed by I=(i1,,ig)1gk, and they tell the portion of variance of Y that is explained by XI.

The two most popular variance-based sensitivity measures are the main effectSi=Var(E(Y|Xi))Var(Y)and the total effectSTi=E(Var(Y|X-i))Var(Y)where X-i indicates all input factors except Xi.

The main effect measures the singular contribution of the input factor Xi to the uncertainty (variance) of the output Y, while the total effect measures the overall contribution of Xi on Y, including all interaction terms of Xi with all other input factors.

There are clear links between variance-based sensitivity analysis and model emulation. First, a statistical approximation (the emulator) f^(X) can be used to compute sensitivity indices in place of the original computational mapping f(X). Second, the variance-based sensitivity measures can be interpreted as the non-parametric R2 or correlation ratio, used in statistics to measure the explanatory power of covariates in regression [2], [3]. In fact, it is well known that the inner argument E(Y|XI) of (1) is the function of the subset of input factors that approximates f(X), by minimizing a quadratic loss (i.e. maximizing the R2). Therefore, estimating E(Y|XI) provides a route for both a model approximation and sensitivity estimation. Smoothing methods that provide more or less accurate and efficient estimations of E(Y|XI) are becoming a popular approach to sensitivity analysis [4], [5], [6], [7], [8]. State-dependent parameter (SDP) modelling is one class of non-parametric smoothing approach first suggested by Young [9], [10]. The estimation is performed with the help of the ‘classical’ recursive (numerically non-intensive) Kalman filter (KF) and associated fixed interval smoothing (FIS) algorithms: it has been applied for sensitivity analysis by Ratto et al. in [11], [12].

Variance-based techniques have a quite general applicability, since they apply to a very wide range of non-linear mappings f(·) and rely on only a few assumptions, namely Y has to be square integrable and the variance is an adequate measure of the uncertainty of Y. Nonetheless, these techniques are sometimes criticized, since all kinds of sensitivity patterns that cannot be attributed to shifts in the mean (the first moment—see factor X3 in Fig. 1), are not accounted for by E(Y|Xi) and the related variance-based sensitivity index. Such sensitivity patterns can be characterized by a shift in higher order moments: the simplest example of which is the heteroscedastic process, where the variance of Y changes along the conditioning term Xi. This lead to the development of a number of sensitivity techniques, such as entropy-based sensitivity measures [13], [14] or moment independent sensitivity measures [15], [16], that provide ‘main effects’ that are able to account for such phenomena.

In this paper, we show how non-parametric techniques can be applied to estimate conditional moments of order larger than one, allowing us to add valuable information to the standard variance-based analysis and, at the same time, avoid the computational load characterizing the latter class of sensitivity measures. In fact, the analysis does not require any additional model evaluation with respect to any standard smoothing method that may be applied to estimate the E(Y|Xi) terms.

Section snippets

The method

Readers can refer to [12] for a discussion of the SDP approach to sensitivity analysis and to [10] for a more comprehensive discussion of SDP modelling and its algorithms. Here we synthesize some key concepts regarding the estimation of main effects.

Summarizing considerably, a state-dependent model approximating E(Y|Xi), based on a Monte Carlo sample of dimension N, can be written asYt=E(Y|Xi,t)+ei,t=pi,t(si,t)+ei,twhere ei,t is the observation noise (i.e. what is not explained by E(Y|Xi)), pi,t

Test function analysis

The various estimation procedures described in the previous section have been evaluated by application to the following test function due to Ishigami [19]:Y=sinX1+asin2(X2)+bX34sinX1where Xi are independent and uniformly distributed in [-π,π]. The values of the constants a and b (5 and 0.1, respectively) are the same as in [16].

The analysis is carried out using a Sobol’ quasi-random sequence of size 1024 [20]. Fig. 1 illustrates the first step of the analysis, where the conditional expectations

Conclusions

In this paper, we have discussed an extended use of smoothing procedures for the estimation of conditional moments in sensitivity analysis. This analysis can be performed at no additional cost with respect to standard smoothing analysis for the estimate of E(Y|Xi) and provides a very useful completion of the standard variance-based sensitivity analysis. In particular, it generates:

  • estimates of significant patterns in the conditional variance that are an indication of an interaction structure

References (24)

  • M. Ratto et al.

    State dependent parameter meta-modelling and sensitivity analysis

    Comput Phys Commun

    (2007)
  • E. Borgonovo

    A new uncertainty importance measure

    Reliab Eng Syst Saf

    (2007)
  • I.M. Sobol’

    Sensitivity estimates for nonlinear mathematical models

    Mat Model

    (1990)
  • K. Pearson

    On the general theory of skew correlation and non-linear regression

    Drapers Company Res Memo Biom Ser

    (1905)
  • K. Doksum et al.

    Nonparametric estimation of global functionals and a measure of the explanatory power of covariates in regression

    Ann Statist

    (1995)
  • G. Li et al.

    Practical approaches to construct RS-HDMR component functions

    J Phys Chem

    (2002)
  • G. Li et al.

    Random sampling-high dimensional model representation (RS-HDMR) and orthogonality of its different order component functions

    J Phys Chem A

    (2006)
  • J. Oakley et al.

    Probabilistic sensitivity analysis of complex models: a Bayesian approach

    J R Stat Soc B

    (2004)
  • C.B. Storlie et al.

    Multiple predictor smoothing methods for sensitivity analysis: description of techniques

    Reliab Eng Syst Saf

    (2007)
  • C.B. Storlie et al.

    Multiple predictor smoothing methods for sensitivity analysis: example results

    Reliab Eng Syst Saf

    (2007)
  • P.C. Young

    Time variable and state dependent modelling of nonstationary and nonlinear time series

  • P.C. Young

    Stochastic dynamic modelling and signal processing: time variable and state dependent parameter estimation

  • Cited by (63)

    • Copula-based decomposition approach for the derivative-based sensitivity of variance contributions with dependent variables

      2018, Reliability Engineering and System Safety
      Citation Excerpt :

      Here we briefly review some key concepts of the SDP model and their application to the computation of variance based sensitivities. Readers can refer to references [38–41] for more details. 1) The first-order HDMR

    • Improving pooling method for regularization of convolutional networks based on the failure probability density

      2017, Optik
      Citation Excerpt :

      In Section 5, three image classification datasets, namely, CIFAR-10 [14], CIFAR-100 [14], and SVHN [15], were used to verify feasibility and accuracy of the proposed method.In Section 6 summarizes this paper and provides relevant conclusions. The pooling principle seeks to summarize the output of nerve cells of the neighborhood in the feature image of the same convolution kernel mapping in the NN [16–18]. In general, the pooling elements selected can be overlapped or not overlapped, but many experiments have proved that non-overlapped pooling units can achieve better effects [19].

    View all citing articles on Scopus
    View full text