A software framework for probabilistic sensitivity analysis for computationally expensive models

https://doi.org/10.1016/j.advengsoft.2016.06.005Get rights and content

Highlights

  • Sensitivity analysis (SA) approach is used to quantify the effects of correlated input parameters on model outputs.

  • Penalized spline regression model is used to approximate complex data.

Abstract

We provide a sensitivity analysis toolbox consisting of a set of Matlab functions that offer utilities for quantifying the influence of uncertain input parameters on uncertain model outputs. It allows the determination of the key input parameters of an output of interest. The results are based on a probability density function (PDF) provided for the input parameters. The toolbox for uncertainty and sensitivity analysis methods consists of three ingredients: (1) sampling method, (2) surrogate models, (3) sensitivity analysis (SA) method. Numerical studies based on analytical functions associated with noise and industrial data are performed to prove the usefulness and effectiveness of this study.

Introduction

In many fields such as structural reliability [30], [31], [32], material modeling [30], [31], [33], finance etc., mathematical (numerical) models are used for predicting the response of a system. Due to the increasing computer power, the complexity of the model is growing. Generally, the more complex the models are, the larger becomes the uncertainty in the model outputs due to randomness in the input parameters. It is essential to determine how much the model output is changed by the variation in input parameters as well as calibrate and validate the mathematical models. Sensitivity analysis (SA) is a great help for these purposes [1]. Therefore, uncertainty and sensitivity analysis have recently received widespread interest of researcher in many fields such as material modeling and structural design. Numerous SA approaches have been developed to quantify the models with uncorrelated parameters [2]. However, engineering systems are complex and frequently contain correlated input parameters such that if one parameter varies, it results in variations in other parameters. The variation in the output of the models with correlated input parameters (e.g., composition constraints in material modeling [3]) is not only contributed by the variations in input parameter itself, but also contributed by the correlated variations in other parameters [4]. Hence, it is more realistic to estimate the effects of changing more than one parameters on the model outputs simultaneously. It is essential to understand the relations among the uncertain input parameters for designing a SA.

A few methods have been developed to quantitatively assess the effect of correlated input parameters on the model outputs. For instance, Xu and Gernert [4] improved the original Fourier amplitude sensitivity test (FAST) associated with Iman and Conover method [18] – used to generate correlated samples – to properly measure the sensitivity index for a model with correlated input parameters. Then, they developed another method to evaluate the sensitivity index for the uncorrelated and correlated contributions, see [5]. Nevertheless, those methods are limited in estimating the first-order sensitivity indices and the latter is based on a weak assumption that the model output linearly relates to input parameters. Later, SA methods, see [1], [6], [7], were proposed to quantitatively assess the total-effect sensitivity index that is essential for model simplification, however, most of them deal with analytical functions but not for experimental or simulation data.

Hence, a unified framework that links different steps from generating sample, constructing the surrogate model and implementing the sensitivity analysis method is needed. In this article, a review and computer implementation for uncertainty and sensitivity analysis and its application in engineering analysis has been carried out to provide a robust and powerful modeling tool to support for designing uncertainty and sensitivity analysis. We employ a sensitivity analysis (SA) method for the case of correlated parameters [6] whose formulas were derived similarly to Sobol’ formulas for the case of uncorrelated parameters [8]. For the estimation, the sample data is generated from the joint and conditional probability distribution functions of input parameters which are required to account for the constraints in the inputs space. Gaussian copula is used to generate a joint cumulative distribution function (CDF) (multivariate normal distribution) that requires only marginal distributions and covariance matrix of input parameters.

Complex models are often very time-consuming and computationally expensive so that they cannot be used to compute sensitivity indices. Thus, the so-called surrogate-based approach is employed as an approximation of the real model for sensitivity analysis. In [10], the authors presented a penalized spline regression model for a single continuous predictor. Since predictor variables have nonlinear relationships with the model output, the regression models considering multiple smooth functions [11] are adopted in this article to approximate the observed data. Subsequently, the SA indices are computed based on penalized spline regression models.

The objective of this work is to provide a MATLAB toolbox consisting of a set of functions that can be used to randomly generate samples, construct the surrogate model and carry out the SA. The computer implementation has been presented in this article. The support MATLAB code can be found at the website (http://www.uni-weimar.de/Bauing/rabczuk/).

The article is outlined as follows. In the next section, we briefly describe the flow chart and structure of the framework. The sampling technique is shown in Section 3. Surrogate models containing polynomial and penalized spline regression models are presented in Section 4.2. The SA is described in Section 5. Application of the SA method for models with correlated input parameters are presented in Section 6 including two analytical models with additional noise and an industrial example. Finally, we close the manuscript with concluding remarks.

Section snippets

Matlab toolbox: A flowchart and structure of the code

A framework including sampling of correlated input parameters, construction of surrogate model, and implementation of sensitivity analysis are respectively described in Fig. 1. Also, the structure and purpose of the code are briefly depicted in Table 1. In the first step, sampling technique is used to randomly generate correlated input values which are then inserted into the computational model to obtain the model response in the second step. In the third step, the regression model is used to

Sampling method

Latin Hypercube Sampling (LHS) [14], [15] is an improved sampling strategy that enables a reliable approximation of the stochastic properties even for a small number of samples N. LHS is used to provide the design points which are spread throughout the design space. The LHS can be summarized as:

  • Divide the cumulative curve into N equal intervals on the cumulative distribution of each parameter;

  • A probability value is then randomly selected from each interval of the parameter distributionProbi=(1/N

Polynomial regression model

Given a computational model with d predictor variables, the general multiple regression model isYj=β0+i=1dβixji+ϵj.

The model response Y of N observed data points can be shown as:Y^=Xβ+ϵ,in which Y is an N × 1 vector, X=[1X1Xd], ϵ is an N × 1 error vector and β are the regression coefficients. By minimizingi=1N(YixiTβ)2=YXβ2the least-squares estimate of β is determined asβ^=(XTX)1XTY,with xj=[1,xj1,,xjd],j=1,,N. To test the model, we simply compute either the root mean square error (

Extension of Sobol’ approach for models with correlated inputs (ESACIs)

We consider a computational model f(X1, X2, …, Xd) defined in Rd with a realization matrix X=(X1,,Xd). If the realization matrix X is partitioned into y=(Xi1,,Xis),1is<k and z=(Xis+1,,Xd), the total variance of f(X1, X2, …, Xd) can be written as follows:V=Vy[Ezf(y,zˇ)]+Ey[Vz(f(y,zˇ))].

We can estimate the first-order of the group y bySy=Vy[Ez(f(y,zˇ))]V,and the total-effect index of the group z bySTz=Ez[Vy(f(yˇ,z))]V.

Similarly, the total-effect index of the group y can be computed bySTy=Ey[Vz

Application

In this article, we use the presented method applied to analytical models for the first two examples and industrial data for the last one. If the input parameters follow a correlated multivariate normal distribution, an application of the method is straightforward. In the general case, arbitrary multivariate data need to be transformed to a correlated multivariate normal distribution through copula-based methods. With regards to the first two analytical, a random noise will be added to model

Conclusions

A unified method for probabilistic sensitivity analysis for computationally expensive models with correlated inputs has been presented. The framework presents sequential steps consisting of generating random sample data by LHS, transforming an arbitrary distributions into a multivariate normal distribution, approximating the observed data via surrogate models and estimating the sensitivity indices for the model with correlated parameters. In summary, the computer implementation involves the

Acknowledgements

We gratefully acknowledge the support from National Basic Research Program of China (973 Program: 2011CB013800), NSFC (41130751), the Ministry of Science and Technology of China (SLDRCE14-B-31), Science and Technology Commission of Shanghai Municipality (16QA1404000), IRSES-MULTIFRAC. The support from the Alexander von Humboldt Foundation in the framework of the Sofja Kovalevskaja Award endowed by the Federal Ministry of Education and Research is acknowledged by X. Zhuang.

References (33)

  • C. Xu et al.

    Extending a global sensitivity analysis technique to models with correlated parameters

    Comput. Stat. Data Anal.

    (2007)
  • I.M. Sobol’

    Sensitivity analysis for non-linear mathematical models

    Math. Model. Comput. Exp.

    (1993)
  • Schumann E.....
  • N. Vu-Bac et al.

    A unified framework for stochastic predictions of mechanical properties of polymeric nanocomposites

    Comput. Mat. Sci.

    (2015)
  • D. Ruppert et al.

    Semiparametric regression

    (2003)
  • S.N. Lophaven et al.

    DACE A MATLAB Kriging toolbox

    Informatics and mathematical modeling

    (2002)
  • Cited by (542)

    View all citing articles on Scopus
    View full text