A comparison of uncertainty and sensitivity analysis results obtained with random and Latin hypercube sampling

doi:10.1016/j.ress.2004.09.006

Reliability Engineering & System Safety

Volume 89, Issue 3, September 2005, Pages 305-330

https://doi.org/10.1016/j.ress.2004.09.006 Get rights and content

Abstract

Uncertainty and sensitivity analysis results obtained with random and Latin hypercube sampling are compared. The comparison uses results from a model for two-phase fluid flow obtained with three independent random samples of size 100 each and three independent Latin hypercube samples (LHSs) of size 100 each. Uncertainty and sensitivity analysis results with the two sampling procedures are similar and stable across the three replicated samples. Poor performance of regression-based sensitivity analysis procedures for some analysis outcomes results more from the inappropriateness of the procedure for the nonlinear relationships between model input and model results than from an inadequate sample size. Kendall's coefficient of concordance (KCC) and the top down coefficient of concordance (TDCC) are used to assess the stability of sensitivity analysis results across replicated samples, with the TDCC providing a more informative measure of analysis stability than KCC. A new sensitivity analysis procedure based on replicated samples and the TDCC is introduced.

Introduction

The identification and representation of the implications of uncertainty is widely recognized as a fundamental component of analyses of complex systems [1], [2], [3], [4], [5], [6], [7], [8], [9], [10]. The study of uncertainty is usually subdivided into two closely related activities referred to as uncertainty analysis and sensitivity analysis, where (i) uncertainty analysis involves the determination of the uncertainty in analysis results that derives from uncertainty in analysis inputs and (ii) sensitivity analysis involves the determination of relationships between the uncertainty in analysis results and the uncertainty in individual analysis inputs.

At an abstract level, the analysis or model under consideration can be represented as a function of the form $y = y (x) = f (x),$ where $x = [x_{1}, x_{2}, \dots, x_{n X}]$ is a vector of uncertain analysis inputs and $y = [y_{1}, y_{2}, \dots, y_{n Y}]$ is a vector of analysis results. Further, a sequence of distributions $D_{1}, D_{2}, \dots, D_{n X}$ is used to characterize the uncertainty associated with the elements of x, where D_i is the distribution associated with x_i for i=1, 2,…,nX. Correlations and other restrictions involving the elements of x are also possible. The goal of uncertainty analysis is to determine the uncertainty in the elements of y that derives from the uncertainty in the elements of x characterized by the distributions D₁,D₂,…,D_nX and any associated restrictions. The goal of sensitivity analysis is to determine relationships between the uncertainty associated with individual elements of x and the uncertainty associated with individual elements of y.

A variety of approaches to uncertainty and sensitivity analysis are in use, including (i) differential analysis, which involves approximating a model with a Taylor series and then using variance propagation formulas to obtain uncertainty and sensitivity analysis results [11], [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], [22], [23], [24], (ii) response surface methodology, which is based on using classical experimental designs to select points for use in developing a response surface replacement for a model and then using this replacement model in subsequent uncertainty and sensitivity analyses based on Monte Carlo simulation and variance propagation [25], [26], [27], [28], [29], [30], [31], [32], [33], [34], [35], (iii) the Fourier amplitude sensitivity test (FAST) and other variance decomposition procedures, which involve the determination of uncertainty and sensitivity analysis results on the basis of the variance of model predictions and the contributions of individual variables to this variance [36], [37], [38], [39], [40], [41], [42], [43], [44], [45], [46], [47], [48], [49], [50], [51], [52], [53], [54], [55], (iv) fast probability integration, which is primarily an uncertainty analysis procedure used to estimate the tails of uncertainty distributions for model predictions [56], [57], [58], [59], [60], [61], [62], and (v) sampling-based (i.e. Monte Carlo) procedures, which involve the generation and exploration of a probabilistically based mapping from analysis inputs to analysis results [63], [64], [65], [66], [67], [68], [69], [70], [71], [72], [73]. Additional information on uncertainty and sensitivity analysis is available in a number of reviews [69], [70], [74], [75], [76], [77], [78], [79], [80]. The primary focus of this presentation is on sampling-based methods for uncertainty and sensitivity analysis.

Sampling-based approaches for uncertainty and sensitivity analysis are very popular [81], [82], [83], [84], [85], [86], [87], [88], [89], [90], [91], [92], [93], [94], [95], [96]. Desirable properties of these approaches include conceptual simplicity, ease of implementation, generation of uncertainty analysis results without the use of intermediate models, and availability of a variety of sensitivity analysis procedures [67], [69], [76], [97], [98]. Despite these positive properties, concern is often expressed about using these approaches because of the computational cost involved. In particular, the concern is that the sample sizes required to obtain meaningful results will be so large that analyses will be computationally impracticable for all but the most simple models. At times, statements are made that 1000 to 10,000s of model evaluations are required in a sampling-based uncertainty/sensitivity analysis.

In this presentation, results obtained with a computationally demanding model for two-phase fluid flow are used to illustrate that robust uncertainty and sensitivity analysis results can be obtained with relatively small sample sizes. Further, results are obtained and compared for replicated random and Latin hypercube samples (LHSs) [63], [73]. For the problem under consideration, random and LHSs of size 100 produce similar, stable results.

The presentation is organized as follows. The analysis problem is described in Section 2. Then, the following topics are considered: stability of uncertainty analysis results (Section 3), stability of sensitivity analysis results based on stepwise rank regression (Section 4), use of coefficients of concordance in comparing replicated sensitivity analyses (Section 5), sensitivity analysis based on replicated samples and the top down coefficient concordance (Section 6), sensitivity analysis with reduced sample sizes (Section 7), and sensitivity analysis without regression analysis (Section 8). Finally, the presentation ends with a concluding discussion (Section 9).

Section snippets

Analysis problem

The analysis problem under consideration comes from the 1996 performance assessment (PA) for the Waste Isolation Pilot Plant (WIPP) [99], [100]. This PA was the core analysis that supported the successful Compliance Certification Application (CCA) by the US Department of Energy (DOE) to the US Environmental Protection Agency (EPA) for the operation of the WIPP [101]. With the certification of the WIPP by the EPA for the disposal of transuranic waste in May 1998 [102], the WIPP became the first

Uncertainty analysis results

The time-dependent results in Fig. 2 display the uncertainty in solutions to Eqs. (2.1), (2.2), (2.3), (2.4), (2.5), (2.6) that results from uncertainty in the 31 variables in Table 1. The goal of this presentation is to illustrate the robustness of such uncertainty representations with respect to the type and size of the sample in use. As previously indicated, results at 1000, 10,000–1000, and 10,000 yr will be used for illustration.

One way to compare uncertainty analysis results is to present

Stepwise results

A sensitivity analysis based on stepwise regression analysis with rank-transformed data [118] was carried out for the replicated samples summarized in Fig. 4 (Table 3, Table 4, Table 5, Table 6). This analysis required α-values of 0.02 and 0.05 for variables to enter and to be retained in a given analysis, respectively, and was carried out with the stepwise program [119]. The summary tables (Tables 3–6) present results for both the individual replicates and for the three replicates of a given

Coefficients of concordance

Inspection of the results in Table 3, Table 4, Table 5, Table 6 suggests that the individual replicates are producing similar results. Kendall's coefficient of concordance (KCC) provides a way to formally assess this similarity (p. 305, Ref. [120]). This coefficient is based on the consideration of arrays of the form $\begin{matrix} R_{1} & R_{2} & \dots & R_{n R} \\ x_{1} & r (O_{11}) & r (O_{12}) & \dots & r (O_{1, n R}) \\ x_{2} & r (O_{21}) & r (O_{22}) & \dots & r (O_{2, n R}) \\ ⋮ & ⋮ & ⋮ & \dots & ⋮ \\ x_{n X} & r (O_{n X, 1}) & r (O_{n X, 2}) & \dots & r (O_{n X, n R}) \end{matrix}$ where x₁,x₂,…,x_nX are the variables under consideration (i.e. nX=29 with the exclusion of

Sensitivity analysis with the TDCC

Replicated samples and the TDCC provide the basis for a sensitivity analysis procedure to identify important sets of variables that does not depend on direct testing of the statistical significance of sensitivity measures (e.g. the significance of the coefficients in a stepwise regression model as defined by an α-value for entry into the model). Rather, important variables are identified by the similarity of outcomes in analyses performed for the individual replicated samples.

The procedure

Sensitivity analysis with small samples

The sensitivity analysis results obtained with random and LHSs of size 100 are very similar and thus indicate that a sample size of 100 is adequate for the problem under consideration. The question naturally arises if smaller sample sizes would also be adequate.

To partially address this question, the random samples were pooled to produce 300 observations, and then three samples of size 50 were obtained by randomly sampling from these 300 observations. Each new sample of size 50 was produced by

Sensitivity analysis without regression

The regression analyses summarized in Table 3, Table 4, Table 5, Table 6 exhibit various levels of success. Some analyses are quite good, with R² values above 0.9. Other analyses are not quite so good, with R² values in the range from 0.6 to 0.8. The analyses for WAS_PRES at 10,000 yr are effectively failures, with R² values in the vicinity of 0.2.

An important aspect of the analyses in Table 3, Table 4, Table 5, Table 6 is that the identification of dominant variables tends to remain the same

Discussion

Uncertainty and sensitivity analysis results obtained with replicated random and LHSs are compared. In particular, uncertainty and sensitivity analyses were performed for a large model for two-phase fluid flow with three independently generated random samples of size 100 each and also three independently generated LHSs of size 100 each.

For the outcomes under consideration, analyses with random and LHSs produced similar results. Specifically, there is little difference in the uncertainty and

Acknowledgements

Work performed for Sandia National Laboratories (SNL), which is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy under contract DE-AC04-94AL85000. Review provided at SNL by M. Chavez, J. Garner, and S. Halliday. Editorial support provided by F. Puffer, J. Ripple, and K. Best of Tech Reps, Inc.

References (157)

W.L. Oberkampf et al.
Error and uncertainty in modeling and simulation
Reliab Eng Syst Saf
(2002)
J.C. Helton et al.
Guest editorial: treatment of aleatory and epistemic uncertainty in performance assessments for complex systems
Reliab Eng Syst Saf
(1996)
M.D. Morris et al.
Exploratory designs for computational experiments
J Stat Plan Inf
(1995)
R.I. Cukier et al.
Nonlinear sensitivity analysis of multiparameter model systems
J Comput Phys
(1978)
A. Saltelli et al.
An alternative way to compute Fourier amplitude sensitivity test (FAST)
Comput Stat Data Anal
(1998)
A. Saltelli et al.
About the use of rank transformation in sensitivity analysis of model output
Reliab Eng Syst Saf
(1995)
T. Homma et al.
Importance measures in global sensitivity analysis of nonlinear models
Reliab Eng Syst Saf
(1996)
M.D. McKay
Nonparametric variance-based methods of assessing uncertainty importance
Reliab Eng Syst Saf
(1997)
M.J.W. Jansen
Analysis of variance designs for model output
Comput Phys Commun
(1999)
M.D. McKay et al.
Evaluating prediction uncertainty in simulation models
Comput Phys Commun
(1999)

A guide for uncertainty analysis in dose and risk assessments related to environmental contamination

(1996)

Science and judgment in risk assessment

(1994)

Issues in risk assessment

(1993)

An SAB report: multi-media risk assessment for radon, review of uncertainty analysis of risks associated with exposure to radon

(1993)

O. Øvreberg et al.

Putting error bars on reservoir engineering forecasts

J Pet Technol

(1992)

Evaluating the reliability of predictions made using environmental transfer models

(1989)

M.B. Beck

Water-quality modeling: a review of the analysis of uncertainty

Water Resour Res

(1987)

R. Tomovic et al.

General sensitivity theory

(1972)

P.M. Frank

Introduction to system sensitivity theory

(1978)

J.-T. Hwang et al.

The Green's function method of sensitivity analysis in chemical kinetics

J Chem Phys

(1978)

E.P. Dougherty et al.

A computational algorithm for the Green's function method of sensitivity analysis in chemical kinetics

Int J Chem Kinet

(1979)

E.P. Dougherty et al.

Further developments and applications of the Green's function method of sensitivity analysis in chemical kinetics

J Chem Phys

(1979)

D.G. Cacuci et al.

Sensitivity theory for general systems of nonlinear equations

Nucl Sci Eng

(1980)

D.G. Cacuci

Sensitivity theory for nonlinear systems. I. Nonlinear functional analysis approach

J Math Phys

(1981)

D.G. Cacuci

Sensitivity theory for nonlinear systems. II. Extensions to additional classes of responses

J Math Phys

(1981)

D.G. Cacuci et al.

On the application of the adjoint method of sensitivity analysis to problems in the atmospheric sciences

Atmósfera

(1994)

H. Rabitz et al.

Sensitivity analysis in chemical kinetics

T. Turányi

Sensitivity analysis of complex kinetic systems. Tools and applications

J Math Chem

(1990)

L. Vuilleumier et al.

First- and second-order sensitivity analysis of a photochemically reactive system (a Green's function approach)

Environ Sci Technol

(1997)

D.G. Cacuci

Sensitivity and uncertainty analysis

(2003)

W.J. Hill et al.

A review of response surface methodology: a literature review

Technometrics

(1966)

R. Mead et al.

A review of response surface methodology from a biometric viewpoint

Biometrics

(1975)

R.H. Myers

Response surface methodology

(1971)

R.H. Morton

Response surface methodology

Math Scientist

(1983)

Cited by (239)

A mathematical model to study low-dose metronomic scheduling for chemotherapy
2024, Mathematical Biosciences
Metronomic chemotherapy refers to the frequent administration of chemotherapeutic agents at a lower dose and presents an attractive alternative to conventional chemotherapy with encouraging response rates. However, the schedule of the therapy, including the dosage of the drug, is usually based on empiricism. The confounding effects of tumor-endothelial-immune interactions during metronomic administration of drugs have not yet been explored in detail, resulting in an incomplete assessment of drug dose and frequency evaluations. The present study aimed to gain a mechanistic understanding of different actions of metronomic chemotherapy using a mathematical model. We have established an analytical condition for determining the dosage and frequency of the drug depending on its clearance rate for complete tumor elimination. The model also brings forward the immune-mediated clearance of the tumor during the metronomic administration of the chemotherapeutic agent. The results from the global sensitivity analysis showed an increase in the sensitivity of drug and immune-mediated killing factors toward the tumor population during metronomic scheduling. Our results emphasize metronomic scheduling over the maximum tolerated dose (MTD) and define a model-based approach for approximating the optimal schedule of drug administration to eliminate tumors while minimizing harm to the immune cells and the patient’s body.
Phase equilibria calculations of a hydrocarbon system consisting water and asphaltene using association equation of state: A generalized auto-tune procedure
2024, Fluid Phase Equilibria
A generalized algorithm is proposed for four-phase flash calculations in hydrocarbon-water systems in which asphaltene has tendency to be precipitated. The systems which are going to be studied in this paper can be either single phase, two-phase, three-phase or four-phase of gas, oil-rich, asphaltene-rich and aqueous phases. The properties of vapor and oil-rich (liquid) phases can be predicted using any EOS while Henry's law is performed for calculating the properties of water-rich (aqueous) phase. In addition, an association equation of state (AEOS) has been used to predict the amount of the precipitated asphaltenes at different conditions. Using the proposed algorithm, one can predict the number and types of phases and also the properties of each phase, as accurate as possible. As tuning of EOS models is tedious and time consuming, a generalized and auto-tune method based on sensitivity and optimization algorithms is also applied in this study so that the best model could be found easily, quickly and automatically. The proposed approach proved to be efficient, and very good agreement between the results of the presented method and experimental data was found for different test problems. This new procedure can be very useful in compositional reservoir simulation of certain enhanced oil recovery methods and in surface facility process simulation.
Strategies for grid-friendly and uncertainty-adaptive design in zero energy buildings
2024, Energy and Buildings
The intermittency and fluctuation of renewable energy sources cause large uncertainties in energy generation, not only leading to an energy imbalance between energy generation and utilization in Zero Energy Buildings (ZEBs) but also causing potential risks to the connected electricity grid. To address this issue, we propose a method for optimizing the design of the energy system in ZEBs, which emphasizes grid-friendly interaction with grids and considers the uncertainty quantification in renewable resources and cooling demand. This study introduces the grid-friendly interaction index to evaluate the compatibility of ZEB energy exchanges with the power grid. The aim is to optimize these exchanges for better grid integration. Utilizing 26 years of historical weather data, the Monte Carlo method is used to quantify the uncertainty of design parameters, thereby informing our optimization process. A comprehensive analysis, centered on an actual ZEB in Hong Kong, compares our uncertainty-based design methodology against traditional, deterministic design approach. The results demonstrate that the proposed method can achieve up to a 9 % cost reduction, a 6.8 % reduction in carbon dioxide emissions, and a 10.8 % improvement in grid-friendliness compared to deterministic design schemes. These results affirm the efficacy of the proposed methods in providing appropriate design options to meet diverse reliability demands.
Optimal risk-based design of reinforced concrete beams against progressive collapse
2024, Engineering Structures
Risk analyses addressing consequences of progressive collapse have shown that, due to the small probabilities of element loss due to abnormal loads, structural strengthening for alternate load paths has negative cost-benefit for most buildings subjected to typical threats. This study addresses optimal risk-based design of reinforced concrete beams subjected to supporting column removal, with specific focus on reinforced concrete behavior. Nonlinear finite element analysis (NLFEA) is carried out, allowing both compressive arch and catenary actions to be predicted under large deflections. NLFEA results are compared to experimental data, showing good agreement and allowing quantification of model errors. Risk optimization results show that optimal beam designs change significantly with local damage probability, which is considered an independent parameter. More importantly, the study shows how different failure modes compete for limited construction and strengthening budgets. In case of intact structure, optimal design is governed by serviceability displacements, bending failure at midspan, and bending-shear failure at beam ends. Optimal design of the damaged structure is controlled by shear failure at beam ends and tensile rupture of steel rebar due to either catenary action or snap-through instability. Results highlight that, under significant column-loss probabilities, progressive collapse resistance is reached by larger beam depth, greater reinforcement area and reduced stirrup spacing. Such design measures against progressive collapse also provide greater safety margins against all failure modes of the intact structure.
Simulation of hydrogen generation via in-situ combustion gasification of heavy oil
2024, International Journal of Hydrogen Energy
With the increasing demand for clean hydrogen (H₂) energy, new emerging technologies for in-situ H₂ production from hydrocarbon reservoirs have attracted the attention of researchers and industry. This presents an opportunity for the petroleum industry to contribute to the energy transition. One technology in the field pilot stage is in-situ combustion gasification (ISCG). However, the mechanism behind this process is not fully understood and additional experimental and modeling work is required. To address this issue, we developed a laboratory-scale simulation model for ISCG. Statistical methods were combined to investigate the sensitivity and interactions of different parameters that control the process. The results showed that 34 mol.% H₂ can be generated at 800 °C. Higher temperatures yielded higher H₂ concentrations with coke gasification and water-gas shift reactions dominating hydrogen generation. This study provides valuable knowledge about the process and lays a foundation for future lab-scale ISCG experiments.
Diagnosis and prognosis of battery degradation through re-evaluation and Gaussian process regression of electrochemical model parameters
2023, Journal of Power Sources
Lithium-ion battery degradation is complex, and many mechanisms occur concurrently. In-depth degradation is traditionally investigated by post-mortem characterization in lab-settings. If mechanisms could instead be identified in-operando, utilization could be adjusted, and battery lifetime extended. We investigate changes in electrochemical model parameters during battery testing and their correlation with degradation observed in a traditional post-mortem characterization. Commercial batteries are cycle-aged using different stationary storage service cycles and a novel reference performance test is applied intermittently. This test is based on current profiles optimally designed with respect to maximized sensitivity for individual electrochemical parameters and embedded within a charging procedure. Usage dependency of parameter trajectories over the course of ageing is demonstrated and coupled to observed micro-structural changes. Subsequently, the parameter trajectories are extrapolated using Gaussian Process Regression for physics-based state-of-health estimation and remaining-useful-life prediction. We demonstrate and validate estimation of full cell performance under constant load at a later state in life.

View all citing articles on Scopus

View full text

A comparison of uncertainty and sensitivity analysis results obtained with random and Latin hypercube sampling

Abstract

Introduction

Section snippets

Analysis problem

Uncertainty analysis results

Stepwise results

Coefficients of concordance

Sensitivity analysis with the TDCC

Sensitivity analysis with small samples

Sensitivity analysis without regression

Discussion

Acknowledgements

Reliab Eng Syst Saf

Reliab Eng Syst Saf

J Stat Plan Inf

J Comput Phys

Comput Stat Data Anal

Reliab Eng Syst Saf

Reliab Eng Syst Saf

Reliab Eng Syst Saf

Comput Phys Commun

Comput Phys Commun

Comput Phys Commun

Comput Struct

Reliab Eng Syst

Reliab Eng Syst Saf

Reliab Eng Syst Saf

Reliab Eng Syst Saf

Nucl Eng Des

Int J Biomed Comput

Int J Biomed Comput

Reliab Eng Syst Saf

Exp Therm Fluid Sci

Comput Phys Commun

Prev Vet Med

Comput Stat Data Anal

Science, uncertainty and risk: the problem of complex phenomena

APS News

A guide for uncertainty analysis in dose and risk assessments related to environmental contamination

Science and judgment in risk assessment

Issues in risk assessment

An SAB report: multi-media risk assessment for radon, review of uncertainty analysis of risks associated with exposure to radon

Putting error bars on reservoir engineering forecasts

J Pet Technol

Evaluating the reliability of predictions made using environmental transfer models

Water-quality modeling: a review of the analysis of uncertainty

Water Resour Res

General sensitivity theory

Introduction to system sensitivity theory

The Green's function method of sensitivity analysis in chemical kinetics

J Chem Phys

A computational algorithm for the Green's function method of sensitivity analysis in chemical kinetics

Int J Chem Kinet

Further developments and applications of the Green's function method of sensitivity analysis in chemical kinetics

J Chem Phys

Sensitivity theory for general systems of nonlinear equations

Nucl Sci Eng

Sensitivity theory for nonlinear systems. I. Nonlinear functional analysis approach

J Math Phys

Sensitivity theory for nonlinear systems. II. Extensions to additional classes of responses

J Math Phys

On the application of the adjoint method of sensitivity analysis to problems in the atmospheric sciences

Atmósfera

Sensitivity analysis in chemical kinetics

Sensitivity analysis of complex kinetic systems. Tools and applications

J Math Chem

First- and second-order sensitivity analysis of a photochemically reactive system (a Green's function approach)

Environ Sci Technol

Sensitivity and uncertainty analysis

A review of response surface methodology: a literature review

Technometrics

A review of response surface methodology from a biometric viewpoint

Biometrics

Response surface methodology

Response surface methodology

Math Scientist