1 Introduction
1.1 Background
1.2 Review of related work
Metamodelling technique | Problem characteristic | Test problem | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Polynomial | Kriging | RBF | MLS | NN | MARS | Splines | SVR | Other (e.g. HDMR) | Dimensionality | Nonlinearity | Sample size | Noise | Benchmark functions | Engineering problem | Engineering case study | |
Simpson et al. (1998) | x | x | x | x | Aerospike nozzle design | |||||||||||
Giunta and Watson (1998) | x | x | x | x | ||||||||||||
Yang et al. (2000) | x | x | x | x | x | x | Crashworthiness simulation | |||||||||
Varadarajan et al. (200) | x | x | x | Engine combustion modelling | ||||||||||||
Jin et al. (2001) | x | x | x | x | x | x | x | x | x | |||||||
Jin et al. (2003) | x | x | x | x | x | x | Structural design | |||||||||
Seabrook et al. (2003) | x | x | x | x | x | Engine mapping experiments | ||||||||||
Forsberg and Nilsson (2004) | x | x | x | Crash simulation | ||||||||||||
Fang et al. (2005) | x | x | x | x | Crash simulation | |||||||||||
Mullur and Messac (2006) | x | x | x | x | x | x | ||||||||||
Chen et al. (2006) | x | x | x | x | x | x | x | x | x | Wastewater treatment | ||||||
Ben-Ari and Steinberg (2007) | x | x | x | x | x | x | Piston dynamics, electric circuit | |||||||||
Kim et al. (2009) | x | x | x | x | x | x | ||||||||||
Zhu et al. (2009) | x | x | x | x | x | Automotive - body structure | ||||||||||
Paiva et al. (2009) | x | x | x | x | x | Aircraft wing design | ||||||||||
Zhao and Xue (2010) | x | x | x | x | x | x | x | x | ||||||||
x | x | x | x | x | Engine modelling experiments | |||||||||||
Li et al. (2010) | x | x | x | x | x | x | x | x | x | Process simulation | ||||||
Wang et al. (2011) | x | x | x | x | x | x | x | x | x | Crashworthiness optimisation | ||||||
Van Gelder et al. (2014) | x | x | x | x | x | x | x | Building simulation | ||||||||
Liu et al. (2016) | x | x | x | x | x | x | x | 9 engineering problems | ||||||||
Kroetz et al. (2017) | x | x | x | x | x | x | Structural reliability | |||||||||
x | x | x | x | x | x | x | x | x | ||||||||
Ostergard et al. (2018) | x | x | x | x |
1.3 Research objectives and contribution
2 Metamodelling techniques
2.1 Polynomial
2.2 Radial Basis Function (RBF)
- RBF can learn the complex and nonlinear relationship between the input and output parameters (Hagan and Demuth 1999).
- RBF can confront missing and noisy data with a good generalisation capability (Hagan and Demuth 1999).
- RBF models can be more efficient (in relation to the number of measurements required) than polynomials for large-scale nonlinear problems, since the approximation model is purely driven from the collected data (through learning), rather than assuming a fixed model type in advance (He and Rutland 2004).
- RBF is very fast in learning the relationship between the input and output variables because of using two-stage network training. The first stage is to determine the weights from the ‘Input’ to the ‘Hidden’ layer, and the second stage from the ‘Hidden’ layer to the ‘Output’ layer (Khan 2011).
2.3 Kriging
- Kriging models are highly flexible due to the wide range of correlation functions.
- Kriging models require fewer measurements due to the strong interpolation among the measured sample points.
- Kriging models can either ‘honour the data’ by providing an exact interpolation of the data, or ‘smooth the data’ by providing an inexact interpolation.
3 Research methodology
3.1 Test functions and test cases
Test function | No of variables (n) | Problem type | Modelling conditions | ||||
---|---|---|---|---|---|---|---|
Scale | Nonlinearity | DoE size (test points) | Noise | ||||
MBs (10n) | MBl (30n) | Validation | |||||
Test 1 | 2 | Small | Low | 20 | 60 | 1000 | 0–15% |
Test 2 | 3 | Small | Low | 30 | 90 | 1000 | 0–15% |
Test 3 | 4 | Small | Low | 40 | 120 | 1000 | 0–15% |
Test 4 | 2 | Small | High | 20 | 60 | 1000 | 0–15% |
Test 5 | 3 | Small | High | 30 | 90 | 1000 | 0–15% |
Test 6 | 4 | Small | High | 40 | 120 | 1000 | 0–15% |
Test 7 | 5 | Medium | Low | 50 | 150 | 1500 | 0–15% |
Test 8 | 6 | Medium | Low | 60 | 180 | 1500 | 0–15% |
Test 9 | 8 | Medium | Low | 80 | 240 | 1500 | 0–15% |
Test 10 | 6 | Medium | High | 60 | 180 | 1500 | 0–15% |
Test 11 | 7 | Medium | High | 70 | 210 | 1500 | 0–15% |
Test 12 | 8 | Medium | High | 80 | 240 | 1500 | 0–15% |
Test 13 | 10 | Large | Low | 100 | 300 | 2000 | 0–15% |
Test 14 | 16 | Large | Low | 160 | 480 | 2000 | 0–15% |
Test 15 | 20 | Large | Low | 200 | 600 | 2000 | 0–15% |
Test 16 | 10 | Large | High | 100 | 300 | 2000 | 0–15% |
Test 17 | 14 | Large | High | 140 | 420 | 2000 | 0–15% |
Test 18 | 18 | Large | High | 180 | 540 | 2000 | 0–15% |
- Small-scale (k ≤ 4) (6 test functions)
- Medium-scale (5 ≤ k ≤ 8) (6 test functions)
- Large-scale (k ≥ 9) (6 test functions)
- Low-order nonlinear problems (9 test functions)
- High-order nonlinear problems (9 test functions)
- Base test—‘0%’ noise condition (or smooth data);
- Repeat 1—‘5%’ noise condition;
- Repeat 2—‘10%’ noise condition;
- Repeat 3—‘15%’ noise condition.
3.2 Selection of Metamodelling techniques
- Two polynomial metamodels: order 2 (‘Poly 2’) and three (‘Poly 3_c2’—i.e. cubic polynomial with interaction order 2);
- Three RBF metamodels: using Gaussian, Thinplate and Multiquadric kernel functions (shown as ‘RBF_G’, ‘RBF_TP’ and ‘RBF_MQ’);
- Three Kriging metamodels: using Gaussian (‘Krig_G’), Matérn 3/2 (‘Krig_M32’) and Matérn 5/2 (‘Krig_M52’) correlation functions;
- Three Kriging metamodels with nugget factor, using Gaussian (‘KrigN_G’), Matérn 3/2 (‘KrigN_M32’) and Matérn 5/2 (‘KrigN_M52’) correlation functions;
3.3 Performance metrics and analysis procedure
3.4 Validation test cases
Type of experiment | Test problem | Problem characteristics |
---|---|---|
Physical | V1: Modelling of fuel consumption response for a diesel engine based on dynamometer testing data (Kianifar et al. 2014) | •Medium scale •High-order nonlinearity •Medium/large data set •Low noise |
V2: Modelling of NOx response for a diesel engine based on dynamometer testing data (Kianifar et al. 2014) | •Medium scale •High-order nonlinearity •Medium/large data set •High noise | |
Computer-based/CAE | V3: Modelling the head injury criteria (HIC) response for a tall man during a car-pedestrian impact based on dynamic CAE experiments (Zhao et al. 2010) | •Medium scale •High-order nonlinearity •Small sample data set •Medium noise (induced in the experiment) |
•Large scale •High-order nonlinearity •Small sample data set •No noise |
4 Results and analysis
4.1 Evaluation of metamodelling techniques for small sample DoEs
- Polynomial models can deliver good models (the NRMSE whiskers extend to 0), but their robustness is worse than most other models. In particular, the cubic polynomial model shows a very wide NRMSE range for the large-scale test functions, which can be explained by the number of coefficients needed for the larger scale test problems in relation to the DoE size.
- The performance of the RBF models improves relative to polynomials for the large-scale problems, in particular in terms of robustness.
- Kriging with nugget metamodelling delivers robust performance, but inferior to the standard Kriging in terms of accuracy (average NRMSE), which is not surprising given that there is no noise in the response.
4.2 Evaluation of metamodelling techniques for large sample DoEs
4.3 Sensitivity to noise
- The performance of the Kriging with nugget metamodelling technique is remarkably quite insensitive to noise levels within the range considered in the study. However, the average NRMSE of Kriging with nugget (with Matern 5/2 correlation function) outperforms the standard Kriging under highest levels of noise (15%) and with a large DoE sample size.
- As expected, a general trend of worsening NRMSE with higher noise is observed across the metamodelling techniques, with the exception of Kriging with nugget. The polynomial models also show good stability in relation to increase noise for large DoE sample sizes.
- It is also seen that the accuracy of modelling improves with the large DoE sample size, except for the RBF with Gaussian Kernel. In general, RBF metamodels show the greatest sensitivity to noise.
4.4 Evaluation of computational efficiency of Metamodelling techniques
DoE size | ||||||
---|---|---|---|---|---|---|
MBs | MBs | MBs | MBl | MBl | MBl | |
Model/scale | Small | Medium | Large | Small | Medium | Large |
Poly 2 | 0.01 | 0.01 | 0.46 | 0.00 | 0.02 | 0.20 |
Poly 3_c2 | 0.00 | 0.00 | 0.05 | 0.00 | 0.15 | 0.43 |
RBF_G | 0.00 | 0.00 | 0.06 | 0.01 | 0.01 | 0.20 |
RBF_TP | 0.00 | 0.00 | 0.02 | 0.01 | 0.02 | 0.09 |
RBF_MQ | 0.00 | 0.00 | 0.02 | 0.00 | 0.03 | 0.06 |
Krig_G | 1.81 | 8.61 | 102.22 | 5.88 | 55.03 | 653.34 |
Krig_M32 | 1.96 | 8.85 | 131.79 | 8.43 | 82.31 | 645.37 |
Krig_M52 | 1.92 | 8.09 | 109.66 | 7.12 | 48.38 | 1365.52 |
KrigN_G | 1.82 | 7.39 | 58.17 | 6.59 | 56.17 | 724.76 |
KrigN_M32 | 1.50 | 5.44 | 71.21 | 6.70 | 50.34 | 603.08 |
KrigN_M52 | 1.81 | 8.08 | 109.49 | 5.24 | 55.88 | 781.47 |
4.5 Overall guideline for choosing a robust metamodelling method
- Kriging with Matérn 5/2 correlation function outperforms all the other modelling methods, regardless of problem scale and DoE size.
- Kriging with Matérn 5/2 correlation function performs best for highly nonlinear functions, regardless of the problem scale and DoE size, while also performing reasonably accurate for low nonlinear problems. Polynomials (especially Cubic models) are also providing competitive results to the Krig_M52 method for low-order non-linear functions, however; a Polynomial might require a larger DoE sample to calculate the model coefficients.
- Kriging with Matérn 3/2 correlation function excels other modelling techniques in terms of accuracy and robustness.
- Kriging with nugget factor (Matérn 5/2 correlation function) can be a better modelling option for highly noisy data (15% noise) with large DoE samples.
5 Validation case studies
5.1 Diesel engine fuel consumption
- Medium scale: 6 design parameters;
- Medium- to high-order nonlinearity: due to the expected physical behaviour of the fuel consumption response (Kianifar et al. 2014);
- Medium DoE sample: 127 Model Building points for 6 parameters, which is bigger than a small DoE of 60 (i.e. 10 × 6) points, but smaller that the “large” DoE (i.e. 30 × 6 = 180);
- Low noise condition (< 5%): measurement of fuel flow for a Diesel engine on an engine dynamometer test setting is reasonably accurate—though not 0% noise.
5.2 Diesel engine NOx emissions
5.3 Metamodelling for crashworthiness/pedestrian impact simulation
- Medium scale: with 7 design parameters
- High-order nonlinear: due to the highly nonlinear behaviour of head injury response in relation to the design parameters (Zhao et al. 2010)
- Small sampling data: 70 points for 7 parameters (i.e. 10 × 7)
- About 10% noise condition, due to the applied error range during the simulation process
5.4 Metamodelling for the MARTHE simulation data
- Large scale: with 20 design parameters
- High-order nonlinear: due to the high-order nonlinear behaviour of the response (Volkova et al. 2006)
- Small sampling data: 200 points for 20 parameters (i.e. 10 × 20)
- 0% noise condition: collected from the MARTHE simulation code
6 Discussion, conclusions and future work
- Under 0% noise condition, Kriging with Matérn 5/2 correlation function outperforms the other methods for high-order nonlinear problems; however, for low-order problems, Polynomials are providing competitive response models in terms of accuracy, and have the advantage that are easier to interpret;
- Problem scale has an insignificant effect on the performance of modelling techniques in terms of accuracy, and Kriging with both Matérn 5/2 and Matérn 3/2 correlation functions provide highly accurate response models;
- Increasing the number of sampling points usually results in an enhancement in the performance of modelling techniques, in terms of accuracy and robustness; however, this is also dependent on the noise condition;
- For both small and large DoE sample sizes, Kriging with Matérn 5/2 correlation function excels in terms of accuracy and robustness;
- For engineering problems under noise condition, Kriging modelling technique with Matérn 3/2 correlation function performs better than other techniques; however, its performance might deteriorate under very high-noise conditions (i.e. 15% noise or more), where using a Kriging model with nugget factor might provide better models;
- In terms of computation-efficiency, both Polynomials and RBFs take a trivial amount of time to construct the models, while the Kriging can be very time-consuming, in particular for large samples; however, from a practical engineering point of view, the cost of computation is perhaps less significant compared to the benefits gained from a better quality model.