Response surface methodology with prediction uncertainty: A multi-objective optimisation approach

doi:10.1016/j.cherd.2011.12.012

Chemical Engineering Research and Design

Volume 90, Issue 9, September 2012, Pages 1235-1244

https://doi.org/10.1016/j.cherd.2011.12.012 Get rights and content

Abstract

In the field of response surface methodology (RSM), the prediction uncertainty of the empirical model needs to be considered for effective process optimisation. Current methods combine the prediction mean and uncertainty through certain weighting strategies, either explicitly or implicitly, to form a single objective function for optimisation. This paper proposes to address this problem under the multi-objective optimisation framework. Overall, the method iterates through initial experimental design, empirical modelling and model-based optimisation to allocate promising experiments for the next iteration. Specifically, the Gaussian process regression is adopted as the empirical model due to its demonstrated prediction accuracy and reliable quantification of prediction uncertainty in the literature. The non-dominated sorting genetic algorithm II (NSGA-II) is used to search for Pareto points that are further clustered to give experimental points to be conducted in the next iteration. The application study, on the optimisation of a catalytic epoxidation process, demonstrates that the proposed method is a powerful tool to aid the development of chemical and potentially other processes.

Highlights

► The model prediction uncertainty is addressed in response surface methodology. ► The prediction mean and uncertainty are combined through a multi-objective optimisation method. ► The methodology is applied to the optimisation of a catalytic reaction process.

Introduction

Response surface methodology (RSM) is a widely used technology for rational experimental design and process optimisation in the absence of mechanistic information (Box and Draper, 1987, Myers and Montgomery, 1995). RSM initiates from design of experiments (DoE) to determine the factors’ values for conducting experiments and collecting data. The data are then used to develop an empirical model that relates the process response to the factors. Subsequently, the model facilitates to search for better process response, which is validated through experiment(s). The above procedure iterates until an optimal process is identified or the limit on experimental resources is reached. RSM has seen diverse applications in almost every area of scientific research and engineering practice, including the development of chemical and biochemical processes (Agatonovic-Kustrin et al., 1998, Baumes et al., 2004, Dutta et al., 2004, Hadjmohammadi and Kamel, 2008, Shao et al., 2007, Tang et al., 2010, Yan et al., 2011a, Yan et al., 2011b).

In traditional RSM, the first- or second-order polynomial function is adopted for empirical modelling. However, the restrictive functional form of polynomials has long been recognised as ineffective in modelling complex processes. Progress in adopting more flexible models in RSM includes artificial neural networks (ANN) (Agatonovic-Kustrin et al., 1998, Baumes et al., 2004, Dutta et al., 2004, Shao et al., 2007), support vector regression (SVR) (Hadjmohammadi and Kamel, 2008, Serna et al., 2008), and more recently Gaussian process regression (GPR) (Tang et al., 2010, Yan et al., 2011a, Yan et al., 2011b, Yuan et al., 2008). GPR, also known as kriging model with slightly different formulation, has been accepted as a powerful modelling tool in various fields, in particular in process systems engineering (Ge and Song, 2010, Grancharova et al., 2008, Likar and Kocijan, 2007). GPR is attractive partly because of the sound theoretical foundation: it can be derived from the perspective of either ANN in the limit of an infinite network, or Bayesian regression (Rasmussen and Williams, 2006). In practice, GPR has been shown to be superior to or comparable with ANN and SVR in terms of prediction accuracy (Hernández et al., 2008, Rasmussen, 1996, Yuan et al., 2008). Therefore, GPR is utilised in this study for empirical modelling.

When an empirical model has been developed, the usual approach to process optimisation is to find factors’ value x^* that gives the maximal predicted response, and then conduct a new experiment at x^*. (Throughout this paper, we assume that the objective is to maximise the response variable.) However, this is not an ideal method, since it ignores the predictive uncertainty that quantifies the mismatch between model prediction and the actual process. In fact, predictive uncertainty, usually expressed in terms of variance, is available in all empirical models through either classical statistical inference (e.g. for polynomial regression, ANN and SVR) or Bayesian approach (e.g. for Bayesian ANN and GPR). A large predictive variance usually suggests that the experimental data around this point are not sufficient to give a reliable prediction. Hence, a design point that is predicted to give inferior response with high variance may actually result in improved process. Therefore, both predictive mean and variance must be jointly considered in the optimisation algorithm. In particular, new experiment(s) should be allocated so that either the mean prediction is large, or the prediction uncertainty is large.

In the literature, several methods have been proposed to handle prediction uncertainty when using empirical models for optimisation, including maximisation of prediction bounds (Apley et al., 2006, Tang et al., 2010, Yuan et al., 2008), minimisation of information free energy (Lin and Jang, 1998, Chen et al., 1998), maximisation of relative information gain (Coleman and Block, 2007), and maximisation of expected improvement (Jones et al., 1998, Jones, 2001). The basic rationale of these methods is to combine the prediction mean and uncertainty in the optimisation algorithm by using a user-determined weight, either explicitly or implicitly (this will be discussed subsequently). Clearly, the appropriateness of the selected weight needs to be carefully examined to ensure effective optimisation.

This paper proposes an alternative approach to RSM in the presence of model uncertainty. The idea is to cast this problem into a multi-objective optimisation framework (Deb, 2001) that seeks to maximise both prediction mean and variance simultaneously. Through this formulation, Pareto solutions can be identified using a standard multi-objective genetic algorithm; the nondominated sorting genetic algorithm II (NSGA-II) (Deb et al., 2002) is adopted in this study. It has been well recognised that seeking the entire Pareto set gives a more complete picture of multi-objective problems than using a fixed weighting strategy (Deb, 2001). In addition, in face of limited experimental resource, the identified Pareto points will be clustered into a few groups, and only the points that are closest to the cluster centres will be selected for experimentation in the next iteration. We further suggest to visualise the clustered Pareto points and their predictive mean/uncertainty in order to aid the decision-making by the experimenters. Compared with fully automatic algorithm, this “interactive” approach may receive wider acceptance when used for investigating real processes, because it involves active and subjective decision of the experimenter and this human intervention brings in domain knowledge that is often difficult to be properly incorporated in the modelling framework.

The proposed algorithm will be validated through maximising the conversion rate of a catalytic reaction process for the epoxidation of cis-cyclooctene. Cyclooctene oxide is an important intermediate used in the synthesis of various fine chemicals and pharmaceuticals. Recently, cobalt ion-exchanged faujasite zeolite (Co²⁺–NaX) has been reported as an efficient heterogeneous catalyst for several epoxidation processes (Sebastian et al., 2006, Tang et al., 2010, Yan et al., 2011b), and it is being tested for cis-cyclooctene epoxidation in our laboratory. Hence, the current work serves a dual purpose: to propose a novel solution to RSM in the presence of model uncertainty, and to demonstrate its application to an important catalytic reaction.

Section snippets

Experimental

In this study, we are interested in maximising the molar conversion rate of cis-cyclooctene during its epoxidation with TBHP (tert-butyl hydroperoxide) over Co²⁺–NaX catalyst, which serves a testbed to validate the proposed RSM technique. Five process factors are considered: reaction temperature, initial cis-cyclooctene concentration, the molar ratio of TBHP/cis-cyclooctene, stirring rate and reaction time. The range of these factors is listed in Table 1.

Sodium form zeolite X (NaX) was

Process modelling and optimisation with prediction uncertainty

Similar with general RSM, the proposed method is operated in an iterative manner and is summarised in Fig. 1. The first step is to design initial experiments to obtain the data, which are subsequently used to develop a GPR model. Then, a model-based optimisation algorithm is used to identify promising point(s) that, when further experimented in the next iteration, may give improved process performance. As discussed in Section 1, an efficient optimisation method needs to consider both prediction

Results and discussion

This section applies the proposed RSM framework to maximising the conversion rate of cis-cyclooctene during its catalytic epoxidation. In the initial iteration, the knowledge about the process is relatively limited, and the HSS algorithm is used to obtain 10 design points within the whole range of five factors for experiments. The designs and corresponding conversion rates are given in Table 2. The conversion rates varies between 10% and 30%, and the best conversion obtained is 27.8%. The

Concluding remarks

This paper proposes an alternative approach to RSM with the prediction uncertainty being quantified and accounted for. We have shown that the available methods assign weights to the prediction mean and uncertainty in one way or another, whilst the proposed method attempts to locate the Pareto points for this intrinsically multi-objective optimisation problem. For real experiments, we suggested to cluster and present the Pareto points graphically to aid the decision as to which points will be

Acknowledgements

Financial support from Singapore AcRF Tier 1 Grant (RG 19/09) is acknowledged. Woo Ren Ong participated in the catalytic epoxidation experiments as a partial requirement of his final year project.

References (41)

ChenT. et al.
Interpretation of non-linear empirical data-based process models using global sensitivity analysis
Chemometrics and Intelligent Laboratory Systems
(2011)
DuttaJ.R. et al.
Optimization of culture parameters for extracellular protease production from a newly isolated Pseudomonas sp. using response surface and artificial neural network models
Process Biochemistry
(2004)
GrancharovaA. et al.
Explicit stochastic predictive control of combustion plants based on Gaussian process models
Automatica
(2008)
LikarB. et al.
Predictive control of a gas–liquid separation plant based on a Gaussian process model
Computers and Chemical Engineering
(2007)
MartensH. et al.
Validation and verification of regression in small data sets
Chemometrics and Intelligent Laboratory Systems
(1998)
PalmerK. et al.
Metamodeling approach to optimization of steady-state flowsheet simulations model generation
Chemical Engineering Research and Design
(2002)
SebastianJ. et al.
Effect of alkali and alkaline earth metal ions on the catalytic epoxidation of styrene with molecular oxygen using cobalt(II)-exchanged zeolite x
Journal of Catalysis
(2006)
SernaP. et al.
Combining high-throughput experimentation, advanced data modeling and fundamental knowledge to develop catalysts for the epoxidation of large olefins and fatty esters
Journal of Catalysis
(2008)
ShaoP. et al.
Optimization of molecular distillation for recovery of tocopherol from rapeseed oil deodorizer distillate using response surface and artificial neural network models
Food and Bioproducts Processing
(2007)
TangQ. et al.
Response surface methodology using Gaussian processes: towards optimizing the trans-stilbene epoxidation over Co²⁺–NaX catalysts
Chemical Engineering Journal
(2010)

YanW. et al.

Development of high performance catalysts for CO oxidation using data-based modeling

Catalysis Today

(2011)

YanW. et al.

Bayesian migration of Gaussian process regression for rapid process modeling and optimization

Chemical Engineering Journal

(2011)

YuanJ. et al.

Reliable multi-objective optimization of high-speed WEDM process based on Gaussian process regression

International Journal of Machine Tools and Manufacture

(2008)

Agatonovic-KustrinS. et al.

Application of neural networks for response surface modeling in HPLC optimization

Analytica Chimica Acta

(1998)

ApleyD. et al.

Understanding the effects of model uncertainty in robust design with computer experiments

Journal of Mechanical Design

(2006)

BaumesL. et al.

Using artificial neural networks to boost high-throughput discovery in heterogeneous catalysis

QSAR & Combinatorial Science

(2004)

BoxG.E.P. et al.

Empirical Model Building and Response Surfaces

(1987)

ChangJ. et al.

Product and process development via sequential pseudo-uniform design

Industrial and Engineering Chemistry Research

(2004)

ChenJ. et al.

Product and process development using artificial neural-network model and information analysis

AIChE Journal

(1998)

ColemanM. et al.

Nonlinear experimental design using Bayesian regularized neural networks

AIChE Journal

(2007)

Cited by (62)

Clean production of isopropyl myristate: A cutting-edge enzymatic approach with a holistic techno-economic evaluation
2024, Sustainable Energy Technologies and Assessments
This work aims to develop a simple, clean, and energy-efficient lipase-catalyzed method for the synthesis of isopropyl myristate (IPM). The enzymatic esterification between isopropyl alcohol and myristic acid was catalyzed using immobilized Candida Antarctica lipase. Response Surface Methodology (RSM) was applied to study the interactive effect of reaction conditions on IPM yield. The maximum experimental and predicted conversions were 92.4 % and 92.0 %, respectively. The optimized conditions were as follows: molar ratio of isopropyl alcohol to myristic acid molar ratio of 8:1, molecular sieves of 12.5 % w/w, a catalyst load of 4 % w/w, at a temperature of 60 °C and a reaction time of 2.5 h. Isopropyl myristate synthesized was isolated and fully characterized by GC–MS, FTIR, ¹H and ¹³C NMR. Finally, to support the applicability perspective of this proposed method, a process diagram (PSD) was created using ASPEN PLUS software to simulate the production of IPM under the optimized conditions. The economic assessment of the whole process produced a positive net present value (NPV) of $44,797,732, return on investment (ROI) of 716.17 %, internal rate of return (IRR) of 110 %, payback period of 1.61, and a levelized cost of production (LCOP) of $1,777 per ton over a 14-year project lifespan. These results strongly suggest low-risk and high-profitability benefits to investing in this green route. Finally, the environmental impact was also assessed by calculating the quantity of CO₂ generated from the proposed enzymatic process. The results showed a reduced emission rate of 0.25 ton CO₂ eq. per ton of IPM produced. This underscores the lower environmental impact of this technology compared to traditional methods. Importantly, this study stands out as the first to conduct a comprehensive techno-economic assessment of the enzymatic synthesis of IPM, providing valuable insights into the economic viability and potential benefits of adopting this innovative and sustainable approach in the chemical manufacturing industry.
Optimization of fabrication conditions for low-Pt anode using response surface methodology in high-temperature polymer electrolyte membrane fuel cell
2022, Journal of Industrial and Engineering Chemistry
Optimizing the fabrication conditions of the catalyst layer in high-temperature polymer electrolyte membrane fuel cells is important for improving the catalyst utilization of the electrode. In this study, the effects of the binder concentration and the heat treatment temperature on the performance of membrane electrode assembly are investigated as controlled variables for fabricating low platinum (Pt) anodes. Furthermore, the response surface methodology (RSM), which is a kind of the design of experiment method, is applied to elucidate the optimum conditions based on the statistical analysis. Polytetrafluoroethylene is used as a binder in the range of 17.1–32.2 wt.% to generate the hydrophobic surface in the heat treatment temperature range of 307–392℃. The optimum anode based on the conditions from the RSM results shows a voltage of 0.636 V at 0.2 A/cm² with a Pt loading under 0.2 mg/cm². These results indicate that RSM can be used to optimize fabrication conditions with multiple variables.
A machine learning-based methodology for multi-parametric solution of chemical processes operation optimization under uncertainty
2021, Chemical Engineering Journal
Chemical process operation optimization aims at obtaining the optimal operating set-points by real-time solution of an optimization problem that embeds a steady-state model of the process. This task is challenged by unavoidable Uncertain Parameters (UPs) variations. MultiParametric Programming (MPP) is an approach for solving this challenge, where the optimal set-points must be updated online, reacting to sudden changes in the UPs. MPP provides algebraic functions describing the optimal solution as a function of the UPs, which allows alleviating large computational cost required for solving the optimization problem each time the UPs values vary. However, MPP applicability requires a well-constructed mathematical model of the process, which is not suited for process operation optimization, where complex, highly nonlinear and/or black-box models are usually used. To tackle this issue, this paper proposes a machine learning-based methodology for multiparametric solution of continuous optimization problems. The methodology relies on the offline development of data-driven models that accurately approximate the multiparametric behavior of the optimal solution over the UPs space. The models are developed using data generated by running the optimization using the original complex process model under different UPs values. The models are, then, used online to, quickly, predict the optimal solutions in response to UPs variation. The methodology is applied to benchmark examples and two case studies of process operation optimization. The results demonstrate the methodology effectiveness in terms of high prediction accuracy (less than 1% of NRMSE, in most cases), robustness to deal with problems of different natures (linear, bilinear, quadratic, nonlinear and/or black boxes) and significant reduction in the complexity of the solution procedure compared to traditional approaches (a minimum of 67% reduction in the optimization time).
Control of the antagonistic effects of heat-assisted chlorine oxidative degradation on pressure retarded osmosis thin film composite membrane surface
2021, Journal of Membrane Science
During pressure retarded osmosis (PRO) operation, thin film composite (TFC) membranes are continuously exposed to chemicals present in the stream that can deteriorate the membrane's selective layer with exposure time. Following this observation, TFC membranes are placed in controlled oxidative degradation conditions using aqueous NaOCl solutions. Active chlorine, along with heat, can thin out the dense layer and, when controlled and optimized, can tune the membrane surface properties and separation efficiency as desirable for specific applications. The chlorine oxidative degradation is optimized in terms of chlorine exposure (a factor of both exposure time and chemical dosage), solution pH, and the subsequent heating time. After the chemical modification process, the membrane surface properties were characterized and the PRO performance as well as the osmotic energy harvesting capability were determined. The modified membranes exhibited different levels of polyamide degradation and increase in water permeability, which came along with decrease in selectivity. Optimization of the chlorine oxidative degradation using response surface methodology was performed to maximize the water permeability and extractable osmotic power while keeping salt rejection satisfactory. After performing chlorine oxidation at the following optimized conditions: 3025 ppm Cl₂·h, pH 10.72, and 3 min heating time, initial non-pressure retarded water flux of 73.2 L m⁻² h⁻¹, specific reverse solute flux of 1.17 g L⁻¹, and power density of 18.71 W m⁻² (corresponding to water flux of 56.1 L m-2 h-1) at 12 bar were obtained using 0.6 M NaCl as draw and deionized water as feed.
A multi-objective optimization framework for aerosol jet customized line width printing via small data set and prediction uncertainty
2020, Journal of Materials Processing Technology
Aerosol jet printing (AJP) is a promising non-contact writing technology to fabricate customized and conformal microelectronics devices on flexible substrates. However, in recent years, the printed line quality is highlighted as a limitation in the applications of AJP technology. According to previous researches, a line printed with high edge roughness and low cross-sectional area will reduce the resistance homogeneity and current carrying capacity, respectively. Despite a high line thickness is beneficial to increase the cross-sectional area, it will be in contradiction with a customized line width under a certain mass flow rate, and may lead to an increase in the line edge roughness. Therefore, it is necessary to minimize the inherent contradictions between different printed line features in a design space. In this research, a multi-objective optimization framework is proposed to optimize the overall printing quality of customized line width. In the proposed framework, Latin hyper sampling is utilized for initial experimental design as it could maximize uniformity in a design space with small dataset. Gaussian process regression (GPR) is then adopted for rapid modeling of the printed line morphology due to its capability of providing prediction uncertainty. Following that, GPR models are driven with an efficient multi-objective genetic algorithm to minimize the inherent contradictions of the AJP process. Thus, the optimal process parameters for customized line width printing can be identified systematically and cost-efficiently in a design space. Experimental results indicate the validity of the proposed framework for customized line width printing. Till now, there are few systematic researches on the optimization of printed line morphology, which is an essential component for AJP. This research attempts to contribute to enriching the body of knowledge on printing process optimization.
Systematic multivariate optimisation of butylene carbonate synthesis via CO<inf>2</inf> utilisation using graphene-inorganic nanocomposite catalysts
2020, Catalysis Today
The synthesis of butylene carbonate (BC) through the reaction of butylene oxide (BO) and carbon dioxide (CO₂) has been investigated using highly efficient graphene-inorganic heterogeneous catalyst, cerium-lanthana-zirconia and graphene oxide represented as Ce–La–Zr–GO nanocomposite. The systematic multivariate optimisation of BC synthesis via CO₂ utilisation using graphene-inorganic nanocomposite has been developed using Box-Behnken Design (BBD) of Response Surface Methodology (RSM). The BBD has been applied to optimise the single and interactive effect of four independent reaction variables, i.e. reaction temperature, pressure, catalyst loading and reaction time on the conversion of BO and BC yield. Two quadratic regression models have been developed representing an empirical relationship between each reaction response and all the independent variables. The predicted models have been validated statistically and experimentally, where a high agreement has been observed between predicted and experimental results with approximate relative errors of ±1.45% and ±1.52% for both the BO conversion and BC yield, respectively. The implementation of RSM optimisation process for the conversion of BC through the reaction between BO and CO₂, has offered a new direction in green chemical process in terms of waste reduction, maximising production of value-added chemicals and effectively utilise CO₂ gas emissions.

View all citing articles on Scopus

View full text

Response surface methodology with prediction uncertainty: A multi-objective optimisation approach

Abstract

Highlights

Introduction

Section snippets

Experimental

Process modelling and optimisation with prediction uncertainty

Results and discussion

Concluding remarks

Acknowledgements

Chemometrics and Intelligent Laboratory Systems

Process Biochemistry

Automatica

Computers and Chemical Engineering

Chemometrics and Intelligent Laboratory Systems

Chemical Engineering Research and Design

Journal of Catalysis

Journal of Catalysis

Food and Bioproducts Processing

Chemical Engineering Journal

Catalysis Today

Chemical Engineering Journal

International Journal of Machine Tools and Manufacture

Application of neural networks for response surface modeling in HPLC optimization

Analytica Chimica Acta

Understanding the effects of model uncertainty in robust design with computer experiments

Journal of Mechanical Design

Using artificial neural networks to boost high-throughput discovery in heterogeneous catalysis

QSAR & Combinatorial Science

Empirical Model Building and Response Surfaces

Product and process development via sequential pseudo-uniform design

Industrial and Engineering Chemistry Research

Product and process development using artificial neural-network model and information analysis

AIChE Journal

Nonlinear experimental design using Bayesian regularized neural networks

AIChE Journal