On-line monitoring of batch processes using generalized additive kernel principal component analysis

doi:10.1016/j.jprocont.2015.02.007

Journal of Process Control

Volume 28, April 2015, Pages 56-72

https://doi.org/10.1016/j.jprocont.2015.02.007 Get rights and content

Highlights

•
Generalized additive kernel PCA is introduced for on-line batch process monitoring.
•
GAKPCA inherits the good properties of multiway PCA for on-line monitoring.
•
Different unfolding approaches are closely related based on their kernel matrices.
•
Its connection with correntropy shows robustness when the Gaussian kernel is used.

Abstract

Based on analyzing the special structure of three-way array and generalizing the concept of additive kernels, this paper proposes the generalized additive kernel principal component analysis (GAKPCA) method for on-line monitoring of batch processes. The proposed method is a special nonlinear principal component analysis (PCA) method which can handle the nonlinear relationships between different monitoring variables and/or time intervals. It inherits the good properties of traditional multiway PCA (MPCA) method for on-line monitoring, and solves some problems that exist in traditional multiway kernel PCA (MKPCA) method. For example, based on the decomposition of batch samples in the feature space, the total squared prediction error (SPE) statistic of an entire batch can be divided into K components corresponding to K time intervals respectively, and its score vectors can be directly estimated on-line by the least squares approach without filling the unknown observations. As a special case, when the Gaussian kernel is used as the kernel function at each time interval, the proposed method is connected with the concept of correntropy which can bring robustness to our method. The experimental results on a fed-batch penicillin fermentation process demonstrate the validity of the proposed GAKPCA-based on-line monitoring method.

Introduction

Batch process monitoring is very important to ensure operational safety and high quality in the biochemical, polymer, pharmaceuticals, semiconductor and food industries. To detect faults accurately and quickly in order to recover the normal operation as soon as possible, on-line batch process monitoring has gotten much attention. Compared with traditional continuous processes, batch processes consider each batch of finite time duration as a unit. The collected dataset of a batch process is often organized as a three-way array $\underline{X} (I \times J \times K)$ , which contains the values of J monitoring variables at K time intervals in I batches. For the three-way array, many researchers have proposed different processing approaches from different viewpoints to make it adapt to traditional multivariate statistical process control (MSPC) methods such as principal component analysis (PCA), partial least squares (PLS) and independent component analysis (ICA) [1], [2], [3], [4], [5], [6], [7], [8], [9]. The commonly used modeling approaches summarized by Camacho et al. [8] mainly include: (i) single model approach based on batch-wise unfolding (BWU), variable-wise unfolding (VWU) or batch dynamic unfolding (BDU); (ii) K-models approach based on local, evolving, uniformly weighted moving window or exponential weighted evolving window models; (iii) hierarchical or multi-block approach; and (iv) multi-phase approach.

In [8], Camacho et al. compared these approaches in theory by analyzing their covariance structures, and they further compared their modeling performance of the corresponding PLS models on the actual industrial datasets in [10]. Generally speaking, VWU considers each J-dimensional sampling vector as a sample, so it just considers the correlation between different variables and cannot capture the dynamics of a process. In contrast, BWU considers each batch as a KJ-dimensional vector and fairly treats different monitoring variables and time intervals in a batch run, so its model considers both the correlation between different variables and the dynamic information simultaneously and synthetically. As a general unfolding mechanism, BDU uses the number of lagged measurement vectors (LMVs) as the additional degree of freedom in the model to balance the capability of capturing dynamic structure and the parsimony of the model. Based on the results in [10], often the performance of BWU and BDU is better than VWU, while there is no statistically significant difference between BDU and BWU. Furthermore, with the increasing of the number of models, e.g. using the K-models approach, the computational cost will increase. Therefore, the simple and direct unfolding approach of BWU which also gets stable and satisfying monitoring performance has been widely applied in the batch process monitoring problems [1], [2], [6], [11], [12], [13].

Among various monitoring methods, multiway principal component analysis (MPCA) [1], [11] based on BWU is one of the most important on-line monitoring methods. On one hand, like BWU discussed above, MPCA takes into account of the correlation between different variables and the dynamics of the process. On the other hand, it fully utilizes the structural feature of the three-way array and loading vectors to design the on-line monitoring strategies that are easy to realize [11]. For example, based on the fact that each batch sample can be decomposed through the sampling time, the total squared prediction error (SPE) statistic of a whole batch can be decomposed as the sum of the local SPE values at K time intervals. For on-line estimating the score vectors, besides the approach of filling the unknown observations with zeros or the current sampling values (assuming that all the trajectories have been centered and scaled in advance), MPCA can directly calculate the scores by projecting the already known observations into a reduced space based on the least squares approach. These characteristics help MPCA get the concise on-line monitoring strategies and obtain satisfying actual monitoring performance as well.

Nevertheless, MPCA assumes that the relationships between different variables and time intervals are only linear, which does not hold in some situations. In some complicated industrial processes, nonlinear relationships may exist between some variables and/or time intervals, so the nonlinear monitoring methods are needed. Generally there are two categories of nonlinear monitoring methods so far [14]. One is introducing the kernel trick to the above linear monitoring methods [15], [16], [17]. The other is utilizing the idea of one-class classification in machine learning, such as Gaussian mixture models (GMM) [18], [19], k-nearest neighbor (k-NN) [20], [21], one-class support vector machine (OCSVM) [22], and support vector data description (SVDD) [23], [24] methods. The former is more direct and has strong extendibility. The linear monitoring methods mentioned above can all be extended to their kernel counterparts. The nonlinear version of MPCA called multiway kernel principal component analysis (MKPCA) is first studied by Lee et al. [15]. It maps each batch (using BWU) from the original space into the feature space via a nonlinear transformation. Based on the kernel trick, MKPCA performs the eigen-decomposition on the kernel matrix (using polynomial, Gaussian, sigmoid, or other kernels) to compute the principal components in the feature space. Compared with other nonlinear methods, KPCA is easy to realize, which just needs linear algebra and needs no nonlinear optimization [15]. Like MPCA, MKPCA also constructs two statistics to monitor the systematic part (i.e. the principal component subspace) and noisy part (i.e. the residual subspace) respectively. Via filling the unknown observations with zeros or the current sampling values, MKPCA can be used for on-line monitoring of batch processes. However, using the above kernel functions contains the interaction terms between different time intervals, and thus mixes the data information at different time intervals in the feature space. It makes MKPCA face some problems in the on-line monitoring applications. For example, the total SPE statistic in MKPCA often could not be decomposed into K components associated with K time intervals independently, and the score vectors could not be estimated by projecting the already known observations into a reduced feature space. More details could be found in Section 2. Therefore, MKPCA has not inherited all the good properties of MPCA for on-line batch process monitoring.

For solving these problems, this paper explores a novel nonlinear PCA method for on-line monitoring of batch processes via utilizing the structural characteristics of kernel matrices and the concept of generalized additive kernels. The corresponding method is called generalized additive kernel PCA (GAKPCA) in brief. The proposed method can handle the possible nonlinear relationships between different variables and/or time intervals, and remains the good properties of MPCA used for on-line monitoring at the same time, such as the division of the total SPE statistic and directly estimating the score vectors. Based on the generalized additive kernels, the corresponding kernel matrices of BWU, VWU and BDU show particular connections, which complement the results in [8] which analyzes their covariance matrices. Specially, when the Gaussian kernel is chosen as the kernel function at each time interval, the entire kernel function between two batches can be regarded as a generalized similarity measure between two J-dimensional random variables, which corresponds to the concept of correntropy defined in information theoretic learning [25]. The property of the correntropy induced metric (CIM) [25] brings robustness to our method if the correntropy is utilized during the whole modeling and monitoring process.

In summary, the rest of this paper is organized as follows. By analyzing the differences between MKPCA and MPCA for on-line batch process monitoring in Section 2, the rationale of the proposed GAKPCA method is elaborated in Section 3. Section 3.4 discusses the relationship between our method and the generalized additive kernels, and compares different unfolding approaches by analyzing their kernel matrices. After that, Section 4 further compares the special cases of GAKPCA and MKPCA by using the Gaussian kernel, and considers the connection of the corresponding GAKPCA method and the correntropy which leads to the underlying robustness. In Section 5, a fed-batch fermentation process is utilized to test the actual on-line monitoring performance of the proposed method. Finally, some conclusions are drawn.

Section snippets

Preliminaries

In general, after trajectory synchronization and alignment [26], [27], [28], [29], the batch dataset often can be organized as a three-way array denoted by $\underline{X} (I \times J \times K)$ , where I is the number of batches, J is the number of variables, and K is the number of time intervals in a batch. The kth observation of variable j in batch i is denoted by x_i,j,k (i = 1, …, I, j = 1, …, J, and k = 1, …, K). The sampling vector at time k in batch i is denoted by x_i,k = [x_i,1,k, …, x_i,J,k]^T, the whole sampling vector of

Nonlinear PCA with special structure

Unlike the traditional MKPCA method, here we map each sampling vector x_i,k into a feature space. The sampling vector in the feature space is denoted by ϕ(x_i,k), which is assumed to be an M-dimensional column vector (M can be infinite). Then batch i in the feature space can be denoted by $Φ (x_{i}) = {[ϕ {(x_{i, 1})}^{T}, \dots, ϕ {(x_{i, K})}^{T}]}^{T}$ , which is composed of K sampling components and hence has the dimension of KM. Here, we use two different symbols ϕ(·) and Φ(·) to denote the nonlinear transformation functions of

GAKPCA using the Gaussian kernel

Among various kernel functions, this section takes the Gaussian kernel as a special example to analyze the difference between GAKPCA and MKPCA on handling the nonlinear correlation. Based on its connection with the correntropy, GAKPCA presents some other interesting properties.

Case study

A fed-batch penicillin fermentation process is considered as the case study to evaluate the on-line monitoring performance of the proposed method, and a standard modular simulator (Pensim V.2.0) developed by Birol et al. [42] is used to generate the batch dataset. The penicillin fermentation process consists of a main fermenter and some accessory devices for such as substrate feed, pH and temperature control. It has been widely used in many papers [15], [18], [43] for its friendly interface,

Conclusions

In this paper, a special nonlinear PCA method called generalized additive kernel PCA (GAKPCA) has been proposed for on-line batch process monitoring based on the generalized additive kernels (sequential additive). From both the theoretic analysis and experimental results, the proposed method presents the satisfying and robust on-line monitoring performance. Fig. 4 summarizes the relationships between the GAKPCA method and some other methods such as MPCA, MKPCA and correntropy. The main

Acknowledgements

The authors would like to thank the anonymous referees for their good comments that help to improve this paper.

References (49)

P. Nomikos et al.
Multi-way partial least squares in monitoring batch processes
Chemom. Intell. Lab. Syst.
(1995)
S. Wold et al.
Modelling and diagnostics of batch processes and analogous kinetic experiments
Chemom. Intell. Lab. Syst.
(1998)
E.N.M. van Sprang et al.
Critical evaluation of approaches for on-line batch process monitoring
Chem. Eng. Sci.
(2002)
C.K. Yoo et al.
On-line monitoring of batch processes using multiway independent component analysis
Chemom. Intell. Lab. Syst.
(2004)
H.J. Ramaker et al.
Fault detection properties of global, local and time evolving models for batch process monitoring
J. Process Contr.
(2005)
J. Camacho et al.
The best approaches in the on-line monitoring of batch processes based on PCA: does the modelling structure matter?
Anal. Chim. Acta
(2009)
K.L. Hu et al.
Multivariate statistical process control based on multiway locality preserving projections
J. Process Contr.
(2008)
J.M. Lee et al.
Fault detection of batch processes using multiway kernel principal component analysis
Comput. Chem. Eng.
(2004)
Y.W. Zhang et al.
On-line batch process monitoring using hierarchical kernel partial least squares
Chem. Eng. Res. Des.
(2011)
T. Chen et al.
On-line multivariate statistical monitoring of batch processes using Gaussian mixture model
Comput. Chem. Eng.
(2010)

S. Mahadevan et al.

Fault detection and diagnosis in process data using one-class support vector machines

J. Process Contr.

(2009)

Z.Q. Ge et al.

Batch process monitoring based on support vector data description method

J. Process Contr.

(2011)

Z.Q. Ge et al.

Bagging support vector data description model for batch process monitoring

J. Process Contr.

(2013)

J.M. González-Martínez et al.

Real-time synchronization of batch trajectories for on-line multivariate statistical process control using dynamic time warping

Chemom. Intell. Lab. Syst.

(2011)

S. García-Muñoz et al.

Experiences in batch trajectory alignment for pharmaceutical process improvement through multivariate latent variable modelling

J. Process Contr.

(2011)

S. Rännar et al.

Adaptive batch monitoring using hierarchical PCA

Chemom. Intell. Lab. Syst.

(1998)

R. He et al.

Principal component analysis based on non-parametric maximum entropy

Neurocomputing

(2010)

J.C. Munoz et al.

Removal of the effects of outliers in batch process data through maximum correntropy estimator

Chemom. Intell. Lab. Syst.

(2012)

J.H. Chen et al.

Correntropy estimator for data reconciliation

Chem. Eng. Sci.

(2013)

G. Birol et al.

A modular simulation package for fed-batch fermentation: penicillin production

Comput. Chem. Eng.

(2002)

C.R. Alvarez et al.

Batch process monitoring in the original measurement's space

J. Process Contr.

(2010)

S.J. Qin et al.

Determining the number of principal components for best reconstruction

J. Process Contr.

(2000)

H. Hoffmann

Kernel PCA for novelty detection

Pattern Recogn.

(2007)

M. Yao et al.

Batch process monitoring based on functional data analysis and support vector data description

J. Process Contr.

(2014)

Cited by (43)

Evolution stage identification of haze pollution episodes in beijing using constrained dynamic time warping and multiway principal component analysis
2023, Environmental Modelling and Software
To develop more effective haze pollution emission reduction measures, this study utilized a data-driven method based on MPCA and DTW to identify the evolution stages of haze pollution events and quantitatively calculate the contribution rates of precursor substances and meteorological factors to haze concentration. The results show that 13.58% of pollution events are explosive pollution events with a growth rate of up to 20.49 μg/m³·h, while 28% of pollution events exhibit a rapid dissipation period with an average dissipation rate of 26.34 μg/m³·h. The contribution rate of NO₂ is as high as 64.25% in the early stage of pollution events, but only 17.08% during the rising phase of explosive pollution events.
Criteria for optimizing kernel methods in fault monitoring process: A survey
2022, ISA Transactions
Citation Excerpt :
Deng et al. [49] propose a novel similarity factor using KPCA to reveal the statistics information hidden in original measured variables. Alternatively, Yao et al. [51] explore the concept of generalized additive KPCA for the batch monitoring processes. On the other hand, Wang et al. [50] utilize the functional KPCA to handle nonlinear correlations between monitoring variables and/or sampling times.
Nowadays, how to select the kernel function and their parameters for ensuring high-performance indicators in fault diagnosis applications remains as two open research issues. This paper provides a comprehensive literature survey of kernel-preprocessing methods in condition monitoring tasks, with emphasis on the procedures for selecting their parameters. Accordingly, twenty kernel optimization criteria and sixteen kernel functions are analyzed. A kernel evaluation framework is further provided for helping in the selection and adjustment of kernel functions. The proposal is validated via a KPCA-based monitoring scheme and two well-known benchmark processes.
A novel kernel dynamic inner slow feature analysis method for dynamic nonlinear process concurrent monitoring of operating point deviations and process dynamics anomalies
2022, Journal of Process Control
A novel nonlinear dynamic inner slow feature analysis method is proposed for dynamic nonlinear process concurrent monitoring of operating point deviations and process dynamics anomalies. In this method, the nonlinear correlation and serial autocorrelation are considered meanwhile to extract the serial auto-correlated latent slow features with explicit dynamic representation. In order to approve slow features (SFs) with explicit dynamic representation from nonlinear dynamic process data, a new multi-goal optimization question is formulized with constraints of the extract latent variable catch some variation information and mutually orthogonal. After the nonlinear dynamic inner slow feature analysis model is trained from data, a corresponding detection strategy is also developed to perform process condition concurrent monitoring. Finally, the superiority and effectiveness of the proposed monitoring method are demonstrated by a numerical simulation case and an actual cold rolling mill case.
Fault detection and diagnosis of the air handling unit via an enhanced kernel slow feature analysis approach considering the time-wise and batch-wise dynamics
2021, Energy and Buildings
Air handling unit (AHU) is a typical special batch control process, exhibiting strong nonlinear property and two-directional dynamic characteristics which are the time-wise and batch-wise dynamic characteristics. Specifically, the time-wise dynamic characteristic corresponds to the evolution of different operating modes caused by the underlying driving forces which vary slowly in each running day (a batch run), while the batch-wise dynamic characteristic relates to the dynamic variations and deviations among different running days (batch runs). In order to further improve the AHU FDD performance through capturing the underlying driving forces of the AHU system and tackling the batch-wise dynamic property between different batch runs, in this paper, an enhanced kernel slow feature analysis (SFA) based FDD scheme is developed to detect and identify the faults of the nonlinear AHU system. Firstly, a three-way data based kernel SFA (TBKSFA) approach is proposed to detect the faults. In the proposed TBKSFA approach, the kernel trick is adopted in the SFA to sufficiently deal with the nonlinearity and the time-wise dynamic characteristic, and the multiway data analysis is employed to cope with the batch-wise dynamics among different batch runs by converting the three-way training dataset into a variable-wise unfolding two-way matrix. In addition, to handle the tough problem of nonlinearly identifying the fault pattern, a novel kernel discriminant SFA (KDSFA) model is further built by combining the kernel SFA with the discriminant analysis method. In the fault pattern diagnosis process, the proposed KDSFA is pairwisely implemented on the normal and fault datasets to calculate the fault direction, and the fault is then identified by computing the similar degrees of its own fault direction and the historical fault directions. At last, experiments and comparisons on the FDD performance of the developed approach are made using the experimental data provided by ASHRAE Research Project RP-1312. To be specific, the proposed TBKSFA based fault detection method is compared with the popular kernel principal component analysis (KPCA) method, the closely related kernel SFA method and the emerging manifold learning based kernel locality preserving projections (KLPP) method. While the developed KDSFA based fault pattern diagnosis scheme is compared with the conventional jointed angle analysis technique, the strongly linked DSFA based method and the rising artificial neural networks based long short-term memory(LSTM) classifier. Experimental results demonstrate that significant improvements can be achieved by the proposed approach compared with some other popular methods.
Linear and nonlinear hierarchical multivariate time delay analytics for dynamic modeling and process monitoring
2021, Journal of Process Control
Citation Excerpt :
Due to the complexity of industrial equipment, there are significantly nonlinear relationships between process variables. Kernel method [9] is a typical technique to deal with nonlinearity, which embeds the original data into an appropriate high-dimensional feature space through some nonlinear mappings. However, it may be tough to find suitable kernel functions for data with different characteristics.
Due to the complexity of industrial processes, the collected data show typical nonlinearity and dynamic characteristics, bringing significant challenges for nonlinear dynamic process monitoring. Due to the instinctive structure of equipment and different positions of measurements, time delays widely exist between variables, decreasing the accuracy of models. In this work, a linear and nonlinear hierarchical modeling method is proposed for time delay analytics and nonlinear dynamic process monitoring. First, the variables are automatically divided into several linear subgroups and nonlinear variables by extracting dynamic latent variables. Then an autoregressive autoencoder (ARAE) model is designed to describe linear and nonlinear characteristics combined with dynamic-inner principal component analysis (DiPCA). In this way, time delay analytics based on the dynamic framework is developed to enhance the effectiveness of extracting dynamic characteristics. Finally, a hierarchical monitoring strategy is developed for nonlinear processes from both linear and nonlinear, static and dynamic perspectives. The effectiveness is verified by a numerical case and a three-phase flow process.
A multi-feature extraction technique based on principal component analysis for nonlinear dynamic process monitoring
2020, Journal of Process Control
Principal component analysis (PCA) and its modified methods have been widely applied in industrial process monitoring. In practice, industrial processes are with disparate characteristics, the process monitoring system should consider as many process characteristics as possible, such as dynamic and nonlinear characteristics. In this paper, a multi-feature extraction technique based on PCA is proposed for nonlinear dynamic process monitoring. The proposed method integrates dynamic inner PCA (DiPCA), PCA and kernel PCA (KPCA) methods through a serial structure to extract the dynamic, linear and nonlinear features among the process data. Along with the proposed method, the original data space is decomposed into several orthogonal subspaces, in which abnormal variations of different features can be monitored. For real-time process monitoring, a combined Hotelling’s T² statistic based on the extracted multi-feature and a squared prediction error (SPE or Q) statistic are established. Case studies on a numerical example and the Tennessee Eastman process are carried out to demonstrate the superior process monitoring performance of the proposed method compared with other relevant methods.

View all citing articles on Scopus

View full text

On-line monitoring of batch processes using generalized additive kernel principal component analysis

Highlights

Abstract

Introduction

Section snippets

Preliminaries

Nonlinear PCA with special structure

GAKPCA using the Gaussian kernel

Case study

Conclusions

Acknowledgements

Chemom. Intell. Lab. Syst.

Chemom. Intell. Lab. Syst.

Chem. Eng. Sci.

Chemom. Intell. Lab. Syst.

J. Process Contr.

Anal. Chim. Acta

J. Process Contr.

Comput. Chem. Eng.

Chem. Eng. Res. Des.

Comput. Chem. Eng.

J. Process Contr.

J. Process Contr.

J. Process Contr.

Chemom. Intell. Lab. Syst.

J. Process Contr.

Chemom. Intell. Lab. Syst.

Neurocomputing

Chemom. Intell. Lab. Syst.

Chem. Eng. Sci.

Comput. Chem. Eng.

J. Process Contr.

J. Process Contr.

Pattern Recogn.

J. Process Contr.