Skip to main content

2018 | Buch

Data-Driven Prediction for Industrial Processes and Their Applications

insite
SUCHEN

Über dieses Buch

This book presents modeling methods and algorithms for data-driven prediction and forecasting of practical industrial process by employing machine learning and statistics methodologies. Related case studies, especially on energy systems in the steel industry are also addressed and analyzed. The case studies in this volume are entirely rooted in both classical data-driven prediction problems and industrial practice requirements. Detailed figures and tables demonstrate the effectiveness and generalization of the methods addressed, and the classifications of the addressed prediction problems come from practical industrial demands, rather than from academic categories. As such, readers will learn the corresponding approaches for resolving their industrial technical problems. Although the contents of this book and its case studies come from the steel industry, these techniques can be also used for other process industries. This book appeals to students, researchers, and professionals within the machine learning and data analysis and mining communities.

Inhaltsverzeichnis

Frontmatter
Chapter 1. Introduction
Abstract
This chapter gives an overall introduction to this book. First, we discuss the importance of the prediction for industrial process. Then, we divide the data-driven prediction methodology discussed in this book into a number of categories. Specifically, there are three categories, i.e., data feature-based methods, time scale-based ones, and prediction reliability-based ones. Besides, considering the characteristics of prediction modeling and industrial demands, this book introduces some commonly used prediction techniques, including the time series-based methods, the factor-based methods, the prediction intervals (PIs) construction methods, and the granular-based long-term prediction methods.
Jun Zhao, Wei Wang, Chunyang Sheng
Chapter 2. Data Preprocessing Techniques
Abstract
It is hard for raw industrial data accumulated by commonly implemented supervisory control and data acquisition (SCADA) system on-site to be directly employed to construct a prediction model, given that such data are always mixed with high level noise, missing points, and outliers due to the possible real-time database malfunction, data transformation, or maintenance. Thereby, the data preprocessing techniques have to be implemented, which usually contain anomaly data detection, data imputation, and data de-noising techniques. As for the issue of outliers, in this chapter, we introduce the anomaly detection methods based on fuzzy C means (FCM), K-nearest-neighbor (KNN), and dynamic time warping (DTW) algorithms. To tackle the missing data points problem, a series of data imputation methods are also described. After introducing the generic regression filling and expectation maximum methods, we supplement a varied window similarity measure method, the segmented shape-representation-based method, and the non-equal-length granules correlation method for industrial data imputation. With respect to the high level noise embodied in raw data, we then give an introduction to the well-known empirical mode decomposition (EMD) method. To verify the effectiveness of these methods, this chapter also provides a number of industrial case studies.
Jun Zhao, Wei Wang, Chunyang Sheng
Chapter 3. Industrial Time Series Prediction
Abstract
Time series prediction is a significant way for forecasting the variables involved in industrial process, which usually identifies the latent rules hidden behind the time series data of the variables by means of auto-regression. In this chapter we introduce the phase space reconstruction technique, which aims to construct the training dataset for modeling, and then a series of data-driven machine learning methods are provided for time series prediction, where some well-known artificial neural networks (ANNs) models are introduced, and a dual estimation-based echo state network (ESN) model is particularly proposed to simultaneously estimate the uncertainties of the output weights and the internal states by using a nonlinear Kalman-filter and a linear one for noisy industrial time series. In addition, the kernel based methods, including Gaussian processes (GP) model and support vector machine (SVM) model, are also presented in this chapter. Specifically, an improved GP-based ESN model is proposed for time series prediction, in which the output weights in ESN modeled by using GP avoids the ill-conditioned phenomenon associated with the generic ESN version. A number of case studies related to industrial energy system are provided to validate the performance of these methods.
Jun Zhao, Wei Wang, Chunyang Sheng
Chapter 4. Factor-Based Industrial Process Prediction
Abstract
This chapter gives the factors-based prediction methods for industrial processes. Different from the mentioned time series-based prediction, this kind of approaches construct a forecasting model by treating the process variables (not the output or target variables) called “factors” as the model inputs, rather than the auto-regression mode used in time series version. To select the factors from lots of candidates, this chapter firstly introduces some commonly used feature selection approaches such as gray correlation method and convolution-based methods. As for the single-output model, this chapter introduces NNs-based model, Takagi-Sugeno (T-S) fuzzy model, and SVM. In particular, a multi-kernel setting of a LSSVM model can perform better explanatary ability for learning a nonlinear model and fit the regression problem more effectively. Besides, this chapter also introduces a multi-output LSSVM model, which considers the single fitting error of each output and the combined error as well, and aims at the issues of multiple interactional outputs in industrial system. This chapter also provides some case studies on industrial energy system for performance verification.
Jun Zhao, Wei Wang, Chunyang Sheng
Chapter 5. Industrial Prediction Intervals with Data Uncertainty
Abstract
Prediction intervals (PIs) construction is a comprehensive prediction technique that provides not only the point estimates of the industrial variables, but also the reliability of the prediction results indicated by an interval. Reviewing the conventional PIs construction methods (e.g., delta method, mean and variance-based estimation method, Bayesian method, and bootstrap technique), we provide some recently developed approaches in this chapter. Here, a bootstrapping-based ESN ensemble (BESNE) model is specially proposed to produce reliable PIs for industrial time series, in which a simultaneous training method based on Bayesian linear regression is developed. Besides, to cope with the error accumulation caused by the traditional iterative mode of time series prediction, a non-iterative granular ESN is also reported for PIs construction, where the network connections are represented by the interval-valued information granules. In addition, we present a mixed Gaussian kernel-based regression model to construct PIs, in which a gradient descent algorithm is derived to optimize the hyper-parameters of the mixed Gaussian kernel. In order to tackle the incomplete testing input problem, a kernel-based high order dynamic Bayesian network (DBN) model for industrial time series is then proposed, which directly deals with the missing points involved in the inputs. Finally, we provide some case studies to verify the effectiveness of these approaches.
Jun Zhao, Wei Wang, Chunyang Sheng
Chapter 6. Granular Computing-Based Long-Term Prediction Intervals
Abstract
In industrial practice, long-term prediction for process variables is fairly significant for the process industry, which is capable of providing the guidance for equipment control, operational scheduling, and decision-making. This chapter firstly introduces the basic principles of granularity partition, and a long-term prediction model for time series and factor-based prediction are developed in this chapter. In terms of time series prediction, the unequal-length temporal granules are constructed by exploiting dynamic time warping (DTW) technique, and a granular-computing (GrC)-based hybrid collaborative fuzzy clustering (HCFC) algorithm is proposed for the mentioned factor-based prediction problem. Besides, in this chapter, the long-term prediction approach is also combined with the PIs construction in order to provide the prediction reliability in the context of long-term time series task. Similarly, the PIs construction on multi-dimension problem is also introduced by employing the structure of the HCFC algorithm. To verify the effectiveness of these approaches, this chapter provides some experimental analysis on industrial data coming from an energy data center of a steel plant.
Jun Zhao, Wei Wang, Chunyang Sheng
Chapter 7. Parameter Estimation and Optimization
Abstract
The selection of parameters or hyper-parameters gives great impact on the performance of a data-driven model. This chapter introduces some commonly used parameter optimization and estimation methods, such as the gradient-based methods (e.g., gradient descend, Newton method, and conjugate gradient method) and the intelligent optimization ones (e.g., genetic algorithm, differential evolution algorithm, and particle swarm optimization). In particular, in this chapter, the conjugate gradient method is employed to optimize the hyper-parameters in a LSSVM model based on noise estimation, which enable to alleviate the impact of noise on the performance of the LSSVM. As for dynamic models, this chapter introduces nonlinear Kalman-filter methods for parameter estimation. The well-known ones include the extended Kalman-filter, the unscented Kalman-filter, and the cubature Kalman-filter. Here, a dual estimation model based on two Kalman-filters is illustrated, which simultaneously estimates the uncertainties of internal state and the output. Besides, the probabilistic methods for parameter estimation are also introduced, where a Bayesian model, especially a variational inference framework, is elaborated in details. In such a framework, a particular variational relevance vector machine (RVM) model based on automatic relevance determination kernel is introduced, which provides the approximated posterior distributions over the kernel parameters. Finally, we give some case studies by employing a number of industrial data.
Jun Zhao, Wei Wang, Chunyang Sheng
Chapter 8. Parallel Computing Considerations
Abstract
This chapter discusses the computational cost of machine learning model. To reduce its training time is a requisite of its industrial applications since a production process usually requires real-time responses. The commonly used method to accelerate the training process is to develop a parallel computing framework. In literature, two kinds of popular methods speeding up the training involves the one with a computer equipped with graphics processor unit (GPU) and the one with computer cluster including a number of computers. This chapter firstly introduces the basic ideas of GPU acceleration (e.g., the compute unified device architecture (CUDA) created by NVIDIA™) and the computer cluster framework (e.g., the MapReduce framework), then gives some specified examples of them. When training an EKF-based Elman network, the inversion operation of a Jacobian matrix is the most time-consuming procedure; a parallel computing strategy for such an operation is therefore proposed by using the CUDA-based GPU acceleration. Besides, with regard to the LSSVM modeling, a CUDA-based parallel PSO is then introduced for its hyper-parameters optimization. As for the computer cluster version, we design a parallelized EKF based on ESN by using MapReduce framework for acceleration. At the end, we also present a series of experimental analysis by using the practical energy data in steel industry to validate the performance of the accelerating approaches.
Jun Zhao, Wei Wang, Chunyang Sheng
Chapter 9. Data-Based Prediction for Energy Scheduling of Steel Industry
Abstract
Based on the results of a number of different forecasting modes introduced in the previous chapters, this chapter provides a practical case study related to the optimal scheduling for energy system in steel industry based on the prediction outcomes. As for the by-product gas scheduling problem, a two-stage scheduling method is introduced here. On the prediction stage, the states of the optimized objectives, the consumption of the outsourcing natural gas and oil, the power generation, and the gas holder levels are forecasted by using the previous data-driven learning methods. On the optimal scheduling stage, a rolling optimization procedure is performed by employing the predicted results. More typically, with respect to the scheduling task for the oxygen/nitrogen system in steel industry, a similar two-stage method is also developed, in which a granular-computing (GrC)-based prediction model is firstly established on the stage of a long-term prediction, and the scheduling solution is also optimized later. Furthermore, the results of the scheduling system applications also indicate the effectiveness of the real-time prediction and scheduling optimization.
Jun Zhao, Wei Wang, Chunyang Sheng
Backmatter
Metadaten
Titel
Data-Driven Prediction for Industrial Processes and Their Applications
verfasst von
Prof. Jun Zhao
Prof. Wei Wang
Dr. Chunyang Sheng
Copyright-Jahr
2018
Electronic ISBN
978-3-319-94051-9
Print ISBN
978-3-319-94050-2
DOI
https://doi.org/10.1007/978-3-319-94051-9