Probabilistic load flow method considering large-scale wind power integration

DENG, Xiaoyang; ZHANG, Pei; JIN, Kangmeng; HE, Jinghan; WANG, Xiaojun; WANG, Yuwei

doi:10.1007/s40565-019-0502-0

Probabilistic load flow method considering large-scale wind power integration

Open access
Published: 28 February 2019

Volume 7, pages 813–825, (2019)
Cite this article

Download PDF

You have full access to this open access article

Journal of Modern Power Systems and Clean Energy

Probabilistic load flow method considering large-scale wind power integration

Download PDF

Xiaoyang DENG¹,
Pei ZHANG¹,
Kangmeng JIN¹,
Jinghan HE¹,
Xiaojun WANG¹ &
…
Yuwei WANG²

3596 Accesses
25 Citations
Explore all metrics

Abstract

The increasing penetration of wind power brings great uncertainties into power systems, which poses challenges to system planning and operation. This paper proposes a novel probabilistic load flow (PLF) method based on clustering technique to handle large fluctuations from large-scale wind power integration. The traditional cumulant method (CM) for PLF is based on the linearization of load flow equations around the operating point, therefore resulting in significant errors when input random variables have large fluctuations. In the proposed method, the samples of wind power and loads are first generated by the inverse Nataf transformation and then clustered using an improved K-means algorithm to obtain input variable samples with small variances in each cluster. With such pre-processing, the cumulant method can be applied within each cluster to calculate cumulants of output random variables with improved accuracy. The results obtained in each cluster are combined according to the law of total probability to calculate the final cumulants of output random variables for the whole samples. The proposed method is validated on modified IEEE 9-bus and 118-bus test systems with additional wind farms. Compared with the traditional CM, 2m+1 point estimate method (PEM), Monte Carlo simulation (MCS) and Latin hypercube sampling (LHS) based MCS, the proposed method can achieve a better performance with consideration of both computational efficiency and accuracy.

Modeling multiple-criteria decision making of the electrical grid considering optimal demand management

Article 29 April 2024

Electricity load forecasting: a systematic review

Article Open access 09 September 2020

Electricity generation scheduling of thermal- wind-solar energy systems

Article 03 July 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Load flow study is a vital tool for power system planning and operation. However, there are many uncertainties, which result from changes in load demands, outages of generators and changes of network. Large-scale wind power integration further introduces great uncertainties into power systems. Many researchers have performed researches on applying probabilistic load flow (PLF) methods to handle these uncertainties.

A critical review was provided in [1], where the methods to solve PLF problems were classified into three types, namely simulation methods, approximation methods and analytical methods.

Monte Carlo simulation (MCS) can obtain accurate results after a large number of simulations, which are generally treated as reference results for comparisons. But, MCS is time-consuming. The application of importance sampling [2], Latin hypercube sampling (LHS) [3, 4] and Latin supercube sampling [5] reduced the computational burden of MCS. For handling correlated input random variables, Nataf transformation [6, 7] and copula function [8] were applied together with LHS. References [9,10,11] applied a quasi-Monte Carlo approach to solve PLF problems. That approach was more efficient than MCS.

The point estimate method (PEM), one kind of approximation methods, was widely applied to solve PLF problems. Reference [12] first proposed the 2m PEM to solve PLF. Reference [13] introduced a modified 2m PEM to handle correlated uncertain variables. Compared with 2m PEM, 2m+1 PEM had higher accuracy, but conducted more simulations [14]. For handling correlated uncertain variables, references [15, 16] provided a modified 2m+1 PEM based on Cholesky decomposition. Reference [17] applied 2m+1 PEM to solve probabilistic three-phase load flow for unbalanced electrical systems with wind farms. Reference [18] discussed the performance of five point estimate method (5PEM). Reference [19] proposed another approximation method, unscented transformation method (UTM), which could consider correlations of input variables. Approximation methods are generally more efficient than MCS. However, the accuracy and efficiency are sensitive to the number of input random variables.

Analytical methods do not need to run many times of simulations as Monte Carlo method does. In [20], a first-order second-moment method (FOSMM) was applied to obtain the mean and standard deviation of load flow solutions. The sequence operation methodology [21] is one of the analytical methods. It has a great advantage in terms of efficiency by sequence operation. But the sequence operation needs to meet new operation rules, which limits its application. Cumulant method (CM) is another analytical method to solve PLF and has an excellent performance on computational efficiency. In [22, 23], CM and Gram-Charlier expansion were applied to solve PLF. Reference [24] discussed the properties, advantages and deficiencies of three types of series expansions, namely Gram-Charlier, Edgeworth and Cornish-Fisher expansions. Furthermore, Cholesky decomposition [25] and joint cumulants [24, 26] were utilized to deal with correlations of input variables. Reference [27] applied Gaussian mixture approximation method to handle non-normal correlated random variables. References [28, 29] applied the maximum entropy instead of series expansions to calculate probability density functions (PDFs). It could improve the accuracy of PDFs, but required a relatively more complex process.

CM requires less computational effort than other methods. However, it may produce significant errors when input random variables have large fluctuations. The reason is that the linear relationship between input and output variables is estimated based on the linearization of load power equations at the operating point. When input random variables fluctuate away from the operating point, the relationship between input and output variables may change significantly. Reference [30] studied the error resulting from linear model and showed that the error would increase when the varying ranges of input variables increased. The wind power output can change significantly over time due to fluctuations of wind speeds. To solve PLF considering large-scale wind power integration, reference [31] divided PDFs of wind power into multiple intervals, and incorporated these intervals into the integral formulation of calculating cumulants. This method cannot handle the correlation of different input variables and is computationally complex. Our previous work [32] tried to solve probabilistic optimal power flow (P-OPF) with large fluctuations using the method of combined the traditional K-means clustering technique and CM. However, the traditional K-means performs inefficiently for large-scale systems.

To solve PLF considering large-scale wind power integration, this paper proposes a novel PLF method by combining an improved K-means clustering technique and CM. It tackles the problems that the traditional CM cannot handle input random variables with large fluctuations, and that the traditional K-means is not efficient for large-scale systems. Compared with existing methods, such as the traditional CM, 2m+1 PEM, LHS and MCS, the proposed method can achieve a better performance with consideration of both computational efficiency and accuracy. The proposed method can be used to analyze effects of uncertainties on power systems.

The rest of this paper is organized as follows: Section 2 introduces the CM for PLF formulation. The theoretical framework of the proposed method is described in Section 3. IEEE 9-bus and 118-bus test systems are modified for case studies in Section 4. Finally, conclusions are summarized in Section 5.

2 CM for PLF formulation

CM for PLF is based on the linearization of load flow equations. In the process, a set of equations, which consist of nodal power injections and line power flows, are formulated.

Let X be the vector of nodal power injections, U be the vector consisting of voltage angles at PV and PQ buses and voltage magnitudes at PQ buses, and Z be the vector of power flows in branches. AC power flow equations can be written as:

$$ \left\{ {\begin{array}{l} {\varvec{X} = \varvec{f}\left( \varvec{U} \right)} \\ {\varvec{Z} = \varvec{g}\left( \varvec{U} \right)} \\ \end{array} } \right. $$

(1)

where f(·) and g(·) are the corresponding power injection functions and corresponding line flow functions, respectively.

The linear equations can be obtained by linearizing (1) at the operating point:

$$ \left\{ {\begin{array}{l} {\varvec{U} = \varvec{U}_{0} + \varvec{J}_{0}^{ - 1} \Delta \varvec{X} = \varvec{U}_{0} + \varvec{H}_{0} \Delta \varvec{X}} \\ {\varvec{Z} = \varvec{Z}_{0} + \varvec{G}_{0} \Delta \varvec{U} = \varvec{Z}_{0} + \varvec{L}_{0} \Delta \varvec{X }} \\ \end{array} } \right. $$

(2)

where $ \varvec{U}_{0} $ and $ \varvec{Z}_{0} $ are the values of U and Z at the operating point, respectively; ∆U and ∆X are the vectors of the changes in U and X; $ \varvec{J}_{0}^{ - 1} $ is the inverse of Jacobian matrix at the operating point; $ \varvec{H}_{0} = \varvec{J}_{0}^{ - 1} $; $ \varvec{G}_{0} = \left( {{{\partial \varvec{Z}} \mathord{\left/ {\vphantom {{\partial \varvec{Z}} {\partial \varvec{U}}}} \right. \kern-0pt} {\partial \varvec{U}}}} \right)\left| {_{{\varvec{U} = \varvec{U}_{0} }} } \right. $; $ \varvec{L}_{0} = \varvec{G}_{0} \varvec{J}_{0}^{ - 1} $.

For power systems containing wind farms, fluctuations of both wind generation and load can result in uncertainties. The active wind power output can be obtained as follows [23]:

$$ P_{\text{w}} = \left\{ {\begin{array}{lll} {0 \, } \\ {{{\left( {\nu_{i} - \nu_{\text{ci}} } \right)} \mathord{\left/ {\vphantom {{\left( {\nu_{i} - \nu_{\text{ci}} } \right)} {\left( {\nu_{\text{r}} - \nu_{\text{ci}} } \right)}}} \right. \kern-0pt} {\left( {\nu_{\text{r}} - \nu_{\text{ci}} } \right)}}P_{\text{r}} } \\ {P_{\text{r}} \, } \\ {0 \, } \\ \end{array} } \right.\;\;\;\;\;\begin{array}{*{20}l} {0 < \nu_{i} < \nu_{\text{ci}} } \hfill \\ {\nu_{\text{ci}} \le \nu_{i} < \nu_{\text{r}} } \hfill \\ {\nu_{\text{r}} \le \nu_{i} < \nu_{\text{co}} } \hfill \\ {\nu_{i} \ge \nu_{\text{co}} } \hfill \\ \end{array} $$

(3)

where P_w is the active wind power output; P_r is the rated power of the wind farm; v_i is the wind speed of the wind farm; v_ci, v_r and v_co are the cut-in, rated and cut-out speeds of the wind farm, respectively. Wind power output is treated as a negative load, whose power factor is kept constant [13].

Thus, $ \Delta \varvec{X} $ can be reformed as follows:

$$ \Delta \varvec{X} = \Delta \varvec{W} - \Delta \varvec{L} $$

(4)

where W is the vector consisting of active and reactive wind power outputs at corresponding buses; L is the vector consisting of active and reactive load demands at corresponding buses; $ \Delta \varvec{W} $ and $ \Delta \varvec{L} $ are the vectors of the changes in W and L.

From (2), taking a specific variable in U for example, it can be converted to a linear combination as follows:

$$ \begin{aligned} u_{i} =\, & u_{{i0}} + \sum\limits_{{j = 1}}^{{N_{X} }} {h_{{0ij}} \left( {w_{j} - w_{{j0}} } \right)} - \sum\limits_{{j = 1}}^{{N_{X} }} {h_{{0ij}} \left( {l_{j} - l_{{j0}} } \right)} \\ =\, & u_{{si0}} + \sum\limits_{{j = 1}}^{{N_{X} }} {h_{{0ij}} w_{j} } - \sum\limits_{{j = 1}}^{{N_{X} }} {h_{{0ij}} l_{j} } \\ \end{aligned} $$

(5)

where $ u_{si0} = u_{i0} - \sum\limits_{j = 1}^{{N_{X} }} {h_{0ij} w_{j0} } + \sum\limits_{j = 1}^{{N_{X} }} {h_{0ij} l_{j0} } $; $ u_{i} $ is a specific variable in U; w_j is the j^th variable in W; l_j is the j^th variable in L; u_i0, w_j0 and l_j0 are the values of u_j, w_j, and l_j at the operating point, respectively; h_0ij is the value at row i and column j of $ \varvec{H}_{0} $; N_X is the number of variables of X. For active and reactive power flows in branches, they can also be expressed as a linear combination of input variables.

According to (5), system variables can be converted to a linear combination of input random variables. Assuming the independence among input random variables, the cumulants of output random variables can be calculated by combining the cumulants of input random variables based on the property of cumulants [22]. In general, there are correlations of input random variables. In this paper, the correlations of input random variables are handled by the Cholesky decomposition algorithm [25].

3 Proposed method

The fundamental reason why the traditional CM has high errors for solving PLF of power systems containing large-scale wind power is that the wind power output can change significantly over time due to fluctuations of wind speeds. Therefore, this paper focuses on how to reduce the fluctuations of input random variables. Given the probability distribution functions and correlation coefficient matrix of input random variables, the samples of input random variables can be generated through the inverse Nataf transformation [33]. The values of input variables at the same position form one point as shown in Fig. 1, where X_i is a column vector of samples for a specific input variable (wind power or load). Then, these points are grouped into several clusters through the K-means algorithm. After clustering, the samples in each cluster have small variances. Furthermore, the proposed method adopts the law of total probability to combine the results obtained using CM for PLF in all clusters.

3.1 Improved K-means algorithm

After generating the whole samples, the K-means clustering is applied to divide the whole samples into several clusters. Each cluster has a cluster center. In fact, the cluster centers form a multi-state model for random variables. For the case with only one input random variable, such as the load at one bus, the obtained cluster centers correspond to multiple load levels. The analysis on the cluster centers can represent the analysis on the whole load samples. For the case with two input random variables, such as two wind farm outputs in the same area, the K-means clustering divides their samples into a number of clusters. Each cluster center is a combination of two wind power output levels, which has implied the correlation of these two wind power outputs. For clustering samples of more input variables, the K-means algorithm is conducted in the multi-dimensional Euclidean space. The detailed analyses are introduced in the following subsections.

3.1.1 General steps of K-means algorithm

Step 1: Select initial cluster centers, which is expressed as the matrix M₀.

$$ \varvec{M}_{0} = \left[ {\begin{array}{*{20}c} {x_{11}^{0} } & {x_{12}^{0} } & \cdots & {x_{1i}^{0} } & \cdots & {x_{1n}^{0} } \\ \vdots & \vdots & {} & \vdots & {} & \vdots \\ {x_{j1}^{0} } & {x_{j2}^{0} } & \cdots & {x_{ji}^{0} } & \cdots & {x_{jn}^{0} } \\ \vdots & \vdots & {} & \vdots & {} & \vdots \\ {x_{K1}^{0} } & {x_{K2}^{0} } & \cdots & {x_{Ki}^{0} } & \cdots & {x_{Kn}^{0} } \\ \end{array} } \right] $$

(6)

where K is the number of clusters set in advance; $ x_{ji}^{0} $ is the initial center of the variable i in the j^th cluster. The j^th cluster center can be expressed as: $ \left( {x_{j1}^{0} ,x_{j2}^{0} , \cdots ,x_{ji}^{0} , \cdots ,x_{jn}^{0} } \right) $.

Step 2: Calculate the Euclidean distance of all points to each cluster center.

$$ E_{d} \left( {l,j} \right) = \sqrt {\sum\limits_{i = 1}^{n} {\left( {x_{li} - x_{ji}^{0} } \right)^{2} } } $$

(7)

where $ E_{d} \left( {l,j} \right) $ denotes the Euclidean distance of point l to the center of the j^th cluster.

Step 3: Assign all points to the closest cluster according to the Euclidean distance, and recalculate the cluster centers.

Step 4: Repeat Steps 2 and 3 until cluster centers don’t migrate.

3.1.2 Methods for improving performance of K-means

1)
Selection of the initial cluster centers

The clustering performance is sensitive to the initial cluster centers, so that it is important to select them. In this paper, 10% of samples are randomly selected for clustering first. The obtained cluster centers through the first clustering can reflect the locations of cluster centers for the whole samples to some extent. Then, the obtained cluster centers are used as initial cluster centers to perform K-means clustering for the whole samples.

2)
Determination of the appropriate value of K

It is necessary to determine the number of clusters before performing K-means clustering. Therefore, the weighted average radius (WAR) is proposed to evaluate the clustering performance. The WAR can be calculated as follows:

$$ R = \sum\limits_{j = 1}^{K} {p_{j} r_{j} } $$

(8)

where R is the WAR; p_j is the ratio of the number of points in the j^th cluster to the number of all points; r_j is the radius of the j^th cluster [34].

In general, the value of WAR decreases with the increase of the number of clusters. Furthermore, the value of WAR decreases slowly once the number of clusters exceeds one value, which indicates that the quality of clustering doesn’t improve significantly once the number of clusters is larger than that value. Therefore, that value is suggested as the appropriate value of K.

3)
Dimensionality reduction

For improving the efficiency of K-means to handle high-dimensional samples, the singular value decomposition (SVD) can be used when the number of input variables is high. X consisting of the samples of input random variables is an $ N \times n $ matrix. Carry on SVD to X:

$$ \varvec{X} = \varvec{U}_{x}\varvec{\varSigma}_{x} {\mathbf{V}}_{x}^{\text{T}} $$

(9)

where $ \varvec{\varSigma}_{x} $ is a diagonal matrix with singular values along the main diagonal; U_x and V^T_x are the left and right singular matrices derived by performing SVD on X, respectively.

The high-dimensional samples X can be converted to low-dimensional samples $ \varvec{X}^{'} $ as follows:

$$ \varvec{X}^{'} = \varvec{XV}_{x} \left( {1:r} \right) $$

(10)

where $ \varvec{V}_{x} \left( {1:r} \right) $ is the first r columns of V_x. The value of r is the number of singular values, whose quadratic sum exceeds 90% of the quadratic sum of all singular values [34]. It is more efficient to perform K-means on the low-dimensional samples $ \varvec{X}^{'} $ than on the high-dimensional samples X.

3.1.3 Overall procedure of improved K-means algorithm

According to the methods described in above subsections, the overall procedure of the improved K-means algorithm is shown in Fig. 2, where N_max is the number of input variables to perform dimensionality reduction.

3.2 Computation of final cumulants

After the K-means clustering, a number of clusters are identified. In each cluster, the cumulant method is utilized to solve PLF. Once the computation for all clusters is completed, the law of total probability is applied to combine the moments obtained in all clusters to obtain the final cumulants of output random variables for the whole samples.

Assuming y to be one of output random variables, its final cumulants can be calculated as follows:

Step 1: The cumulants obtained using CM in each cluster can be converted to the corresponding moments.

$$ \mu_{r}^{i} = \left\{ {\begin{array}{*{20}l} {k_{1}^{i} } \hfill & \quad{r = 1} \hfill \\ {k_{r}^{i} + \sum\limits_{j = 1}^{r - 1} {C_{r - 1}^{j} \mu_{j}^{i} k_{r - j}^{i} } } \hfill & \quad{r >1} \hfill \\ \end{array} } \right. $$

(11)

where $ \mu_{r}^{i} $ is the r^th moment of y for the i^th cluster; kⁱ_r is the r^th cumulant of y for the i^th cluster; $ C_{r - 1}^{j} $ is the binomial coefficient, which is equal to the number of subsets of j distinct elements of r−1 elements.

Step 2: The final moments for the whole samples can be calculated according to the law of total probability.

$$ \mu_{r}^{y} = \sum\limits_{i = 1}^{K} {p_{i} } \mu_{r}^{i} $$

(12)

where $ \mu_{r}^{y} $ is the r^th moment for the whole samples.

Step 3: The final cumulants for the original whole samples can be calculated as (13).

$$ k_{r}^{y} = \left\{ {\begin{array}{*{20}l} {\mu_{1}^{y} } \quad & {r = 1} \hfill\\ {\mu_{r}^{y} - \sum\limits_{j = 1}^{r - 1} {C_{r - 1}^{j} \mu_{j}^{y} k_{r - j}^{y} } } \quad & {r > 1} \hfill \\ \end{array} } \right. $$

(13)

where $ k_{r}^{y} $ is the r^th cumulant of the output variable y for the whole samples.

3.3 Procedure of solving PLF using proposed method

Figure 3 shows the flow chart of the proposed method, where k is the current cluster. A five-step procedure is described as follows.

Step 1: Apply the inverse Nataf transformation to generate wind speed and load samples. The wind power samples can be obtained according to (3).

Step 2: Apply the improved K-means to cluster the wind power and load samples into a number of clusters.

Step 3: In each cluster, the CM is used to solve probabilistic load flow considering correlations of wind power outputs and loads. The correlated samples are first transformed to uncorrelated samples using the Cholesky decomposition. Then, calculate the cumulants of uncorrelated samples [25]. Finally, the CM introduced in Section 2 are executed to calculate the cumulants of all output random variables.

Step 4: Calculate the final cumulants of output random variables using the method introduced in Section 3.2.

Step 5: Approximate the PDFs of output random variables using Gram-Charlier series expansion due to its good tail behavior [22].

This paper solves PLF problems for a determined network and does not consider equipment contingencies such as N−1 contingency. If equipment contingencies are required, the proposed method can be performed for each contingency. Then, the results can be combined according to the law of total probability.

4 Case study

The proposed method, namely the improved K-means based cumulant method (IKCM), is tested on modified IEEE 9-bus and 118-bus test systems [35], which are integrated with additional wind farms. Table 1 lists the particulars of wind farms. In addition, v_ci = 3 m/s, v_r = 13 m/s, and v_co = 25 m/s [33]. The wind farms are assumed to be PQ buses, whose power factors are kept constant at 0.85 lag [13]. In these two cases, MCS with 20000 samples is applied to solve PLF, and its results are treated as the benchmark to assess the accuracy and efficiency of the proposed method. In addition, the uncorrelated CM (UCM), the correlated CM (CCM), the 2m+1 PEM and LHS-based MCS are conducted for comparison purpose. The UCM does not consider correlations of input random variables. The CCM handles correlated input random variables using the Cholesky decomposition. The 2m+1 PEM is proposed in [15]. Reference [6] proposed an LHS-based PLF method and proved that it could obtain accurate results by hundreds of simulations. In this paper, the LHS-based MCS is conducted with 500 samples. The errors of cumulants and PDFs obtained using IKCM, UCM, CCM, 2m+1 PEM and LHS are measured by the indices of absolute percent error (APE) and average root mean square (ARMS), as shown in (14) and (15), respectively. The programs are developed using MATLAB and are executed on a PC with 2.6 GHz Intel (R) Core (TM) i5 duo processor and 8 GB DDR3 RAM.

$$ APE = \left| {{{\left( {r_{\text{o}} - r_{\text{MCS}} } \right)} \mathord{\left/ {\vphantom {{\left( {r_{\text{o}} - r_{\text{MCS}} } \right)} {r_{\text{MCS}} }}} \right. \kern-0pt} {r_{\text{MCS}} }}} \right| \times 100\% $$

(14)

$$ ARMS = \frac{1}{{N_{\text{p}} }}\sqrt {\sum\limits_{i = 1}^{{N_{\text{p}} }} {\left( {OM_{i} - MCS_{i} } \right)^{2} } } $$

(15)

where r_o is the cumulant value obtained using different methods except MCS; r_MCS is the cumulant value obtained using MCS; OM_i denotes the value of the i^th point on the PDFs obtained using different methods except MCS; MCS_i denotes the value of the i^th point on the PDFs obtained using MCS; N_p is the number of points on PDFs.

Table 1 Particulars of wind farms

Full size table

4.1 Case 1: modified IEEE 9-bus test system

4.1.1 Basic information

In modified IEEE 9-bus test system, all loads have constant power factors. The active load demand at each bus is modeled as a Gussian distribution, whose mean is provided in MATPOWER [35] and standard deviation is equal to 10% of its mean. Weibull distributions are used to model wind speeds. Table 2 lists the shape and scale parameters of wind speeds [36]. The correlation coefficient between loads is assumed to be 0.8, the correlation coefficient between wind speeds is assumed to be 0.76, and the correlation coefficient between the wind speed and load at the same bus is assumed to be 0.2 [33]. The PDFs of active load power and wind power at bus 7 are depicted by histograms as shown in Figs. 4 and 5.

Table 2 Parameters of wind speeds (case 1)

Full size table

4.1.2 Performance of improved K-means clustering

According to (12), the relationship between WAR and the number of clusters can be obtained, as shown in Fig. 6.

It can be observed that the WAR declines slowly after the number of clusters is more than 40. This implies that the clustering performance will not significantly improve when the number of clusters is above 40. Therefore, the K value is suggested to be 40.

Table 3 shows the clustering results of the K-means algorithm. After clustering, input random variable samples are grouped into 40 clusters. The variance can reflect the fluctuation of one random variable. For each cluster, the variance of the random variable is calculated. As a result, 40 variance values corresponding to 40 clusters are obtained. Among these 40 values, we choose the minimum, the maximum and the mean value to present the fluctuation level of each input random variable in each cluster. The chosen values are labelled as S_min, S_max and S_mean, respectively. The column labelled S is the variance of a specific input random variable for the original total samples. It can be observed that the variances after the improved K-means clustering are much smaller than those for the original whole samples.

Table 3 Comparison of variances of input variables

Full size table

4.1.3 Probabilistic results

Table 4 lists the results of different methods used to solve PLF problems for this test system. The results are aggregated into: VA which stands for voltage angles, VM which stands for voltage magnitudes, PL which stands for line active power flows, and QL which stands for line reactive power flows, since it is difficult to present all output variables individually. The columns labelled “ε_r1”, “ε_r2”, “ε_r3” and “ε_r4” are APE values of the first four cumulants compared with those obtained using MCS. The mean and maximum values of the APE values are shown to demonstrate the scope of APE values for a class of variables.

Table 4 Comparison of first four cumulants (case 1)

Full size table

It can be observed from Table 4 that all APE values obtained using IKCM are small, which indicates that the cumulants obtained using IKCM are approximately the same as those of MCS. The worst APE value of the proposed method’s results is 59.29%, which occurs at the ε_r4 of VM at bus 5. However, the actual error for this VM is only − 3.66 × 10⁻¹¹ p.u. This variable may mislead the comparison on APE and should not be applied to assess the performance of different methods. Compared with IKCM, the 2m+1 PEM has smaller values of ε_r1, but has much larger values of ε_r3 and ε_r4. The CCM has much larger values of ε_r1, ε_r2, ε_r3 and ε_r4 than IKCM. There are two points to be pointed out about the CCM. First, the values of ε_r3 and ε_r4 are much larger than the values of ε_r1 and ε_r2. Second, the results of reactive quantities (VM and QL) are worse than those of active quantities (VA and PL), which can be significantly observed from the values of ε_r3. The UCM has large values of ε_r2, ε_r3 and ε_r4. Compared with IKCM, LHS produces slightly larger errors. However, LHS has much smaller ε_r1 and ε_r2 than CCM and UCM, and has significantly smaller ε_r3 and ε_r4 than 2m+1 PEM. The second, third and fourth cumulants can reflect the variance, skewness and kurtosis of an output random variable, respectively. Therefore, the large values of ε_r2, ε_r3 and ε_r4 can result in distortions on PDF curves. The PDF curves of output variables (VA at PV and PQ buses, VM at PQ buses, PL and QL) are approximated using 7-order Gram-Charlier series expansion.

Figures 7 and 8 show the PDFs of the VM at bus 9 and the active power flow in line 7–8, respectively. From the comparison in Figs. 7 and 8, the PDFs of IKCM can better match MCS histograms than CCM, UCM and 2m+1 PEM. The PDFs of LHS are close to those of IKCM.

Figure 9 shows the ARMS results of PDF curves of all output random variables. The box plots corresponding to IKCM are all below those corresponding to CCM, UCM, 2m+1 PEM and LHS, which indicates that the PDFs produced by IKCM can approximate those of MCS better. Comparison between LHS and IKCM shows that the ARMS values of the PDFs of LHS are also small, and that the PDFs of LHS are close to those of IKCM. It should be pointed out that in Fig. 9d, there are two exception values in the box plots of IKCM. However, the actual ARMS values are only 0.0995% and 0.1170%. It can be seen that the CCM perform worse on reactive quantities than active quantities. This characteristic can also be observed from the results of cumulants and PDFs. The reason is that reactive quantities generally have higher degree of non-linearity than active quantities. Moreover, the computation time of each method consumes and the number of deterministic power flow (DPF) calculations conducted by each method are shown in Table 5. It can be seen that the proposed method spends much less time than LHS and MCS.

Table 5 Comparison of computation time (case 1)

Full size table

An additional experiment is conducted to examine the performance of the proposed method with more clusters, where the proposed method with 60 clusters is implemented on this test system. The results indicate that the proposed method with 60 clusters is more accurate than 40 clusters. For example, the ε_r1, ε_r2, ε_r3 and ε_r4 of QL in line 1–4, which are obtained using 40 clusters, are 1.45%, 0.41%, 8.64% and 18.18%, respectively. These values obtained using 60 clusters are 1.22%, 0.30%, 5.96% and 15.97%, respectively. It can be seen that more clusters will produce more accurate results. Obviously, more clusters will require more computation time.

It can be concluded that the proposed method has higher computational accuracy than CCM, UCM, 2m+1 PEM and LHS, and is more efficient than LHS and MCS. In addition, more clusters can achieve higher accuracy at the expense of efficiency.

4.2 Case 2: modified IEEE 118-bus test system

The modified IEEE 118-bus test system is used to examine the feasibility of the proposed method for a large system with multiple wind farms. Weibull distributions are used to model wind speeds. Table 6 lists the shape and scale parameters of wind speed distributions [36]. The correlations of wind speeds at buses 17 and 30, buses 59 and 80, and buses 92 and 100 are set to be 0.88, and others are set to be 0.48. All loads have constant power factors. The active load demand at each bus is modeled as Gussian distribution, whose mean is provided in MATPOWER [35] and standard deviation is equal to 10% of its mean.

Table 6 Parameters of wind speeds (case 2)

Full size table

The relationship between WAR and the number of clusters for case 2 can be obtained using (12). The number of clusters is suggested to be 40. In this test system, there are 105 input random variables, including 6 wind power outputs and 99 load demands. Therefore, the dimensionality reduction based on SVD is applied in the K-means process, where the first six singular values are selected and their sum is equal to 92.13% of the quadratic sum of all singular values. The computation times of the traditional K-means and the improved K-means with SVD are 3.08 s and 0.92 s, respectively. It can be seen that the K-means algorithm achieves an efficiency improvement through the dimensionality reduction based on SVD.

Table 7 presents the results of cumulants. Figures 10 and 11 show the PDFs of the PL in line 100–101 and the QL in line 79–80. It is of note that the proposed method has slightly large ε_r3 and ε_r4 values for very few system variables. The reason is that the Cholesky decomposition algorithm used to handle correlations has some errors for high-order cumulants when input random variables are non-normal distributions. The final PDFs of these output random variables obtained using the proposed method still have low ARMS values as shown in Fig. 12. The comparison between Table 8 and Table 5 shows that the 2m+1 PEM does not have the obvious advantage of computational efficiency over the proposed method when solving PLF problems for this large test system. This is because the 2m+1 PEM conducts 211 load flow simulations for this large test system due to 105 input random variables. According to the results for this IEEE 118-bus test system, the same conclusion that the proposed method has higher computational accuracy than CCM, UCM, 2m+1 PEM and LHS, and spends much less time than LHS and MCS, can be achieved.

Table 7 Comparison of first four cumulants (case 2)

Full size table

Table 8 Comparison of computation time (case 2)

Full size table

4.3 Discussion about stability of proposed method

The proposed method is based on the clustering algorithm. Theoretically, the result of clustering is the local optimal solution, which is influenced by the initial cluster centers. In order to examine the stability of the proposed method, probabilistic power flow for the modified IEEE 118-bus test system is conducted 100 times with random initial cluster centers. The APE values of the first four cumulants of each type of variables obtained in each simulation are summed and averaged. In Table 9, the columns labelled with ε_r1,mean, ε_r2,mean, ε_r3,mean and ε_r4,mean are the mean values of APE values of the first four cumulants for 100 simulations. It can be seen that the errors in Table 9 are approximately equal to the corresponding values of the proposed method in Table 7, which demonstrates that the proposed method can achieve stable and accurate results.

Table 9 Errors of first four cumulants for 100 simulations

Full size table

5 Conclusion

A novel PLF method considering large-scale wind power integration is proposed in this paper. In the process of the proposed method, an improved K-means algorithm is used to cluster the samples of input random variables, and the law of total probability is applied to combine the results obtained in each cluster. From the case studies on modified IEEE 9-bus and 118 bus test systems, some conclusions are drawn as follows:

1)
To solve PLF considering large-scale wind power integration, the proposed method can achieve higher accuracy than traditional CM, 2m+1 PEM and LHS, and higher efficiency than LHS and MCS. In other words, the proposed method can achieve a better performance with consideration of both computational efficiency and accuracy.
2)
More clusters will produce more accurate results at the expense of time. The suggested number of clusters should be determined in advance.
3)
The traditional CM considering the correlation of input random variables generally has significant errors for reactive quantities.
4)
The 2m+1 PEM has accurate results for the first two cumulants but not for the third and fourth cumulants, which results in significant errors in PDFs of output variables.

In conclusion, as the proposed method has been tested on the small and large test systems, it can provide an accurate and efficient tool for power system planning and operation with large-scale wind power.

References

Prusty BR, Jena D (2016) A critical review on probabilistic load flow studies in uncertainty constrained power systems with photovoltaic generation and a new approach. Renew Sust Energ Rev 69:1286–1302
Article Google Scholar
Glynn PW, Iglehart DL (1989) Importance sampling for stochastic simulations. Management Science 35(11):1367–1392
Article MathSciNet MATH Google Scholar
Yu H, Chung CY, Wong KP et al (2009) Probabilistic load flow evaluation with hybrid Latin hypercube sampling and Cholesky decomposition. IEEE Trans Power Syst 24:661–667
Article Google Scholar
Luo G, Chen JF, Cai DF et al (2013) Probabilistic assessment of available transfer capability considering spatial correlation in wind power integrated system. IET Gener Transm Distrib 7:1527–1535
Article Google Scholar
Hajian M, Rosehart WD, Zareipour H (2013) Probabilistic power flow by Monte Carlo simulation with Latin supercube sampling. IEEE Trans Power Syst 28:1550–1559
Article Google Scholar
Chen Y, Wen J, Cheng S (2013) Probabilistic load flow method based on Nataf transformation and Latin hypercube sampling. IEEE Trans Sustain Energy 4(2):294–301
Article Google Scholar
Liu Y, Gao S, Cui H et al (2016) Probabilistic load flow considering correlations of input variables following arbitrary distributions. Electr Power Syst Res 140:354–362
Article Google Scholar
Li B, Shahzad M, Qi B et al (2018) Probabilistic computational model for correlated wind farms using copula theory. IEEE Access 6:14179–14187
Article Google Scholar
Cui T, Franchetti F (2013) A quasi-Monte Carlo approach for radial distribution system probabilistic load flow. In: Proceedings of IEEE PES innovative smart grid technologies conference (ISGT), Washington DC, USA, 1-6 February 2013, pp 1–6
Chen W, Yan HQ, Pei XP et al (2016) A quasi Monte Carlo probabilistic load flow method of distribution system containing distributed generation and electric vehicle charging load based on Sobol sequence. In: Proceedings of China international conference on electricity distribution (CICED), Xi’an, China, 10–13 August 2016, pp 1–7
Xu X, Yan Z (2017) Probabilistic load flow calculation with quasi-Monte Carlo and multiple linear regression. Int J Electr Power Energy Syst 88:1–12
Article Google Scholar
Su CL (2005) Probabilistic load-flow computation using point estimate method. IEEE Trans Power Syst 20:1843–1851
Article Google Scholar
Aien M, Khajeh MG, Rashidinejad M et al (2014) Probabilistic power flow of correlated hybrid wind-photovoltaic power systems. IET Renew Power Gen 8:649–658
Article Google Scholar
Morales JM, Perez-Ruiz J (2007) Point estimate schemes to solve the probabilistic power flow. IEEE Trans Power Syst 22:1594–1601
Article Google Scholar
Morales JM, Baringo L, Conejo AJ et al (2010) Probabilistic power flow with correlated wind sources. IET Gener Transm Distrib 4:641–651
Article Google Scholar
Wang X, Gong Y, Jiang C (2015) Regional carbon emission management based on probabilistic power flow with correlated stochastic variables. IEEE Trans Power Syst 30:1094–1103
Article Google Scholar
Gupta N, Daratha N (2017) Probabilistic three-phase load flow for unbalanced electrical systems with wind farms. Int J Electr Power Energy Syst 87:154–165
Article Google Scholar
Gupta N (2016) Probabilistic load flow with detailed wind generator models considering correlated wind generation and correlated loads. Renew Energy 94:96–105
Article Google Scholar
Aien M, Fotuhi-Firuzabad M, Aminifar F (2012) Probabilistic load flow in correlated uncertain environment using unscented transformation. IEEE Trans Power Syst 27:2233–2241
Article Google Scholar
Wan C, Xu Z, Dong ZY et al (2012) Probabilistic load flow computation using first-order second-moment method. In: Proceeding of the 2012 IEEE power and energy society general meeting, San Diego, USA, 22–26 July 2012, pp 1-–6
Liu H, Tang C, Han J et al (2017) Probabilistic load flow analysis of active distribution network adopting improved sequence operation methodology. IET Gener Transm Distrib 11(9):2147–2153
Article Google Scholar
Zhang P, Lee ST (2004) Probabilistic load flow computation using the method of combined cumulants and Gram-Charlier expansion. IEEE Trans Power Syst 19:676–682
Article Google Scholar
Yuan Y, Zhou J, Ju P et al (2011) Probabilistic load flow computation of a power system containing wind farms using the method of combined cumulants and Gram-Charlier expansion. IET Renew Power Gen 5:448–454
Article Google Scholar
Fan M, Vittal V, Heydt GT et al (2012) Probabilistic power flow studies for transmission systems with photovoltaic generation using cumulants. IEEE Trans Power Syst 27:2251–2261
Article Google Scholar
Cai D, Chen J, Shi D et al (2012) Enhancements to the cumulant method for probabilistic load flow studies. In: Proceedings of 2012 IEEE power and energy society general meeting, San Diego, USA, 22–26 July 2012, pp 1–8
Ran X, Miao S (2016) Three-phase probabilistic load flow for power system with correlated wind, photovoltaic and load. IET Gener Transm Distrib 10:3093–3101
Article Google Scholar
Prusty BR, Jena D (2016) Combined cumulant and Gaussian mixture approximation for correlated probabilistic load flow studies: a new approach. CSEE J Power Energy Syst 2(2):71–78
Article Google Scholar
Sui BY, Hou K, Jia HJ et al (2018) Maximum entropy based probabilistic load flow calculation for power system integrated with wind power generation. J Mod Power Syst Clean Energy 6(5):1042–1054
Article Google Scholar
Amid P, Crawford C (2018) A cumulant-tensor based probabilistic load flow method. IEEE Trans Power Syst 33(5):5648–5656
Article Google Scholar
Allan RN, Da Silva AL, Burchett RC (1981) Evaluation methods and accuracy in probabilistic load flow solutions. IEEE Trans Power App Syst 100(5):2539–2546
Article Google Scholar
Zhu X, Liu X, Zhang JH (2013) Probabilistic load flow method considering large-scale wind power integration. Proceedings CSEE 33:77–85
Google Scholar
Deng X, He J, Zhang P (2017) A novel probabilistic optimal power flow method to handle large fluctuations of stochastic variables. Energies 10(10):1–21
Article Google Scholar
Li Y, Li W, Yan W et al (2014) Probabilistic optimal power flow considering correlations of wind speeds following different distributions. IEEE Trans Power Syst 29:1847–1854
Article Google Scholar
Leskovec J, Ullman JD (2014) Mining of massive datasets. Cambridge University Press, Cambridge
Book Google Scholar
MATPOWER homepage. http://www.pserc.cornell.edu/matpower. Accessed 20 January 2017
Masseran N, Razali AM, Ibrahim K (2012) An analysis of wind power density derived from several wind speed density functions: the regional assessment on wind power in Malaysia. Rebew Sust Energ Rev 16:6476–6487
Article Google Scholar

Download references

Acknowledgements

This work was supported by the National Key Research and Development Program of China (No. 2017YFB0903400).

Author information

Authors and Affiliations

School of Electrical Engineering, Beijing Jiaotong University, Beijing, China
Xiaoyang DENG, Pei ZHANG, Kangmeng JIN, Jinghan HE & Xiaojun WANG
Department of Electrical and Electronic Engineering, Imperial College London, London, UK
Yuwei WANG

Authors

Xiaoyang DENG
View author publications
You can also search for this author in PubMed Google Scholar
Pei ZHANG
View author publications
You can also search for this author in PubMed Google Scholar
Kangmeng JIN
View author publications
You can also search for this author in PubMed Google Scholar
Jinghan HE
View author publications
You can also search for this author in PubMed Google Scholar
Xiaojun WANG
View author publications
You can also search for this author in PubMed Google Scholar
Yuwei WANG
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pei ZHANG.

Additional information

CrossCheck date: 6 December 2018

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

DENG, X., ZHANG, P., JIN, K. et al. Probabilistic load flow method considering large-scale wind power integration. J. Mod. Power Syst. Clean Energy 7, 813–825 (2019). https://doi.org/10.1007/s40565-019-0502-0

Download citation

Received: 04 July 2018
Accepted: 06 December 2018
Published: 28 February 2019
Issue Date: 13 July 2019
DOI: https://doi.org/10.1007/s40565-019-0502-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Probabilistic load flow method considering large-scale wind power integration

Abstract

Similar content being viewed by others

Modeling multiple-criteria decision making of the electrical grid considering optimal demand management

Electricity load forecasting: a systematic review

Electricity generation scheduling of thermal- wind-solar energy systems

1 Introduction

2 CM for PLF formulation