1 Introduction
1.1 Related work
1.2 Contribution
2 Existing foundations
2.1 Changes in network data
-
Global Link Change (GLC): The change is triggered by a significantly increased or decreased link amount. This is assumed to happen globally, i.e., the changed link probability affects each node equally.
-
Local Link Change (LLC): Similar to GLCs, but the changed link behavior only affects a few nodes which either get more or less influence, i.e., the network structure changes to a more centralized or flat hierarchy.
-
Global Node Change (GNC): The node amount increases or decreases significantly, because new nodes enter the network or existing ones leave it.
-
Local Node Change (LNC): Only a few influential nodes enter or leave the network which results in a significant impact on the network structure.
Type | Examples |
---|---|
GLC | Increased/decreased communication in communication networks |
Changed activity in cyber networks (e.g., due to malware) | |
LLC | Formation of new hotspots in disease networks |
Changed route layout in transportation networks | |
GNC | New advertising strategy in customer networks |
Addition of new destinations in tourism networks | |
LNC | Restructuring of supply chains in logistics networks |
Creation of new leading positions in profession networks |
2.2 Metric-based network monitoring
-
Frobenius norm (F): \(\sqrt{\sum \limits _{i = 1}^{n_t} \sum \limits _{j = 1}^{n_t} a_{t,ij}^2}\).
-
Spectral norm (S): \(\max \limits _{\left\Vert x \right\Vert _2 = 1} \left\Vert A_t x \right\Vert _2\).
-
Closeness centrality (C): \(c_i^C = (\frac{1}{n-1}\sum \limits _{j \in V} \text {d}(i,j))^{-1}\).
-
Degree centrality (D): \(c_i^D = \sum \limits _{j = 1}^{n_t} a_{t,ij}\).
-
Betweenness centrality (B): \(c_i^B = \sum \limits _{i \ne j \ne l}\frac{\sigma _{jl}(i)}{\sigma _{jl}}\).
-
Eigenvector centrality (E): \(c_i^E = \frac{1}{\lambda }{\sum \limits _{j=1}^{n_t}} a_{t,ij} c_j^E\).
2.3 General online monitoring procedure
2.4 Control charts for traditional multivariate data
3 Multivariate metric-based monitoring solutions
Change type | F | S | C\(_m\) | C\(_d\) | D\(_m\) | D\(_d\) | B\(_m\) | B\(_d\) | E |
---|---|---|---|---|---|---|---|---|---|
GLC | \(++\) | \(++\) | \(+\) | − | \(++\) | − | \(+\) | − | o |
LLC | \(+\) | o | − | \(+\) | o | \(+\) | o | \(+\) | \(++\) |
GNC | − | \(++\) | \(+\) | − | \(+\) | − | \(+\) | − | o |
LNC | − | − | o | \(+\) | − | \(+\) | − | \(+\) | \(++\) |
3.1 Selection of a suitable set of metrics
-
I - Balanced Setups: This category represents the most flexible monitoring strategy. The goal is to achieve a reliable performance for as many as possible change types. Particularly, changes of moderate to high intensity should be detected. Corresponding sets include one metric from each of the performance groups A and B. The third metric is the average eigenvector centrality. This balanced setup is a promising candidate for ad hoc applications in which users do not know what to expect and how a change might look like (e.g., networks with high dynamics like social networks). Example setup: SB\(_d\)E.
-
IIa - Balanced Setups with a focus on global changes: In this category, it is still of interest to be sensitive to as many change types as possible, but with a clear focus on the detection of global changes. This category involves all 2 vs. 1 combinations (i.e., two metrics out of group A and one out of group B). Example setup: SD\(_m\)C\(_d\).
-
IIb - Balanced Setups with a focus on local changes: The same as IIa, just with a major focus on local changes (i.e., one metric out of group A and two out of group B). Example setup: B\(_d\)D\(_d\)S.
-
IIIa - Unbalanced setups for global changes: It is only of interest to detect global changes in the network - even those which are characterized by a small change intensity. The detection of other change types is not relevant. The corresponding metric sets are constructed with three metrics out of group A. Example setup: C\(_m\)D\(_m\)S.
-
IIIb - Unbalanced setups for local changes: The same as IIIa, just for local changes. The metric sets are constructed with three metrics out of group B. Example setup:C\(_d\)D\(_d\)B\(_d\).
-
IV - Setups with a particular focus on link changes: This category represents all metric sets in which the Frobenius norm is part of, since this metric is specialized to detect link changes. Changes purely triggered by nodes can also be emphasized by combining the Frobenius norm with other metrics that are sensitive to it. Example setup: FSB\(_d\).
3.2 Selection of a suitable multivariate control chart
3.3 Interpretation of the results and further analyses
4 Simulation study
Type | Data generation |
---|---|
GLC | In-Control: ER graph with probability p and n nodes |
Out of Control: ER graph with probability \({\tilde{p}}\) and n nodes | |
LLC | In-Control: ER graph with probability p and n nodes |
Out of Control: heterogenous ER graph with a changed | |
probability \({\tilde{p}}\) for k central nodes | |
GNC | In-Control: ER graph with at each time point a randomly |
chosen node size in \([n-n\cdot d, n + n\cdot d]\) and a randomly | |
chosen link amount in \([m - m\cdot d, m + m\cdot d]\). | |
Out of Control: ER graph with at each time point a | |
randomly chosen node size in \([{\tilde{n}}-{\tilde{n}}\cdot d, {\tilde{n}} + {\tilde{n}}\cdot d]\) and a | |
randomly chosen link amount in \([m - m\cdot d, m + m\cdot d]\). | |
LNC | In-Control: ER graph with probability p and at each time |
point a random node size \(n_{\text {flex}} \in [n - n\cdot d, n + n\cdot d]\). | |
Out of Control: heterogenous ER graph with at each time | |
point a random node size \({\tilde{n}}_{\text {flex}} \in [{\tilde{n}}-{\tilde{n}}\cdot d, {\tilde{n}} + {\tilde{n}}\cdot d]\) | |
and changed probability \({\tilde{p}}\) for k central nodes and a | |
probability of \(\frac{pn_{\text {flex}} - {\tilde{p}}k}{{\tilde{n}}_{\text {flex}}-k}\) for all other nodes. |
Type | Intensity | Parameter setup |
---|---|---|
GLC | Small | \(p = 0.4\), \({\tilde{p}} = 0.43\) |
Moderate | \(p = 0.4\), \({\tilde{p}} = 0.45\) | |
Heavy | \(p = 0.4\), \({\tilde{p}} = 0.5\) | |
LLC | Small | \(p = 0.35\), \({\tilde{p}} = 0.45\), \(k = 5\) |
Moderate | \(p = 0.4\), \({\tilde{p}} = 0.6\), \(k = 2\) | |
Heavy | \(p = 0.35\), \({\tilde{p}} = 0.65\), \(k = 1\) | |
GNC | Small | \(d = 0.05\), \(m = 800\), \({\tilde{n}} = 55\) |
Moderate | \(d = 0.05\), \(m = 800\), \({\tilde{n}} = 60\) | |
Heavy | \(d = 0.05\), \(m = 800\), \({\tilde{n}} = 75\) | |
LNC | Small | \(d = 0.15\), \(p = 0.4\), \({\tilde{n}} = 54\), \({\tilde{p}} = 0.6\), \(k = 4\) |
Moderate | \(d = 0.15\), \(p = 0.4\), \({\tilde{n}} = 52\), \({\tilde{p}} = 0.7,\) \(k = 2\) | |
Heavy | \(d = 0.15\), \(p = 0.4\), \({\tilde{n}} = 51\), \({\tilde{p}} = 0.8\), \(k = 1\) |
4.1 Setup
4.2 Phase I performances
4.3 Phase II performances
(a) Univariate setup performances | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
Type | Intensity | F | S | C\(_m\) | C\(_d\) | D\(_m\) | D\(_d\) | B\(_m\) | B\(_d\) | E |
GLC | small | 3.12 | 3.24 | 3.03 | 94.78 | 3.12 | 109.89 | 3.17 | 55.63 | 50.61 |
moderate | 1.22 | 1.20 | 1.20 | 68.86 | 1.23 | 86.83 | 1.22 | 16.89 | 19.82 | |
heavy | 1.00 | 1.00 | 1.00 | 45.02 | 1.00 | 102.13 | 1.00 | 4.71 | 9.82 | |
LLC | small | 9.19 | 7.70 | 8.40 | 25.22 | 9.19 | 27.44 | 9.53 | 60.99 | 36.00 |
moderate | 14.57 | 9.97 | 12.14 | 3.14 | 14.57 | 3.45 | 14.57 | 6.04 | 4.43 | |
heavy | 22.52 | 11.60 | 17.67 | 1.41 | 32.52 | 1.46 | 21.72 | 2.27 | 1.81 | |
GNC | small | 144.02 | 2.19 | 1.41 | 45.33 | 2.17 | 78.00 | 1.34 | 8.16 | 11.98 |
moderate | 142.55 | 1.01 | 1.00 | 9.82 | 1.01 | 73.06 | 1.00 | 2.02 | 3.03 | |
heavy | 135.90 | 1.00 | 1.00 | 1.47 | 1.00 | 61.49 | 1.00 | 1.34 | 1.06 | |
LNC | small | 93.22 | 39.53 | 1.06 | 1.16 | 9.01 | 1.07 | 1.80 | 1.07 | 1.00 |
moderate | 90.57 | 95.01 | 4.45 | 1.02 | 38.66 | 1.00 | 4.46 | 1.07 | 1.00 | |
heavy | 88.24 | 93.16 | 23.84 | 1.01 | 62.04 | 1.01 | 13.16 | 1.02 | 1.01 |
(b) Multivariate setup performances | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
Type | Intensity | SB\(_d\)E | SD\(_m\)C\(_d\) | B\(_d\)D\(_d\)S | C\(_m\)D\(_m\)S | B\(_d\)C\(_d\)D\(_d\) | FSB\(_d\) | SAL | ||
GLC | small | 20.56 | 15.63 | 19.00 | 8.90 | 25.31 | 23.52 | 41.34 | ||
moderate | 2.22 | 1.82 | 2.25 | 1.66 | 6.28 | 2.39 | 10.61 | |||
heavy | 1.00 | 1.00 | 1.00 | 1.00 | 1.04 | 1.00 | 1.06 | |||
LLC | small | 27.72 | 18.47 | 25.47 | 68.36 | 22.07 | 29.05 | 23.56 | ||
moderate | 4.66 | 3.49 | 4.07 | 13.81 | 2.92 | 4.76 | 2.94 | |||
heavy | 1.86 | 1.60 | 1.76 | 17.27 | 1.52 | 2.04 | 1.51 | |||
GNC | small | 7.49 | 12.38 | 6.87 | 6.06 | 8.88 | 23.94 | 7.09 | ||
moderate | 1.48 | 1.50 | 1.34 | 1.17 | 1.61 | 5.04 | 1.44 | |||
heavy | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |||
LNC | small | 1.02 | 1.03 | 1.03 | 1.00 | 1.02 | 1.50 | 1.11 | ||
moderate | 1.02 | 1.05 | 1.04 | 1.11 | 1.13 | 1.81 | 1.10 | |||
heavy | 1.01 | 1.01 | 1.01 | 2.00 | 1.01 | 1.01 | 1.01 |
4.4 Extension to mixed-type changes
No. | In-Control | Changes |
---|---|---|
1 | \(K = 3, p_1 = 0.7, p_2 = 0.6,\) | \(p_1 = 0.9, p_2 = 0.7,\) |
\(p_3 = 0.8, p_{ij} = 0.1,\) | \(p_3 = 0.9, p_{ij} = 0.2,\) | |
\(n_1 = n_2 = 33, n_3 = 34\) | \(n_1 = n_2 = n_3 = 40\) | |
2 | \(K = 3, p_1 = 0.7, p_2 = 0.6,\) | |
\(p_3 = 0.8, p_{ij} = 0.1,\) | \(p_1 = 0.4, p_2 = 0.9\) | |
\(n_1 = n_2 = 33, n_3 = 34\) | ||
3 | \(K = 3, p_1 = 0.7, p_2 = 0.6,\) | |
\(p_3 = 0.3, p_{ij} = 0.1,\) | \(K = 4, p_4 = 0.3,\) | |
\(n_1 = 30, n_ 2 = 20,\) | \(n_3 = 30, n_4 = 20\) | |
\(n_3 = 50\) | ||
4 | \(K = 3, p_1 = 0.7, p_2 = 0.6,\) | |
\(p_3 = 0.3, p_{ij} = 0.1,\) | \(K = 4, p_3 = p_4 = 0.49\) | |
\(n_1 = 30, n_ 2 = 20,\) | \(n_3 = 30, n_4 = 20\) | |
\(n_3 = 50\) |
a. Univariate Setup Performances | |||||||||
---|---|---|---|---|---|---|---|---|---|
Case | F | S | C\(_m\) | C\(_d\) | D\(_m\) | D\(_d\) | B\(_m\) | B\(_d\) | E |
1 | 1.00 | 1.00 | 1.00 | 19.32 | 1.00 | 10.63 | 1.00 | 1.49 | 1.06 |
2 | 20.34 | 15.71 | 17.90 | 18.59 | 18.34 | 22.77 | 19.42 | 12.37 | 1.62 |
3 | 1.01 | 3.02 | 1.00 | 2.55 | 1.00 | 6.13 | 1.00 | 5.89 | 1.28 |
4 | 25.60 | 17.33 | 10.61 | 16.46 | 25.60 | 17.93 | 10.44 | 18.50 | 20.03 |
b. Multivariate Setup Performances | |||||||||
---|---|---|---|---|---|---|---|---|---|
Case | SB\(_d\)E | SD\(_m\)C\(_d\) | B\(_d\)D\(_d\)S | C\(_m\)D\(_m\)S | B\(_d\)C\(_d\)D\(_d\) | FSB\(_d\) | SAL | ||
1 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | ||
2 | 1.87 | 5.00 | 9.87 | 5.12 | 14.58 | 4.77 | 13.75 | ||
3 | 1.39 | 1.00 | 2.21 | 1.00 | 1.43 | 1.00 | 1.03 | ||
4 | 8.94 | 17.18 | 14.14 | 5.12 | 10.74 | 15.25 | 10.92 |