1 Introduction
2 Related Work
2.1 Attribute Graph Clustering
2.2 Self-Supervised Clustering
3 Method
3.1 Notations
3.2 Deep Self-Supervised Attribute Graph Cluster (DSAGC)
3.2.1 Pretext Task: Self-Supervised Symmetric Graph Auto-Encoder (SSGAE)
3.2.2 Reliable Sample Selection
3.2.3 Downstream Task
4 Experiments
4.1 Datasets
Datasets | Nodes | Features | Edge | Classes |
---|---|---|---|---|
Cora | 2708 | 1433 | 5429 | 7 |
Citeseer | 3312 | 3703 | 4732 | 6 |
Pubmed | 19,717 | 500 | 44,338 | 3 |
Wiki | 2405 | 4973 | 17,981 | 17 |
4.2 Experiment Settings
4.3 Baselines and Evaluation Metrics
Metric | ACC | NMI | ARI |
---|---|---|---|
k-means | 50.0 | 31.7 | 23.9 |
Spectral clustering | 39.8 | 29.7 | 17.4 |
Graph encoder | 30.1 | 5.9 | 4.6 |
DeepWalk | 52.9 | 38.4 | 29.1 |
DNGR | 41.9 | 31.8 | 14.2 |
DEC | 46.5 | 23.5 | 15.1 |
TADW | 53.6 | 36.6 | 24.0 |
GAE | 53.0 | 39.7 | 29.3 |
VGAE | 59.2 | 40.8 | 34.7 |
ARGE | 64.0 | 44.9 | 35.2 |
ARVGE | 63.8 | 45.0 | 37.4 |
DAEGC | 70.4 | 52.8 | 49.6 |
GC-VGE | 70.7 | 53.6 | 48.2 |
SDCN | 35.6 | 14.3 | 7.8 |
AGC | 68.9 | 53.7 | 48.6 |
EGAE | 72.4 | 54.0 | 47.2 |
GALA | 74.6 | 57.7 | 53.2 |
SSGAE | 75.2 | 56.6 | 54.8 |
DSAGC | 77.1 | 58.7 | 57.2 |
Metric | ACC | NMI | ARI |
---|---|---|---|
k-means | 54.4 | 31.2 | 28.5 |
Spectral clustering | 30.8 | 9.0 | 8.2 |
Graph encoder | 29.3 | 5.7 | 4.3 |
DeepWalk | 33.7 | 8.9 | 9.2 |
DNGR | 32.6 | 18.0 | 4.3 |
DEC | 55.9 | 28.3 | 28.1 |
TADW | 45.5 | 29.1 | 22.8 |
GAE | 45.6 | 22.1 | 19.1 |
VGAE | 46.7 | 26.1 | 20.6 |
ARGE | 57.3 | 35.0 | 34.1 |
ARVGE | 54.4 | 26.1 | 24.5 |
DAEGC | 67.2 | 39.7 | 41.0 |
GC-VGE | 66.6 | 40.9 | 41.5 |
SDCN | 66.0 | 38.7 | 40.2 |
AGC | 67.0 | 41.1 | 41.9 |
EGAE | 67.4 | 41.2 | 43.2 |
GALA | 69.3 | 44.1 | 44.6 |
SSGAE | 71.2 | 45.2 | 46.9 |
DSAGC | 72.7 | 44.9 | 47.3 |
Metric | ACC | NMI | ARI |
---|---|---|---|
k-means | 40.4 | 42.9 | 15.0 |
Spectral clustering | 22.0 | 18.2 | 1.5 |
Graph encoder | 20.7 | 12.1 | 0.5 |
DeepWalk | 38.5 | 32.4 | 17.3 |
DNGR | 37.6 | 35.9 | 18.0 |
DEC | 40.0 | 41.1 | 25.6 |
TADW | 31.0 | 27.1 | 4.5 |
GAE | 37.9 | 34.5 | 18.9 |
VGAE | 45.1 | 46.8 | 26.3 |
ARGE | 38.1 | 34.5 | 26.3 |
ARVGE | 38.7 | 33.9 | 10.7 |
DAEGC | 48.2 | 44.8 | 33.1 |
GC-VGE | 48.8 | 47.6 | 28.4 |
SDCN | 44.3 | 42.0 | 28.8 |
AGC | 47.7 | 45.3 | 34.3 |
EGAE | 51.5 | 48.0 | 33.1 |
GALA | 54.5 | 50.4 | 38.9 |
SSGAE | 57.1 | 52.2 | 35.3 |
DSAGC | 63.4 | 55.7 | 42.5 |
Metric | ACC | NMI | ARI |
---|---|---|---|
k-means | 59.5 | 31.5 | 28.1 |
Spectral clustering | 52.8 | 9.7 | 6.2 |
Graph encoder | 53.1 | 21.0 | 18.4 |
DeepWalk | 54.3 | 10.2 | 8.8 |
DNGR | 46.8 | 15.3 | 5.9 |
DEC | 60.1 | 22.4 | 19.6 |
TADW | 51.1 | 24.4 | 21.7 |
GAE | 63.2 | 24.9 | 24.6 |
VGAE | 61.9 | 21.6 | 20.1 |
ARGE | 68.1 | 27.6 | 29.1 |
ARVGE | 51.3 | 11.7 | 7.8 |
DAEGC | 67.1 | 26.6 | 27.8 |
GC-VGE | 68.2 | 29.7 | 29.8 |
SDCN | 64.2 | 22.9 | 22.3 |
AGC | 69.8 | 31.6 | 31.9 |
EGAE | 70.6 | 32.0 | 33.0 |
GALA | 69.4 | 32.7 | 32.1 |
SSGAE | 69.0 | 28.8 | 29.5 |
DSAGC | 70.7 | 31.2 | 32.6 |
Activation layers | Citeseer | Cora | Pubmed | Wiki |
---|---|---|---|---|
\(L_{1}+L_{2}+L_{3}+L_{4}\) | \(66.09\pm 0.85\) | \(64.08\pm 3.05\) | \(61.23\pm 2.07\) | \(52.72\pm 1.00\) |
\(L_{1}+L_{4}\) | \(67.98\pm 0.71\) | \(71.92\pm 1.25\) | \(65.55\pm 2.04\) | \(55.94\pm 1.05\) |