Skip to main content
Top
Published in: Neural Computing and Applications 14/2022

06-03-2022 | Original Article

Clustering uncertain graphs using ant colony optimization (ACO)

Authors: Syed Fawad Hussain, Ifra Arif Butt, Muhammad Hanif, Sajid Anwar

Published in: Neural Computing and Applications | Issue 14/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In deterministic graphs, an edge between two vertices denotes a certain link. In contrast, in probabilistic graph, a link between two vertices merely implies the possibility of its existence based on probability. Probabilistic data results from uncertainties due to the preprocessing, data collection process, or the inherent nature of the problem which results in uncertain outcomes. These types of graphs are common in the real-world applications such as protein–protein interactions and identifying links in social media. Clustering probabilistic graphs is a challenging task since computing traditional metrics (like distance, paths, etc.) will all be probabilistic. Therefore, determining a valid clustering or making the data deterministic is an important research problem. We propose a new clustering algorithm for probabilistic graphs using the ant colony optimization (ACO) technique. The algorithm uses multiple versions of the probabilistic graph and employs a modified ACO to optimize the objective function. Moreover, heuristics are proposed to guide the algorithm for better accuracy and faster convergence. The proposed approach is tested against two real-world probabilistic graphs and five synthetic datasets using multiple cluster validity indices. Results show that ACO with heuristic guidance can produce good solutions that are comparable to or better than other traditional approaches.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Gotlieb CC, Kumar S (1968) Semantic clustering of index terms. J ACM (JACM) 15:493–513CrossRef Gotlieb CC, Kumar S (1968) Semantic clustering of index terms. J ACM (JACM) 15:493–513CrossRef
2.
go back to reference Pacheco TM, Gonçalves LB, Ströele V, Soares SSR (2018) An ant colony optimization for automatic data clustering problem. In: 2018 IEEE congress on evolutionary computation (CEC). IEEE, pp 1–8 Pacheco TM, Gonçalves LB, Ströele V, Soares SSR (2018) An ant colony optimization for automatic data clustering problem. In: 2018 IEEE congress on evolutionary computation (CEC). IEEE, pp 1–8
3.
go back to reference Hussain SF, Haris M (2019) A k-means based co-clustering (kCC) algorithm for sparse, high dimensional data. Expert Syst Appl 118:20–34CrossRef Hussain SF, Haris M (2019) A k-means based co-clustering (kCC) algorithm for sparse, high dimensional data. Expert Syst Appl 118:20–34CrossRef
4.
go back to reference Hussain SF (2011) Bi-clustering gene expression data using co-similarity. In: Proceedings of the international conferences on advanced data mining and applications (ADMA). Beijing, China, pp 190–200 Hussain SF (2011) Bi-clustering gene expression data using co-similarity. In: Proceedings of the international conferences on advanced data mining and applications (ADMA). Beijing, China, pp 190–200
5.
go back to reference Zhao B, Wang J, Li M et al (2014) Detecting protein complexes based on uncertain graph model. IEEE/ACM Trans Comput Biol Bioinform (TCBB) 11:486–497CrossRef Zhao B, Wang J, Li M et al (2014) Detecting protein complexes based on uncertain graph model. IEEE/ACM Trans Comput Biol Bioinform (TCBB) 11:486–497CrossRef
6.
go back to reference Vu K, Zheng R (2011) Robust coverage under uncertainty in wireless sensor networks. In: Proceedings of IEEE international conference on computer communications (INFOCOM). IEEE, pp 2015–2023 Vu K, Zheng R (2011) Robust coverage under uncertainty in wireless sensor networks. In: Proceedings of IEEE international conference on computer communications (INFOCOM). IEEE, pp 2015–2023
7.
go back to reference Ahmed NM, Chen L (2016) An efficient algorithm for link prediction in temporal uncertain social networks. Inf Sci 331:120–136MathSciNetCrossRef Ahmed NM, Chen L (2016) An efficient algorithm for link prediction in temporal uncertain social networks. Inf Sci 331:120–136MathSciNetCrossRef
8.
go back to reference Chen X, Chen M, Shi W et al (2019) Embedding uncertain knowledge graphs. In: Proceedings of the AAAI conference on artificial intelligence pp 3363–3370 Chen X, Chen M, Shi W et al (2019) Embedding uncertain knowledge graphs. In: Proceedings of the AAAI conference on artificial intelligence pp 3363–3370
9.
go back to reference Halim Z, Waqas M, Hussain SF (2015) Clustering large probabilistic graphs using multi-population evolutionary algorithm. Inf Sci 317:78–95CrossRef Halim Z, Waqas M, Hussain SF (2015) Clustering large probabilistic graphs using multi-population evolutionary algorithm. Inf Sci 317:78–95CrossRef
10.
go back to reference İnkaya T, Kayalıgil S, Özdemirel NE (2015) Ant colony optimization based clustering methodology. Appl Soft Comput 28:301–311CrossRef İnkaya T, Kayalıgil S, Özdemirel NE (2015) Ant colony optimization based clustering methodology. Appl Soft Comput 28:301–311CrossRef
11.
go back to reference Shelokar PS, Jayaraman VK, Kulkarni BD (2004) An ant colony approach for clustering. Anal Chim Acta 509:187–195CrossRef Shelokar PS, Jayaraman VK, Kulkarni BD (2004) An ant colony approach for clustering. Anal Chim Acta 509:187–195CrossRef
12.
go back to reference Jahanshahi M, Maleki E, Ghiami A (2017) On the efficiency of artificial neural networks for plastic analysis of planar frames in comparison with genetic algorithms and ant colony systems. Neural Comput Appl 28:3209–3227CrossRef Jahanshahi M, Maleki E, Ghiami A (2017) On the efficiency of artificial neural networks for plastic analysis of planar frames in comparison with genetic algorithms and ant colony systems. Neural Comput Appl 28:3209–3227CrossRef
13.
go back to reference AlFarraj O, AlZubi A, Tolba A (2019) Optimized feature selection algorithm based on fireflies with gravitational ant colony algorithm for big data predictive analytics. Neural Comput Appl 31:1391–1403CrossRef AlFarraj O, AlZubi A, Tolba A (2019) Optimized feature selection algorithm based on fireflies with gravitational ant colony algorithm for big data predictive analytics. Neural Comput Appl 31:1391–1403CrossRef
14.
go back to reference Gao W, Hu L, Zhang P (2018) Class-specific mutual information variation for feature selection. Pattern Recogn 79:328–339CrossRef Gao W, Hu L, Zhang P (2018) Class-specific mutual information variation for feature selection. Pattern Recogn 79:328–339CrossRef
15.
go back to reference Agrawal P, Sarma AD, Ullman J, Widom J (2010) Foundations of uncertain-data integration. In: Proceedings of the VLDB endowment 3, pp 1080–1090 Agrawal P, Sarma AD, Ullman J, Widom J (2010) Foundations of uncertain-data integration. In: Proceedings of the VLDB endowment 3, pp 1080–1090
16.
go back to reference Aggarwal CC (2013) A survey of uncertain data clustering algorithms. Taylor and Francis, EnglandCrossRef Aggarwal CC (2013) A survey of uncertain data clustering algorithms. Taylor and Francis, EnglandCrossRef
17.
go back to reference Ester M, Kriegel HP, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings 2nd international conference on knowledge discovery and data mining (KDD), pp 226–231 Ester M, Kriegel HP, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings 2nd international conference on knowledge discovery and data mining (KDD), pp 226–231
18.
go back to reference Kriegel H-P, Pfeifle M (2005) Density-based clustering of uncertain data. In: Proceedings of the eleventh ACM SIGKDD international conference on knowledge discovery in data mining, pp 672–677 Kriegel H-P, Pfeifle M (2005) Density-based clustering of uncertain data. In: Proceedings of the eleventh ACM SIGKDD international conference on knowledge discovery in data mining, pp 672–677
19.
go back to reference Kriegel H-P, Pfeifle M (2005) Hierarchical density-based clustering of uncertain data. In: Fifth IEEE international conference on data mining (ICDM’05) IEEE, p 4 Kriegel H-P, Pfeifle M (2005) Hierarchical density-based clustering of uncertain data. In: Fifth IEEE international conference on data mining (ICDM’05) IEEE, p 4
20.
go back to reference Ankerst M, Breunig MM, Kriegel HP, Sander J (1999) OPTICS: ordering points to identify the clustering structure. In: Proceedings of the ACM SIGMOD 28, pp 49–60 Ankerst M, Breunig MM, Kriegel HP, Sander J (1999) OPTICS: ordering points to identify the clustering structure. In: Proceedings of the ACM SIGMOD 28, pp 49–60
21.
go back to reference Chau M, Cheng R, Kao B, Ng J (2006) Uncertain data mining: an example in clustering location data. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, pp 199–204 Chau M, Cheng R, Kao B, Ng J (2006) Uncertain data mining: an example in clustering location data. In: Pacific-Asia conference on knowledge discovery and data mining. Springer, pp 199–204
22.
go back to reference Ngai WK, Kao B, Chui CK et al (2006) Efficient clustering of uncertain data. In: Sixth international conference on data mining (ICDM’06). IEEE, pp 436–445 Ngai WK, Kao B, Chui CK et al (2006) Efficient clustering of uncertain data. In: Sixth international conference on data mining (ICDM’06). IEEE, pp 436–445
23.
go back to reference Cormode G, McGregor A (2008) Approximation algorithms for clustering uncertain data. In: Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on principles of database systems pp 191–200 Cormode G, McGregor A (2008) Approximation algorithms for clustering uncertain data. In: Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on principles of database systems pp 191–200
24.
go back to reference Foggia P, Percannella G, Sansone C, Vento M (2007) A graph-based clustering method and its applications. In: International symposium on brain, vision, and artificial intelligence. Springer, pp 277–287 Foggia P, Percannella G, Sansone C, Vento M (2007) A graph-based clustering method and its applications. In: International symposium on brain, vision, and artificial intelligence. Springer, pp 277–287
25.
go back to reference Pfeiffer, J. and Neville, J., (2011) Methods to determine node centrality and clustering in graphs with uncertain structure. In Proceedings of the International AAAI Conference on Web and Social Media (Vol. 5, No. 1, pp. 590-593). Pfeiffer, J. and Neville, J., (2011) Methods to determine node centrality and clustering in graphs with uncertain structure. In Proceedings of the International AAAI Conference on Web and Social Media (Vol. 5, No. 1, pp. 590-593).
26.
go back to reference Pelekis N, Kopanakis I, Kotsifakos EE et al (2011) Clustering uncertain trajectories. Knowl Inf Syst 28:117–147CrossRef Pelekis N, Kopanakis I, Kotsifakos EE et al (2011) Clustering uncertain trajectories. Knowl Inf Syst 28:117–147CrossRef
27.
go back to reference Di Mauro N, Taranto C, Esposito F (2014) Link classification with probabilistic graphs. J Intell Inf Syst 42:181–206CrossRef Di Mauro N, Taranto C, Esposito F (2014) Link classification with probabilistic graphs. J Intell Inf Syst 42:181–206CrossRef
28.
go back to reference Kollios G, Potamias M, Terzi E (2011) Clustering large probabilistic graphs. IEEE Trans Knowl Data Eng 25:325–336CrossRef Kollios G, Potamias M, Terzi E (2011) Clustering large probabilistic graphs. IEEE Trans Knowl Data Eng 25:325–336CrossRef
29.
go back to reference Symeonidis P, Iakovidou N, Mantas N, Manolopoulos Y (2013) From biological to social networks: link prediction based on multi-way spectral clustering. Data Knowl Eng 87:226–242CrossRef Symeonidis P, Iakovidou N, Mantas N, Manolopoulos Y (2013) From biological to social networks: link prediction based on multi-way spectral clustering. Data Knowl Eng 87:226–242CrossRef
30.
go back to reference Halim Z, Waqas M, Baig AR, Rashid A (2017) Efficient clustering of large uncertain graphs using neighborhood information. Int J Approx Reason 90:274–291MathSciNetCrossRef Halim Z, Waqas M, Baig AR, Rashid A (2017) Efficient clustering of large uncertain graphs using neighborhood information. Int J Approx Reason 90:274–291MathSciNetCrossRef
31.
go back to reference Dadaneh BZ, Markid HY, Zakerolhosseini A (2016) Unsupervised probabilistic feature selection using ant colony optimization. Expert Syst Appl 53:27–42CrossRef Dadaneh BZ, Markid HY, Zakerolhosseini A (2016) Unsupervised probabilistic feature selection using ant colony optimization. Expert Syst Appl 53:27–42CrossRef
32.
go back to reference Kassiano V, Gounaris A, Papadopoulos AN, Tsichlas K (2016) Mining uncertain graphs: an overview. In: International workshop of algorithmic aspects of cloud computing. Springer, pp 87–116 Kassiano V, Gounaris A, Papadopoulos AN, Tsichlas K (2016) Mining uncertain graphs: an overview. In: International workshop of algorithmic aspects of cloud computing. Springer, pp 87–116
33.
go back to reference Ceccarello M, Fantozzi C, Pietracaprina A et al (2017) Clustering uncertain graphs. In: Proceedings of the VLDB endowment 11, pp 472–484 Ceccarello M, Fantozzi C, Pietracaprina A et al (2017) Clustering uncertain graphs. In: Proceedings of the VLDB endowment 11, pp 472–484
34.
go back to reference Han K, Gui F, Xiao X et al (2019) Efficient and effective algorithms for clustering uncertain graphs. In: Proceedings of the VLDB endowment 12, pp 667–680 Han K, Gui F, Xiao X et al (2019) Efficient and effective algorithms for clustering uncertain graphs. In: Proceedings of the VLDB endowment 12, pp 667–680
35.
go back to reference Xu R, Wunsch D (2005) Survey of clustering algorithms. IEEE Trans Neural Networks 16:645–678CrossRef Xu R, Wunsch D (2005) Survey of clustering algorithms. IEEE Trans Neural Networks 16:645–678CrossRef
36.
go back to reference Buhmann JM (2003) Data clustering and learning. In: The handbook of brain theory and neural networks, pp 278–281 Buhmann JM (2003) Data clustering and learning. In: The handbook of brain theory and neural networks, pp 278–281
37.
go back to reference Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surv (CSUR) 31:264–323CrossRef Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surv (CSUR) 31:264–323CrossRef
38.
go back to reference Hussain SF, Iqbal S (2018) CCGA: co-similarity based co-clustering using genetic algorithm. Appl Soft Comput 72:30–42CrossRef Hussain SF, Iqbal S (2018) CCGA: co-similarity based co-clustering using genetic algorithm. Appl Soft Comput 72:30–42CrossRef
39.
go back to reference Gambardella LM, Dorigo M (2000) An ant colony system hybridized with a new local search for the sequential ordering problem. Informs J Comput 12:237–255MathSciNetCrossRef Gambardella LM, Dorigo M (2000) An ant colony system hybridized with a new local search for the sequential ordering problem. Informs J Comput 12:237–255MathSciNetCrossRef
40.
go back to reference Stutzle T, Hoos H (1997) Max-min ant system and local search for combinatorial optimization. In: 2nd international conference on metaheuristics, Sophie-Antipolis, France Stutzle T, Hoos H (1997) Max-min ant system and local search for combinatorial optimization. In: 2nd international conference on metaheuristics, Sophie-Antipolis, France
41.
go back to reference Chiaravalloti AD, Greco G, Guzzo A, Pontieri L (2006) An information-theoretic framework for high-order co-clustering of heterogeneous objects. Lect Notes Comput Sci 4212:598CrossRef Chiaravalloti AD, Greco G, Guzzo A, Pontieri L (2006) An information-theoretic framework for high-order co-clustering of heterogeneous objects. Lect Notes Comput Sci 4212:598CrossRef
42.
go back to reference Davis JV, Kulis B, Jain P et al (2007) Information-theoretic metric learning. In: Proceedings of the 24th international conference on Machine learning. p 216 Davis JV, Kulis B, Jain P et al (2007) Information-theoretic metric learning. In: Proceedings of the 24th international conference on Machine learning. p 216
43.
go back to reference Shang C, Li M, Feng S et al (2013) Feature selection via maximizing global information gain for text classification. Knowl Based Syst 54:298–309CrossRef Shang C, Li M, Feng S et al (2013) Feature selection via maximizing global information gain for text classification. Knowl Based Syst 54:298–309CrossRef
45.
go back to reference Krogan NJ, Cagney G, Yu H et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature 440:637–643CrossRef Krogan NJ, Cagney G, Yu H et al (2006) Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature 440:637–643CrossRef
46.
go back to reference Dhillon IS (2001) Co-clustering documents and words using bipartite spectral graph partitioning. In: Proceedings of the seventh ACM SIGKDD international conference on knowledge discovery and data mining, pp 269–274 Dhillon IS (2001) Co-clustering documents and words using bipartite spectral graph partitioning. In: Proceedings of the seventh ACM SIGKDD international conference on knowledge discovery and data mining, pp 269–274
47.
go back to reference Hussain SF (2019) A novel robust kernel for classifying high-dimensional data using support vector machines. Expert Syst Appl 131:116–131CrossRef Hussain SF (2019) A novel robust kernel for classifying high-dimensional data using support vector machines. Expert Syst Appl 131:116–131CrossRef
48.
go back to reference Glenn TC, Zare A, Gader PD (2014) Bayesian fuzzy clustering. IEEE Trans Fuzzy Syst 23:1545–1561CrossRef Glenn TC, Zare A, Gader PD (2014) Bayesian fuzzy clustering. IEEE Trans Fuzzy Syst 23:1545–1561CrossRef
49.
go back to reference Hussain SF, Pervaiz A, Hussain M (2020) Co-clustering optimization using artificial bee colony (ABC) algorithm. Appl Soft Comput 97:106725CrossRef Hussain SF, Pervaiz A, Hussain M (2020) Co-clustering optimization using artificial bee colony (ABC) algorithm. Appl Soft Comput 97:106725CrossRef
50.
go back to reference Li M (2015) Efficiency improvement of ant colony optimization in solving the moderate LTSP. J Syst Eng Electron 26(6):1300–1308CrossRef Li M (2015) Efficiency improvement of ant colony optimization in solving the moderate LTSP. J Syst Eng Electron 26(6):1300–1308CrossRef
Metadata
Title
Clustering uncertain graphs using ant colony optimization (ACO)
Authors
Syed Fawad Hussain
Ifra Arif Butt
Muhammad Hanif
Sajid Anwar
Publication date
06-03-2022
Publisher
Springer London
Published in
Neural Computing and Applications / Issue 14/2022
Print ISSN: 0941-0643
Electronic ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-022-07063-1

Other articles of this Issue 14/2022

Neural Computing and Applications 14/2022 Go to the issue

Premium Partner