Skip to main content

2016 | OriginalPaper | Buchkapitel

Soft and Adaptive Aggregation of Heterogeneous Graphs with Heterogeneous Attributes

verfasst von : Amine Louati, Marie-Aude Aufaure, Etienne Cuvelier, Bruno Pimentel

Erschienen in: Semantic Web Collaborative Spaces

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In the enterprise context, people need to exploit, interpret and mainly visualize different types of interactions between heterogeneous objects. Graph model is an appropriate way to represent those interactions. Nodes represent the individuals or objects and edges represent the relationships between them. However, extracted graphs are in general heterogeneous and large sized which makes it difficult to visualize and to analyze easily. An adaptive aggregation operation is needed to have more understandable graphs in order to allow users discovering underlying information and hidden relationships between objects. Existing graph summarization approaches such as k-SNAP are carried out in homogeneous graphs where nodes are described by the same list of attributes that represent only one community. The aim of this work is to propose a general tool for graph aggregation which addresses both homogeneous and heterogeneous graphs. To do that, we develop a new soft and adaptive approach to aggregate heterogeneous graphs (i.e., composed of different node attributes and different relationship types) using the definition of Rough Set Theory (RST) combined with Formal Concept Analysis (FCA), the well known K-Medoids and the hierarchical clustering methods. Aggregated graphs are produced according to user-selected node attributes and relationships. To evaluate the quality of the obtained summaries, we propose two quality measures that evaluate respectively the similarity and the separability in groups based on the notion of common neighbor nodes. Experimental results demonstrate that our approach is effective for its ability to produce a high quality solution with relevant interpretations.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Freeman, L.: A set of measures of centrality based upon betweenness. Sociometry 40, 35–41 (1977)CrossRef Freeman, L.: A set of measures of centrality based upon betweenness. Sociometry 40, 35–41 (1977)CrossRef
2.
Zurück zum Zitat Girvan, M., Newman, M.E.J.: Community structure in social and biological networks. JProc. Natl. Acad. Sci. USA 99(12), 7821–7826 (2002)MathSciNetCrossRefMATH Girvan, M., Newman, M.E.J.: Community structure in social and biological networks. JProc. Natl. Acad. Sci. USA 99(12), 7821–7826 (2002)MathSciNetCrossRefMATH
3.
Zurück zum Zitat Newman, M., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E. 69(2), 026113 (2004)CrossRef Newman, M., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E. 69(2), 026113 (2004)CrossRef
6.
Zurück zum Zitat Yan, X., Han, J.: gspan: Graph-based substructure pattern mining. In: ICDM, pp. 721–724(2002) Yan, X., Han, J.: gspan: Graph-based substructure pattern mining. In: ICDM, pp. 721–724(2002)
7.
Zurück zum Zitat Sun, Y., Aggarwal, C.C., Han, J.: Relation strength-aware clustering of heterogeneous information networks with incomplete attributes. Proc. VLDB Endow. 5(5), 394–405 (2012)CrossRef Sun, Y., Aggarwal, C.C., Han, J.: Relation strength-aware clustering of heterogeneous information networks with incomplete attributes. Proc. VLDB Endow. 5(5), 394–405 (2012)CrossRef
8.
Zurück zum Zitat Tian, Y., Hankins, R.A., Pate, l.J.M.: Efficient aggregation for graph summarization. In: SIGMOD, pp. 567–580. ACM (2008) Tian, Y., Hankins, R.A., Pate, l.J.M.: Efficient aggregation for graph summarization. In: SIGMOD, pp. 567–580. ACM (2008)
9.
Zurück zum Zitat Soussi, R., Aufaure, M.A., Zghal, H.B.: Towards social network extraction using a graph database. In: DBKDA, pp. 28–34. IEEE Computer Society (2010) Soussi, R., Aufaure, M.A., Zghal, H.B.: Towards social network extraction using a graph database. In: DBKDA, pp. 28–34. IEEE Computer Society (2010)
11.
Zurück zum Zitat MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Cam, L.M.L., Neyman, J. (eds.): Proc. of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281–297. University of California Press (1967) MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Cam, L.M.L., Neyman, J. (eds.): Proc. of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281–297. University of California Press (1967)
12.
Zurück zum Zitat Newman, M.E.J.: Detecting community structure in networks. Eur. Phys. J. B 38, 321–330 (2004)CrossRef Newman, M.E.J.: Detecting community structure in networks. Eur. Phys. J. B 38, 321–330 (2004)CrossRef
13.
Zurück zum Zitat Rodrigues Jr., J.F., Traina, A.J.M., Faloutsos, C., Traina Jr., C.: Supergraph visualization. In: ISM 2006: Proceedings of the Eighth IEEE International Symposium on Multimedia, pp. 227–234. IEEE Computer Society (2006) Rodrigues Jr., J.F., Traina, A.J.M., Faloutsos, C., Traina Jr., C.: Supergraph visualization. In: ISM 2006: Proceedings of the Eighth IEEE International Symposium on Multimedia, pp. 227–234. IEEE Computer Society (2006)
14.
Zurück zum Zitat Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE TPAMI 22(8), 888–905 (2000)CrossRef Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE TPAMI 22(8), 888–905 (2000)CrossRef
15.
Zurück zum Zitat Ng, A., Jordan, M., Weiss, Y., Dietterich, T., Becker, S., Ghahramani, Z.: Advances in Neural Information Processing Systems. MIT Press, Cambridge (2002) Ng, A., Jordan, M., Weiss, Y., Dietterich, T., Becker, S., Ghahramani, Z.: Advances in Neural Information Processing Systems. MIT Press, Cambridge (2002)
17.
Zurück zum Zitat Chakrabarti, D., Faloutsos, C., Zhan, Y.: Visualization of large networks with min-cut plots, A-plots and R-MAT. Int. J. Hum.-Comput. Stud. 65(5), 434–445 (2007)CrossRef Chakrabarti, D., Faloutsos, C., Zhan, Y.: Visualization of large networks with min-cut plots, A-plots and R-MAT. Int. J. Hum.-Comput. Stud. 65(5), 434–445 (2007)CrossRef
18.
Zurück zum Zitat Watts, D.J., Strogatz, S.H.: Collective dynamics of ‘small-world’ networks. Nature 393(6684), 440–442 (1998)CrossRef Watts, D.J., Strogatz, S.H.: Collective dynamics of ‘small-world’ networks. Nature 393(6684), 440–442 (1998)CrossRef
19.
Zurück zum Zitat Ren, X., Wang, Y., Yu, X., Yan, J., Chen, Z., Han, J.: Heterogeneous graph-based intent learning with queries, web pages and wikipedia concepts. In: Proceedings of the 7th ACM International Conference on Web Search and Data Mining, pp. 23–32. ACM (2014) Ren, X., Wang, Y., Yu, X., Yan, J., Chen, Z., Han, J.: Heterogeneous graph-based intent learning with queries, web pages and wikipedia concepts. In: Proceedings of the 7th ACM International Conference on Web Search and Data Mining, pp. 23–32. ACM (2014)
20.
Zurück zum Zitat Wei, L., Qi, J.J.: Relation between concept lattice reduction and rough set reduction. Knowl.-Based Syst. 23(8), 934–938 (2010)CrossRef Wei, L., Qi, J.J.: Relation between concept lattice reduction and rough set reduction. Knowl.-Based Syst. 23(8), 934–938 (2010)CrossRef
21.
Zurück zum Zitat Shi, C., Niu, Z., Wang, T.: Considering the relationship between RST and FCA. In: WKDD, pp. 224–227. IEEE Computer Society (2010) Shi, C., Niu, Z., Wang, T.: Considering the relationship between RST and FCA. In: WKDD, pp. 224–227. IEEE Computer Society (2010)
22.
Zurück zum Zitat Stumme, G.: Formal concept analysis. In: Handbook on Ontologies 2009, pp. 177–199 (2009) Stumme, G.: Formal concept analysis. In: Handbook on Ontologies 2009, pp. 177–199 (2009)
23.
Zurück zum Zitat Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323 (1999)CrossRef Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323 (1999)CrossRef
24.
Zurück zum Zitat Duda, R.O., Stork, D.G., Hart, P.E.: Pattern Classification. Wiley, New York; Chichester (2000)MATH Duda, R.O., Stork, D.G., Hart, P.E.: Pattern Classification. Wiley, New York; Chichester (2000)MATH
25.
Zurück zum Zitat Gan, G., Ma, C., Wu, J.: Data Clustering - Theory, Algorithms, and Applications. SIAM, Philadelphia (2007)CrossRefMATH Gan, G., Ma, C., Wu, J.: Data Clustering - Theory, Algorithms, and Applications. SIAM, Philadelphia (2007)CrossRefMATH
26.
Zurück zum Zitat Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Kluwer Academic Publishers, Norwell (1981)CrossRefMATH Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Kluwer Academic Publishers, Norwell (1981)CrossRefMATH
27.
Zurück zum Zitat Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, New York (1990)CrossRef Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, New York (1990)CrossRef
28.
Zurück zum Zitat Salvador, S., Chan, P.: Determining the number of clusters/segments in hierarchical clustering/segmentation algorithms. In: ICTAI, pp. 576–584. IEEE Computer Society (2004) Salvador, S., Chan, P.: Determining the number of clusters/segments in hierarchical clustering/segmentation algorithms. In: ICTAI, pp. 576–584. IEEE Computer Society (2004)
29.
Zurück zum Zitat Adamic, L.A., Glance, N.: The political blogosphere and the 2004 U.S. election: Divided they blog. In: LinkKDD, pp. 36–43. ACM (2005) Adamic, L.A., Glance, N.: The political blogosphere and the 2004 U.S. election: Divided they blog. In: LinkKDD, pp. 36–43. ACM (2005)
Metadaten
Titel
Soft and Adaptive Aggregation of Heterogeneous Graphs with Heterogeneous Attributes
verfasst von
Amine Louati
Marie-Aude Aufaure
Etienne Cuvelier
Bruno Pimentel
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-32667-2_7

Premium Partner