Skip to main content
Top
Published in: Cluster Computing 3/2019

01-03-2018

Efficient computation of the transitive closure size

Authors: Xian Tang, Ziyang Chen, Kai Li, Xiang Liu

Published in: Cluster Computing | Special Issue 3/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Given a directed graph and a node u, the transitive closure (TC) of u(TC(u)) is the set of nodes that u can reach in the graph, and the size of u’s transitive closure is to get the number of nodes in TC(u). Considering that computing the size of TC (TC-size computation or detection) is very important in many applications, and existing approaches on TC-size computation cannot scale to large graphs appeared in recent applications, we propose a path decomposition based algorithm, namely buTC, for TC-size computation with linear space complexity. buTC first gets a set of paths by path decomposition, then process nodes in each path in bottom-up manner. The benefit of buTC is that it can utilize the reachable relationship between nodes in a path to avoid the repeatedly visiting of many nodes, such that to guarantee that after processing all nodes in a single path, all involved edges are visited only once. Considering that buTC processes each path independently, we further propose an optimized algorithm, namely buTC+, to avoid the redundant operation of buTC. buTC+ does not need to do path decomposition, it uses a stack to buffer processed nodes with unprocessed in-neighbors, such that to utilize the reachable relationship between nodes in different paths of buTC to avoid the redundant computation. We conduct rich experiment on 26 real datasets and 5 synthetic datasets. The experimental results confirm that both buTC and buTC+ has better space and time scalability, and can be scaled to large graphs.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Aho, A.V., Garey, M.R., Ullman, J.D.: The transitive reduction of a directed graph. SIAM J. Comput. 1(2), 131–137 (1972)MathSciNetCrossRef Aho, A.V., Garey, M.R., Ullman, J.D.: The transitive reduction of a directed graph. SIAM J. Comput. 1(2), 131–137 (1972)MathSciNetCrossRef
2.
go back to reference Alves, C.E.R., Cáceres, E.N., de Castro, A.A., Song, S.W., Szwarcfiter, J.L.: Parallel transitive closure algorithm. J. Braz. Comp. Soc. 19(2), 161–166 (2013)MathSciNetCrossRef Alves, C.E.R., Cáceres, E.N., de Castro, A.A., Song, S.W., Szwarcfiter, J.L.: Parallel transitive closure algorithm. J. Braz. Comp. Soc. 19(2), 161–166 (2013)MathSciNetCrossRef
3.
go back to reference Chen, Y., Chen, Y.: Decomposing dags into spanning trees: A new way to compress transitive closures. In: Proceedings of the 27th International Conference on Data Engineering, ICDE 2011, April 11–16, 2011, Hannover, Germany, pp. 1007–1018 (2011) Chen, Y., Chen, Y.: Decomposing dags into spanning trees: A new way to compress transitive closures. In: Proceedings of the 27th International Conference on Data Engineering, ICDE 2011, April 11–16, 2011, Hannover, Germany, pp. 1007–1018 (2011)
4.
go back to reference Cheng, J., Huang, S., Wu,H., Fu, A.W.: Tf-label: a topological-folding labeling scheme for reachability querying in a large graph. In: SIGMOD, pp. 193–204 (2013) Cheng, J., Huang, S., Wu,H., Fu, A.W.: Tf-label: a topological-folding labeling scheme for reachability querying in a large graph. In: SIGMOD, pp. 193–204 (2013)
5.
go back to reference Cohen, E.: Estimating the size of the transitive closure in linear time. In: 35th Annual Symposium on Foundations of Computer Science, Santa Fe, New Mexico, USA, 20–22 November 1994, pp. 190–200 (1994) Cohen, E.: Estimating the size of the transitive closure in linear time. In: 35th Annual Symposium on Foundations of Computer Science, Santa Fe, New Mexico, USA, 20–22 November 1994, pp. 190–200 (1994)
6.
go back to reference Dar, S.: Augmenting Databases with Generalized Transitive Closure. PhD thesis, Univ. ofWisconsin-Madison (1993) Dar, S.: Augmenting Databases with Generalized Transitive Closure. PhD thesis, Univ. ofWisconsin-Madison (1993)
7.
go back to reference Ding, C.H.Q., He, X., Peng, H.: Finding cliques in protein interaction networks via transitive closure of a weighted graph. In: Proceedings of the 5th international workshop on Bioinformatics, BIOKDD 2005, Chicago, Illinois, USA, August 21, 2005, pp. 69–75 (2005) Ding, C.H.Q., He, X., Peng, H.: Finding cliques in protein interaction networks via transitive closure of a weighted graph. In: Proceedings of the 5th international workshop on Bioinformatics, BIOKDD 2005, Chicago, Illinois, USA, August 21, 2005, pp. 69–75 (2005)
8.
go back to reference Fletcher, G.H.L., Gyssens, M., Leinders, D., den Bussche, J.V., Gucht, D.V., Vansummeren, S., Wu., Y.: The impact of transitive closure on the boolean expressiveness of navigational query languages on graphs. In: Foundations of Information and Knowledge Systems—7th International Symposium, FoIKS 2012, Kiel, Germany, March 5-9, 2012. Proceedings, pp. 124–143 (2012) Fletcher, G.H.L., Gyssens, M., Leinders, D., den Bussche, J.V., Gucht, D.V., Vansummeren, S., Wu., Y.: The impact of transitive closure on the boolean expressiveness of navigational query languages on graphs. In: Foundations of Information and Knowledge Systems—7th International Symposium, FoIKS 2012, Kiel, Germany, March 5-9, 2012. Proceedings, pp. 124–143 (2012)
9.
go back to reference Fletcher, G.H.L., Gyssens, M., Leinders, D., den Bussche, J.V., Gucht, D.V., Vansummeren, S., Wu, Y.: The impact of transitive closure on the expressiveness of navigational query languages on unlabeled graphs. Ann. Math. Artif. Intell. 73(1–2), 167–203 (2015)MathSciNetCrossRef Fletcher, G.H.L., Gyssens, M., Leinders, D., den Bussche, J.V., Gucht, D.V., Vansummeren, S., Wu, Y.: The impact of transitive closure on the expressiveness of navigational query languages on unlabeled graphs. Ann. Math. Artif. Intell. 73(1–2), 167–203 (2015)MathSciNetCrossRef
10.
go back to reference Gora lciková, A., Koubek, V.: A reduct-and-closure algorithm for graphs. In: Mathematical Foundations of Computer Science 1979, Proceedings, 8th Symposium, Olomouc, Czechoslovakia, September 3–7, 1979, pp. 301–307 (1979) Gora lciková, A., Koubek, V.: A reduct-and-closure algorithm for graphs. In: Mathematical Foundations of Computer Science 1979, Proceedings, 8th Symposium, Olomouc, Czechoslovakia, September 3–7, 1979, pp. 301–307 (1979)
11.
go back to reference Pfeiffer, J.J., Fond, T.L., Moreno, S., Neville, J.: Fast generation of large scale social networks while incorporating transitive closures. In: 2012 International Conference on Privacy, Security, Risk and Trust, PASSAT 2012, and 2012 International Confernece on Social Computing, SocialCom 2012, Amsterdam, Netherlands, September 3–5, 2012, pp. 154–165 (2012) Pfeiffer, J.J., Fond, T.L., Moreno, S., Neville, J.: Fast generation of large scale social networks while incorporating transitive closures. In: 2012 International Conference on Privacy, Security, Risk and Trust, PASSAT 2012, and 2012 International Confernece on Social Computing, SocialCom 2012, Amsterdam, Netherlands, September 3–5, 2012, pp. 154–165 (2012)
12.
go back to reference Jin, R., Ruan, N., Dey, S., Yu, j.X.: SCARAB: scaling reachability computation on large graphs. In SIGMOD, pp. 169–180 (2012) Jin, R., Ruan, N., Dey, S., Yu, j.X.: SCARAB: scaling reachability computation on large graphs. In SIGMOD, pp. 169–180 (2012)
13.
go back to reference Jin, R., Wang, G.: Simple, fast, and scalable reachability oracle. PVLDB 6(14), 1978–1989 (2013) Jin, R., Wang, G.: Simple, fast, and scalable reachability oracle. PVLDB 6(14), 1978–1989 (2013)
14.
go back to reference King, V.: Fully dynamic transitive closure. In: Encyclopedia of Algorithms, pp. 808–809 (2016)CrossRef King, V.: Fully dynamic transitive closure. In: Encyclopedia of Algorithms, pp. 808–809 (2016)CrossRef
15.
go back to reference Krommidas, I., Zaroliagis, C.D.: An experimental study of algorithms for fully dynamic transitive closure. ACM J. Exp. Algorithm 12, 16 (2008)MathSciNetMATH Krommidas, I., Zaroliagis, C.D.: An experimental study of algorithms for fully dynamic transitive closure. ACM J. Exp. Algorithm 12, 16 (2008)MathSciNetMATH
16.
go back to reference Niesink, P., Poulin, K., Sajna, M.: Computing transitive closure of bipolar weighted digraphs. Discret. Appl. Math. 161(1–2), 217–243 (2013)MathSciNetCrossRef Niesink, P., Poulin, K., Sajna, M.: Computing transitive closure of bipolar weighted digraphs. Discret. Appl. Math. 161(1–2), 217–243 (2013)MathSciNetCrossRef
17.
go back to reference Palkowski, M., Bielecki, W.: Optimizing numerical code by means of the transitive closure of dependence graphs. In: Proceedings of the 2017 Federated Conference on Computer Science and Information Systems, FedCSIS 2017, Prague, Czech Republic, September 3–6, 2017., pp. 523–526 (2017) Palkowski, M., Bielecki, W.: Optimizing numerical code by means of the transitive closure of dependence graphs. In: Proceedings of the 2017 Federated Conference on Computer Science and Information Systems, FedCSIS 2017, Prague, Czech Republic, September 3–6, 2017., pp. 523–526 (2017)
18.
19.
20.
go back to reference Seufert, S., Anand, A., Bedathur, S.J., Weikum, G.: FERRARI: flexible and efficient reachability range assignment for graph indexing. In: ICDE, pp. 1009–1020 (2013) Seufert, S., Anand, A., Bedathur, S.J., Weikum, G.: FERRARI: flexible and efficient reachability range assignment for graph indexing. In: ICDE, pp. 1009–1020 (2013)
21.
go back to reference Simon, K.: An improved algorithm for transitive closure on acyclic digraphs. Theor. Comput. Sci. 58, 325–346 (1988)MathSciNetCrossRef Simon, K.: An improved algorithm for transitive closure on acyclic digraphs. Theor. Comput. Sci. 58, 325–346 (1988)MathSciNetCrossRef
22.
go back to reference Su, J., Zhu, Q., Wei, H., Yu, J.X.: Reachability querying: can it be even faster? IEEE Trans. Knowl. Data Eng. 29(3), 683–697 (2017)CrossRef Su, J., Zhu, Q., Wei, H., Yu, J.X.: Reachability querying: can it be even faster? IEEE Trans. Knowl. Data Eng. 29(3), 683–697 (2017)CrossRef
24.
go back to reference ten Cate, B.: The expressivity of xpath with transitive closure. In: Proceedings of the Twenty-Fifth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, June 26–28, 2006, Chicago, Illinois, USA, pp. 328–337 (2006) ten Cate, B.: The expressivity of xpath with transitive closure. In: Proceedings of the Twenty-Fifth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, June 26–28, 2006, Chicago, Illinois, USA, pp. 328–337 (2006)
25.
go back to reference Veloso, R.R., Cerf, L., Junior, W.M., Zaki, M.J.: Reachability queries in very large graphs: A fast refined online search approach. In: EDBT, pp. 511–522 (2014) Veloso, R.R., Cerf, L., Junior, W.M., Zaki, M.J.: Reachability queries in very large graphs: A fast refined online search approach. In: EDBT, pp. 511–522 (2014)
26.
go back to reference Wan, J., Zhu, Q., Lei, D., Lu, J.: Outlier detection based on transitive closure. Intell. Data Anal. 19(1), 145–160 (2015)CrossRef Wan, J., Zhu, Q., Lei, D., Lu, J.: Outlier detection based on transitive closure. Intell. Data Anal. 19(1), 145–160 (2015)CrossRef
27.
go back to reference Wei, H., Yu, J.X., Lu, C., Jin, R.: Reachability querying: an independent permutation labeling approach. PVLDB 7(12), 1191–1202 (2014) Wei, H., Yu, J.X., Lu, C., Jin, R.: Reachability querying: an independent permutation labeling approach. PVLDB 7(12), 1191–1202 (2014)
28.
go back to reference Williams, V.V: Multiplying matrices faster than coppersmith-winograd. In STOC, pp. 887–898 (2012) Williams, V.V: Multiplying matrices faster than coppersmith-winograd. In STOC, pp. 887–898 (2012)
29.
go back to reference Yannakakis, M.: Graph-theoretic methods in database theory. In: Proceedings of the Ninth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, April 2–4, 1990, Nashville, Tennessee, USA, pp. 230–242 (1990) Yannakakis, M.: Graph-theoretic methods in database theory. In: Proceedings of the Ninth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, April 2–4, 1990, Nashville, Tennessee, USA, pp. 230–242 (1990)
30.
go back to reference Yano, Y., Akiba, T., Iwata, Y., Yoshida, Y.: Fast and scalable reachability queries on graphs by pruned labeling with landmarks and paths. In CIKM, pp. 1601–1606 (2013) Yano, Y., Akiba, T., Iwata, Y., Yoshida, Y.: Fast and scalable reachability queries on graphs by pruned labeling with landmarks and paths. In CIKM, pp. 1601–1606 (2013)
31.
go back to reference Yildirim, H., Chaoji, V., Zaki, M.J.: GRAIL: scalable reachability index for large graphs. PVLDB 3(1), 276–284 (2010) Yildirim, H., Chaoji, V., Zaki, M.J.: GRAIL: scalable reachability index for large graphs. PVLDB 3(1), 276–284 (2010)
32.
go back to reference Yildirim, H., Chaoji, V., Zaki, M.J.: GRAIL: a scalable index for reachability queries in very large graphs. VLDB J. 21(4), 509–534 (2012)CrossRef Yildirim, H., Chaoji, V., Zaki, M.J.: GRAIL: a scalable index for reachability queries in very large graphs. VLDB J. 21(4), 509–534 (2012)CrossRef
33.
go back to reference Zhou, J., Zhou, S., Yu, J.X., Wei, H., Chen, Z., Tang, X.: DAG reduction: Fast answering reachability queries. In: Proceedings of the 2017 ACM International Conference on Management of Data, SIGMOD Conference 2017, Chicago, IL, USA, May 14–19, 2017, pp. 375–390 (2017) Zhou, J., Zhou, S., Yu, J.X., Wei, H., Chen, Z., Tang, X.: DAG reduction: Fast answering reachability queries. In: Proceedings of the 2017 ACM International Conference on Management of Data, SIGMOD Conference 2017, Chicago, IL, USA, May 14–19, 2017, pp. 375–390 (2017)
34.
go back to reference Zhu, A.D., Lin, W., Wang, S., Xiao, X.: Reachability queries on large dynamic graphs: a total order approach. In: SIGMOD, pages 1323–1334 (2014) Zhu, A.D., Lin, W., Wang, S., Xiao, X.: Reachability queries on large dynamic graphs: a total order approach. In: SIGMOD, pages 1323–1334 (2014)
Metadata
Title
Efficient computation of the transitive closure size
Authors
Xian Tang
Ziyang Chen
Kai Li
Xiang Liu
Publication date
01-03-2018
Publisher
Springer US
Published in
Cluster Computing / Issue Special Issue 3/2019
Print ISSN: 1386-7857
Electronic ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-018-2278-9

Other articles of this Special Issue 3/2019

Cluster Computing 3/2019 Go to the issue

Premium Partner