Skip to main content
Top
Published in: World Wide Web 2/2020

13-01-2020

Dolha - an efficient and exact data structure for streaming graphs

Authors: Fan Zhang, Lei Zou, Li Zeng, Xiangyang Gou

Published in: World Wide Web | Issue 2/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

A streaming graph is a graph formed by a sequence of incoming edges with time stamps. Unlike the static graphs, the streaming graph is highly dynamic and time-related. Streaming graphs in the real world, which are of the high volume and velocity, can be challenging to the classic graph data structures: data of internet traffic, social network communication, and financial transections, etc. The traditional graph storage models like the adjacency matrix and the adjacency list are no longer sufficient for the large amount data and high frequency updates. And most the streaming graph structures are only supports the specific graph algorithms. Here a new data structure is presented to meet the challenge: a double orthogonal list in hash table (Dolha) as a high speed and high memory efficiency graph structure. Dolha has constant time cost for single edge processing, and near-linear space cost. Moreover, time cost for neighborhood queries in Dolha is linear, which enables it to support most algorithms of graphs without extra cost. A persistent structure based on Dolha is also presented, to handle the sliding window update and time related queries.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
2.
go back to reference Bender, M., Hu, H.: An adaptive packed-memory array. ACM Trans. Database Syst. 32(4) (2007)CrossRef Bender, M., Hu, H.: An adaptive packed-memory array. ACM Trans. Database Syst. 32(4) (2007)CrossRef
3.
go back to reference Boldi, P., Rosa, M., Vigna, S.: Hyperanf: approximating the neighbourhood function of very large graphs on a budget. International World Wide Web Conferences, 625–634 (2011) Boldi, P., Rosa, M., Vigna, S.: Hyperanf: approximating the neighbourhood function of very large graphs on a budget. International World Wide Web Conferences, 625–634 (2011)
4.
go back to reference Broder, A.Z., Mitzenmacher, M.: Network applications of bloom filters: a survey. Internet Math. 1(4), 485–509 (2004)MathSciNetCrossRef Broder, A.Z., Mitzenmacher, M.: Network applications of bloom filters: a survey. Internet Math. 1(4), 485–509 (2004)MathSciNetCrossRef
6.
go back to reference Cha, M., Haddadi, H., Benevenuto, F., Gummadi, K.P.: Measuring user influence in twitter: the million follower fallacy (2010) Cha, M., Haddadi, H., Benevenuto, F., Gummadi, K.P.: Measuring user influence in twitter: the million follower fallacy (2010)
7.
go back to reference Chi, L., Li, B., Zhu, X., Pan, S., Chen, L.: Hashing for adaptive real-time graph stream classification with concept drifts. IEEE Trans. Sys. Man Cybern. 48(5), 1591–1604 (2018) Chi, L., Li, B., Zhu, X., Pan, S., Chen, L.: Hashing for adaptive real-time graph stream classification with concept drifts. IEEE Trans. Sys. Man Cybern. 48(5), 1591–1604 (2018)
8.
go back to reference Cormode, G., Muthukrishnan, S.: An improved data stream summary: The count-min sketch and its applications, latin american symposium on theoretical informatics, 29–38 (2004) Cormode, G., Muthukrishnan, S.: An improved data stream summary: The count-min sketch and its applications, latin american symposium on theoretical informatics, 29–38 (2004)
9.
go back to reference De Stefani, L., Epasto, A., Riondato, M., Upfal, E.: TRIÈST: counting local and global triangles in fully-dynamic streams with fixed memory size. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, ser. KDD ’16. ACM (2016) De Stefani, L., Epasto, A., Riondato, M., Upfal, E.: TRIÈST: counting local and global triangles in fully-dynamic streams with fixed memory size. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, ser. KDD ’16. ACM (2016)
12.
go back to reference Eswaran, D., Faloutsos, C., Guha, S.: Spotlight: detecting anomalies in streaming graphs. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1378–1386 (2018) Eswaran, D., Faloutsos, C., Guha, S.: Spotlight: detecting anomalies in streaming graphs. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1378–1386 (2018)
13.
go back to reference Gao, J., Zhou, C., Zhou, J., Yu, J.X.: Continuous pattern detection over billion-edge graph using distributed framework. In: Proc. 30th IEEE international conference on data engineering, pp 556–567 (2014) Gao, J., Zhou, C., Zhou, J., Yu, J.X.: Continuous pattern detection over billion-edge graph using distributed framework. In: Proc. 30th IEEE international conference on data engineering, pp 556–567 (2014)
14.
go back to reference Gao, L., Golab, L., Ozsu, M.T., Aluc, G.: Stream watdiv: a streaming rdf benchmark (3) (2018) Gao, L., Golab, L., Ozsu, M.T., Aluc, G.: Stream watdiv: a streaming rdf benchmark (3) (2018)
16.
go back to reference Guha, S., Andrew, M.: Graph synopses, sketches, and streams: a survey. PVLDB 5(12), 2030–2031 (2012) Guha, S., Andrew, M.: Graph synopses, sketches, and streams: a survey. PVLDB 5(12), 2030–2031 (2012)
17.
go back to reference Khan, A., Aggarwal, C.C.: Query-friendly compression of graph streams. IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 130–137 (2016) Khan, A., Aggarwal, C.C.: Query-friendly compression of graph streams. IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 130–137 (2016)
19.
go back to reference Liu, G., Liu, Y., Zheng, K., Liu, A., Li, Z., Wang, Y., Zhou, X.: MCS-GPM: multi-constrained simulation based graph pattern matching in contextual social graphs. IEEE Trans. Knowl. Data Eng. 30, 1050–1064 (2018)CrossRef Liu, G., Liu, Y., Zheng, K., Liu, A., Li, Z., Wang, Y., Zhou, X.: MCS-GPM: multi-constrained simulation based graph pattern matching in contextual social graphs. IEEE Trans. Knowl. Data Eng. 30, 1050–1064 (2018)CrossRef
20.
go back to reference Liu, G., Wang, Y., Orgun, M.: Optimal social trust path selection in complex social networks. PAAAI, 1391–1398 (2010) Liu, G., Wang, Y., Orgun, M.: Optimal social trust path selection in complex social networks. PAAAI, 1391–1398 (2010)
21.
go back to reference Liu, G., Wang, Y., Orgun, M.: Finding K optimal social trust paths for the selection of trustworthy service providers in complex social networks. IEEE Trans. Services Comput. 6(2) (2013) Liu, G., Wang, Y., Orgun, M.: Finding K optimal social trust paths for the selection of trustworthy service providers in complex social networks. IEEE Trans. Services Comput. 6(2) (2013)
22.
go back to reference Liu, G., Zheng, K., Wang, Y., Orgun, M., Liu, A., Zhao, L., Zhou, X.: Multi-constrained graph pattern matching in large-scale contextual social graphs. ICDE, 351–362 (2015) Liu, G., Zheng, K., Wang, Y., Orgun, M., Liu, A., Zhao, L., Zhou, X.: Multi-constrained graph pattern matching in large-scale contextual social graphs. ICDE, 351–362 (2015)
23.
go back to reference Liu, G., Zhu, F., Zheng, K., Liu, A., Li, Z., Zhao, L., Zhou, X.: TOSI: a trust-oriented social influence evaluation method in contextual social networks. Neurocomputing 210, 130–140 (2016)CrossRef Liu, G., Zhu, F., Zheng, K., Liu, A., Li, Z., Zhao, L., Zhou, X.: TOSI: a trust-oriented social influence evaluation method in contextual social networks. Neurocomputing 210, 130–140 (2016)CrossRef
24.
go back to reference Mcgregor, A.: Graph stream algorithms: a survey. SIGMOD Record 43(1), 9–20 (2014)CrossRef Mcgregor, A.: Graph stream algorithms: a survey. SIGMOD Record 43(1), 9–20 (2014)CrossRef
25.
go back to reference Pan, S., Wu, J., Zhu, X., Zhang, C.: Graph ensemble boosting for imbalanced noisy graph stream classification. IEEE Trans. Sys. Man Cybern. 45(5), 940–954 (2015) Pan, S., Wu, J., Zhu, X., Zhang, C.: Graph ensemble boosting for imbalanced noisy graph stream classification. IEEE Trans. Sys. Man Cybern. 45(5), 940–954 (2015)
26.
go back to reference Pigne, Y., Dutot, A., Guinand, F., Olivier, D.: Graphstream: a tool for bridging the gap between complex systems and dynamic graphs. EPNACS (2007) Pigne, Y., Dutot, A., Guinand, F., Olivier, D.: Graphstream: a tool for bridging the gap between complex systems and dynamic graphs. EPNACS (2007)
27.
go back to reference Qiu, X., Cen, W., Qian, Z., Peng, Y., Zhang, Y., Lin, X., Zhou, J.: Real-time constrained cycle detection in large dynamic graphs. Proceedings of the VLDB Endowment 11(12) (2018)CrossRef Qiu, X., Cen, W., Qian, Z., Peng, Y., Zhang, Y., Lin, X., Zhou, J.: Real-time constrained cycle detection in large dynamic graphs. Proceedings of the VLDB Endowment 11(12) (2018)CrossRef
28.
go back to reference Schank, T., Wagner, D.: Finding, counting and listing all triangles in large graphs, an experimental study. In: Nikoletseas, S.E. (ed.) Experimental and Efficient Algorithms. Lecture Notes in Computer Science, vol. 3503 (2005)CrossRef Schank, T., Wagner, D.: Finding, counting and listing all triangles in large graphs, an experimental study. In: Nikoletseas, S.E. (ed.) Experimental and Efficient Algorithms. Lecture Notes in Computer Science, vol. 3503 (2005)CrossRef
29.
go back to reference Stein, C., Drysdale, S., Borgart, K.: Probability Calculations in Hashing, in Discrete Mathematics for Computer Scientists, 1st edn., pp 245–254. Addison-Wesley, Reading (2010) Stein, C., Drysdale, S., Borgart, K.: Probability Calculations in Hashing, in Discrete Mathematics for Computer Scientists, 1st edn., pp 245–254. Addison-Wesley, Reading (2010)
30.
go back to reference Tang, N., Chen, Q., Mitra, P.: Graph stream summarization: from big bang to big crunch. SIGMOD, 1481–1496 (2016) Tang, N., Chen, Q., Mitra, P.: Graph stream summarization: from big bang to big crunch. SIGMOD, 1481–1496 (2016)
Metadata
Title
Dolha - an efficient and exact data structure for streaming graphs
Authors
Fan Zhang
Lei Zou
Li Zeng
Xiangyang Gou
Publication date
13-01-2020
Publisher
Springer US
Published in
World Wide Web / Issue 2/2020
Print ISSN: 1386-145X
Electronic ISSN: 1573-1413
DOI
https://doi.org/10.1007/s11280-019-00762-1

Other articles of this Issue 2/2020

World Wide Web 2/2020 Go to the issue

Premium Partner