Skip to main content
Erschienen in: World Wide Web 2/2020

13.01.2020

Dolha - an efficient and exact data structure for streaming graphs

verfasst von: Fan Zhang, Lei Zou, Li Zeng, Xiangyang Gou

Erschienen in: World Wide Web | Ausgabe 2/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

A streaming graph is a graph formed by a sequence of incoming edges with time stamps. Unlike the static graphs, the streaming graph is highly dynamic and time-related. Streaming graphs in the real world, which are of the high volume and velocity, can be challenging to the classic graph data structures: data of internet traffic, social network communication, and financial transections, etc. The traditional graph storage models like the adjacency matrix and the adjacency list are no longer sufficient for the large amount data and high frequency updates. And most the streaming graph structures are only supports the specific graph algorithms. Here a new data structure is presented to meet the challenge: a double orthogonal list in hash table (Dolha) as a high speed and high memory efficiency graph structure. Dolha has constant time cost for single edge processing, and near-linear space cost. Moreover, time cost for neighborhood queries in Dolha is linear, which enables it to support most algorithms of graphs without extra cost. A persistent structure based on Dolha is also presented, to handle the sliding window update and time related queries.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
2.
Zurück zum Zitat Bender, M., Hu, H.: An adaptive packed-memory array. ACM Trans. Database Syst. 32(4) (2007)CrossRef Bender, M., Hu, H.: An adaptive packed-memory array. ACM Trans. Database Syst. 32(4) (2007)CrossRef
3.
Zurück zum Zitat Boldi, P., Rosa, M., Vigna, S.: Hyperanf: approximating the neighbourhood function of very large graphs on a budget. International World Wide Web Conferences, 625–634 (2011) Boldi, P., Rosa, M., Vigna, S.: Hyperanf: approximating the neighbourhood function of very large graphs on a budget. International World Wide Web Conferences, 625–634 (2011)
4.
Zurück zum Zitat Broder, A.Z., Mitzenmacher, M.: Network applications of bloom filters: a survey. Internet Math. 1(4), 485–509 (2004)MathSciNetCrossRef Broder, A.Z., Mitzenmacher, M.: Network applications of bloom filters: a survey. Internet Math. 1(4), 485–509 (2004)MathSciNetCrossRef
6.
Zurück zum Zitat Cha, M., Haddadi, H., Benevenuto, F., Gummadi, K.P.: Measuring user influence in twitter: the million follower fallacy (2010) Cha, M., Haddadi, H., Benevenuto, F., Gummadi, K.P.: Measuring user influence in twitter: the million follower fallacy (2010)
7.
Zurück zum Zitat Chi, L., Li, B., Zhu, X., Pan, S., Chen, L.: Hashing for adaptive real-time graph stream classification with concept drifts. IEEE Trans. Sys. Man Cybern. 48(5), 1591–1604 (2018) Chi, L., Li, B., Zhu, X., Pan, S., Chen, L.: Hashing for adaptive real-time graph stream classification with concept drifts. IEEE Trans. Sys. Man Cybern. 48(5), 1591–1604 (2018)
8.
Zurück zum Zitat Cormode, G., Muthukrishnan, S.: An improved data stream summary: The count-min sketch and its applications, latin american symposium on theoretical informatics, 29–38 (2004) Cormode, G., Muthukrishnan, S.: An improved data stream summary: The count-min sketch and its applications, latin american symposium on theoretical informatics, 29–38 (2004)
9.
Zurück zum Zitat De Stefani, L., Epasto, A., Riondato, M., Upfal, E.: TRIÈST: counting local and global triangles in fully-dynamic streams with fixed memory size. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, ser. KDD ’16. ACM (2016) De Stefani, L., Epasto, A., Riondato, M., Upfal, E.: TRIÈST: counting local and global triangles in fully-dynamic streams with fixed memory size. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, ser. KDD ’16. ACM (2016)
12.
Zurück zum Zitat Eswaran, D., Faloutsos, C., Guha, S.: Spotlight: detecting anomalies in streaming graphs. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1378–1386 (2018) Eswaran, D., Faloutsos, C., Guha, S.: Spotlight: detecting anomalies in streaming graphs. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1378–1386 (2018)
13.
Zurück zum Zitat Gao, J., Zhou, C., Zhou, J., Yu, J.X.: Continuous pattern detection over billion-edge graph using distributed framework. In: Proc. 30th IEEE international conference on data engineering, pp 556–567 (2014) Gao, J., Zhou, C., Zhou, J., Yu, J.X.: Continuous pattern detection over billion-edge graph using distributed framework. In: Proc. 30th IEEE international conference on data engineering, pp 556–567 (2014)
14.
Zurück zum Zitat Gao, L., Golab, L., Ozsu, M.T., Aluc, G.: Stream watdiv: a streaming rdf benchmark (3) (2018) Gao, L., Golab, L., Ozsu, M.T., Aluc, G.: Stream watdiv: a streaming rdf benchmark (3) (2018)
16.
Zurück zum Zitat Guha, S., Andrew, M.: Graph synopses, sketches, and streams: a survey. PVLDB 5(12), 2030–2031 (2012) Guha, S., Andrew, M.: Graph synopses, sketches, and streams: a survey. PVLDB 5(12), 2030–2031 (2012)
17.
Zurück zum Zitat Khan, A., Aggarwal, C.C.: Query-friendly compression of graph streams. IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 130–137 (2016) Khan, A., Aggarwal, C.C.: Query-friendly compression of graph streams. IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 130–137 (2016)
19.
Zurück zum Zitat Liu, G., Liu, Y., Zheng, K., Liu, A., Li, Z., Wang, Y., Zhou, X.: MCS-GPM: multi-constrained simulation based graph pattern matching in contextual social graphs. IEEE Trans. Knowl. Data Eng. 30, 1050–1064 (2018)CrossRef Liu, G., Liu, Y., Zheng, K., Liu, A., Li, Z., Wang, Y., Zhou, X.: MCS-GPM: multi-constrained simulation based graph pattern matching in contextual social graphs. IEEE Trans. Knowl. Data Eng. 30, 1050–1064 (2018)CrossRef
20.
Zurück zum Zitat Liu, G., Wang, Y., Orgun, M.: Optimal social trust path selection in complex social networks. PAAAI, 1391–1398 (2010) Liu, G., Wang, Y., Orgun, M.: Optimal social trust path selection in complex social networks. PAAAI, 1391–1398 (2010)
21.
Zurück zum Zitat Liu, G., Wang, Y., Orgun, M.: Finding K optimal social trust paths for the selection of trustworthy service providers in complex social networks. IEEE Trans. Services Comput. 6(2) (2013) Liu, G., Wang, Y., Orgun, M.: Finding K optimal social trust paths for the selection of trustworthy service providers in complex social networks. IEEE Trans. Services Comput. 6(2) (2013)
22.
Zurück zum Zitat Liu, G., Zheng, K., Wang, Y., Orgun, M., Liu, A., Zhao, L., Zhou, X.: Multi-constrained graph pattern matching in large-scale contextual social graphs. ICDE, 351–362 (2015) Liu, G., Zheng, K., Wang, Y., Orgun, M., Liu, A., Zhao, L., Zhou, X.: Multi-constrained graph pattern matching in large-scale contextual social graphs. ICDE, 351–362 (2015)
23.
Zurück zum Zitat Liu, G., Zhu, F., Zheng, K., Liu, A., Li, Z., Zhao, L., Zhou, X.: TOSI: a trust-oriented social influence evaluation method in contextual social networks. Neurocomputing 210, 130–140 (2016)CrossRef Liu, G., Zhu, F., Zheng, K., Liu, A., Li, Z., Zhao, L., Zhou, X.: TOSI: a trust-oriented social influence evaluation method in contextual social networks. Neurocomputing 210, 130–140 (2016)CrossRef
24.
Zurück zum Zitat Mcgregor, A.: Graph stream algorithms: a survey. SIGMOD Record 43(1), 9–20 (2014)CrossRef Mcgregor, A.: Graph stream algorithms: a survey. SIGMOD Record 43(1), 9–20 (2014)CrossRef
25.
Zurück zum Zitat Pan, S., Wu, J., Zhu, X., Zhang, C.: Graph ensemble boosting for imbalanced noisy graph stream classification. IEEE Trans. Sys. Man Cybern. 45(5), 940–954 (2015) Pan, S., Wu, J., Zhu, X., Zhang, C.: Graph ensemble boosting for imbalanced noisy graph stream classification. IEEE Trans. Sys. Man Cybern. 45(5), 940–954 (2015)
26.
Zurück zum Zitat Pigne, Y., Dutot, A., Guinand, F., Olivier, D.: Graphstream: a tool for bridging the gap between complex systems and dynamic graphs. EPNACS (2007) Pigne, Y., Dutot, A., Guinand, F., Olivier, D.: Graphstream: a tool for bridging the gap between complex systems and dynamic graphs. EPNACS (2007)
27.
Zurück zum Zitat Qiu, X., Cen, W., Qian, Z., Peng, Y., Zhang, Y., Lin, X., Zhou, J.: Real-time constrained cycle detection in large dynamic graphs. Proceedings of the VLDB Endowment 11(12) (2018)CrossRef Qiu, X., Cen, W., Qian, Z., Peng, Y., Zhang, Y., Lin, X., Zhou, J.: Real-time constrained cycle detection in large dynamic graphs. Proceedings of the VLDB Endowment 11(12) (2018)CrossRef
28.
Zurück zum Zitat Schank, T., Wagner, D.: Finding, counting and listing all triangles in large graphs, an experimental study. In: Nikoletseas, S.E. (ed.) Experimental and Efficient Algorithms. Lecture Notes in Computer Science, vol. 3503 (2005)CrossRef Schank, T., Wagner, D.: Finding, counting and listing all triangles in large graphs, an experimental study. In: Nikoletseas, S.E. (ed.) Experimental and Efficient Algorithms. Lecture Notes in Computer Science, vol. 3503 (2005)CrossRef
29.
Zurück zum Zitat Stein, C., Drysdale, S., Borgart, K.: Probability Calculations in Hashing, in Discrete Mathematics for Computer Scientists, 1st edn., pp 245–254. Addison-Wesley, Reading (2010) Stein, C., Drysdale, S., Borgart, K.: Probability Calculations in Hashing, in Discrete Mathematics for Computer Scientists, 1st edn., pp 245–254. Addison-Wesley, Reading (2010)
30.
Zurück zum Zitat Tang, N., Chen, Q., Mitra, P.: Graph stream summarization: from big bang to big crunch. SIGMOD, 1481–1496 (2016) Tang, N., Chen, Q., Mitra, P.: Graph stream summarization: from big bang to big crunch. SIGMOD, 1481–1496 (2016)
Metadaten
Titel
Dolha - an efficient and exact data structure for streaming graphs
verfasst von
Fan Zhang
Lei Zou
Li Zeng
Xiangyang Gou
Publikationsdatum
13.01.2020
Verlag
Springer US
Erschienen in
World Wide Web / Ausgabe 2/2020
Print ISSN: 1386-145X
Elektronische ISSN: 1573-1413
DOI
https://doi.org/10.1007/s11280-019-00762-1

Weitere Artikel der Ausgabe 2/2020

World Wide Web 2/2020 Zur Ausgabe

Premium Partner