Skip to main content
Erschienen in: The Journal of Supercomputing 9/2021

19.02.2021

Graph threshold algorithm

verfasst von: Wookey Lee, Justin JongSu Song, Charles Cheolgi Lee, Tae-Chang Jo, James J. H. Lee

Erschienen in: The Journal of Supercomputing | Ausgabe 9/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Recently more and more information sources are connected together and become a sort of complex graphs that can be exploited not only as a structured and semi-structured data such as rdb or xml, RDF or NoSQL, but also as many kinds of unstructured data such as web, bioinformatics, genometrics, patents, social media, knowlege graphs, IoT, hidden graph from deep learning results. State of the art studies have suggested methods of presenting data as hyper-graphs in search queries and finding search results in subgraphs or graph embeddings rather than a list of individual node results. We study the problem of retrieving top-k graph results with the query relevances; that is, given a set of query keywords Q on a graph G, we aim to find a subgraph g of G such that g is highly related to Q and closely linked under g. In order to consider the relevant graph results and the connectivity simultaneously, we present an effective algorithm graph threshold algorithm (GTA) based on a threshold algorithm (TA) which works efficiently in non-graph structure. We show that GTA does not need unnecessary searches under given objective functions, and prove the existence of an upper bound of the size of subgraph for top-k results, called \({\textit{hop}}_{{\textit{max}}}\), which makes it efficient to find the combined results. Finally, we conduct the performance studies on real and synthetic graphs, which demonstrate that our algorithm significantly outperforms conventional approaches with respect to time complexity and cost consumption.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Abu-Aisheh Z, Raveaux R, Ramel J (2020) Efficient k-nearest neighbors search in graph space. Pattern Recognit Lett 134:77–86CrossRef Abu-Aisheh Z, Raveaux R, Ramel J (2020) Efficient k-nearest neighbors search in graph space. Pattern Recognit Lett 134:77–86CrossRef
2.
Zurück zum Zitat Amsterdamer Y, Kukliansky A, Milo T (2015) A natural language interface for querying general and individual knowledge. Proc VLDB Endow 8(12):1430–1441CrossRef Amsterdamer Y, Kukliansky A, Milo T (2015) A natural language interface for querying general and individual knowledge. Proc VLDB Endow 8(12):1430–1441CrossRef
3.
Zurück zum Zitat Anand D, Gadiya S, Sethi A (2020) Histographs: graphs in histopathology. In: Tomaszewski JE, Ward AD (eds) Medical Imaging 2020, SPIE Proceedings, vol 11320. SPIE, p 113200O Anand D, Gadiya S, Sethi A (2020) Histographs: graphs in histopathology. In: Tomaszewski JE, Ward AD (eds) Medical Imaging 2020, SPIE Proceedings, vol 11320. SPIE, p 113200O
4.
Zurück zum Zitat Chen J, Li B, Wang J, Zhao Y, Yao L, Xiong Y (2020) Knowledge graph enhanced third-party library recommendation for mobile application development. IEEE Access 8:42436–42446CrossRef Chen J, Li B, Wang J, Zhao Y, Yao L, Xiong Y (2020) Knowledge graph enhanced third-party library recommendation for mobile application development. IEEE Access 8:42436–42446CrossRef
5.
Zurück zum Zitat Chen X, Pan L (2018) A survey of graph cuts/graph search based medical image segmentation. IEEE Rev Biomed Eng 11:112–124CrossRef Chen X, Pan L (2018) A survey of graph cuts/graph search based medical image segmentation. IEEE Rev Biomed Eng 11:112–124CrossRef
6.
Zurück zum Zitat Cheng J, Zeng X, Yu JX (2013) Top-\(k\) graph pattern matching over large graphs. In: ICDE. IEEE, pp 1033–1044 Cheng J, Zeng X, Yu JX (2013) Top-\(k\) graph pattern matching over large graphs. In: ICDE. IEEE, pp 1033–1044
7.
Zurück zum Zitat Choi D, Park CS, Chung YD (2019) Progressive top-\(k\) subarray query processing in array databases. Proc VLDB Endow 12(9):989–1001CrossRef Choi D, Park CS, Chung YD (2019) Progressive top-\(k\) subarray query processing in array databases. Proc VLDB Endow 12(9):989–1001CrossRef
8.
Zurück zum Zitat Christmann P, Saha Roy R, Abujabal A, Singh J, Weikum G (2019) Look before you hop: conversational question answering over knowledge graphs using judicious context expansion. In: CIKM, pp 729–738 Christmann P, Saha Roy R, Abujabal A, Singh J, Weikum G (2019) Look before you hop: conversational question answering over knowledge graphs using judicious context expansion. In: CIKM, pp 729–738
9.
Zurück zum Zitat Ding B, Yu JX, Wang S, Qin L, Zhang X, Lin, X (2007) Finding top-\(k\) min-cost connected trees in databases. In: IEEE International Conference on Data Engineering. IEEE, pp 836–845 Ding B, Yu JX, Wang S, Qin L, Zhang X, Lin, X (2007) Finding top-\(k\) min-cost connected trees in databases. In: IEEE International Conference on Data Engineering. IEEE, pp 836–845
10.
Zurück zum Zitat Ding X, Jia J, Li J, Liu J, Jin H (2014) Top-\(k\) similarity matching in large graphs with attributes. In: International Conference on Database Systems for Advanced Applications. Springer, Berlin, pp 156–170 Ding X, Jia J, Li J, Liu J, Jin H (2014) Top-\(k\) similarity matching in large graphs with attributes. In: International Conference on Database Systems for Advanced Applications. Springer, Berlin, pp 156–170
11.
Zurück zum Zitat Eom CS, Lee CC, Lee W, Leung CK (2020) Effective privacy preserving data publishing by vectorization. Inf Sci 527:311–328CrossRef Eom CS, Lee CC, Lee W, Leung CK (2020) Effective privacy preserving data publishing by vectorization. Inf Sci 527:311–328CrossRef
12.
13.
Zurück zum Zitat Fagin R, Lotem A, Naor M (2003) Optimal aggregation algorithms for middleware. J Comput Syst Sci 66(4):614–656MathSciNetCrossRef Fagin R, Lotem A, Naor M (2003) Optimal aggregation algorithms for middleware. J Comput Syst Sci 66(4):614–656MathSciNetCrossRef
14.
Zurück zum Zitat Han X, Yue L, Dong Y, Xu Q, Xie G, Xu X (2020) Efficient hybrid algorithm based on moth search and fireworks algorithm for solving numerical and constrained engineering optimization problems. J Supercomput 76(12):9404–9429CrossRef Han X, Yue L, Dong Y, Xu Q, Xie G, Xu X (2020) Efficient hybrid algorithm based on moth search and fireworks algorithm for solving numerical and constrained engineering optimization problems. J Supercomput 76(12):9404–9429CrossRef
15.
Zurück zum Zitat He H, Wang H, Yang J, Yu PS (2007) Blinks: ranked keyword searches on graphs. In: Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data. ACM, pp 305–316 He H, Wang H, Yang J, Yu PS (2007) Blinks: ranked keyword searches on graphs. In: Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data. ACM, pp 305–316
16.
Zurück zum Zitat Ho VT, Pal K, Kleer N, Berberich K, Weikum G (2020) Entities with quantities: extraction, search, and ranking. In: WSDM, pp 833–836 Ho VT, Pal K, Kleer N, Berberich K, Weikum G (2020) Entities with quantities: extraction, search, and ranking. In: WSDM, pp 833–836
17.
Zurück zum Zitat Hormozi N (2019) Disambiguation and result expansion in keyword search over relational databases. In: International Conference on Data Engineering (ICDE). IEEE, pp 2101–2105 Hormozi N (2019) Disambiguation and result expansion in keyword search over relational databases. In: International Conference on Data Engineering (ICDE). IEEE, pp 2101–2105
18.
Zurück zum Zitat Hristidis V, Papakonstantinou Y, Gravano L (2003) Efficient IR-style keyword search over relational databases. In: International Conference on Very Large Databases. Elsevier, Amsterdam, pp 850–861 Hristidis V, Papakonstantinou Y, Gravano L (2003) Efficient IR-style keyword search over relational databases. In: International Conference on Very Large Databases. Elsevier, Amsterdam, pp 850–861
19.
Zurück zum Zitat Ilyas IF, Beskales G, Soliman MA (2008) A survey of top-\(k\) query processing techniques in relational database systems. ACM Comput Surv 40(4):11:1–11:58CrossRef Ilyas IF, Beskales G, Soliman MA (2008) A survey of top-\(k\) query processing techniques in relational database systems. ACM Comput Surv 40(4):11:1–11:58CrossRef
20.
Zurück zum Zitat Jenkins P, Zhao J, Vinicombe H, Subramanian A, Prasad A, Dobi A, Li E, Guo Y (2020) Natural language annotations for search engine optimization. In: WWW, pp 2856–2862 Jenkins P, Zhao J, Vinicombe H, Subramanian A, Prasad A, Dobi A, Li E, Guo Y (2020) Natural language annotations for search engine optimization. In: WWW, pp 2856–2862
21.
Zurück zum Zitat Jin J, Luo J, Khemmarat S, Dong F, Gao L (2019) GStar: an efficient framework for answering top-\(k\) star queries on billion-node knowledge graphs. World Wide Web 22(4):1611–1638CrossRef Jin J, Luo J, Khemmarat S, Dong F, Gao L (2019) GStar: an efficient framework for answering top-\(k\) star queries on billion-node knowledge graphs. World Wide Web 22(4):1611–1638CrossRef
22.
Zurück zum Zitat Kargar M, An A (2011) Keyword search in graphs: finding r-cliques. Proc VLDB 4(10):681–692CrossRef Kargar M, An A (2011) Keyword search in graphs: finding r-cliques. Proc VLDB 4(10):681–692CrossRef
23.
Zurück zum Zitat Kasneci G, Ramanath M, Sozio M, Suchanek FM, Weikum G (2009) Star: Steiner-tree approximation in relationship graphs. In: IEEE International Conference on Data Engineering. IEEE, pp 868–879 Kasneci G, Ramanath M, Sozio M, Suchanek FM, Weikum G (2009) Star: Steiner-tree approximation in relationship graphs. In: IEEE International Conference on Data Engineering. IEEE, pp 868–879
24.
Zurück zum Zitat Kim J, Lee W, Song JJ, Lee SB (2017) Optimized combinatorial clustering for stochastic processes. Clust Comput 20(2):1135–1148CrossRef Kim J, Lee W, Song JJ, Lee SB (2017) Optimized combinatorial clustering for stochastic processes. Clust Comput 20(2):1135–1148CrossRef
25.
26.
Zurück zum Zitat Kwon HY, Whang KY (2016) Scalable and efficient processing of top-\(k\) multiple-type integrated queries. World Wide Web 19(6):1051–1075CrossRef Kwon HY, Whang KY (2016) Scalable and efficient processing of top-\(k\) multiple-type integrated queries. World Wide Web 19(6):1051–1075CrossRef
27.
Zurück zum Zitat Lee W, Leung CKS (2009) Structural top-\(k\) web navigation with inclusive query. In: IEEE International Conference on Industrial Technology. IEEE, pp 1–6 Lee W, Leung CKS (2009) Structural top-\(k\) web navigation with inclusive query. In: IEEE International Conference on Industrial Technology. IEEE, pp 1–6
28.
Zurück zum Zitat Lee W, Leung CK, Lee JJ (2011) Mobile web navigation in digital ecosystems using rooted directed trees. IEEE Trans Ind Electron 58(6):2154–2162CrossRef Lee W, Leung CK, Lee JJ (2011) Mobile web navigation in digital ecosystems using rooted directed trees. IEEE Trans Ind Electron 58(6):2154–2162CrossRef
29.
Zurück zum Zitat Li WS, Candan KS, Vu Q, Agrawal D (2001) Retrieving and organizing web pages by “information unit”. In: WWW’01, pp 230–244 Li WS, Candan KS, Vu Q, Agrawal D (2001) Retrieving and organizing web pages by “information unit”. In: WWW’01, pp 230–244
30.
Zurück zum Zitat Li G, Ooi BC, Feng J, Wang J, Zhou L (2008) Ease: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data. ACM, pp 903–914 Li G, Ooi BC, Feng J, Wang J, Zhou L (2008) Ease: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data. ACM, pp 903–914
31.
Zurück zum Zitat Lin Z, Yang D, Yin X (2020) Patient similarity via joint embeddings of medical knowledge graph and medical entity descriptions. IEEE Access 8:156663–156676CrossRef Lin Z, Yang D, Yin X (2020) Patient similarity via joint embeddings of medical knowledge graph and medical entity descriptions. IEEE Access 8:156663–156676CrossRef
32.
Zurück zum Zitat Luo C, Carey MJ (2020) LSM-based storage techniques: a survey. VLDB J 29(1):393–418CrossRef Luo C, Carey MJ (2020) LSM-based storage techniques: a survey. VLDB J 29(1):393–418CrossRef
33.
Zurück zum Zitat Luo Y, Lin X, Wang W, Zhou X (2007) Spark: top-\(k\) keyword query in relational databases. In: ACM SIGMOD International Conference on Management of Data, SIGMOD’07. ACM, New York, pp 115–126 Luo Y, Lin X, Wang W, Zhou X (2007) Spark: top-\(k\) keyword query in relational databases. In: ACM SIGMOD International Conference on Management of Data, SIGMOD’07. ACM, New York, pp 115–126
34.
Zurück zum Zitat Mohanty M, Ramanath M (2019) Insta-search: towards effective exploration of knowledge graphs. In: International Conference on Information and Knowledge Management, pp 2909–2912 Mohanty M, Ramanath M (2019) Insta-search: towards effective exploration of knowledge graphs. In: International Conference on Information and Knowledge Management, pp 2909–2912
35.
Zurück zum Zitat Mottin D, Lissandrini M, Velegrakis Y, Palpanas T (2016) Exemplar queries: a new way of searching. VLDB J Int J Very Large Data Bases 25(6):741–765CrossRef Mottin D, Lissandrini M, Velegrakis Y, Palpanas T (2016) Exemplar queries: a new way of searching. VLDB J Int J Very Large Data Bases 25(6):741–765CrossRef
36.
Zurück zum Zitat Shang J, Wang C, Wang C, Guo G, Qian J (2020) An attribute-based community search method with graph refining. J Supercomput 76(10):7777–7804CrossRef Shang J, Wang C, Wang C, Guo G, Qian J (2020) An attribute-based community search method with graph refining. J Supercomput 76(10):7777–7804CrossRef
37.
Zurück zum Zitat Shimomura LC, Oyamada RS, Vieira MR, Kaster DS (2021) A survey on graph-based methods for similarity searches in metric spaces. Inf Syst 95:101507CrossRef Shimomura LC, Oyamada RS, Vieira MR, Kaster DS (2021) A survey on graph-based methods for similarity searches in metric spaces. Inf Syst 95:101507CrossRef
38.
Zurück zum Zitat Srinidhi CL, Aparna P, Rajan J (2019) Automated method for retinal artery/vein separation via graph search metaheuristic approach. IEEE Trans Image Process 28(6):2705–2718MathSciNetCrossRef Srinidhi CL, Aparna P, Rajan J (2019) Automated method for retinal artery/vein separation via graph search metaheuristic approach. IEEE Trans Image Process 28(6):2705–2718MathSciNetCrossRef
39.
Zurück zum Zitat Sun Z, Tang J, Du P, Deng ZH, Nie JY (2019) Divgraphpointer: a graph pointer network for extracting diverse keyphrases. In: SIGIR, pp 755–764 Sun Z, Tang J, Du P, Deng ZH, Nie JY (2019) Divgraphpointer: a graph pointer network for extracting diverse keyphrases. In: SIGIR, pp 755–764
40.
Zurück zum Zitat Tang B, Mouratidis K, Yiu ML, Chen Z (2019) Creating top ranking options in the continuous option and preference space. Proc VLDB Endow 12(10):1181–1194CrossRef Tang B, Mouratidis K, Yiu ML, Chen Z (2019) Creating top ranking options in the continuous option and preference space. Proc VLDB Endow 12(10):1181–1194CrossRef
41.
Zurück zum Zitat Tiakas E, Valkanas G, Papadopoulos AN, Manolopoulos Y, Gunopulos D (2016) Processing top-\(k\) dominating queries in metric spaces. ACM Trans Database Syst 40(4):23:1–23:38MathSciNetCrossRef Tiakas E, Valkanas G, Papadopoulos AN, Manolopoulos Y, Gunopulos D (2016) Processing top-\(k\) dominating queries in metric spaces. ACM Trans Database Syst 40(4):23:1–23:38MathSciNetCrossRef
42.
Zurück zum Zitat Tian Q, Li J, Liu H (2019) A method for guaranteeing wireless communication based on a combination of deep and shallow learning. IEEE Access 7:38688–38695CrossRef Tian Q, Li J, Liu H (2019) A method for guaranteeing wireless communication based on a combination of deep and shallow learning. IEEE Access 7:38688–38695CrossRef
43.
Zurück zum Zitat Yang S, Han F, Wu Y, Yan X (2016) Fast top-\(k\) search in knowledge graphs. In: 2016 IEEE 32nd International Conference on Data Engineering (ICDE). IEEE, pp 990–1001 Yang S, Han F, Wu Y, Yan X (2016) Fast top-\(k\) search in knowledge graphs. In: 2016 IEEE 32nd International Conference on Data Engineering (ICDE). IEEE, pp 990–1001
44.
Zurück zum Zitat Zou L, Huang R, Wang H, Yu JX, He W, Zhao D (2014) Natural language question answering over RDF: a graph data driven approach. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data. ACM, pp 313–324 Zou L, Huang R, Wang H, Yu JX, He W, Zhao D (2014) Natural language question answering over RDF: a graph data driven approach. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data. ACM, pp 313–324
Metadaten
Titel
Graph threshold algorithm
verfasst von
Wookey Lee
Justin JongSu Song
Charles Cheolgi Lee
Tae-Chang Jo
James J. H. Lee
Publikationsdatum
19.02.2021
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 9/2021
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-021-03665-z

Weitere Artikel der Ausgabe 9/2021

The Journal of Supercomputing 9/2021 Zur Ausgabe

Premium Partner