skip to main content
research-article

Big graphs: challenges and opportunities

Published:01 August 2022Publication History
Skip Abstract Section

Abstract

Big data is typically characterized with 4V's: Volume, Velocity, Variety and Veracity. When it comes to big graphs, these challenges become even more staggering. Each and every of the 4V's raises new questions, from theory to systems and practice. Is it possible to parallelize sequential graph algorithms and guarantee the correctness of the parallelized computations? Given a computational problem, does there exist a parallel algorithm for it that guarantees to reduce parallel runtime when more machines are used? Is there a systematic method for developing incremental algorithms with effectiveness guarantees in response to frequent updates? Is it possible to write queries across relational databases and semistructured graphs in SQL? Can we unify logic rules and machine learning, to improve the quality of graph-structured data, and deduce associations between entities? This paper aims to incite interest and curiosity in these topics. It raises as many questions as it answers.

References

  1. 2020. GraphScope. https://graphscope.io/.Google ScholarGoogle Scholar
  2. 2021. DBLP collaboration network. https://snap.stanford.edu/data/com-DBLP.html.Google ScholarGoogle Scholar
  3. 2022. IMDB. https://www.imdb.com/interfaces.Google ScholarGoogle Scholar
  4. 2022. Neo4J Project. http://neo4j.org/.Google ScholarGoogle Scholar
  5. 2022. Wikipedia. https://www.wikipedia.org.Google ScholarGoogle Scholar
  6. Azza Abouzeid, Kamil Bajda-Pawlikowski, Daniel J. Abadi, Alexander Rasin, and Avi Silberschatz. 2009. HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads. PVLDB 2, 1 (2009), 922--933.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Umut A. Acar. 2005. Self-Adjusting Computation. Ph.D. Dissertation. CMU.Google ScholarGoogle Scholar
  8. Christopher P Adams and Van V Brantner. 2006. Estimating the cost of new drug development: is it really 802 million? Health affairs 25, 2 (2006), 420--428.Google ScholarGoogle Scholar
  9. Sajad Ahmadian, Nima Joorabloo, Mahdi Jalili, Majid Meghdadi, Mohsen Afsharchi, and Yongli Ren. 2018. A temporal clustering approach for social recommender systems. In ASONAM. IEEE, 1139--1144.Google ScholarGoogle Scholar
  10. Zhiyuan Ai, Mingxing Zhang, Yongwei Wu, Xuehai Qian, Kang Chen, and Weimin Zheng. 2017. Squeezing out All the Value of Loaded Data: An Out-of-core Graph Processing System with Reduced Disk I/O. In USENIX. 125--137.Google ScholarGoogle Scholar
  11. João Paulo Aires and Felipe Meneguzzi. 2017. Norm Conflict Identification Using Deep Learning. In AAMAS Workshops. 194--207.Google ScholarGoogle Scholar
  12. Arvind Arasu, Michaela Götz, and Raghav Kaushik. 2010. On active learning of record matching packages. In SIGMOD. 783--794.Google ScholarGoogle Scholar
  13. Arvind Arasu, Christopher Ré, and Dan Suciu. 2009. Large-Scale Deduplication with Constraints Using Dedupalog. In ICDE. 952--963.Google ScholarGoogle Scholar
  14. Marcelo Arenas, Leopoldo Bertossi, and Jan Chomicki. 1999. Consistent Query Answers in Inconsistent Databases. In PODS. 68--79.Google ScholarGoogle Scholar
  15. Zeinab Bahmani, Leopoldo E. Bertossi, and Nikolaos Vasiloglou. 2017. ERBlox: Combining matching dependencies with machine learning for entity resolution. Int. J. Approx. Reasoning 83 (2017), 118--141.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Jørgen Bang-Jensen and Gregory Z. Gutin. 2009. Digraphs - Theory, Algorithms and Applications, Second Edition. Springer.Google ScholarGoogle Scholar
  17. Nurken Berdigaliyev and Mohamad Aljofan. 2020. An overview of drug discovery and development. Future Medicinal Chemistry 12, 10 (2020), 939--947.Google ScholarGoogle ScholarCross RefCross Ref
  18. Rianne van den Berg, Thomas N Kipf, and Max Welling. 2017. Graph convolutional matrix completion. arXiv preprint arXiv:1706.02263 (2017).Google ScholarGoogle Scholar
  19. James Bergstra and Yoshua Bengio. 2012. Random search for hyper-parameter optimization. JMLR 13, 1 (2012), 281--305.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Leopoldo E. Bertossi, Solmaz Kolahi, and Laks V. S. Lakshmanan. 2013. Data Cleaning and Query Answering with Matching Dependencies and Matching Functions. Theory Comput. Syst. 52, 3 (2013), 441--482.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Indrajit Bhattacharya and Lise Getoor. 2006. Entity resolution in graphs. Mining graph data (2006).Google ScholarGoogle Scholar
  22. Dmitry Bogdanov, Martín Haro, Ferdinand Fuhrmann, Anna Xambó, Emilia Gómez, and Perfecto Herrera. 2013. Semantic audio content-based music recommendation and visualization based on user preference examples. Information Processing & Management 49, 1 (2013), 13--33.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Kurt D. Bollacker, Colin Evans, Praveen K. Paritosh, Tim Sturge, and Jamie Taylor. 2008. Freebase: A collaboratively created graph database for structuring human knowledge. In SIGMOD. 1247--1250.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Florian Bourse, Marc Lelarge, and Milan Vojnovic. 2014. Balanced graph edge partition. In SIGKDD. 1456--1465.Google ScholarGoogle Scholar
  25. Jin-yi Cai, Martin Fürer, and Neil Immerman. 1992. An optimal lower bound on the number of variables for graph identifications. Comb. 12, 4 (1992), 389--410.Google ScholarGoogle ScholarCross RefCross Ref
  26. Yufei Cai, Paolo G. Giarrusso, Tillmann Rendel, and Klaus Ostermann. 2014. A theory of changes for higher-order languages: Incrementalizing λ-calculi by static differentiation. In PLDI. 145--155.Google ScholarGoogle Scholar
  27. Zhuhua Cai, Dionysios Logothetis, and Georgos Siganos. 2012. Facilitating real-time graph mining. In CloudDB.Google ScholarGoogle Scholar
  28. Riccardo Cappuzzo, Paolo Papotti, and Saravanan Thirumuruganathan. 2020. Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks. In SIGMOD. 1335--1349.Google ScholarGoogle Scholar
  29. Dawei Chen, Cheng Soon Ong, and Lexing Xie. 2016. Learning points and routes to recommend trajectories. In CIKM. 2227--2232.Google ScholarGoogle Scholar
  30. Xu Chen, Yongfeng Zhang, and Zheng Qin. 2019. Dynamic explainable recommendation based on neural attentive models. In AAAI, Vol. 33. 53--60.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Zhaoqiang Chen, Qun Chen, Fengfeng Fan, Yanyan Wang, Zhuo Wang, Youcef Nafa, Zhanhuai Li, Hailong Liu, and Wei Pan. 2018. Enabling quality control for entity resolution: A human and machine cooperation framework. In ICDE. IEEE, 1156--1167.Google ScholarGoogle Scholar
  32. Wei-Ta Chu and Ya-Lun Tsai. 2017. A hybrid recommendation system considering visual information for predicting favorite restaurants. World Wide Web 20, 6 (2017), 1313--1331.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Gao Cong, Wenfei Fan, Floris Geerts, Xibei Jia, and Shuai Ma. 2007. Improving Data Quality: Consistency and Accuracy. In VLDB. 315--326.Google ScholarGoogle Scholar
  34. A Dairam, Edith M Antunes, KS Saravanan, and Santylal Daya. 2006. Non-steroidal anti-inflammatory agents, tolmetin and sulindac, inhibit liver tryptophan 2, 3-dioxygenase activity and alter brain neurotransmitter levels. Life sciences 79, 24 (2006), 2269--2274.Google ScholarGoogle ScholarCross RefCross Ref
  35. Sanjib Das, Paul Suganthan G. C., AnHai Doan, Jeffrey F. Naughton, Ganesh Krishnan, Rohit Deep, Esteban Arcaute, Vijay Raghavendra, and Youngchoon Park. 2017. Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services. In SIGMOD. 1431--1446.Google ScholarGoogle Scholar
  36. Roshan Dathathri, Gurbinder Gill, Loc Hoang, Hoang-Vu Dang, Alex Brooks, Nikoli Dryden, Marc Snir, and Keshav Pingali. 2018. Gluon: A Communication-Optimizing Substrate for Distributed Heterogeneous Graph Analytics. In PLDI. 752--768.Google ScholarGoogle Scholar
  37. Ankur Dave, Alekh Jindal, Li Erran Li, Reynold Xin, Joseph Gonzalez, and Matei Zaharia. 2016. GraphFrames: an integrated API for mixing graph and relational queries. In GRADES. 2.Google ScholarGoogle Scholar
  38. Allan Peter Davis, Cynthia J Grondin, Robin J Johnson, Daniela Sciaky, Jolene Wiegers, Thomas C Wiegers, and Carolyn J Mattingly. 2021. Comparative toxicogenomics database (CTD): update 2021. Nucleic acids research 49, D1 (2021), D1138--D1143.Google ScholarGoogle Scholar
  39. Amol Deshpande. 2018. In situ graph querying and analytics with graphgen: Extended abstract. In GRADES. 2:1--2:2.Google ScholarGoogle Scholar
  40. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In NAACL-HLT.Google ScholarGoogle Scholar
  41. J A. Dimasi. 2001. New drug development in the United States from 1963 to 1999. In Clinical pharmacology and therapeutics vol. 69,5.Google ScholarGoogle Scholar
  42. Mohamad Dolatshah, Mathew Teoh, Jiannan Wang, and Jian Pei. 2018. Cleaning Crowdsourced Labels Using Oracles For Statistical Classification. PVLDB 12, 4 (2018), 376--389.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Yuxiao Dong, Nitesh V. Chawla, and Ananthram Swami. 2017. metapath2vec: Scalable Representation Learning for Heterogeneous Networks. In KDD.Google ScholarGoogle Scholar
  44. Jennie Duggan, Aaron J. Elmore, Michael Stonebraker, Magdalena Balazinska, Bill Howe, Jeremy Kepner, Sam Madden, David Maier, Tim Mattson, and Stanley B. Zdonik. 2015. The BigDAWG Polystore System. SIGMOD Rec. 44, 2 (2015), 11--16.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Muhammad Ebraheem, Saravanan Thirumuruganathan, Shafiq R. Joty, Mourad Ouzzani, and Nan Tang. 2018. Distributed Representations of Tuples for Entity Resolution. PVLDB 11, 11 (2018), 1454--1467.Google ScholarGoogle Scholar
  46. Federico Errica, Marco Podda, Davide Bacciu, and Alessio Micheli. 2020. A Fair Comparison of Graph Neural Networks for Graph Classification. In ICLR.Google ScholarGoogle Scholar
  47. Grace Fan, Wenfei Fan, Yuanhao Li, Ping Lu, Chao Tian, and Jingren Zhou. 2020. Extending Graph Patterns with Conditions. In SIGMOD. 715--729.Google ScholarGoogle Scholar
  48. Jing Fan, Adalbert Gerald Soosai Raj, and Jignesh M. Patel. 2015. The Case Against Specialized Graph Analytics Engines. In CIDR.Google ScholarGoogle Scholar
  49. Wenfei Fan, Zhe Fan, Chao Tian, and Xin Luna Dong. 2015. Keys for graphs. PVLDB 8, 12 (2015), 1590--1601.Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Wenfei Fan, Wenzhi Fu, Ruochun Jin, Ping Lu, and Chao Tian. 2022. Discovering Association Rules from Big Graphs. PVLDB 15, 7 (2022), 1479--1492.Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Wenfei Fan, Hong Gao, Xibei Jia, Jianzhong Li, and Shuai Ma. 2011. Dynamic constraints for record matching. VLDB J. 20, 4 (2011), 495--520.Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Wenfei Fan, Ling Ge, Ruochun Jin, Ping Lu, and Wenyuan Yu. 2022. Linking Entities across Relations and Graphs. In ICDE. IEEE.Google ScholarGoogle Scholar
  53. Wenfei Fan and Floris Geerts. 2012. Foundations of Data Quality Management. Morgan & Claypool Publishers.Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Wenfei Fan, Floris Geerts, Xibei Jia, and Anastasios Kementsietsidis. 2008. Conditional Functional Dependencies for Capturing Data Inconsistencies. ACM Trans. Database Syst. 33, 1 (2008), 25:1--25:49.Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. Wenfei Fan, Floris Geerts, Nan Tang, and Wenyuan Yu. 2014. Conflict resolution with data currency and consistency. J. Data and Information Quality 5, 1--2 (2014), 6:1--6:37.Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Wenfei Fan, Tao He, Longbin Lai, Xue Li, Yong Li, Zhao Li, Zhengping Qian, Chao Tian, Lei Wang, Jingbo Xu, Youyang Yao, Qiang Yin, Wenyuan Yu, Kai Zeng, Kun Zhao, Jingren Zhou, Diwen Zhu, and Rong Zhu. 2021. GraphScope: A Unified Engine For Big Graph Processing. PVLDB 14, 12 (2021), 2879--2892.Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Wenfei Fan, Chunming Hu, Xueli Liu, and Ping Lu. 2020. Discovering Graph Functional Dependencies. ACM Trans. Database Syst. 45, 3 (2020), 15:1--15:42.Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. Wenfei Fan, Chunming Hu, and Chao Tian. 2017. Incremental Graph Computations: Doable and Undoable. In SIGMOD. 155--169.Google ScholarGoogle Scholar
  59. Wenfei Fan, Ruochun Jin, Muyang Liu, Ping Lu, Xiaojian Luo, Ruiqi Xu, Qiang Yin, Wenyuan Yu, and Jingren Zhou. 2020. Application Driven Graph Partitioning. In SIGMOD. 1765--1779.Google ScholarGoogle Scholar
  60. Wenfei Fan, Ruochun Jin, Muyang Liu, Ping Lu, Chao Tian, and Jingren Zhou. 2020. Capturing Associations in Graphs. PVLDB 13, 11 (2020), 1863--1876.Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. Wenfei Fan, Ruochun Jin, Ping Lu, Chao Tian, and Ruiqi Xu. 2022. Towards Event Prediction in Temporal Graphs. PVLDB 15, 9 (2022), 1861--1874.Google ScholarGoogle ScholarDigital LibraryDigital Library
  62. Wenfei Fan, Yuanhao Li, Muyang Liu, and Can Lu. 2021. Making Graphs Compact by Lossless Contraction. In SIGMOD. 472--484.Google ScholarGoogle Scholar
  63. Wenfei Fan, Yuanhao Li, Muyang Liu, and Can Lu. 2022. A Hierarchical Contraction Scheme for Querying Big Graphs. In SIGMOD. 1726--1740.Google ScholarGoogle Scholar
  64. Wenfei Fan, Muyang Liu, Chao Tian, Ruiqi Xu, and Jingren Zhou. 2020. Incrementalization of Graph Partitioning Algorithms. PVLDB 13, 8 (2020), 1261--1274.Google ScholarGoogle ScholarDigital LibraryDigital Library
  65. Wenfei Fan, Xueli Liu, Ping Lu, and Chao Tian. 2018. Catching Numeric Inconsistencies in Graphs. In SIGMOD. 381--393.Google ScholarGoogle Scholar
  66. Wenfei Fan and Ping Lu. 2019. Dependencies for Graphs. ACM Trans. Database Syst. 44, 2 (2019), 5:1--5:40.Google ScholarGoogle ScholarDigital LibraryDigital Library
  67. Wenfei Fan, Ping Lu, Xiaojian Luo, Jingbo Xu, Qiang Yin, Wenyuan Yu, and Ruiqi Xu. 2018. Adaptive Asynchronous Parallelization of Graph Algorithms. In SIGMOD. 1141--1156.Google ScholarGoogle Scholar
  68. Wenfei Fan, Ping Lu, and Chao Tian. 2020. Unifying logic rules and machine learning for entity enhancing. Sci. China Inf. Sci. 63, 7 (2020).Google ScholarGoogle Scholar
  69. Wenfei Fan, Ping Lu, Chao Tian, and Jingren Zhou. 2019. Deducing Certain Fixes to Graphs. PVLDB 12, 7 (2019), 752--765.Google ScholarGoogle ScholarDigital LibraryDigital Library
  70. Wenfei Fan, Ping Lu, Wenyuan Yu, Jingbo Xu, Qiang Yin, Xiaojian Luo, Jingren Zhou, and Ruochun Jin. 2020. Adaptive Asynchronous Parallelization of Graph Algorithms. ACM Trans. Database Syst. 45, 2 (2020), 6:1--6:45.Google ScholarGoogle ScholarDigital LibraryDigital Library
  71. Wenfei Fan and Chao Tian. 2022. Incremental Graph Computations: Doable and Undoable. ACM Trans. Database Syst. 47, 2 (2022), 6:1--6:44.Google ScholarGoogle ScholarDigital LibraryDigital Library
  72. Wenfei Fan, Chao Tian, Yanghao Wang, and Qiang Yin. 2021. Discrepancy Detection and Incremental Detection. PVLDB 14, 8 (2021), 1351--1364.Google ScholarGoogle ScholarDigital LibraryDigital Library
  73. Wenfei Fan, Chao Tian, Ruiqi Xu, Qiang Yin, Wenyuan Yu, and Jingren Zhou. 2021. Incrementalizing Graph Algorithms. In SIGMOD. 459--471.Google ScholarGoogle Scholar
  74. Wenfei Fan, Xin Wang, and Yinghui Wu. 2013. Incremental graph pattern matching. ACM Trans. Database Syst. 38, 3 (2013).Google ScholarGoogle ScholarDigital LibraryDigital Library
  75. Wenfei Fan, Xin Wang, and Yinghui Wu. 2014. Distributed graph simulation: Impossibility and possibility. PVLDB 7, 12 (2014), 1083--1094.Google ScholarGoogle ScholarDigital LibraryDigital Library
  76. Wenfei Fan, Yinghui Wu, and Jingbo Xu. 2016. Functional dependencies for graphs. In SIGMOD. 1843--1857.Google ScholarGoogle Scholar
  77. Wenfei Fan, Jingbo Xu, Yinghui Wu, Wenyuan Yu, Jiaxin Jiang, Zeyu Zheng, Bohan Zhang, Yang Cao, and Chao Tian. 2017. Parallelizing Sequential Graph Computations. In SIGMOD. 495--510.Google ScholarGoogle Scholar
  78. Wenfei Fan, Wenyuan Yu, Jingbo Xu, Jingren Zhou, Xiaojian Luo, Qiang Yin, Ping Lu, Yang Cao, and Ruiqi Xu. 2018. Parallelizing Sequential Graph Computations. ACM Trans. Database Syst. 43, 4 (2018), 18:1--18:39.Google ScholarGoogle ScholarDigital LibraryDigital Library
  79. Chris Fotis, Asier Antoranz, Dimitris Hatziavramidis, Theodore Sakellaropoulos, and Leonidas G. Alexopoulos. 2017. Pathway-based technologies for early drug discovery. Drug Discovery Today (2017).Google ScholarGoogle Scholar
  80. Michael L Fredman and Robert Endre Tarjan. 1987. Fibonacci heaps and their uses in improved network optimization algorithms. JACM 34, 3 (1987).Google ScholarGoogle Scholar
  81. Cheng Fu, Xianpei Han, Le Sun, Bo Chen, Wei Zhang, Suhui Wu, and Hao Kong. 2019. End-to-end multi-perspective matching for entity resolution. In IJCAI. 4961--4967.Google ScholarGoogle Scholar
  82. Xinyu Fu, Jiani Zhang, Ziqiao Meng, and Irwin King. 2020. MAGNN: Metapath aggregated graph neural network for heterogeneous graph embedding. In WWW. 2331--2341.Google ScholarGoogle Scholar
  83. Michael Garey and David Johnson. 1979. Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman.Google ScholarGoogle ScholarDigital LibraryDigital Library
  84. Gartner. 2018. How to create a business case for data quality improvement. https://www.gartner.com/smarterwithgartner/how-to-create-a-business-case-for-data-quality-improvement/.Google ScholarGoogle Scholar
  85. Gurbinder Gill, Roshan Dathathri, Loc Hoang, Ramesh Peri, and Keshav Pingali. 2020. Single Machine Graph Analytics on Massive Datasets Using Intel Optane DC Persistent Memory. PVLDB 13, 8 (2020), 1304--1318.Google ScholarGoogle ScholarDigital LibraryDigital Library
  86. Lukasz Golab, Howard Karloff, Flip Korn, Divesh Srivastava, and Bei Yu. 2008. On generating near-optimal tableaux for conditional functional dependencies. PVLDB 1, 1 (2008), 376--390.Google ScholarGoogle ScholarDigital LibraryDigital Library
  87. Joseph E. Gonzalez, Yucheng Low, Haijie Gu, Danny Bickson, and Carlos Guestrin. 2012. PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs. In USENIX. 17--30.Google ScholarGoogle Scholar
  88. Joseph E. Gonzalez, Reynold S. Xin, Ankur Dave, Daniel Crankshaw, Michael J. Franklin, and Ion Stoica. 2014. GraphX: Graph Processing in a Distributed Dataflow Framework. In OSDI. 599--613.Google ScholarGoogle Scholar
  89. Raymond Greenlaw, H. James Hoover, and Walter L. Ruzzo. 1995. Limits to Parallel Computation: P-Completeness Theory. Oxford University Press.Google ScholarGoogle ScholarDigital LibraryDigital Library
  90. Martin Grohe. 2020. word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings of Structured Data. In PODS. ACM, 1--16.Google ScholarGoogle Scholar
  91. Songtao Guo, Xin Luna Dong, Divesh Srivastava, and Remi Zajac. 2010. Record Linkage with Uniqueness Constraints and Erroneous Values. PVLDB 3, 1 (2010), 417--428.Google ScholarGoogle ScholarDigital LibraryDigital Library
  92. Daniel Halperin, Victor Teixeira de Almeida, Lee Lee Choo, Shumo Chu, Paraschos Koutris, Dominik Moritz, Jennifer Ortiz, Vaspol Ruamviboonsuk, Jingjing Wang, Andrew Whitaker, Shengliang Xu, Magdalena Balazinska, Bill Howe, and Dan Suciu. 2014. Demonstration of the Myria big data management service. In SIGMOD. 881--884.Google ScholarGoogle Scholar
  93. Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In WWW. 173--182.Google ScholarGoogle Scholar
  94. Xinran He, Junfeng Pan, Ou Jin, Tianbing Xu, Bo Liu, Tao Xu, Yanxin Shi, Antoine Atallah, Ralf Herbrich, Stuart Bowers, and Joaquin Quiñonero Candela. 2014. Practical lessons from predicting clicks on ads at facebook. In Proceedings of the Eighth International Workshop on Data Mining for Online Advertising. 1--9.Google ScholarGoogle ScholarDigital LibraryDigital Library
  95. Alireza Heidari, Joshua McGrath, Ihab F Ilyas, and Theodoros Rekatsinas. 2019. HoloDetect: Few-Shot Learning for Error Detection. In SIGMOD. 829--846.Google ScholarGoogle Scholar
  96. M. R. Henzinger, T. Henzinger, and P. Kopke. 1995. Computing simulations on finite and infinite graphs. In FOCS. 453--462.Google ScholarGoogle Scholar
  97. Linus Hermansson, Tommi Kerola, Fredrik Johansson, Vinay Jethava, and Devdatt Dubhashi. 2013. Entity disambiguation in anonymized graphs using graph kernels. In CIKM. 1037--1046.Google ScholarGoogle Scholar
  98. Johannes Hoffart, Fabian M. Suchanek, Klaus Berberich, and Gerhard Weikum. 2013. YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia: Extended Abstract. In IJCAI. 3161--3165.Google ScholarGoogle Scholar
  99. John E. Hopcroft and Richard M. Karp. 1973. An n5/2 Algorithm for Maximum Matchings in Bipartite Graphs. SIAM J. Comput. 2, 4 (1973), 225--231.Google ScholarGoogle ScholarDigital LibraryDigital Library
  100. Boyi Hou, Qun Chen, Yanyan Wang, Youcef Nafa, and Zhanhua Li. 2022. Gradual Machine Learning for Entity Resolution. TKDE 34, 4 (2022), 1803--1814.Google ScholarGoogle ScholarCross RefCross Ref
  101. Robert Isele, Anja Jentzsch, and Christian Bizer. 2010. Silk Server - Adding missing Links while consuming Linked Data. In COLD, Vol. 665.Google ScholarGoogle Scholar
  102. Glen Jeh and Jennifer Widom. 2002. Simrank: A measure of structural-context similarity. In KDD. 538--543.Google ScholarGoogle ScholarDigital LibraryDigital Library
  103. Alekh Jindal, Samuel Madden, Malú Castellanos, and Meichun Hsu. 2015. Graph analytics using Vertica relational database. In BigData. IEEE Computer Society, 1191--1200.Google ScholarGoogle Scholar
  104. Alekh Jindal, Praynaa Rawlani, Eugene Wu, Samuel Madden, Amol Deshpande, and Mike Stonebraker. 2014. VERTEXICA: Your Relational Friend for Graph Analytics! PVLDB 7, 13 (2014), 1669--1672.Google ScholarGoogle ScholarDigital LibraryDigital Library
  105. N. D. Jones. 1996. An Introduction to Partial Evaluation. Comput. Surveys 28, 3 (1996), 480--503.Google ScholarGoogle ScholarDigital LibraryDigital Library
  106. Richard M. Karp and Vijaya Ramachandran. 1988. A Survey of Parallel Algorithms for Shared-Memory Machines. Technical Report UCB/CSD-88-408. EECS Department, University of California, Berkeley. http://www.eecs.berkeley.edu/Pubs/TechRpts/1988/5865.htmlGoogle ScholarGoogle Scholar
  107. George Karypis and Vipin Kumar. 1998. Multilevel k-way Partitioning Scheme for Irregular Graphs. J. Parallel Distributed Comput. 48, 1 (1998), 96--129.Google ScholarGoogle ScholarDigital LibraryDigital Library
  108. Jungo Kasai, Kun Qian, Sairam Gurajada, Yunyao Li, and Lucian Popa. 2019. Low-resource deep entity resolution with transfer and active learning. arXiv preprint arXiv:1906.08042 (2019).Google ScholarGoogle Scholar
  109. Jeremy Kepner, William Arcand, William Bergeron, Nadya T. Bliss, Robert Bond, Chansup Byun, Gary Condon, Kenneth Gregson, Matthew Hubbell, Jonathan Kurz, Andrew McCabe, Peter Michaleas, Andrew Prout, Albert Reuther, Antonio Rosa, and Charles Yee. 2012. Dynamic distributed dimensional data model (D4M) database and computation system. In ICASSP. IEEE, 5349--5352.Google ScholarGoogle Scholar
  110. Zuhair Khayyat, Ihab F. Ilyas, Alekh Jindal, Samuel Madden, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, and Si Yin. 2015. BigDansing: A System for Big Data Cleansing. In SIGMOD. 1215--1230.Google ScholarGoogle Scholar
  111. Mijung Kim and K Selçuk Candan. 2012. SBV-Cut: Vertex-cut based graph partitioning using structural balance vertices. DKE 72 (2012), 285--303.Google ScholarGoogle ScholarDigital LibraryDigital Library
  112. Boyan Kolev, Carlyna Bondiombouy, Patrick Valduriez, Ricardo Jiménez-Peris, Raquel Pau, and José Pereira. 2016. The CloudMdsQL, Multistore System. In SIGMOD. ACM, 2113--2116.Google ScholarGoogle Scholar
  113. Pradap Konda, Sanjib Das, Paul Suganthan G. C., AnHai Doan, Adel Ardalan, Jeffrey R. Ballard, Han Li, Fatemah Panahi, Haojun Zhang, Jeffrey F. Naughton, Shishir Prasad, Ganesh Krishnan, Rohit Deep, and Vijay Raghavendra. 2016. Magellan: Toward building entity matching management systems. PVLDB 9, 12 (2016), 1197--1208.Google ScholarGoogle ScholarDigital LibraryDigital Library
  114. Hanna Köpcke, Andreas Thor, and Erhard Rahm. 2010. Evaluation of entity resolution approaches on real-world match problems. PVLDB 3, 1 (2010), 484--493.Google ScholarGoogle ScholarDigital LibraryDigital Library
  115. Yehuda Koren. 2008. Factorization meets the neighborhood: a multifaceted collaborative filtering model. In SIGKDD. 426--434.Google ScholarGoogle Scholar
  116. Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer 42, 8 (2009), 30--37.Google ScholarGoogle ScholarDigital LibraryDigital Library
  117. Christos Koutras, Marios Fragkoulis, Asterios Katsifodimos, and Christoph Lofi. 2020. REMA: Graph Embeddings-based Relational Schema Matching. In EDBT/ICDT, Vol. 2578.Google ScholarGoogle Scholar
  118. Clyde P. Kruskal, Larry Rudolph, and Marc Snir. 1990. A Complexity Theory of Efficient Parallel Algorithms. Theor. Comput. Sci. 71, 1 (1990), 95--132.Google ScholarGoogle ScholarDigital LibraryDigital Library
  119. Mitsuru Kusumoto, Takanori Maehara, and Ken-ichi Kawarabayashi. 2014. Scalable similarity search for SimRank. In SIGMOD. 325--336.Google ScholarGoogle Scholar
  120. Selasi Kwashie, Lin Liu, Jixue Liu, Markus Stumptner, Jiuyong Li, and Lujing Yang. 2019. Certus: An effective entity resolution approach with graph differential dependencies (GDDs). PVLDB 12, 6 (2019), 653--666.Google ScholarGoogle ScholarDigital LibraryDigital Library
  121. Aapo Kyrola, Guy E. Blelloch, and Carlos Guestrin. 2012. GraphChi: Large-scale graph computation on just a PC. In OSDI. 31--46.Google ScholarGoogle Scholar
  122. Jeanne C Latourelle, Merete Dybdahl, Anita L Destefano, Richard H Myers, and Timothy L Lash. 2010. Risk of Parkinson's disease after tamoxifen treatment. BMC neurology 10, 1 (2010), 1--7.Google ScholarGoogle Scholar
  123. Jens Lehmann, Robert Isele, Max Jakob, Anja Jentzsch, Dimitris Kontokostas, Pablo N. Mendes, Sebastian Hellmann, Mohamed Morsey, Patrick van Kleef, Sören Auer, and Christian Bizer. 2015. DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web 6, 2 (2015), 167--195.Google ScholarGoogle ScholarCross RefCross Ref
  124. Bing Li, Wei Wang, Yifang Sun, Linhan Zhang, Muhammad Asif Ali, and Yi Wang. 2020. GraphER: Token-Centric Entity Resolution with Graph Convolutional Neural Networks.. In AAAI. 8172--8179.Google ScholarGoogle Scholar
  125. Yu Li, Hiroyuki Kuwahara, Peng Yang, Le Song, and Xin Gao. 2019. PGCN: Disease gene prioritization by disease and gene embedding through graph convolutional neural networks. biorxiv (2019), 532226.Google ScholarGoogle Scholar
  126. Yuliang Li, Jinfeng Li, Yoshihiko Suhara, AnHai Doan, and Wang-Chiew Tan. 2020. Deep Entity Matching with Pre-Trained Language Models. PVLDB 14, 1 (2020), 50--60.Google ScholarGoogle ScholarDigital LibraryDigital Library
  127. Leonid Libkin. 2004. Elements of Finite Model Theory. Springer.Google ScholarGoogle ScholarDigital LibraryDigital Library
  128. Greg Linden, Brent Smith, and Jeremy York. 2003. Amazon. com recommendations: Item-to-item collaborative filtering. IEEE Internet computing 7, 1 (2003), 76--80.Google ScholarGoogle Scholar
  129. Yanhong A. Liu. 2000. Efficiency by Incrementalization: An Introduction. High. Order Symb. Comput. 13, 4 (2000), 289--313.Google ScholarGoogle ScholarDigital LibraryDigital Library
  130. Beth Logan and Ariel Salomon. 2001. A content-based music similarity function. Cambridge Research Labs-Tech Report (2001).Google ScholarGoogle Scholar
  131. Yucheng Low, Joseph Gonzalez, Aapo Kyrola, Danny Bickson, Carlos Guestrin, and Joseph M. Hellerstein. 2012. Distributed GraphLab: A Framework for Machine Learning in the Cloud. PVLDB 5, 8 (2012), 716--727.Google ScholarGoogle ScholarDigital LibraryDigital Library
  132. Xin Luo, Mengchu Zhou, Yunni Xia, and Qingsheng Zhu. 2014. An efficient non-negative matrix-factorization-based approach to collaborative filtering for recommender systems. IEEE Transactions on Industrial Informatics 10, 2 (2014), 1273--1284.Google ScholarGoogle ScholarCross RefCross Ref
  133. Steffen Maass, Changwoo Min, Sanidhya Kashyap, Woon-Hak Kang, Mohan Kumar, and Taesoo Kim. 2017. Mosaic: Processing a Trillion-Edge Graph on a Single Machine. In EuroSys. ACM, 527--543.Google ScholarGoogle Scholar
  134. Mohammad Mahdavi, Ziawasch Abedjan, Raul Castro Fernandez, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, and Nan Tang. 2019. Raha: A Configuration-Free Error Detection System. In SIGMOD. 865--882.Google ScholarGoogle Scholar
  135. Grzegorz Malewicz, Matthew H. Austern, Aart J. C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski. 2010. Pregel: A system for large-scale graph processing. In SIGMOD. 135--146.Google ScholarGoogle ScholarDigital LibraryDigital Library
  136. Bernard Mans and Luke Mathieson. 2017. Incremental Problems in the Parameterized Complexity Setting. Theory Comput. Syst. 60, 1 (2017), 3--19.Google ScholarGoogle ScholarDigital LibraryDigital Library
  137. Mugilan Mariappan and Keval Vora. 2019. GraphBolt: Dependency-Driven Synchronous Processing of Streaming Graphs. In EuroSys. 25:1--25:16.Google ScholarGoogle Scholar
  138. Brian McFee, Luke Barrington, and Gert Lanckriet. 2012. Learning content similarity for music recommendation. IEEE transactions on audio, speech, and language processing 20, 8 (2012), 2207--2218.Google ScholarGoogle ScholarDigital LibraryDigital Library
  139. Frank McSherry, Michael Isard, and Derek Gordon Murray. 2015. Scalability! But at what COST?. In HotOS.Google ScholarGoogle Scholar
  140. Frank McSherry, Derek Gordon Murray, Rebecca Isaacs, and Michael Isard. 2013. Differential Dataflow. In CIDR.Google ScholarGoogle Scholar
  141. Alberto O. Mendelzon and Peter T. Wood. 1995. Finding Regular Simple Paths in Graph Databases. SIAM J. Comput. 24, 6 (1995), 1235--1258.Google ScholarGoogle ScholarDigital LibraryDigital Library
  142. Franck Michel, Johan Montagnat, and Catherine Faron Zucker. 2014. A survey of RDB to RDF translation approaches and tools. https://hal.archives-ouvertes.fr/hal-00903568/file/Rapport_Rech_I3S_v2_-_Michel_et_al_2013_-_A_survey_of_RDB_to_RDF_translation_approaches_and_tools.pdf.Google ScholarGoogle Scholar
  143. Robin Milner. 1989. Communication and Concurrency. Prentice Hall.Google ScholarGoogle ScholarDigital LibraryDigital Library
  144. Peter Bro Miltersen, Sairam Subramanian, Jeffrey Scott Vitter, and Roberto Tamassia. 1994. Complexity Models for Incremental Computation. Theor. Comput. Sci. 130, 1 (1994), 203--236.Google ScholarGoogle ScholarDigital LibraryDigital Library
  145. Sidharth Mudgal, Han Li, Theodoros Rekatsinas, AnHai Doan, Youngchoon Park, Ganesh Krishnan, Rohit Deep, Esteban Arcaute, and Vijay Raghavendra. 2018. Deep Learning for Entity Matching: A Design Space Exploration. In SIGMOD. 19--34.Google ScholarGoogle Scholar
  146. Fatemeh Nargesian, Erkang Zhu, Renée J. Miller, Ken Q. Pu, and Patricia C. Arocena. 2019. Data Lake Management: Challenges and Opportunities. PVLDB 12, 12 (2019), 1986--1989.Google ScholarGoogle ScholarDigital LibraryDigital Library
  147. Axel-Cyrille Ngonga Ngomo and Sören Auer. 2011. LIMES - A Time-Efficient Approach for Large-Scale Link Discovery on the Web of Data. In IJCAI. 2312--2317.Google ScholarGoogle Scholar
  148. Donald Nguyen, Andrew Lenharth, and Keshav Pingali. 2013. A lightweight infrastructure for graph analytics. In SOSP. 456--471.Google ScholarGoogle Scholar
  149. Phuc Nguyen, Ikuya Yamada, Natthawut Kertkeidkachorn, Ryutaro Ichise, and Hideaki Takeda. 2020. MTab4Wikidata at SemTab 2020: Tabular Data Annotation with Wikidata. In SemTabISWC, Vol. 2775. 86--95.Google ScholarGoogle Scholar
  150. George Papadakis, Leonidas Tsekouras, Emmanouil Thanos, George Giannakopoulos, Themis Palpanas, and Manolis Koubarakis. 2018. The return of JedAI: End-to-end entity resolution for structured and semi-structured data. PVLDB 11, 12 (2018), 1950--1953.Google ScholarGoogle ScholarDigital LibraryDigital Library
  151. Christos H. Papadimitriou. 1994. Computational complexity. Addison-Wesley.Google ScholarGoogle Scholar
  152. Kun Qian, Lucian Popa, and Prithviraj Sen. 2017. Active Learning for Large-Scale Entity Resolution. In CIKM. 1379--1388.Google ScholarGoogle Scholar
  153. Abdul Quamar and Amol Deshpande. 2016. NScaleSpark: subgraph-centric graph analytics on Apache Spark. In NDA. ACM, 5:1--5:8.Google ScholarGoogle Scholar
  154. G. Ramalingam and Thomas Reps. 1996. An incremental algorithm for a generalization of the shortest-path problem. J. Algorithms 21, 2 (1996), 267--305.Google ScholarGoogle ScholarDigital LibraryDigital Library
  155. G. Ramalingam and Thomas Reps. 1996. On the computational complexity of dynamic graph problems. ACM Trans. Database Syst. 158, 1--2 (1996), 233--277.Google ScholarGoogle Scholar
  156. G. Ramalingam and Thomas W. Reps. 1996. An Incremental Algorithm for a Generalization of the Shortest-Path Problem. J. Algorithms 21, 2 (1996), 267--305.Google ScholarGoogle ScholarDigital LibraryDigital Library
  157. Thomas C. Redman. 2016. Bad Data Costs the U.S. $3 Trillion Per Year. Harvard Business Review. https://hbr.org/2016/09/bad-data-costs-the-u-s-3-trillion-per-year.Google ScholarGoogle Scholar
  158. Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In EMNLP-IJCNLP. 3980--3990.Google ScholarGoogle Scholar
  159. Theodoros Rekatsinas, Xu Chu, Ihab F. Ilyas, and Christopher Ré. 2017. Holo-Clean: Holistic Data Repairs with Probabilistic Inference. PVLDB 10, 11 (2017), 1190--1201.Google ScholarGoogle ScholarDigital LibraryDigital Library
  160. Amitabha Roy, Ivo Mihailovic, and Willy Zwaenepoel. 2013. X-stream: Edge-centric graph processing using streaming partitions. In SOSP. ACM, 472--488.Google ScholarGoogle ScholarDigital LibraryDigital Library
  161. Alieh Saeedi, Eric Peukert, and Erhard Rahm. 2018. Using link features for entity clustering in knowledge graphs. In ESWC. 576--592.Google ScholarGoogle Scholar
  162. R Sandyk and MA Gillman. 1985. Acute exacerbation of Parkinson's disease with sulindac. Annals of neurology 17, 1 (1985), 104--105.Google ScholarGoogle ScholarCross RefCross Ref
  163. Jan Schluter and Christian Osendorfer. 2011. Music similarity estimation with the mean-covariance restricted Boltzmann machine. In ICMLA, Vol. 2. IEEE, 118--123.Google ScholarGoogle ScholarDigital LibraryDigital Library
  164. Shilad Sen, Jesse Vig, and John Riedl. 2009. Tagommenders: connecting users to items through tags. In WWW. 671--680.Google ScholarGoogle Scholar
  165. Bin Shao, Haixun Wang, and Yatao Li. 2013. Trinity: a distributed graph engine on a memory cloud. In SIGMOD. 505--516.Google ScholarGoogle Scholar
  166. Shenzhen Institute of Computing Sciences. 2022. Fishing Fort. https://en.sics.ac.cn/col84/index.Google ScholarGoogle Scholar
  167. Juan Shu, Yu Li, Sheng Wang, Bowei Xi, and Jianzhu Ma. 2021. Disease gene prediction with privileged information and heteroscedastic dropout. Bioinformatics 37, Supplement_1 (2021), i410--i417.Google ScholarGoogle ScholarCross RefCross Ref
  168. Benjamin A. Steer, Alhamza Alnaimi, Marco A. B. F. G. Lotz, Félix Cuadrado, Luis M. Vaquero, and Joan Varvenne. 2017. Cytosm: Declarative Property Graph Queries Without Data Migration. In GRADES. 4:1--4:6.Google ScholarGoogle Scholar
  169. Fabian M. Suchanek, Serge Abiteboul, and Pierre Senellart. 2011. PARIS: Probabilistic Alignment of Relations, Instances, and Schema. PVLDB 5, 3 (2011), 157--168.Google ScholarGoogle ScholarDigital LibraryDigital Library
  170. Huifeng Sun, Yong Peng, Junliang Chen, Chuanchang Liu, and Yuzhuo Sun. 2011. A New Similarity Measure Based on Adjusted Euclidean Distance for Memory-based Collaborative Filtering. JSW 6, 6 (2011), 993--1000.Google ScholarGoogle ScholarCross RefCross Ref
  171. Yizhou Sun, Jiawei Han, Xifeng Yan, Philip S. Yu, and Tianyi Wu. 2011. Path-Sim: Meta Path-Based Top-K Similarity Search in Heterogeneous Information Networks. PVLDB 4, 11 (2011), 992--1003.Google ScholarGoogle ScholarDigital LibraryDigital Library
  172. Zhu Sun, Qing Guo, Jie Yang, Hui Fang, Guibing Guo, Jie Zhang, and Robin Burke. 2019. Research commentary on recommendations with side information: A survey and research directions. Electronic Commerce Research and Applications 37 (2019).Google ScholarGoogle Scholar
  173. Katia P. Sycara. 1993. Machine learning for intelligent support of conflict resolution. Decision Support Systems 10, 2 (1993), 121--136.Google ScholarGoogle ScholarDigital LibraryDigital Library
  174. Bo-Tao Tan, Li Wang, Sen Li, Zai-Yun Long, Ya-Min Wu, and Yuan Liu. 2015. Retinoic acid induced the differentiation of neural stem cells from embryonic spinal cord into functional neurons in vitro. International journal of clinical and experimental pathology 8, 7 (2015).Google ScholarGoogle Scholar
  175. Nan Tang, Ju Fan, Fangyi Li, Jianhong Tu, Xiaoyong Du, Guoliang Li, Samuel Madden, and Mourad Ouzzani. 2021. RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation. PVLDB 14, 8 (2021), 1254--1261.Google ScholarGoogle ScholarDigital LibraryDigital Library
  176. Robert Endre Tarjan. 1972. Depth-First Search and Linear Graph Algorithms. SIAM J. Comput. 1, 2 (1972), 146--160.Google ScholarGoogle ScholarDigital LibraryDigital Library
  177. Tim Teitelbaum and Thomas W. Reps. 1981. The Cornell Program Synthesizer: A Syntax-Directed Programming Environment. Commun. ACM 24, 9 (1981), 563--573.Google ScholarGoogle ScholarDigital LibraryDigital Library
  178. Yuanyuan Tian, Andrey Balmin, Severin Andreas Corsten, and John McPherson Shirish Tatikonda. 2013. From "Think Like a Vertex" to "Think Like a Graph". PVLDB 7, 7 (2013), 193--204.Google ScholarGoogle ScholarDigital LibraryDigital Library
  179. Rakshit Trivedi, Bunyamin Sisman, Jun Ma, Christos Faloutsos, Hongyuan Zha, and Xin Luna Dong. 2018. Linknbed: Multi-graph representation learning with entity linkage. In ACL.Google ScholarGoogle Scholar
  180. Shalini Tyagi and Ernesto Jimenez-Ruiz. 2020. LexMa: Tabular data to knowledge graph matching using lexical techniques. In CEUR Workshop Proceedings, Vol. 2775. 59--64.Google ScholarGoogle Scholar
  181. Farman Ullah, Ghulam Sarwar, Sung Chang Lee, Yun Kyung Park, Kyeong Deok Moon, and Jin Tae Kim. 2012. Hybrid recommender system with temporal information. In ICOIN. IEEE, 421--425.Google ScholarGoogle Scholar
  182. Leslie G. Valiant. 1990. A Bridging Model for Parallel Computation. Commun. ACM 33, 8 (1990), 103--111.Google ScholarGoogle ScholarDigital LibraryDigital Library
  183. Aäron Van Den Oord, Sander Dieleman, and Benjamin Schrauwen. 2013. Deep content-based music recommendation. In NIPS. Neural Information Processing Systems Foundation (NIPS), 2643--2651.Google ScholarGoogle Scholar
  184. Larysa Visengeriyeva and Ziawasch Abedjan. 2018. Metadata-driven error detection. In SSDBM. 1:1--1:12.Google ScholarGoogle Scholar
  185. Keval Vora, Rajiv Gupta, and Guoqing (Harry) Xu. 2017. KickStarter: Fast and Accurate Computations on Streaming Graphs via Trimmed Approximations. In ASPLOS.Google ScholarGoogle Scholar
  186. W3C. 2012. Relational Databases to RDF (RDB2RDF).Google ScholarGoogle Scholar
  187. Guozhang Wang, Wenlei Xie, Alan J. Demers, and Johannes Gehrke. 2013. Asynchronous Large-Scale Graph Processing Made Easy. In CIDR.Google ScholarGoogle Scholar
  188. Xiaochan Wang, Yuchong Gong, Jing Yi, and Wen Zhang. 2019. Predicting gene-disease associations from the heterogeneous network using graph embedding. In IEEE International conference on bioinformatics and biomedicine (BIBM). 504--511.Google ScholarGoogle ScholarCross RefCross Ref
  189. Xiang Wang, Xiangnan He, Meng Wang, Fuli Feng, and Tat-Seng Chua. 2019. Neural graph collaborative filtering. In SIGIR. 165--174.Google ScholarGoogle Scholar
  190. Xinxi Wang and Ye Wang. 2014. Improving content-based and hybrid music recommendation using deep learning. In ACM Multimedia. 627--636.Google ScholarGoogle Scholar
  191. Yue Wang, Zhe Wang, Ziyuan Zhao, Zijian Li, Xun Jian, Hao Xin, Lei Chen, Jianchun Song, Zhenhong Chen, and Meng Zhao. 2022. Effective Similarity Search on Heterogeneous Networks: A Meta-path Free Approach. TKDE 34, 7 (2022), 3225--3240.Google ScholarGoogle Scholar
  192. Duncan J Watts and Steven H Strogatz. 1998. Collective dynamics of 'small-world' networks. Nature 393, 6684 (1998), 440--442.Google ScholarGoogle Scholar
  193. Steven Euijong Whang and Hector Garcia-Molina. 2013. Joint entity resolution on multiple datasets. The VLDB Journal 22, 6 (2013), 773--795.Google ScholarGoogle ScholarDigital LibraryDigital Library
  194. Charith Wickramaarachchi, Charalampos Chelmis, and Viktor K. Prasanna. 2015. Empowering Fast Incremental Computation over Large Scale Dynamic Graphs. In IPDPS.Google ScholarGoogle Scholar
  195. Renzhi Wu, Sanya Chaba, Saurabh Sawlani, Xu Chu, and Saravanan Thirumuruganathan. 2020. ZeroER: Entity Resolution using Zero Labeled Examples. In SIGMOD. 1149--1164.Google ScholarGoogle Scholar
  196. Yuting Wu, Xiao Liu, Yansong Feng, Zheng Wang, Rui Yan, and Dongyan Zhao. 2019. Relation-Aware Entity Alignment for Heterogeneous Knowledge Graphs. In IJCAI. 5278--5284.Google ScholarGoogle Scholar
  197. Chenning Xie, Rong Chen, Haibing Guan, Binyu Zang, and Haibo Chen. 2015. SYNC or ASYNC: Time to fuse for distributed graph-parallel computation. In PPOPP. 194--204.Google ScholarGoogle Scholar
  198. Xianghao Xu, Fang Wang, Hong Jiang, Yongli Cheng, Dan Feng, and Yongxuan Zhang. 2020. A Hybrid Update Strategy for I/O-Efficient Out-of-Core Graph Processing. IEEE Trans. Parallel Distributed Syst. 31, 8 (2020), 1767--1782.Google ScholarGoogle ScholarCross RefCross Ref
  199. Rex Ying, Ruining He, Kaifeng Chen, Pong Eksombatchai, William L Hamilton, and Jure Leskovec. 2018. Graph convolutional neural networks for web-scale recommender systems. In SIGKDD. 974--983.Google ScholarGoogle Scholar
  200. Weiren Yu, Xuemin Lin, Wenjie Zhang, Jian Pei, and Julie A McCann. 2019. SimRank*: effective and scalable pairwise similarity search based on graph topology. The VLDB Journal 28, 3 (2019), 401--426.Google ScholarGoogle ScholarDigital LibraryDigital Library
  201. Timothy A. K. Zakian, Ludovic A. R. Capelli, and Zhenjiang Hu. 2019. Incrementalization of Vertex-Centric Programs. In IPDPS.Google ScholarGoogle Scholar
  202. Xiangxiang Zeng, Xinqi Tu, Yuansheng Liu, Xiangzheng Fu, and Yansen Su. 2022. Toward better drug discovery with knowledge graph. Current opinion in structural biology 72 (2022), 114--126.Google ScholarGoogle Scholar
  203. Baichuan Zhang and Mohammad Al Hasan. 2017. Name disambiguation in anonymized graphs using network embedding. In CIKM. 1239--1248.Google ScholarGoogle Scholar
  204. Bingjun Zhang, Jialie Shen, Qiaoliang Xiang, and Ye Wang. 2009. Compositemap: a novel framework for music similarity measure. In SIGIR. 403--410.Google ScholarGoogle Scholar
  205. Dongxiang Zhang, Long Guo, Xiangnan He, Jie Shao, Sai Wu, and Heng Tao Shen. 2018. A Graph-Theoretic Fusion Framework for Unsupervised Entity Resolution. In ICDE. 713--724.Google ScholarGoogle Scholar
  206. Qingheng Zhang, Zequn Sun, Wei Hu, Muhao Chen, Lingbing Guo, and Yuzhong Qu. 2019. Multi-view Knowledge Graph Embedding for Entity Alignment. In IJCAI. 5429--5435.Google ScholarGoogle Scholar
  207. Shuo Zhang, Edgar Meij, Krisztian Balog, and Ridho Reinanda. 2020. Novel Entity Discovery from Web Tables. In WWW. 1298--1308.Google ScholarGoogle Scholar
  208. Chen Zhao and Yeye He. 2019. Auto-EM: End-to-end Fuzzy Entity-Matching using Pre-trained Deep Models and Transfer Learning. In WWW. 2413--2424.Google ScholarGoogle Scholar
  209. Jie Zhao, Manish Kumar, Jeevan Sharma, and Zhihai Yuan. 2021. Arbutin effectively ameliorates the symptoms of Parkinson's disease: The role of adenosine receptors and cyclic adenosine monophosphate. Neural regeneration research 16, 10 (2021), 2030.Google ScholarGoogle Scholar
  210. Kangfei Zhao and Jeffrey Xu Yu. 2017. All-in-One: Graph Processing in RDBMSs Revisited. In SIGMOD. 1165--1180.Google ScholarGoogle Scholar
  211. Liang Zhao. 2021. Event Prediction in the Big Data Era: A Systematic Survey. ACM Comput. Surv. 54, 5 (2021), 94:1--94:37.Google ScholarGoogle ScholarDigital LibraryDigital Library
  212. Zhongying Zhao, Xuejian Zhang, Hui Zhou, Chao Li, Maoguo Gong, and Yongqing Wang. 2020. HetNERec: Heterogeneous network embedding based recommendation. Knowledge-Based Systems 204 (2020).Google ScholarGoogle Scholar
  213. Lei Zheng, Vahid Noroozi, and Philip S Yu. 2017. Joint deep modeling of users and items using reviews for recommendation. In WSDM. 425--434.Google ScholarGoogle Scholar
  214. Minpeng Zhu and Tore Risch. 2011. Querying combined cloud-based and relational databases. In CSC. 330--335.Google ScholarGoogle Scholar
  215. Xiaowei Zhu, Wentao Han, and Wenguang Chen. 2015. GridGraph: Large-Scale Graph Processing on a Single Machine Using 2-Level Hierarchical Partitioning. In USENIX ATC. 375--386.Google ScholarGoogle Scholar
  216. Cai-Nicolas Ziegler, Georg Lausen, and Lars Schmidt-Thieme. 2004. Taxonomy-driven computation of product recommendations. In CIKM. 406--415.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in

Full Access

  • Published in

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 15, Issue 12
    August 2022
    551 pages
    ISSN:2150-8097
    Issue’s Table of Contents

    Publisher

    VLDB Endowment

    Publication History

    • Published: 1 August 2022
    Published in pvldb Volume 15, Issue 12

    Qualifiers

    • research-article

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader