research-article

Big graphs: challenges and opportunities

Author:
Wenfei Fan

University of Edinburgh, Beihang University

University of Edinburgh, Beihang University
View Profile

Proceedings of the VLDB Endowment Volume 15 Issue 12pp 3782–3797https://doi.org/10.14778/3554821.3554899

Published:01 August 2022Publication History

Proceedings of the VLDB Endowment

Abstract

Big data is typically characterized with 4V's: Volume, Velocity, Variety and Veracity. When it comes to big graphs, these challenges become even more staggering. Each and every of the 4V's raises new questions, from theory to systems and practice. Is it possible to parallelize sequential graph algorithms and guarantee the correctness of the parallelized computations? Given a computational problem, does there exist a parallel algorithm for it that guarantees to reduce parallel runtime when more machines are used? Is there a systematic method for developing incremental algorithms with effectiveness guarantees in response to frequent updates? Is it possible to write queries across relational databases and semistructured graphs in SQL? Can we unify logic rules and machine learning, to improve the quality of graph-structured data, and deduce associations between entities? This paper aims to incite interest and curiosity in these topics. It raises as many questions as it answers.

References

2020. GraphScope. https://graphscope.io/.Google Scholar
2021. DBLP collaboration network. https://snap.stanford.edu/data/com-DBLP.html.Google Scholar
2022. IMDB. https://www.imdb.com/interfaces.Google Scholar
2022. Neo4J Project. http://neo4j.org/.Google Scholar
2022. Wikipedia. https://www.wikipedia.org.Google Scholar
Azza Abouzeid, Kamil Bajda-Pawlikowski, Daniel J. Abadi, Alexander Rasin, and Avi Silberschatz. 2009. HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads. PVLDB 2, 1 (2009), 922--933.Google ScholarDigital Library
Umut A. Acar. 2005. Self-Adjusting Computation. Ph.D. Dissertation. CMU.Google Scholar
Christopher P Adams and Van V Brantner. 2006. Estimating the cost of new drug development: is it really 802 million? Health affairs 25, 2 (2006), 420--428.Google Scholar
Sajad Ahmadian, Nima Joorabloo, Mahdi Jalili, Majid Meghdadi, Mohsen Afsharchi, and Yongli Ren. 2018. A temporal clustering approach for social recommender systems. In ASONAM. IEEE, 1139--1144.Google Scholar
Zhiyuan Ai, Mingxing Zhang, Yongwei Wu, Xuehai Qian, Kang Chen, and Weimin Zheng. 2017. Squeezing out All the Value of Loaded Data: An Out-of-core Graph Processing System with Reduced Disk I/O. In USENIX. 125--137.Google Scholar
João Paulo Aires and Felipe Meneguzzi. 2017. Norm Conflict Identification Using Deep Learning. In AAMAS Workshops. 194--207.Google Scholar
Arvind Arasu, Michaela Götz, and Raghav Kaushik. 2010. On active learning of record matching packages. In SIGMOD. 783--794.Google Scholar
Arvind Arasu, Christopher Ré, and Dan Suciu. 2009. Large-Scale Deduplication with Constraints Using Dedupalog. In ICDE. 952--963.Google Scholar
Marcelo Arenas, Leopoldo Bertossi, and Jan Chomicki. 1999. Consistent Query Answers in Inconsistent Databases. In PODS. 68--79.Google Scholar
Zeinab Bahmani, Leopoldo E. Bertossi, and Nikolaos Vasiloglou. 2017. ERBlox: Combining matching dependencies with machine learning for entity resolution. Int. J. Approx. Reasoning 83 (2017), 118--141.Google ScholarDigital Library
Jørgen Bang-Jensen and Gregory Z. Gutin. 2009. Digraphs - Theory, Algorithms and Applications, Second Edition. Springer.Google Scholar
Nurken Berdigaliyev and Mohamad Aljofan. 2020. An overview of drug discovery and development. Future Medicinal Chemistry 12, 10 (2020), 939--947.Google ScholarCross Ref
Rianne van den Berg, Thomas N Kipf, and Max Welling. 2017. Graph convolutional matrix completion. arXiv preprint arXiv:1706.02263 (2017).Google Scholar
James Bergstra and Yoshua Bengio. 2012. Random search for hyper-parameter optimization. JMLR 13, 1 (2012), 281--305.Google ScholarDigital Library
Leopoldo E. Bertossi, Solmaz Kolahi, and Laks V. S. Lakshmanan. 2013. Data Cleaning and Query Answering with Matching Dependencies and Matching Functions. Theory Comput. Syst. 52, 3 (2013), 441--482.Google ScholarDigital Library
Indrajit Bhattacharya and Lise Getoor. 2006. Entity resolution in graphs. Mining graph data (2006).Google Scholar
Dmitry Bogdanov, Martín Haro, Ferdinand Fuhrmann, Anna Xambó, Emilia Gómez, and Perfecto Herrera. 2013. Semantic audio content-based music recommendation and visualization based on user preference examples. Information Processing & Management 49, 1 (2013), 13--33.Google ScholarDigital Library
Kurt D. Bollacker, Colin Evans, Praveen K. Paritosh, Tim Sturge, and Jamie Taylor. 2008. Freebase: A collaboratively created graph database for structuring human knowledge. In SIGMOD. 1247--1250.Google ScholarDigital Library
Florian Bourse, Marc Lelarge, and Milan Vojnovic. 2014. Balanced graph edge partition. In SIGKDD. 1456--1465.Google Scholar
Jin-yi Cai, Martin Fürer, and Neil Immerman. 1992. An optimal lower bound on the number of variables for graph identifications. Comb. 12, 4 (1992), 389--410.Google ScholarCross Ref
Yufei Cai, Paolo G. Giarrusso, Tillmann Rendel, and Klaus Ostermann. 2014. A theory of changes for higher-order languages: Incrementalizing λ-calculi by static differentiation. In PLDI. 145--155.Google Scholar
Zhuhua Cai, Dionysios Logothetis, and Georgos Siganos. 2012. Facilitating real-time graph mining. In CloudDB.Google Scholar
Riccardo Cappuzzo, Paolo Papotti, and Saravanan Thirumuruganathan. 2020. Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks. In SIGMOD. 1335--1349.Google Scholar
Dawei Chen, Cheng Soon Ong, and Lexing Xie. 2016. Learning points and routes to recommend trajectories. In CIKM. 2227--2232.Google Scholar
Xu Chen, Yongfeng Zhang, and Zheng Qin. 2019. Dynamic explainable recommendation based on neural attentive models. In AAAI, Vol. 33. 53--60.Google ScholarDigital Library
Zhaoqiang Chen, Qun Chen, Fengfeng Fan, Yanyan Wang, Zhuo Wang, Youcef Nafa, Zhanhuai Li, Hailong Liu, and Wei Pan. 2018. Enabling quality control for entity resolution: A human and machine cooperation framework. In ICDE. IEEE, 1156--1167.Google Scholar
Wei-Ta Chu and Ya-Lun Tsai. 2017. A hybrid recommendation system considering visual information for predicting favorite restaurants. World Wide Web 20, 6 (2017), 1313--1331.Google ScholarDigital Library
Gao Cong, Wenfei Fan, Floris Geerts, Xibei Jia, and Shuai Ma. 2007. Improving Data Quality: Consistency and Accuracy. In VLDB. 315--326.Google Scholar
A Dairam, Edith M Antunes, KS Saravanan, and Santylal Daya. 2006. Non-steroidal anti-inflammatory agents, tolmetin and sulindac, inhibit liver tryptophan 2, 3-dioxygenase activity and alter brain neurotransmitter levels. Life sciences 79, 24 (2006), 2269--2274.Google ScholarCross Ref
Sanjib Das, Paul Suganthan G. C., AnHai Doan, Jeffrey F. Naughton, Ganesh Krishnan, Rohit Deep, Esteban Arcaute, Vijay Raghavendra, and Youngchoon Park. 2017. Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services. In SIGMOD. 1431--1446.Google Scholar
Roshan Dathathri, Gurbinder Gill, Loc Hoang, Hoang-Vu Dang, Alex Brooks, Nikoli Dryden, Marc Snir, and Keshav Pingali. 2018. Gluon: A Communication-Optimizing Substrate for Distributed Heterogeneous Graph Analytics. In PLDI. 752--768.Google Scholar
Ankur Dave, Alekh Jindal, Li Erran Li, Reynold Xin, Joseph Gonzalez, and Matei Zaharia. 2016. GraphFrames: an integrated API for mixing graph and relational queries. In GRADES. 2.Google Scholar
Allan Peter Davis, Cynthia J Grondin, Robin J Johnson, Daniela Sciaky, Jolene Wiegers, Thomas C Wiegers, and Carolyn J Mattingly. 2021. Comparative toxicogenomics database (CTD): update 2021. Nucleic acids research 49, D1 (2021), D1138--D1143.Google Scholar
Amol Deshpande. 2018. In situ graph querying and analytics with graphgen: Extended abstract. In GRADES. 2:1--2:2.Google Scholar
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In NAACL-HLT.Google Scholar
J A. Dimasi. 2001. New drug development in the United States from 1963 to 1999. In Clinical pharmacology and therapeutics vol. 69,5.Google Scholar
Mohamad Dolatshah, Mathew Teoh, Jiannan Wang, and Jian Pei. 2018. Cleaning Crowdsourced Labels Using Oracles For Statistical Classification. PVLDB 12, 4 (2018), 376--389.Google ScholarDigital Library
Yuxiao Dong, Nitesh V. Chawla, and Ananthram Swami. 2017. metapath2vec: Scalable Representation Learning for Heterogeneous Networks. In KDD.Google Scholar
Jennie Duggan, Aaron J. Elmore, Michael Stonebraker, Magdalena Balazinska, Bill Howe, Jeremy Kepner, Sam Madden, David Maier, Tim Mattson, and Stanley B. Zdonik. 2015. The BigDAWG Polystore System. SIGMOD Rec. 44, 2 (2015), 11--16.Google ScholarDigital Library
Muhammad Ebraheem, Saravanan Thirumuruganathan, Shafiq R. Joty, Mourad Ouzzani, and Nan Tang. 2018. Distributed Representations of Tuples for Entity Resolution. PVLDB 11, 11 (2018), 1454--1467.Google Scholar
Federico Errica, Marco Podda, Davide Bacciu, and Alessio Micheli. 2020. A Fair Comparison of Graph Neural Networks for Graph Classification. In ICLR.Google Scholar
Grace Fan, Wenfei Fan, Yuanhao Li, Ping Lu, Chao Tian, and Jingren Zhou. 2020. Extending Graph Patterns with Conditions. In SIGMOD. 715--729.Google Scholar
Jing Fan, Adalbert Gerald Soosai Raj, and Jignesh M. Patel. 2015. The Case Against Specialized Graph Analytics Engines. In CIDR.Google Scholar
Wenfei Fan, Zhe Fan, Chao Tian, and Xin Luna Dong. 2015. Keys for graphs. PVLDB 8, 12 (2015), 1590--1601.Google ScholarDigital Library
Wenfei Fan, Wenzhi Fu, Ruochun Jin, Ping Lu, and Chao Tian. 2022. Discovering Association Rules from Big Graphs. PVLDB 15, 7 (2022), 1479--1492.Google ScholarDigital Library
Wenfei Fan, Hong Gao, Xibei Jia, Jianzhong Li, and Shuai Ma. 2011. Dynamic constraints for record matching. VLDB J. 20, 4 (2011), 495--520.Google ScholarDigital Library
Wenfei Fan, Ling Ge, Ruochun Jin, Ping Lu, and Wenyuan Yu. 2022. Linking Entities across Relations and Graphs. In ICDE. IEEE.Google Scholar
Wenfei Fan and Floris Geerts. 2012. Foundations of Data Quality Management. Morgan & Claypool Publishers.Google ScholarDigital Library
Wenfei Fan, Floris Geerts, Xibei Jia, and Anastasios Kementsietsidis. 2008. Conditional Functional Dependencies for Capturing Data Inconsistencies. ACM Trans. Database Syst. 33, 1 (2008), 25:1--25:49.Google ScholarDigital Library
Wenfei Fan, Floris Geerts, Nan Tang, and Wenyuan Yu. 2014. Conflict resolution with data currency and consistency. J. Data and Information Quality 5, 1--2 (2014), 6:1--6:37.Google ScholarDigital Library
Wenfei Fan, Tao He, Longbin Lai, Xue Li, Yong Li, Zhao Li, Zhengping Qian, Chao Tian, Lei Wang, Jingbo Xu, Youyang Yao, Qiang Yin, Wenyuan Yu, Kai Zeng, Kun Zhao, Jingren Zhou, Diwen Zhu, and Rong Zhu. 2021. GraphScope: A Unified Engine For Big Graph Processing. PVLDB 14, 12 (2021), 2879--2892.Google ScholarDigital Library
Wenfei Fan, Chunming Hu, Xueli Liu, and Ping Lu. 2020. Discovering Graph Functional Dependencies. ACM Trans. Database Syst. 45, 3 (2020), 15:1--15:42.Google ScholarDigital Library
Wenfei Fan, Chunming Hu, and Chao Tian. 2017. Incremental Graph Computations: Doable and Undoable. In SIGMOD. 155--169.Google Scholar
Wenfei Fan, Ruochun Jin, Muyang Liu, Ping Lu, Xiaojian Luo, Ruiqi Xu, Qiang Yin, Wenyuan Yu, and Jingren Zhou. 2020. Application Driven Graph Partitioning. In SIGMOD. 1765--1779.Google Scholar
Wenfei Fan, Ruochun Jin, Muyang Liu, Ping Lu, Chao Tian, and Jingren Zhou. 2020. Capturing Associations in Graphs. PVLDB 13, 11 (2020), 1863--1876.Google ScholarDigital Library
Wenfei Fan, Ruochun Jin, Ping Lu, Chao Tian, and Ruiqi Xu. 2022. Towards Event Prediction in Temporal Graphs. PVLDB 15, 9 (2022), 1861--1874.Google ScholarDigital Library
Wenfei Fan, Yuanhao Li, Muyang Liu, and Can Lu. 2021. Making Graphs Compact by Lossless Contraction. In SIGMOD. 472--484.Google Scholar
Wenfei Fan, Yuanhao Li, Muyang Liu, and Can Lu. 2022. A Hierarchical Contraction Scheme for Querying Big Graphs. In SIGMOD. 1726--1740.Google Scholar
Wenfei Fan, Muyang Liu, Chao Tian, Ruiqi Xu, and Jingren Zhou. 2020. Incrementalization of Graph Partitioning Algorithms. PVLDB 13, 8 (2020), 1261--1274.Google ScholarDigital Library
Wenfei Fan, Xueli Liu, Ping Lu, and Chao Tian. 2018. Catching Numeric Inconsistencies in Graphs. In SIGMOD. 381--393.Google Scholar
Wenfei Fan and Ping Lu. 2019. Dependencies for Graphs. ACM Trans. Database Syst. 44, 2 (2019), 5:1--5:40.Google ScholarDigital Library
Wenfei Fan, Ping Lu, Xiaojian Luo, Jingbo Xu, Qiang Yin, Wenyuan Yu, and Ruiqi Xu. 2018. Adaptive Asynchronous Parallelization of Graph Algorithms. In SIGMOD. 1141--1156.Google Scholar
Wenfei Fan, Ping Lu, and Chao Tian. 2020. Unifying logic rules and machine learning for entity enhancing. Sci. China Inf. Sci. 63, 7 (2020).Google Scholar
Wenfei Fan, Ping Lu, Chao Tian, and Jingren Zhou. 2019. Deducing Certain Fixes to Graphs. PVLDB 12, 7 (2019), 752--765.Google ScholarDigital Library
Wenfei Fan, Ping Lu, Wenyuan Yu, Jingbo Xu, Qiang Yin, Xiaojian Luo, Jingren Zhou, and Ruochun Jin. 2020. Adaptive Asynchronous Parallelization of Graph Algorithms. ACM Trans. Database Syst. 45, 2 (2020), 6:1--6:45.Google ScholarDigital Library
Wenfei Fan and Chao Tian. 2022. Incremental Graph Computations: Doable and Undoable. ACM Trans. Database Syst. 47, 2 (2022), 6:1--6:44.Google ScholarDigital Library
Wenfei Fan, Chao Tian, Yanghao Wang, and Qiang Yin. 2021. Discrepancy Detection and Incremental Detection. PVLDB 14, 8 (2021), 1351--1364.Google ScholarDigital Library
Wenfei Fan, Chao Tian, Ruiqi Xu, Qiang Yin, Wenyuan Yu, and Jingren Zhou. 2021. Incrementalizing Graph Algorithms. In SIGMOD. 459--471.Google Scholar
Wenfei Fan, Xin Wang, and Yinghui Wu. 2013. Incremental graph pattern matching. ACM Trans. Database Syst. 38, 3 (2013).Google ScholarDigital Library
Wenfei Fan, Xin Wang, and Yinghui Wu. 2014. Distributed graph simulation: Impossibility and possibility. PVLDB 7, 12 (2014), 1083--1094.Google ScholarDigital Library
Wenfei Fan, Yinghui Wu, and Jingbo Xu. 2016. Functional dependencies for graphs. In SIGMOD. 1843--1857.Google Scholar
Wenfei Fan, Jingbo Xu, Yinghui Wu, Wenyuan Yu, Jiaxin Jiang, Zeyu Zheng, Bohan Zhang, Yang Cao, and Chao Tian. 2017. Parallelizing Sequential Graph Computations. In SIGMOD. 495--510.Google Scholar
Wenfei Fan, Wenyuan Yu, Jingbo Xu, Jingren Zhou, Xiaojian Luo, Qiang Yin, Ping Lu, Yang Cao, and Ruiqi Xu. 2018. Parallelizing Sequential Graph Computations. ACM Trans. Database Syst. 43, 4 (2018), 18:1--18:39.Google ScholarDigital Library
Chris Fotis, Asier Antoranz, Dimitris Hatziavramidis, Theodore Sakellaropoulos, and Leonidas G. Alexopoulos. 2017. Pathway-based technologies for early drug discovery. Drug Discovery Today (2017).Google Scholar
Michael L Fredman and Robert Endre Tarjan. 1987. Fibonacci heaps and their uses in improved network optimization algorithms. JACM 34, 3 (1987).Google Scholar
Cheng Fu, Xianpei Han, Le Sun, Bo Chen, Wei Zhang, Suhui Wu, and Hao Kong. 2019. End-to-end multi-perspective matching for entity resolution. In IJCAI. 4961--4967.Google Scholar
Xinyu Fu, Jiani Zhang, Ziqiao Meng, and Irwin King. 2020. MAGNN: Metapath aggregated graph neural network for heterogeneous graph embedding. In WWW. 2331--2341.Google Scholar
Michael Garey and David Johnson. 1979. Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman.Google ScholarDigital Library
Gartner. 2018. How to create a business case for data quality improvement. https://www.gartner.com/smarterwithgartner/how-to-create-a-business-case-for-data-quality-improvement/.Google Scholar
Gurbinder Gill, Roshan Dathathri, Loc Hoang, Ramesh Peri, and Keshav Pingali. 2020. Single Machine Graph Analytics on Massive Datasets Using Intel Optane DC Persistent Memory. PVLDB 13, 8 (2020), 1304--1318.Google ScholarDigital Library
Lukasz Golab, Howard Karloff, Flip Korn, Divesh Srivastava, and Bei Yu. 2008. On generating near-optimal tableaux for conditional functional dependencies. PVLDB 1, 1 (2008), 376--390.Google ScholarDigital Library
Joseph E. Gonzalez, Yucheng Low, Haijie Gu, Danny Bickson, and Carlos Guestrin. 2012. PowerGraph: Distributed Graph-Parallel Computation on Natural Graphs. In USENIX. 17--30.Google Scholar
Joseph E. Gonzalez, Reynold S. Xin, Ankur Dave, Daniel Crankshaw, Michael J. Franklin, and Ion Stoica. 2014. GraphX: Graph Processing in a Distributed Dataflow Framework. In OSDI. 599--613.Google Scholar
Raymond Greenlaw, H. James Hoover, and Walter L. Ruzzo. 1995. Limits to Parallel Computation: P-Completeness Theory. Oxford University Press.Google ScholarDigital Library
Martin Grohe. 2020. word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings of Structured Data. In PODS. ACM, 1--16.Google Scholar
Songtao Guo, Xin Luna Dong, Divesh Srivastava, and Remi Zajac. 2010. Record Linkage with Uniqueness Constraints and Erroneous Values. PVLDB 3, 1 (2010), 417--428.Google ScholarDigital Library
Daniel Halperin, Victor Teixeira de Almeida, Lee Lee Choo, Shumo Chu, Paraschos Koutris, Dominik Moritz, Jennifer Ortiz, Vaspol Ruamviboonsuk, Jingjing Wang, Andrew Whitaker, Shengliang Xu, Magdalena Balazinska, Bill Howe, and Dan Suciu. 2014. Demonstration of the Myria big data management service. In SIGMOD. 881--884.Google Scholar
Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In WWW. 173--182.Google Scholar
Xinran He, Junfeng Pan, Ou Jin, Tianbing Xu, Bo Liu, Tao Xu, Yanxin Shi, Antoine Atallah, Ralf Herbrich, Stuart Bowers, and Joaquin Quiñonero Candela. 2014. Practical lessons from predicting clicks on ads at facebook. In Proceedings of the Eighth International Workshop on Data Mining for Online Advertising. 1--9.Google ScholarDigital Library
Alireza Heidari, Joshua McGrath, Ihab F Ilyas, and Theodoros Rekatsinas. 2019. HoloDetect: Few-Shot Learning for Error Detection. In SIGMOD. 829--846.Google Scholar
M. R. Henzinger, T. Henzinger, and P. Kopke. 1995. Computing simulations on finite and infinite graphs. In FOCS. 453--462.Google Scholar
Linus Hermansson, Tommi Kerola, Fredrik Johansson, Vinay Jethava, and Devdatt Dubhashi. 2013. Entity disambiguation in anonymized graphs using graph kernels. In CIKM. 1037--1046.Google Scholar
Johannes Hoffart, Fabian M. Suchanek, Klaus Berberich, and Gerhard Weikum. 2013. YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia: Extended Abstract. In IJCAI. 3161--3165.Google Scholar
John E. Hopcroft and Richard M. Karp. 1973. An n^5/2 Algorithm for Maximum Matchings in Bipartite Graphs. SIAM J. Comput. 2, 4 (1973), 225--231.Google ScholarDigital Library
Boyi Hou, Qun Chen, Yanyan Wang, Youcef Nafa, and Zhanhua Li. 2022. Gradual Machine Learning for Entity Resolution. TKDE 34, 4 (2022), 1803--1814.Google ScholarCross Ref
Robert Isele, Anja Jentzsch, and Christian Bizer. 2010. Silk Server - Adding missing Links while consuming Linked Data. In COLD, Vol. 665.Google Scholar
Glen Jeh and Jennifer Widom. 2002. Simrank: A measure of structural-context similarity. In KDD. 538--543.Google ScholarDigital Library
Alekh Jindal, Samuel Madden, Malú Castellanos, and Meichun Hsu. 2015. Graph analytics using Vertica relational database. In BigData. IEEE Computer Society, 1191--1200.Google Scholar
Alekh Jindal, Praynaa Rawlani, Eugene Wu, Samuel Madden, Amol Deshpande, and Mike Stonebraker. 2014. VERTEXICA: Your Relational Friend for Graph Analytics! PVLDB 7, 13 (2014), 1669--1672.Google ScholarDigital Library
N. D. Jones. 1996. An Introduction to Partial Evaluation. Comput. Surveys 28, 3 (1996), 480--503.Google ScholarDigital Library
Richard M. Karp and Vijaya Ramachandran. 1988. A Survey of Parallel Algorithms for Shared-Memory Machines. Technical Report UCB/CSD-88-408. EECS Department, University of California, Berkeley. http://www.eecs.berkeley.edu/Pubs/TechRpts/1988/5865.htmlGoogle Scholar
George Karypis and Vipin Kumar. 1998. Multilevel k-way Partitioning Scheme for Irregular Graphs. J. Parallel Distributed Comput. 48, 1 (1998), 96--129.Google ScholarDigital Library
Jungo Kasai, Kun Qian, Sairam Gurajada, Yunyao Li, and Lucian Popa. 2019. Low-resource deep entity resolution with transfer and active learning. arXiv preprint arXiv:1906.08042 (2019).Google Scholar
Jeremy Kepner, William Arcand, William Bergeron, Nadya T. Bliss, Robert Bond, Chansup Byun, Gary Condon, Kenneth Gregson, Matthew Hubbell, Jonathan Kurz, Andrew McCabe, Peter Michaleas, Andrew Prout, Albert Reuther, Antonio Rosa, and Charles Yee. 2012. Dynamic distributed dimensional data model (D4M) database and computation system. In ICASSP. IEEE, 5349--5352.Google Scholar
Zuhair Khayyat, Ihab F. Ilyas, Alekh Jindal, Samuel Madden, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, and Si Yin. 2015. BigDansing: A System for Big Data Cleansing. In SIGMOD. 1215--1230.Google Scholar
Mijung Kim and K Selçuk Candan. 2012. SBV-Cut: Vertex-cut based graph partitioning using structural balance vertices. DKE 72 (2012), 285--303.Google ScholarDigital Library
Boyan Kolev, Carlyna Bondiombouy, Patrick Valduriez, Ricardo Jiménez-Peris, Raquel Pau, and José Pereira. 2016. The CloudMdsQL, Multistore System. In SIGMOD. ACM, 2113--2116.Google Scholar
Pradap Konda, Sanjib Das, Paul Suganthan G. C., AnHai Doan, Adel Ardalan, Jeffrey R. Ballard, Han Li, Fatemah Panahi, Haojun Zhang, Jeffrey F. Naughton, Shishir Prasad, Ganesh Krishnan, Rohit Deep, and Vijay Raghavendra. 2016. Magellan: Toward building entity matching management systems. PVLDB 9, 12 (2016), 1197--1208.Google ScholarDigital Library
Hanna Köpcke, Andreas Thor, and Erhard Rahm. 2010. Evaluation of entity resolution approaches on real-world match problems. PVLDB 3, 1 (2010), 484--493.Google ScholarDigital Library
Yehuda Koren. 2008. Factorization meets the neighborhood: a multifaceted collaborative filtering model. In SIGKDD. 426--434.Google Scholar
Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer 42, 8 (2009), 30--37.Google ScholarDigital Library
Christos Koutras, Marios Fragkoulis, Asterios Katsifodimos, and Christoph Lofi. 2020. REMA: Graph Embeddings-based Relational Schema Matching. In EDBT/ICDT, Vol. 2578.Google Scholar
Clyde P. Kruskal, Larry Rudolph, and Marc Snir. 1990. A Complexity Theory of Efficient Parallel Algorithms. Theor. Comput. Sci. 71, 1 (1990), 95--132.Google ScholarDigital Library
Mitsuru Kusumoto, Takanori Maehara, and Ken-ichi Kawarabayashi. 2014. Scalable similarity search for SimRank. In SIGMOD. 325--336.Google Scholar
Selasi Kwashie, Lin Liu, Jixue Liu, Markus Stumptner, Jiuyong Li, and Lujing Yang. 2019. Certus: An effective entity resolution approach with graph differential dependencies (GDDs). PVLDB 12, 6 (2019), 653--666.Google ScholarDigital Library
Aapo Kyrola, Guy E. Blelloch, and Carlos Guestrin. 2012. GraphChi: Large-scale graph computation on just a PC. In OSDI. 31--46.Google Scholar
Jeanne C Latourelle, Merete Dybdahl, Anita L Destefano, Richard H Myers, and Timothy L Lash. 2010. Risk of Parkinson's disease after tamoxifen treatment. BMC neurology 10, 1 (2010), 1--7.Google Scholar
Jens Lehmann, Robert Isele, Max Jakob, Anja Jentzsch, Dimitris Kontokostas, Pablo N. Mendes, Sebastian Hellmann, Mohamed Morsey, Patrick van Kleef, Sören Auer, and Christian Bizer. 2015. DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web 6, 2 (2015), 167--195.Google ScholarCross Ref
Bing Li, Wei Wang, Yifang Sun, Linhan Zhang, Muhammad Asif Ali, and Yi Wang. 2020. GraphER: Token-Centric Entity Resolution with Graph Convolutional Neural Networks.. In AAAI. 8172--8179.Google Scholar
Yu Li, Hiroyuki Kuwahara, Peng Yang, Le Song, and Xin Gao. 2019. PGCN: Disease gene prioritization by disease and gene embedding through graph convolutional neural networks. biorxiv (2019), 532226.Google Scholar
Yuliang Li, Jinfeng Li, Yoshihiko Suhara, AnHai Doan, and Wang-Chiew Tan. 2020. Deep Entity Matching with Pre-Trained Language Models. PVLDB 14, 1 (2020), 50--60.Google ScholarDigital Library
Leonid Libkin. 2004. Elements of Finite Model Theory. Springer.Google ScholarDigital Library
Greg Linden, Brent Smith, and Jeremy York. 2003. Amazon. com recommendations: Item-to-item collaborative filtering. IEEE Internet computing 7, 1 (2003), 76--80.Google Scholar
Yanhong A. Liu. 2000. Efficiency by Incrementalization: An Introduction. High. Order Symb. Comput. 13, 4 (2000), 289--313.Google ScholarDigital Library
Beth Logan and Ariel Salomon. 2001. A content-based music similarity function. Cambridge Research Labs-Tech Report (2001).Google Scholar
Yucheng Low, Joseph Gonzalez, Aapo Kyrola, Danny Bickson, Carlos Guestrin, and Joseph M. Hellerstein. 2012. Distributed GraphLab: A Framework for Machine Learning in the Cloud. PVLDB 5, 8 (2012), 716--727.Google ScholarDigital Library
Xin Luo, Mengchu Zhou, Yunni Xia, and Qingsheng Zhu. 2014. An efficient non-negative matrix-factorization-based approach to collaborative filtering for recommender systems. IEEE Transactions on Industrial Informatics 10, 2 (2014), 1273--1284.Google ScholarCross Ref
Steffen Maass, Changwoo Min, Sanidhya Kashyap, Woon-Hak Kang, Mohan Kumar, and Taesoo Kim. 2017. Mosaic: Processing a Trillion-Edge Graph on a Single Machine. In EuroSys. ACM, 527--543.Google Scholar
Mohammad Mahdavi, Ziawasch Abedjan, Raul Castro Fernandez, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, and Nan Tang. 2019. Raha: A Configuration-Free Error Detection System. In SIGMOD. 865--882.Google Scholar
Grzegorz Malewicz, Matthew H. Austern, Aart J. C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski. 2010. Pregel: A system for large-scale graph processing. In SIGMOD. 135--146.Google ScholarDigital Library
Bernard Mans and Luke Mathieson. 2017. Incremental Problems in the Parameterized Complexity Setting. Theory Comput. Syst. 60, 1 (2017), 3--19.Google ScholarDigital Library
Mugilan Mariappan and Keval Vora. 2019. GraphBolt: Dependency-Driven Synchronous Processing of Streaming Graphs. In EuroSys. 25:1--25:16.Google Scholar
Brian McFee, Luke Barrington, and Gert Lanckriet. 2012. Learning content similarity for music recommendation. IEEE transactions on audio, speech, and language processing 20, 8 (2012), 2207--2218.Google ScholarDigital Library
Frank McSherry, Michael Isard, and Derek Gordon Murray. 2015. Scalability! But at what COST?. In HotOS.Google Scholar
Frank McSherry, Derek Gordon Murray, Rebecca Isaacs, and Michael Isard. 2013. Differential Dataflow. In CIDR.Google Scholar
Alberto O. Mendelzon and Peter T. Wood. 1995. Finding Regular Simple Paths in Graph Databases. SIAM J. Comput. 24, 6 (1995), 1235--1258.Google ScholarDigital Library
Franck Michel, Johan Montagnat, and Catherine Faron Zucker. 2014. A survey of RDB to RDF translation approaches and tools. https://hal.archives-ouvertes.fr/hal-00903568/file/Rapport_Rech_I3S_v2_-_Michel_et_al_2013_-_A_survey_of_RDB_to_RDF_translation_approaches_and_tools.pdf.Google Scholar
Robin Milner. 1989. Communication and Concurrency. Prentice Hall.Google ScholarDigital Library
Peter Bro Miltersen, Sairam Subramanian, Jeffrey Scott Vitter, and Roberto Tamassia. 1994. Complexity Models for Incremental Computation. Theor. Comput. Sci. 130, 1 (1994), 203--236.Google ScholarDigital Library
Sidharth Mudgal, Han Li, Theodoros Rekatsinas, AnHai Doan, Youngchoon Park, Ganesh Krishnan, Rohit Deep, Esteban Arcaute, and Vijay Raghavendra. 2018. Deep Learning for Entity Matching: A Design Space Exploration. In SIGMOD. 19--34.Google Scholar
Fatemeh Nargesian, Erkang Zhu, Renée J. Miller, Ken Q. Pu, and Patricia C. Arocena. 2019. Data Lake Management: Challenges and Opportunities. PVLDB 12, 12 (2019), 1986--1989.Google ScholarDigital Library
Axel-Cyrille Ngonga Ngomo and Sören Auer. 2011. LIMES - A Time-Efficient Approach for Large-Scale Link Discovery on the Web of Data. In IJCAI. 2312--2317.Google Scholar
Donald Nguyen, Andrew Lenharth, and Keshav Pingali. 2013. A lightweight infrastructure for graph analytics. In SOSP. 456--471.Google Scholar
Phuc Nguyen, Ikuya Yamada, Natthawut Kertkeidkachorn, Ryutaro Ichise, and Hideaki Takeda. 2020. MTab4Wikidata at SemTab 2020: Tabular Data Annotation with Wikidata. In SemTabISWC, Vol. 2775. 86--95.Google Scholar
George Papadakis, Leonidas Tsekouras, Emmanouil Thanos, George Giannakopoulos, Themis Palpanas, and Manolis Koubarakis. 2018. The return of JedAI: End-to-end entity resolution for structured and semi-structured data. PVLDB 11, 12 (2018), 1950--1953.Google ScholarDigital Library
Christos H. Papadimitriou. 1994. Computational complexity. Addison-Wesley.Google Scholar
Kun Qian, Lucian Popa, and Prithviraj Sen. 2017. Active Learning for Large-Scale Entity Resolution. In CIKM. 1379--1388.Google Scholar
Abdul Quamar and Amol Deshpande. 2016. NScaleSpark: subgraph-centric graph analytics on Apache Spark. In NDA. ACM, 5:1--5:8.Google Scholar
G. Ramalingam and Thomas Reps. 1996. An incremental algorithm for a generalization of the shortest-path problem. J. Algorithms 21, 2 (1996), 267--305.Google ScholarDigital Library
G. Ramalingam and Thomas Reps. 1996. On the computational complexity of dynamic graph problems. ACM Trans. Database Syst. 158, 1--2 (1996), 233--277.Google Scholar
G. Ramalingam and Thomas W. Reps. 1996. An Incremental Algorithm for a Generalization of the Shortest-Path Problem. J. Algorithms 21, 2 (1996), 267--305.Google ScholarDigital Library
Thomas C. Redman. 2016. Bad Data Costs the U.S. $3 Trillion Per Year. Harvard Business Review. https://hbr.org/2016/09/bad-data-costs-the-u-s-3-trillion-per-year.Google Scholar
Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In EMNLP-IJCNLP. 3980--3990.Google Scholar
Theodoros Rekatsinas, Xu Chu, Ihab F. Ilyas, and Christopher Ré. 2017. Holo-Clean: Holistic Data Repairs with Probabilistic Inference. PVLDB 10, 11 (2017), 1190--1201.Google ScholarDigital Library
Amitabha Roy, Ivo Mihailovic, and Willy Zwaenepoel. 2013. X-stream: Edge-centric graph processing using streaming partitions. In SOSP. ACM, 472--488.Google ScholarDigital Library
Alieh Saeedi, Eric Peukert, and Erhard Rahm. 2018. Using link features for entity clustering in knowledge graphs. In ESWC. 576--592.Google Scholar
R Sandyk and MA Gillman. 1985. Acute exacerbation of Parkinson's disease with sulindac. Annals of neurology 17, 1 (1985), 104--105.Google ScholarCross Ref
Jan Schluter and Christian Osendorfer. 2011. Music similarity estimation with the mean-covariance restricted Boltzmann machine. In ICMLA, Vol. 2. IEEE, 118--123.Google ScholarDigital Library
Shilad Sen, Jesse Vig, and John Riedl. 2009. Tagommenders: connecting users to items through tags. In WWW. 671--680.Google Scholar
Bin Shao, Haixun Wang, and Yatao Li. 2013. Trinity: a distributed graph engine on a memory cloud. In SIGMOD. 505--516.Google Scholar
Shenzhen Institute of Computing Sciences. 2022. Fishing Fort. https://en.sics.ac.cn/col84/index.Google Scholar
Juan Shu, Yu Li, Sheng Wang, Bowei Xi, and Jianzhu Ma. 2021. Disease gene prediction with privileged information and heteroscedastic dropout. Bioinformatics 37, Supplement_1 (2021), i410--i417.Google ScholarCross Ref
Benjamin A. Steer, Alhamza Alnaimi, Marco A. B. F. G. Lotz, Félix Cuadrado, Luis M. Vaquero, and Joan Varvenne. 2017. Cytosm: Declarative Property Graph Queries Without Data Migration. In GRADES. 4:1--4:6.Google Scholar
Fabian M. Suchanek, Serge Abiteboul, and Pierre Senellart. 2011. PARIS: Probabilistic Alignment of Relations, Instances, and Schema. PVLDB 5, 3 (2011), 157--168.Google ScholarDigital Library
Huifeng Sun, Yong Peng, Junliang Chen, Chuanchang Liu, and Yuzhuo Sun. 2011. A New Similarity Measure Based on Adjusted Euclidean Distance for Memory-based Collaborative Filtering. JSW 6, 6 (2011), 993--1000.Google ScholarCross Ref
Yizhou Sun, Jiawei Han, Xifeng Yan, Philip S. Yu, and Tianyi Wu. 2011. Path-Sim: Meta Path-Based Top-K Similarity Search in Heterogeneous Information Networks. PVLDB 4, 11 (2011), 992--1003.Google ScholarDigital Library
Zhu Sun, Qing Guo, Jie Yang, Hui Fang, Guibing Guo, Jie Zhang, and Robin Burke. 2019. Research commentary on recommendations with side information: A survey and research directions. Electronic Commerce Research and Applications 37 (2019).Google Scholar
Katia P. Sycara. 1993. Machine learning for intelligent support of conflict resolution. Decision Support Systems 10, 2 (1993), 121--136.Google ScholarDigital Library
Bo-Tao Tan, Li Wang, Sen Li, Zai-Yun Long, Ya-Min Wu, and Yuan Liu. 2015. Retinoic acid induced the differentiation of neural stem cells from embryonic spinal cord into functional neurons in vitro. International journal of clinical and experimental pathology 8, 7 (2015).Google Scholar
Nan Tang, Ju Fan, Fangyi Li, Jianhong Tu, Xiaoyong Du, Guoliang Li, Samuel Madden, and Mourad Ouzzani. 2021. RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation. PVLDB 14, 8 (2021), 1254--1261.Google ScholarDigital Library
Robert Endre Tarjan. 1972. Depth-First Search and Linear Graph Algorithms. SIAM J. Comput. 1, 2 (1972), 146--160.Google ScholarDigital Library
Tim Teitelbaum and Thomas W. Reps. 1981. The Cornell Program Synthesizer: A Syntax-Directed Programming Environment. Commun. ACM 24, 9 (1981), 563--573.Google ScholarDigital Library
Yuanyuan Tian, Andrey Balmin, Severin Andreas Corsten, and John McPherson Shirish Tatikonda. 2013. From "Think Like a Vertex" to "Think Like a Graph". PVLDB 7, 7 (2013), 193--204.Google ScholarDigital Library
Rakshit Trivedi, Bunyamin Sisman, Jun Ma, Christos Faloutsos, Hongyuan Zha, and Xin Luna Dong. 2018. Linknbed: Multi-graph representation learning with entity linkage. In ACL.Google Scholar
Shalini Tyagi and Ernesto Jimenez-Ruiz. 2020. LexMa: Tabular data to knowledge graph matching using lexical techniques. In CEUR Workshop Proceedings, Vol. 2775. 59--64.Google Scholar
Farman Ullah, Ghulam Sarwar, Sung Chang Lee, Yun Kyung Park, Kyeong Deok Moon, and Jin Tae Kim. 2012. Hybrid recommender system with temporal information. In ICOIN. IEEE, 421--425.Google Scholar
Leslie G. Valiant. 1990. A Bridging Model for Parallel Computation. Commun. ACM 33, 8 (1990), 103--111.Google ScholarDigital Library
Aäron Van Den Oord, Sander Dieleman, and Benjamin Schrauwen. 2013. Deep content-based music recommendation. In NIPS. Neural Information Processing Systems Foundation (NIPS), 2643--2651.Google Scholar
Larysa Visengeriyeva and Ziawasch Abedjan. 2018. Metadata-driven error detection. In SSDBM. 1:1--1:12.Google Scholar
Keval Vora, Rajiv Gupta, and Guoqing (Harry) Xu. 2017. KickStarter: Fast and Accurate Computations on Streaming Graphs via Trimmed Approximations. In ASPLOS.Google Scholar
W3C. 2012. Relational Databases to RDF (RDB2RDF).Google Scholar
Guozhang Wang, Wenlei Xie, Alan J. Demers, and Johannes Gehrke. 2013. Asynchronous Large-Scale Graph Processing Made Easy. In CIDR.Google Scholar
Xiaochan Wang, Yuchong Gong, Jing Yi, and Wen Zhang. 2019. Predicting gene-disease associations from the heterogeneous network using graph embedding. In IEEE International conference on bioinformatics and biomedicine (BIBM). 504--511.Google ScholarCross Ref
Xiang Wang, Xiangnan He, Meng Wang, Fuli Feng, and Tat-Seng Chua. 2019. Neural graph collaborative filtering. In SIGIR. 165--174.Google Scholar
Xinxi Wang and Ye Wang. 2014. Improving content-based and hybrid music recommendation using deep learning. In ACM Multimedia. 627--636.Google Scholar
Yue Wang, Zhe Wang, Ziyuan Zhao, Zijian Li, Xun Jian, Hao Xin, Lei Chen, Jianchun Song, Zhenhong Chen, and Meng Zhao. 2022. Effective Similarity Search on Heterogeneous Networks: A Meta-path Free Approach. TKDE 34, 7 (2022), 3225--3240.Google Scholar
Duncan J Watts and Steven H Strogatz. 1998. Collective dynamics of 'small-world' networks. Nature 393, 6684 (1998), 440--442.Google Scholar
Steven Euijong Whang and Hector Garcia-Molina. 2013. Joint entity resolution on multiple datasets. The VLDB Journal 22, 6 (2013), 773--795.Google ScholarDigital Library
Charith Wickramaarachchi, Charalampos Chelmis, and Viktor K. Prasanna. 2015. Empowering Fast Incremental Computation over Large Scale Dynamic Graphs. In IPDPS.Google Scholar
Renzhi Wu, Sanya Chaba, Saurabh Sawlani, Xu Chu, and Saravanan Thirumuruganathan. 2020. ZeroER: Entity Resolution using Zero Labeled Examples. In SIGMOD. 1149--1164.Google Scholar
Yuting Wu, Xiao Liu, Yansong Feng, Zheng Wang, Rui Yan, and Dongyan Zhao. 2019. Relation-Aware Entity Alignment for Heterogeneous Knowledge Graphs. In IJCAI. 5278--5284.Google Scholar
Chenning Xie, Rong Chen, Haibing Guan, Binyu Zang, and Haibo Chen. 2015. SYNC or ASYNC: Time to fuse for distributed graph-parallel computation. In PPOPP. 194--204.Google Scholar
Xianghao Xu, Fang Wang, Hong Jiang, Yongli Cheng, Dan Feng, and Yongxuan Zhang. 2020. A Hybrid Update Strategy for I/O-Efficient Out-of-Core Graph Processing. IEEE Trans. Parallel Distributed Syst. 31, 8 (2020), 1767--1782.Google ScholarCross Ref
Rex Ying, Ruining He, Kaifeng Chen, Pong Eksombatchai, William L Hamilton, and Jure Leskovec. 2018. Graph convolutional neural networks for web-scale recommender systems. In SIGKDD. 974--983.Google Scholar
Weiren Yu, Xuemin Lin, Wenjie Zhang, Jian Pei, and Julie A McCann. 2019. SimRank^*: effective and scalable pairwise similarity search based on graph topology. The VLDB Journal 28, 3 (2019), 401--426.Google ScholarDigital Library
Timothy A. K. Zakian, Ludovic A. R. Capelli, and Zhenjiang Hu. 2019. Incrementalization of Vertex-Centric Programs. In IPDPS.Google Scholar
Xiangxiang Zeng, Xinqi Tu, Yuansheng Liu, Xiangzheng Fu, and Yansen Su. 2022. Toward better drug discovery with knowledge graph. Current opinion in structural biology 72 (2022), 114--126.Google Scholar
Baichuan Zhang and Mohammad Al Hasan. 2017. Name disambiguation in anonymized graphs using network embedding. In CIKM. 1239--1248.Google Scholar
Bingjun Zhang, Jialie Shen, Qiaoliang Xiang, and Ye Wang. 2009. Compositemap: a novel framework for music similarity measure. In SIGIR. 403--410.Google Scholar
Dongxiang Zhang, Long Guo, Xiangnan He, Jie Shao, Sai Wu, and Heng Tao Shen. 2018. A Graph-Theoretic Fusion Framework for Unsupervised Entity Resolution. In ICDE. 713--724.Google Scholar
Qingheng Zhang, Zequn Sun, Wei Hu, Muhao Chen, Lingbing Guo, and Yuzhong Qu. 2019. Multi-view Knowledge Graph Embedding for Entity Alignment. In IJCAI. 5429--5435.Google Scholar
Shuo Zhang, Edgar Meij, Krisztian Balog, and Ridho Reinanda. 2020. Novel Entity Discovery from Web Tables. In WWW. 1298--1308.Google Scholar
Chen Zhao and Yeye He. 2019. Auto-EM: End-to-end Fuzzy Entity-Matching using Pre-trained Deep Models and Transfer Learning. In WWW. 2413--2424.Google Scholar
Jie Zhao, Manish Kumar, Jeevan Sharma, and Zhihai Yuan. 2021. Arbutin effectively ameliorates the symptoms of Parkinson's disease: The role of adenosine receptors and cyclic adenosine monophosphate. Neural regeneration research 16, 10 (2021), 2030.Google Scholar
Kangfei Zhao and Jeffrey Xu Yu. 2017. All-in-One: Graph Processing in RDBMSs Revisited. In SIGMOD. 1165--1180.Google Scholar
Liang Zhao. 2021. Event Prediction in the Big Data Era: A Systematic Survey. ACM Comput. Surv. 54, 5 (2021), 94:1--94:37.Google ScholarDigital Library
Zhongying Zhao, Xuejian Zhang, Hui Zhou, Chao Li, Maoguo Gong, and Yongqing Wang. 2020. HetNERec: Heterogeneous network embedding based recommendation. Knowledge-Based Systems 204 (2020).Google Scholar
Lei Zheng, Vahid Noroozi, and Philip S Yu. 2017. Joint deep modeling of users and items using reviews for recommendation. In WSDM. 425--434.Google Scholar
Minpeng Zhu and Tore Risch. 2011. Querying combined cloud-based and relational databases. In CSC. 330--335.Google Scholar
Xiaowei Zhu, Wentao Han, and Wenguang Chen. 2015. GridGraph: Large-Scale Graph Processing on a Single Machine Using 2-Level Hierarchical Partitioning. In USENIX ATC. 375--386.Google Scholar
Cai-Nicolas Ziegler, Georg Lausen, and Lars Schmidt-Thieme. 2004. Taxonomy-driven computation of product recommendations. In CIKM. 406--415.Google Scholar

Recommendations

Systems for big-graphs

Graphs have become increasingly important to represent highly-interconnected structures and schema-less data including the World Wide Web, social networks, knowledge graphs, genome and scientific databases, medical and government records. The massive ...
Read More
Big Data Analytics
Read More
Next-Generation Big Data: A Practical Guide to Apache Kudu, Impala, and Spark
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
Proceedings of the VLDB Endowment Volume 15, Issue 12
August 2022
551 pages
ISSN:2150-8097
Editors:
Fatma Özcan
Google
,
Juliana Freire
New York University
,
Xuemin Lin
University of New South Wales
Issue’s Table of Contents
Sponsors
In-Cooperation
Publisher
VLDB Endowment
Publication History
- Published: 1 August 2022
Published in pvldb Volume 15, Issue 12
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 6
  Total Citations
  View Citations
- 270
  Total Downloads
- Downloads (Last 12 months)147
- Downloads (Last 6 weeks)17
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Big graphs: challenges and opportunities

Proceedings of the VLDB Endowment

Abstract

References

Cited By

Recommendations

Systems for big-graphs

Big Data Analytics

Next-Generation Big Data: A Practical Guide to Apache Kudu, Impala, and Spark