research-article

On triangulation-based dense neighborhood graph discovery

Authors:
Nan Wang

National University of Singapore, Singapore

National University of Singapore, Singapore
View Profile

,
Jingbo Zhang

National University of Singapore, Singapore

National University of Singapore, Singapore
View Profile

,
Kian-Lee Tan

National University of Singapore, Singapore

National University of Singapore, Singapore
View Profile

,
Anthony K. H. Tung

National University of Singapore, Singapore

National University of Singapore, Singapore
View Profile

Proceedings of the VLDB Endowment Volume 4 Issue 2pp 58–68https://doi.org/10.14778/1921071.1921073

Published:01 November 2010Publication History

Proceedings of the VLDB Endowment

Abstract

This paper introduces a new definition of dense subgraph pattern, the DN -graph. DN -graph considers both the size of the substructure and the minimum level of interactions between any pair of the vertices.

The mining of DN -graphs inherits the difficulty of finding clique, the fully-connected subgraphs. We thus opt for approximately locating the DN -graphs using the state-of-the-art graph triangulation methods. Our solution consists of a family of algorithms, each of which targets a different problem setting. These algorithms are iterative, and utilize repeated scans through the triangles in the graph to approximately locate the DN -graphs. Each scan on the graph triangles improves the results. Since the triangles are not physically materialized, the algorithms have small memory footprint.

With our solution, the users can adopt a "pay as you go" approach. They have the flexibility to terminate the mining process once they are satisfied with the quality of the results. As a result, our algorithms can cope with semi-streaming environment where the graph edges cannot fit into main memory. Results of extensive performance study confirmed our claims.

References

Netflix Prize Data Set, 2009 (accessed July 23, 2009).Google Scholar
J. Abello, M. Resende, and R. Sudarsky. Massive quasi-clique detection. In Proc. 5th Latin American Symposium on Theoretical Informatics, pages 598--612, 2002. Google Scholar
I. Akihiro, W. Takashi, and M. Hiroshi. Complete mining of frequent patterns from graphs: Mining graph data. Machine Learning, 50(3):321--354, 2003. Google Scholar
P. Aloy, B. BäPttcher, H. Ceulemans, C. Leutwein, C. Mellwig, S. Fischer, A. C. Gavin, P. Bork, S. G. Furga, L. Serrano, and R. D. Russell. Structure-based assembly of protein complexes in yeast. Science, 303(5666):2026--2029, 2004.Google Scholar
L. Becchetti, P. Boldi, C. Castillo, and A. Gionis. Efficient semi-streaming algorithms for local triangle counting in massive graphs. In ACM KDD '08, pages 16--24, New York, NY, USA, 2008. Google Scholar
V. Boginski, S. Butenko, and P. Pardalos. Mining market data: a network approach. Computer Operational Research, 33(11):3171--3184, 2006. Google Scholar
M. Brockington and J. Culberson. Camouflaging independent sets in quasi-random graphs. DIMACS Series, 26:75--88, 1994.Google Scholar
D. Gibson, R. Kumar, and A. Tomkins. Discovering large dense subgraphs in massive graphs. In VLDB'05, pages 721--732, 2005. Google Scholar
J. Han, N. Stefanovic, and K. Koperski. Selective materialization: An efficient method for spatial data cube construction. In PAKDD'98 {Lecture Notes in Artificial Intelligence, 1394, Springer Verlag, 1998}, pages 144--158, 1998. Google Scholar
H. Hu, X. Yan, Y. Huang, J. Han, and X. Zhou. Mining coherent dense subgraphs across massive biological networks for functional discovery. Bioinformatics, 21:213--221, 2005. Google Scholar
C. F. J. Rivas. Proteincprotein interactions essentials: Key concepts to building and analyzing interactome networks. PLoS Computational Biology, 6:6, 2010.Google Scholar
R. Karp. Reducibility among combinatorial problems. The Journal of Symbolic Logic, 40:618--619, 1975.Google Scholar
M. Latapy. Practical algorithms for triangle computations in very large (sparse (power-law)) graphs. Journal of Theoretical Computer Science, 407:458--473, 2008. Google Scholar
T. Schank and D. Wagner. Finding, counting and listing all triangles in large graphs, an experimental study. In WEA, pages 606--609, 2005. Google Scholar
N. Wang, S. Parthasarathy, K. L. Tan, and A. Tung. Csv: visualizing and mining cohesive subgraphs. In SIGMOD'08, pages 445--458, 2008. Google Scholar
I. Xenarios and L. S. etc. DIP, the database of interacting proteins: A research tool for studying cellular networks of protein interactions. Nucleic Acids Research, 30(1):303--305, 2002.Google Scholar

Index Terms

On triangulation-based dense neighborhood graph discovery
1. Information systems
  1. Data management systems
    1. Database design and models
    2. Database management system engines
2. Mathematics of computing
  1. Discrete mathematics
    1. Graph theory

Recommendations

Lossless graph summarization using dense subgraphs discovery
IMCOM '15: Proceedings of the 9th International Conference on Ubiquitous Information Management and Communication

Dense subgraph discovery, in a large graph, is useful to solve the community search problem. Motivated from this, we propose a graph summarization method where we search and aggregate dense subgraphs into super nodes. Since the dense subgraphs have high ...
Read More
A new closure concept preserving graph Hamiltonicity and based on neighborhood equivalence

A graph is Hamiltonian if it contains a cycle which goes through all vertices exactly once. Determining if a graph is Hamiltonian is known as an NP-complete problem, and no satisfactory characterization for these graphs has been found. In 1976, Bondy ...
Read More
Dense graphs are antimagic

An antimagic labeling of graph a with m edges and n vertices is a bijection from the set of edges to the integers 1,…,m such that all n vertex sums are pairwise distinct, where a vertex sum is the sum of labels of all edges incident with the same ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

Proceedings of the VLDB Endowment Volume 4, Issue 2
November 2010
105 pages
ISSN:2150-8097
Issue’s Table of Contents
Sponsors
In-Cooperation
Publisher
VLDB Endowment
Publication History
- Published: 1 November 2010
Published in pvldb Volume 4, Issue 2
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 48
  Total Citations
  View Citations
- 281
  Total Downloads
- Downloads (Last 12 months)14
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

On triangulation-based dense neighborhood graph discovery

Proceedings of the VLDB Endowment

Abstract

References

Cited By

Index Terms

Recommendations

Lossless graph summarization using dense subgraphs discovery

A new closure concept preserving graph Hamiltonicity and based on neighborhood equivalence

Dense graphs are antimagic

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

On triangulation-based dense neighborhood graph discovery

Proceedings of the VLDB Endowment

Abstract

References

Cited By

Index Terms

Recommendations

Lossless graph summarization using dense subgraphs discovery

A new closure concept preserving graph Hamiltonicity and based on neighborhood equivalence

Dense graphs are antimagic

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media