research-article

GPU-based matrix multiplication methods for social networks analysis

Authors:
Yong-Yeon Jo

Hanyang University, Korea

Hanyang University, Korea
View Profile

,
Sang-Wook Kim

Hanyang University, Korea

Hanyang University, Korea
View Profile

,
Duck-Ho Bae

Samsung Electronics, Korea

Samsung Electronics, Korea
View Profile

RACS '14: Proceedings of the 2014 Conference on Research in Adaptive and Convergent SystemsOctober 2014Pages 309–313https://doi.org/10.1145/2663761.2664192

Published:05 October 2014Publication History

RACS '14: Proceedings of the 2014 Conference on Research in Adaptive and Convergent Systems

Pages 309–313

ABSTRACT

A matrix multiplication is a building block for social networks analysis. Recently, there have been various methods proposed for GPU-based matrix multiplications. NVIDIA, one of major manufacturers of GPUs, has also proposed various matrix multiplication methods based on GPUs. In this paper, we introduce the methods, and evaluate their performance via extensive experiments using synthetic and real-world datasets. Our results would help practitioners choose the best one for analyzing real-world social networks.

References

D. Kirk and W. Hwu, Programming Massively Parallel Processors, Morgan Kaufmann, 2010. Google ScholarDigital Library
V. Volkov and J. Demmel, "Benchmarking GPUs to Tune Dense Linear Algebra," In Proc. of Int'l Conf. on Supercomputing, SC, pp. 1--11, 2008. Google ScholarDigital Library
G. He et al., "Parallel SimRank Computation on Large Graphs with Iterative Aggregation," In Proc. ACM Int'l Conf. on Knowledge discovery and data mining, ACM SIGKDD, pp. 543--552, 2010. Google ScholarDigital Library
D. Bae, S. Hwang, and S. Kim, "Constructing Seminal Paper Genealogy," In Proc. ACM Int'l Conf. on Information and knowledge management, ACM CIKM, pp. 2101--2104, 2011. Google ScholarDigital Library
Koren et al., "Matrix factorization techniques for recommender systems," Computer, Vol. 42, No. 8, pp. 30--37, 2009. Google ScholarDigital Library
NVIDIA CUPARSE and CUBLAS libraries, https://developer.nvidia.com/cuda-toolkitGoogle Scholar
csrgemm library, http://on-demand.gputechconf.com/gtc/2012/presentations/S0285-GTC2012-Sparse-Matrix-Multiplication.pdfGoogle Scholar
X. Yang, S. Parthasarathy, and P. Sadayappan, "Fast Sparse Matrix-Vector Multiplication on GPUs: Implications for Graph Mining," VLDB Endowment, Vol. 4, No. 4, pp. 231--242, 2011. Google ScholarDigital Library
S. Ryoo et al., "Optimization Principles and Application Performance Evaluation of a Multithreaded GPU using CUDA," In Proc. ACM Int'l Symp. on Principles and practice of parallel programming, ACM SIGPLAN, pp. 73--82, 2008. Google ScholarDigital Library
N. Bell and M. Garland, Efficient Sparse Matrix-Vector Multiplication on CUDA, NVIDIA Technical Report, NVIDIA Corporation, 2008.Google Scholar
Geforce GT 440 specification, http://www.geforce.com/hardware/desktop-gpus/geforce-gt-440-channelGoogle Scholar
Tesla specification, http://www.nvidia.co.kr/content/PDF/kepler/Tesla-K20-Active-BD-06499-001-v04.pdfGoogle Scholar
Stanford Large Network Dataset Collection, http://snap.stanford.edu/data/Google Scholar
IMC 2007 Data Sets, http://socialnetworks.mpi-sws.org/data-imc2007.htmlGoogle Scholar

Index Terms

GPU-based matrix multiplication methods for social networks analysis
1. Information systems
  1. Information systems applications

Recommendations

Improving Performance of Matrix Multiplication and FFT on GPU
ICPADS '09: Proceedings of the 2009 15th International Conference on Parallel and Distributed Systems

In this paper we discuss about our experiences in improving the performance of two key algorithms: the single-precision matrix-matrix multiplication subprogram (SGEMM of BLAS) and single-precision FFT using CUDA. The former is computation-intensive, ...
Read More
HPMaX: heterogeneous parallel matrix multiplication using CPUs and GPUs
Abstract
We present a novel heterogeneous parallel matrix multiplication algorithm that utilizes both central processing units (CPUs) and graphics processing units (GPUs) for large-scale matrices. Based on Strassen’s method, we represent matrix ...
Read More
Performance Tuning of Matrix Multiplication in OpenCL on Different GPUs and CPUs
SCC '12: Proceedings of the 2012 SC Companion: High Performance Computing, Networking Storage and Analysis

OpenCL (Open Computing Language) is a framework for general-purpose parallel programming. Programs written in OpenCL are functionally portable across multiple processors including CPUs, GPUs, and also FPGAs. Using an auto-tuning technique makes ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
RACS '14: Proceedings of the 2014 Conference on Research in Adaptive and Convergent Systems
October 2014
386 pages
ISBN:9781450330602
DOI:10.1145/2663761
Conference Chairs:
Chao Lu
Towson University
,
Esmaeil Nadimi
University of Southern Denmark, denmark
,
Program Chairs:
Sung-Ryul Kim
Konkuk University, Korea
,
Wei Wang
San Diego State University
Copyright © 2014 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 5 October 2014
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
CUDA
GPU
matrix multiplication
Qualifiers
- research-article
Conference

Acceptance Rates
RACS '14 Paper Acceptance Rate59of251submissions,24%Overall Acceptance Rate393of1,581submissions,25%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 157
  Total Downloads
- Downloads (Last 12 months)4
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

GPU-based matrix multiplication methods for social networks analysis

RACS '14: Proceedings of the 2014 Conference on Research in Adaptive and Convergent Systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Improving Performance of Matrix Multiplication and FFT on GPU

HPMaX: heterogeneous parallel matrix multiplication using CPUs and GPUs

Performance Tuning of Matrix Multiplication in OpenCL on Different GPUs and CPUs

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

GPU-based matrix multiplication methods for social networks analysis

RACS '14: Proceedings of the 2014 Conference on Research in Adaptive and Convergent Systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Improving Performance of Matrix Multiplication and FFT on GPU

HPMaX: heterogeneous parallel matrix multiplication using CPUs and GPUs

Performance Tuning of Matrix Multiplication in OpenCL on Different GPUs and CPUs

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media