Skip to main content
Top
Published in: The Journal of Supercomputing 4/2017

12-07-2016

Resolving small random symmetric linear systems on graphics processing units

Authors: L. A. Abbas-Turki, S. Graillat

Published in: The Journal of Supercomputing | Issue 4/2017

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper focuses on the resolution of a large number of small random symmetric linear systems and its parallel implementation in single precision on graphics processing units (GPUs). The computations involved by each linear system are independent from the others, and the number of unknowns does not exceed 64. For this purpose, we present the adaptation to our context of largely used methods that include: LDLt factorization, Householder reduction to a tridiagonal matrix, parallel cyclic reduction (PCR) that is not a power of two and the divide and conquer algorithm for tridiagonal eigenproblems. We not only detail the implementation and optimization of each method, but we also compare the sustainability of each solution and its performance which include both parallel complexity and cache memory occupation. In the context of solving a large number of small random linear systems on GPUs with no information about their conditioning, our research indicates that the best strategy requires the use of Householder tridiagonalization + PCR followed if necessary by a divide and conquer diagonalization.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Abbas-Turki LA, Bouselmi AI, Mikou MA (2014) Toward a coherent Monte Carlo simulation of CVA. Monte Carlo Methods Appl 20(3):195–216CrossRefMATHMathSciNet Abbas-Turki LA, Bouselmi AI, Mikou MA (2014) Toward a coherent Monte Carlo simulation of CVA. Monte Carlo Methods Appl 20(3):195–216CrossRefMATHMathSciNet
3.
go back to reference Abbas-Turki LA, Vialle S, Lapeyre B, Mercier P (2014) Pricing derivatives on graphics processing units using Monte Carlo simulation. Concurr Comput Pract Exp 26(9):1679–1697CrossRef Abbas-Turki LA, Vialle S, Lapeyre B, Mercier P (2014) Pricing derivatives on graphics processing units using Monte Carlo simulation. Concurr Comput Pract Exp 26(9):1679–1697CrossRef
4.
go back to reference Ballard G, Demmel J, Holtz O, schwartz O (2010) Communication-optimal parallel and sequential Cholesky decomposition. SIAM J Sci Comput 32(6):3495–3523CrossRefMATHMathSciNet Ballard G, Demmel J, Holtz O, schwartz O (2010) Communication-optimal parallel and sequential Cholesky decomposition. SIAM J Sci Comput 32(6):3495–3523CrossRefMATHMathSciNet
5.
go back to reference Brigo D, Morini M, Pallavicini A (2013) Counterparty Credit Risk, Collateral and Funding: With Pricing Cases For All Asset Classes. Wiley, New YorkCrossRefMATH Brigo D, Morini M, Pallavicini A (2013) Counterparty Credit Risk, Collateral and Funding: With Pricing Cases For All Asset Classes. Wiley, New YorkCrossRefMATH
6.
go back to reference Brigo D, Pallavicini A (2008) Counterparty risk and contingent CDS under correlation between interest-rates and default. Risk Mag (February) 84–88 Brigo D, Pallavicini A (2008) Counterparty risk and contingent CDS under correlation between interest-rates and default. Risk Mag (February) 84–88
7.
go back to reference Cesari G et al (2009) Modelling, pricing and hedging counterparty credit exposure, Springer Finance, New York Cesari G et al (2009) Modelling, pricing and hedging counterparty credit exposure, Springer Finance, New York
8.
go back to reference Cho H, Yoon PA (2014) A Memory-efficient algorithm for large-scale symmetric tridiagonal eigenvalue problem on multi-GPU systems. Int’l Conf. Par. and Dist. Proc. Tech. and Appl., pp 568–573 Cho H, Yoon PA (2014) A Memory-efficient algorithm for large-scale symmetric tridiagonal eigenvalue problem on multi-GPU systems. Int’l Conf. Par. and Dist. Proc. Tech. and Appl., pp 568–573
9.
go back to reference Clément E, Lamberton D, Protter P (2002) An analysis of a least squares regression algorithm for American option pricing. Financ Stoch 17:448–471MATH Clément E, Lamberton D, Protter P (2002) An analysis of a least squares regression algorithm for American option pricing. Financ Stoch 17:448–471MATH
10.
go back to reference Crépey S, Bielecki TR (2014) Counterparty risk and funding. a tale of two puzzles. CRC Press, Boca RatonMATH Crépey S, Bielecki TR (2014) Counterparty risk and funding. a tale of two puzzles. CRC Press, Boca RatonMATH
11.
go back to reference Crépey S, Grbac Z, Ngor N, Skovmand D (2014) A Lévy HJM multiple-curve model with application to CVA computation. Quant Financ 15(3):1–19 Crépey S, Grbac Z, Ngor N, Skovmand D (2014) A Lévy HJM multiple-curve model with application to CVA computation. Quant Financ 15(3):1–19
14.
go back to reference Demmel JW, Marques OA, Parlett BN, Vömel C (2008) Performance and accuracy of LAPACK’s symmetric tridiagonal eigensolvers. SIAM J Sci Comput 30(3):1508–1526CrossRefMATHMathSciNet Demmel JW, Marques OA, Parlett BN, Vömel C (2008) Performance and accuracy of LAPACK’s symmetric tridiagonal eigensolvers. SIAM J Sci Comput 30(3):1508–1526CrossRefMATHMathSciNet
15.
go back to reference Fujii M, Takahashi A (2015) Perturbative expansion technique for non-linear FBSDEs with interacting particle method. Asia-Pac Financ Mark 22(3):283–304. doi:10.1007/s10690-015-9201-7 Fujii M, Takahashi A (2015) Perturbative expansion technique for non-linear FBSDEs with interacting particle method. Asia-Pac Financ Mark 22(3):283–304. doi:10.​1007/​s10690-015-9201-7
16.
go back to reference Goddeke D, Strzodka R (2010) Cyclic reduction tridiagonal solvers on GPUs applied to mixed precision multigrid. IEEE Trans Parallel Distrib Syst 22(1):22–32CrossRef Goddeke D, Strzodka R (2010) Cyclic reduction tridiagonal solvers on GPUs applied to mixed precision multigrid. IEEE Trans Parallel Distrib Syst 22(1):22–32CrossRef
17.
go back to reference Gordy MB, Juneja S (2010) Nested simulation in portfolio risk measurement. Manag Sci 56(10):1833–1848CrossRefMATH Gordy MB, Juneja S (2010) Nested simulation in portfolio risk measurement. Manag Sci 56(10):1833–1848CrossRefMATH
18.
go back to reference Gragg WB, Thornton JR, Warner DD (1992) Parallel divide and conquer algorithms for the symmetric tridiagonal eigenproblem and bidiagonal singular value problem. Model Simul 23(1):49–56 Gragg WB, Thornton JR, Warner DD (1992) Parallel divide and conquer algorithms for the symmetric tridiagonal eigenproblem and bidiagonal singular value problem. Model Simul 23(1):49–56
19.
go back to reference Gu M, Eisenstat S (1992) A stable algorithm for the rank-1 modification of the symmetric eigenproblem. Computer Science Dept. Report YALEU/DCS/RR-916, Yale University, New Haven Gu M, Eisenstat S (1992) A stable algorithm for the rank-1 modification of the symmetric eigenproblem. Computer Science Dept. Report YALEU/DCS/RR-916, Yale University, New Haven
20.
go back to reference Gu M, Eisenstat S (1995) A divide-and-conquer algorithm for the symmetric tridiagonal eigenproblem. SIAM J Matrix Anal Appl 16:172–191CrossRefMATHMathSciNet Gu M, Eisenstat S (1995) A divide-and-conquer algorithm for the symmetric tridiagonal eigenproblem. SIAM J Matrix Anal Appl 16:172–191CrossRefMATHMathSciNet
21.
go back to reference Hockney RW, Jesshope CR (1981) Parallel computers: architecture, programming and algorithms. Adam Hilger Ltd, EnglandMATH Hockney RW, Jesshope CR (1981) Parallel computers: architecture, programming and algorithms. Adam Hilger Ltd, EnglandMATH
22.
go back to reference Henry-Labordère P (2012) Cutting CVA’s complexity. Risk Mag (July) 2012:67–73 Henry-Labordère P (2012) Cutting CVA’s complexity. Risk Mag (July) 2012:67–73
26.
go back to reference Li R-C (1994) Solving secular equations stably and efficiently. Computer Science Dept. Technical Report CS-94-260, University of Tennessee, Knoxville, (LAPACK Working Note 89.) Li R-C (1994) Solving secular equations stably and efficiently. Computer Science Dept. Technical Report CS-94-260, University of Tennessee, Knoxville, (LAPACK Working Note 89.)
27.
go back to reference Longstaff FA, Schwartz ES (2001) Valuing American options by simulation: a simple least-squares approach. Rev Financ Stud 14(1):113–147CrossRef Longstaff FA, Schwartz ES (2001) Valuing American options by simulation: a simple least-squares approach. Rev Financ Stud 14(1):113–147CrossRef
29.
go back to reference Press WH, Teukolsky SA, Vetterling WT, Flannery BP (2002) Numerical Recipes in C++: the art of scientific computing. Cambridge University Press, CambridgeMATH Press WH, Teukolsky SA, Vetterling WT, Flannery BP (2002) Numerical Recipes in C++: the art of scientific computing. Cambridge University Press, CambridgeMATH
30.
go back to reference Volkov V, Demmel J (2008) LU, QR and Cholesky factorizations using vector capabilities of GPUs, Technical Report No. UCB/EECS-2008-49, University of California, Berkeley Volkov V, Demmel J (2008) LU, QR and Cholesky factorizations using vector capabilities of GPUs, Technical Report No. UCB/EECS-2008-49, University of California, Berkeley
31.
32.
go back to reference Zhang Y, Cohen J, Owens JD (2010) Fast tridiagonal solvers on the GPU. Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp 127–136 Zhang Y, Cohen J, Owens JD (2010) Fast tridiagonal solvers on the GPU. Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp 127–136
Metadata
Title
Resolving small random symmetric linear systems on graphics processing units
Authors
L. A. Abbas-Turki
S. Graillat
Publication date
12-07-2016
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 4/2017
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-016-1813-9

Other articles of this Issue 4/2017

The Journal of Supercomputing 4/2017 Go to the issue

Premium Partner