Skip to main content
Top
Published in: Foundations of Computational Mathematics 5/2015

01-10-2015

Improved Bounds on Sample Size for Implicit Matrix Trace Estimators

Authors: Farbod Roosta-Khorasani, Uri Ascher

Published in: Foundations of Computational Mathematics | Issue 5/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This article is concerned with Monte Carlo methods for the estimation of the trace of an implicitly given matrix \(A\) whose information is only available through matrix-vector products. Such a method approximates the trace by an average of \(N\) expressions of the form \( \mathbf{w} ^t (A \mathbf{w} )\), with random vectors \( \mathbf{w} \) drawn from an appropriate distribution. We prove, discuss and experiment with bounds on the number of realizations \(N\) required to guarantee a probabilistic bound on the relative error of the trace estimation upon employing Rademacher (Hutchinson), Gaussian and uniform unit vector (with and without replacement) probability distributions. In total, one necessary bound and six sufficient bounds are proved, improving upon and extending similar estimates obtained in the seminal work of Avron and Toledo (JACM 58(2). Article 8, 2011) in several dimensions. We first improve their bound on \(N\) for the Hutchinson method, dropping a term that relates to \(\mathrm{rank}(A)\) and making the bound comparable with that for the Gaussian estimator. We further prove new sufficient bounds for the Hutchinson, Gaussian and unit vector estimators, as well as a necessary bound for the Gaussian estimator, which depend more specifically on properties of matrix \(A\). As such, they may suggest the type of matrix for which one distribution or another provides a particularly effective or relatively ineffective stochastic estimation method.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference M. Abramowitz. Handbook of Mathematical Functions, with Formulas, Graphs, and Mathematical Tables. Dover, 1974. M. Abramowitz. Handbook of Mathematical Functions, with Formulas, Graphs, and Mathematical Tables. Dover, 1974.
2.
go back to reference D. Achlioptas. Database-friendly random projections. In ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 01, volume 20, pages 274–281, 2001. D. Achlioptas. Database-friendly random projections. In ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 01, volume 20, pages 274–281, 2001.
3.
go back to reference H. Avron. Counting triangles in large graphs using randomized matrix trace estimation. Workshop on Large-scale Data Mining: Theory and Applications, 2010. H. Avron. Counting triangles in large graphs using randomized matrix trace estimation. Workshop on Large-scale Data Mining: Theory and Applications, 2010.
4.
go back to reference H. Avron and S. Toledo. Randomized algorithms for estimating the trace of an implicit symmetric positive semi-definite matrix. JACM, 58(2), 2011. Article 8. H. Avron and S. Toledo. Randomized algorithms for estimating the trace of an implicit symmetric positive semi-definite matrix. JACM, 58(2), 2011. Article 8.
5.
go back to reference Z. Bai, M. Fahey, and G. Golub. Some large scale matrix computation problems. J. Comput. Appl. Math., 74:71–89, 1996. Z. Bai, M. Fahey, and G. Golub. Some large scale matrix computation problems. J. Comput. Appl. Math., 74:71–89, 1996.
6.
go back to reference C. Bekas, E. Kokiopoulou, and Y. Saad. An estimator for the diagonal of a matrix. Appl. Numer. Math., 57:12141229, 2007. C. Bekas, E. Kokiopoulou, and Y. Saad. An estimator for the diagonal of a matrix. Appl. Numer. Math., 57:12141229, 2007.
7.
go back to reference K. van den Doel and U. Ascher. Adaptive and stochastic algorithms for EIT and DC resistivity problems with piecewise constant solutions and many measurements. SIAM J. Scient. Comput., 34: doi:10.1137/110826692, 2012. K. van den Doel and U. Ascher. Adaptive and stochastic algorithms for EIT and DC resistivity problems with piecewise constant solutions and many measurements. SIAM J. Scient. Comput., 34: doi:10.​1137/​110826692, 2012.
8.
go back to reference G. H. Golub, M. Heath, and G. Wahba. Generalized cross validation as a method for choosing a good ridge parameter. Technometrics, 21:215–223, 1979. G. H. Golub, M. Heath, and G. Wahba. Generalized cross validation as a method for choosing a good ridge parameter. Technometrics, 21:215–223, 1979.
9.
go back to reference E. Haber, M. Chung, and F. Herrmann. An effective method for parameter estimation with PDE constraints with multiple right-hand sides. SIAM J. Optimization, 22:739–757, 2012. E. Haber, M. Chung, and F. Herrmann. An effective method for parameter estimation with PDE constraints with multiple right-hand sides. SIAM J. Optimization, 22:739–757, 2012.
10.
go back to reference M. F. Hutchinson. A stochastic estimator of the trace of the influence matrix for laplacian smoothing splines. J. Comm. Stat. Simul., 19:433–450, 1990. M. F. Hutchinson. A stochastic estimator of the trace of the influence matrix for laplacian smoothing splines. J. Comm. Stat. Simul., 19:433–450, 1990.
11.
go back to reference T. Van Leeuwen, S. Aravkin, and F. Herrmann. Seismic waveform inversion by stochastic optimization. Hindawi Intl. J. Geophysics, 2011: doi:10.1155/2011/689041, 2012. T. Van Leeuwen, S. Aravkin, and F. Herrmann. Seismic waveform inversion by stochastic optimization. Hindawi Intl. J. Geophysics, 2011: doi:10.​1155/​2011/​689041, 2012.
12.
go back to reference A. Mood, F. A. Graybill, and D. C. Boes. Introduction to the Theory of Statistics. McGraw-Hill; 3rdedition, 1974. A. Mood, F. A. Graybill, and D. C. Boes. Introduction to the Theory of Statistics. McGraw-Hill; 3rdedition, 1974.
13.
go back to reference F. Roosta-Khorasani, K. van den Doel, and U. Ascher. Stochastic algorithms for inverse problems involving PDEs and many measurements. SIAM J. Scient. Comput., 2014. To appear. F. Roosta-Khorasani, K. van den Doel, and U. Ascher. Stochastic algorithms for inverse problems involving PDEs and many measurements. SIAM J. Scient. Comput., 2014. To appear.
14.
go back to reference R. J. Serfling. Probability inequalities for the sum in sampling without replacement. Annals of Statistics, 2:39–48, 1974. R. J. Serfling. Probability inequalities for the sum in sampling without replacement. Annals of Statistics, 2:39–48, 1974.
15.
go back to reference A. Shapiro, D. Dentcheva, and D. Ruszczynski. Lectures on Stochastic Programming: Modeling and Theory. Philadelphia: SIAM, 2009. A. Shapiro, D. Dentcheva, and D. Ruszczynski. Lectures on Stochastic Programming: Modeling and Theory. Philadelphia: SIAM, 2009.
16.
go back to reference G. J. Székely and N. K. Bakirov. Extremal probabilities for Gaussian quadratic forms. Probab. Theory Related Fields, 126:184–202, 2003. G. J. Székely and N. K. Bakirov. Extremal probabilities for Gaussian quadratic forms. Probab. Theory Related Fields, 126:184–202, 2003.
17.
go back to reference J. Tropp. Column subset selection, matrix factorization, and eigenvalue optimization. SODA, pages 978–986, 2009. SIAM. J. Tropp. Column subset selection, matrix factorization, and eigenvalue optimization. SODA, pages 978–986, 2009. SIAM.
18.
go back to reference J. Young and D. Ridzal. An application of random projection to parameter estimation in partial differential equations. SIAM J. Scient. Comput., 34:A2344–A2365, 2012. J. Young and D. Ridzal. An application of random projection to parameter estimation in partial differential equations. SIAM J. Scient. Comput., 34:A2344–A2365, 2012.
Metadata
Title
Improved Bounds on Sample Size for Implicit Matrix Trace Estimators
Authors
Farbod Roosta-Khorasani
Uri Ascher
Publication date
01-10-2015
Publisher
Springer US
Published in
Foundations of Computational Mathematics / Issue 5/2015
Print ISSN: 1615-3375
Electronic ISSN: 1615-3383
DOI
https://doi.org/10.1007/s10208-014-9220-1

Other articles of this Issue 5/2015

Foundations of Computational Mathematics 5/2015 Go to the issue

Premium Partner