Skip to main content
Erschienen in: International Journal of Parallel Programming 5/2017

11.10.2016

A Decomposition of the Tikhonov Regularization Functional Oriented to Exploit Hybrid Multilevel Parallelism

verfasst von: Rossella Arcucci, Luisa D’Amore, Luisa Carracciuolo, Giuseppe Scotti, Giuliano Laccetti

Erschienen in: International Journal of Parallel Programming | Ausgabe 5/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We introduce a decomposition of the Tikhonov Regularization (TR) functional which split this operator into several TR functionals, suitably modified in order to enforce the matching of their solutions. As a consequence, instead of solving one problem we can solve several problems reproducing the initial one at smaller dimensions. Such approach leads to a reduction of the time complexity of the resulting algorithm. Since the subproblems are solved in parallel, this decomposition also leads to a reduction of the overall execution time. Main outcome of the decomposition is that the parallel algorithm is oriented to exploit the highest performance of parallel architectures where concurrency is implemented both at the coarsest and finest levels of granularity. Performance analysis is discussed in terms of the algorithm and software scalability. Validation is performed on a reference parallel architecture made of a distributed memory multiprocessor and a Graphic Processing Unit. Results are presented on the Data Assimilation problem, for oceanographic models.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
A partitioned algorithm is a scalar (or point) algorithm in which the operations have been grouped and reordered into matrix operations. A block algorithm is a generalization of a scalar algorithm in which the basic scalar operations become matrix operations, and a matrix property based on the nonzero structure becomes the corresponding property blockwise (LAPACK contains only partitioned algorithms that is, the main computations are block oriented and implemented by using BLAS-3) [15].
 
Literatur
1.
Zurück zum Zitat Antonelli, L., Carracciuolo, L., Ceccarelli, M., D’Amore, L., Murli, A.: Total variation regularization for edge preserving 3D SPECT imaging in high performance computing environments. Lecture Notes in Computer Science, Vol. 2330 LNCS, Issue PART 2, 171–180, (2002) Antonelli, L., Carracciuolo, L., Ceccarelli, M., D’Amore, L., Murli, A.: Total variation regularization for edge preserving 3D SPECT imaging in high performance computing environments. Lecture Notes in Computer Science, Vol. 2330 LNCS, Issue PART 2, 171–180, (2002)
2.
Zurück zum Zitat Arcucci, R., D’Amore, L., Carracciuolo, L.: On the problem-decomposition of scalable 4D-Var Data Assimilation models, International Conference on High Performance Computing and Simulation (HPCS), pp. 589–594. ISBN 978-1-4673-7812-3, (2015) Arcucci, R., D’Amore, L., Carracciuolo, L.: On the problem-decomposition of scalable 4D-Var Data Assimilation models, International Conference on High Performance Computing and Simulation (HPCS), pp. 589–594. ISBN 978-1-4673-7812-3, (2015)
3.
Zurück zum Zitat Campagna, R., D’Amore, L., Murli, A.: An efficient algorithm for regularization of Laplace transform inversion in real case. J. Comput. Appl. Math. 210(1–2), 84–98 (2007)MathSciNetCrossRefMATH Campagna, R., D’Amore, L., Murli, A.: An efficient algorithm for regularization of Laplace transform inversion in real case. J. Comput. Appl. Math. 210(1–2), 84–98 (2007)MathSciNetCrossRefMATH
4.
Zurück zum Zitat Carracciuolo, L., D’Amore, L., Murli, A.: Towards a parallel component for imaging in PETSc programming environment: a case study in 3-D echocardiography. Parallel Comput. 32(1), 67–83 (2006)MathSciNetCrossRef Carracciuolo, L., D’Amore, L., Murli, A.: Towards a parallel component for imaging in PETSc programming environment: a case study in 3-D echocardiography. Parallel Comput. 32(1), 67–83 (2006)MathSciNetCrossRef
5.
Zurück zum Zitat Chung, J., Nagy, J.G.: An efficient iterative approach for large scale separable nonlinear inverse problems. SIAM J. Sci. Comput. 31(6), 4654–4674 (2010)MathSciNetCrossRefMATH Chung, J., Nagy, J.G.: An efficient iterative approach for large scale separable nonlinear inverse problems. SIAM J. Sci. Comput. 31(6), 4654–4674 (2010)MathSciNetCrossRefMATH
6.
Zurück zum Zitat D’Amore, L., Mele, V., Laccetti, G., Murli, A.: Mathematical Approach to the Performance Evaluation of Matrix Multiply Algorithm. Lecture Notes in Computer Science, Vol. 9574, pp. 25–34 (2016) D’Amore, L., Mele, V., Laccetti, G., Murli, A.: Mathematical Approach to the Performance Evaluation of Matrix Multiply Algorithm. Lecture Notes in Computer Science, Vol. 9574, pp. 25–34 (2016)
7.
Zurück zum Zitat D’Amore, L., Arcucci, R., Carracciuolo, L., Murli, A.: A scalable approach to variational data assimilation. J. Sci. Comput. 2, 239–257 (2014)MathSciNetCrossRefMATH D’Amore, L., Arcucci, R., Carracciuolo, L., Murli, A.: A scalable approach to variational data assimilation. J. Sci. Comput. 2, 239–257 (2014)MathSciNetCrossRefMATH
8.
Zurück zum Zitat D’Amore, L., Arcucci, R., Carracciuolo, L., Murli, A.: DD-oceanvar: a domain decomposition fully parallel data assimilation software in mediterranean sea. Procedia Comp Sci. 18, 1235–1244 (2013)CrossRef D’Amore, L., Arcucci, R., Carracciuolo, L., Murli, A.: DD-oceanvar: a domain decomposition fully parallel data assimilation software in mediterranean sea. Procedia Comp Sci. 18, 1235–1244 (2013)CrossRef
9.
Zurück zum Zitat D’Amore, L., Arcucci, R., Marcellino, L., Murli, A.: HPC computation issues of the incremental 3D variational data assimilation scheme in OceanVar software. J. Numer. Anal. Ind. Appl. Math. 7(3–4), 91–105 (2012)MathSciNetMATH D’Amore, L., Arcucci, R., Marcellino, L., Murli, A.: HPC computation issues of the incremental 3D variational data assimilation scheme in OceanVar software. J. Numer. Anal. Ind. Appl. Math. 7(3–4), 91–105 (2012)MathSciNetMATH
10.
Zurück zum Zitat D’Amore, L., Casaburi, D., Galletti, A., Marcellino, L., Murli, A.: Integration of emerging computer technologies for an efficient image sequences analysis. Integr. Comput. Aided Eng. 18(4), 365–378 (2011) D’Amore, L., Casaburi, D., Galletti, A., Marcellino, L., Murli, A.: Integration of emerging computer technologies for an efficient image sequences analysis. Integr. Comput. Aided Eng. 18(4), 365–378 (2011)
11.
Zurück zum Zitat D’Amore, L., Laccetti, G., Romano, D., Scotti, G.: Towards a parallel component in a GPU-CUDA environment: a case study with the L-BFGS Harwell routine. J. Comput. Math. 93(1), 59–76 (2015)MATH D’Amore, L., Laccetti, G., Romano, D., Scotti, G.: Towards a parallel component in a GPU-CUDA environment: a case study with the L-BFGS Harwell routine. J. Comput. Math. 93(1), 59–76 (2015)MATH
12.
Zurück zum Zitat D’Amore, L., Campagna, R., Galletti, A., Marcellino, L., Murli, A.: A smoothing spline that approximates Laplace transform functions only known on measurements on the real axis. Inverse Probl. 28(2), 37 (2012)MathSciNetMATH D’Amore, L., Campagna, R., Galletti, A., Marcellino, L., Murli, A.: A smoothing spline that approximates Laplace transform functions only known on measurements on the real axis. Inverse Probl. 28(2), 37 (2012)MathSciNetMATH
13.
Zurück zum Zitat D’Amore, L., Murli, A.: Regularization of a fourier series method for the Laplace transform inversion with real data authors of document. Inverse Probl. 18(4), 1185–1205 (2002)CrossRefMATH D’Amore, L., Murli, A.: Regularization of a fourier series method for the Laplace transform inversion with real data authors of document. Inverse Probl. 18(4), 1185–1205 (2002)CrossRefMATH
14.
Zurück zum Zitat D’Amore, L., Marcellino, L., Murli, A.: Image sequence inpainting: towards numerical software for detection and removal of local missing data via motion estimation. J. Comput. Appl. Math. 198(2), 396–413 (2007)MathSciNetCrossRefMATH D’Amore, L., Marcellino, L., Murli, A.: Image sequence inpainting: towards numerical software for detection and removal of local missing data via motion estimation. J. Comput. Appl. Math. 198(2), 396–413 (2007)MathSciNetCrossRefMATH
15.
Zurück zum Zitat Demmel, J.W., Higham, N.J., Schreiber, R.: Block LU Factorization. RIACS Technical Report no. 92–03, (1992) Demmel, J.W., Higham, N.J., Schreiber, R.: Block LU Factorization. RIACS Technical Report no. 92–03, (1992)
16.
Zurück zum Zitat ETP4HPC Agenda, European Technology Platform for High Performance Computing. Strategic research agenda achieving HPC leadership in Europe, (2013) ETP4HPC Agenda, European Technology Platform for High Performance Computing. Strategic research agenda achieving HPC leadership in Europe, (2013)
18.
Zurück zum Zitat Freitag, M.A., Nichols, N.K., Budd, C.J.: L1-regularisation for ill-posed problems in variational data assimilation. PAMM Proc. Appl. Math. Mech. 10(1), 665–668 (2010)CrossRef Freitag, M.A., Nichols, N.K., Budd, C.J.: L1-regularisation for ill-posed problems in variational data assimilation. PAMM Proc. Appl. Math. Mech. 10(1), 665–668 (2010)CrossRef
19.
Zurück zum Zitat Gallopoulos, E., Simoncini, V.: Iterative solution of multiple linear systems: In: Topping B.H.V., Papadrakaki M. (eds.) Theory, practice, parallelism, and applicationsIn Advances in Parallel and Vector Processing for Structural Mechanics, Proc. Second Intl. Conf. Computational Structures Technology, pp. 4751.Civil-Comp Press, Edinburgh (1994) Gallopoulos, E., Simoncini, V.: Iterative solution of multiple linear systems: In: Topping B.H.V., Papadrakaki M. (eds.) Theory, practice, parallelism, and applicationsIn Advances in Parallel and Vector Processing for Structural Mechanics, Proc. Second Intl. Conf. Computational Structures Technology, pp. 4751.Civil-Comp Press, Edinburgh (1994)
20.
Zurück zum Zitat Hansen, P.C.: Rank Deficient and Discrete Ill-posed Problems. SIAM, Philadelphia (1998)CrossRef Hansen, P.C.: Rank Deficient and Discrete Ill-posed Problems. SIAM, Philadelphia (1998)CrossRef
21.
Zurück zum Zitat Kalnay, E.: Atmospheric Modeling, Data Assimilation and Predictability. Cambridge University Press, Cambridge (2003) Kalnay, E.: Atmospheric Modeling, Data Assimilation and Predictability. Cambridge University Press, Cambridge (2003)
22.
Zurück zum Zitat Laccetti, G., Lapegna, M., Mele, V., Romano, D., Murli, A.: A double adaptive algorithm for multidimensional integration on multicore based HPC systems. Int. J. of Parallel Progam. 40(4), 397–409 (2012)CrossRef Laccetti, G., Lapegna, M., Mele, V., Romano, D., Murli, A.: A double adaptive algorithm for multidimensional integration on multicore based HPC systems. Int. J. of Parallel Progam. 40(4), 397–409 (2012)CrossRef
23.
25.
Zurück zum Zitat Murli, A., D’Amore, L., Laccetti, G., Gregoretti, F., Oliva, G.: A multi-grained distributed implementation of the parallel block conjugate gradient algorithm. Concurrency Comput. Pract. Exp. 22(15), 2053–2072 (2010) Murli, A., D’Amore, L., Laccetti, G., Gregoretti, F., Oliva, G.: A multi-grained distributed implementation of the parallel block conjugate gradient algorithm. Concurrency Comput. Pract. Exp. 22(15), 2053–2072 (2010)
26.
Zurück zum Zitat Murli, A., Boccia, V., Carracciuolo, L., D’Amore, L., Laccetti, G., Lapegna, M.: Monitoring and migration of a PETSc-based parallel application for medical imaging in a grid computing PSE. IFIP Int. Federation Inf. Process. 239, 421–432 (2007)CrossRef Murli, A., Boccia, V., Carracciuolo, L., D’Amore, L., Laccetti, G., Lapegna, M.: Monitoring and migration of a PETSc-based parallel application for medical imaging in a grid computing PSE. IFIP Int. Federation Inf. Process. 239, 421–432 (2007)CrossRef
27.
Zurück zum Zitat Nichols, N.K.: Mathematical concepts of data assimilation. In: Lahoz, W., Khattatov, B., Menard, R. (eds.) Data assimilation: Making Sense of Observations, pp. 13–40. Springer, Berlin (2010)CrossRef Nichols, N.K.: Mathematical concepts of data assimilation. In: Lahoz, W., Khattatov, B., Menard, R. (eds.) Data assimilation: Making Sense of Observations, pp. 13–40. Springer, Berlin (2010)CrossRef
29.
Zurück zum Zitat O’Leary, D.P., Simmons, J.A.: A bidiagonalization-regularization procedure for large scale discretizations of ill-posed problems. SIAM J. Sci. Stat. Comput. 2, 474489 (1981)MathSciNetMATH O’Leary, D.P., Simmons, J.A.: A bidiagonalization-regularization procedure for large scale discretizations of ill-posed problems. SIAM J. Sci. Stat. Comput. 2, 474489 (1981)MathSciNetMATH
30.
Zurück zum Zitat Paige, C.C., Saunders, M.A.: LSQR: an algorithm for sparse linear equations and sparse least squares. ACM Trans. Math. Softw. 8, 4371 (1982)MathSciNetMATH Paige, C.C., Saunders, M.A.: LSQR: an algorithm for sparse linear equations and sparse least squares. ACM Trans. Math. Softw. 8, 4371 (1982)MathSciNetMATH
35.
Zurück zum Zitat Reichel, L., Baglamana, J.: Decomposition methods for large linear discrete ill-posed problems. J. Comput. Appl. Math 198(2), 333–343 (2007)MathSciNetCrossRef Reichel, L., Baglamana, J.: Decomposition methods for large linear discrete ill-posed problems. J. Comput. Appl. Math 198(2), 333–343 (2007)MathSciNetCrossRef
36.
Zurück zum Zitat Tikhonov, A.N., Solution of incorrectly formulated problems and the regularization method, Dokl. Akad. Nauk. SSSR 151, : 501504 = Soviet Math. Dokl. 4(1963), 1035–1038 (1963) Tikhonov, A.N., Solution of incorrectly formulated problems and the regularization method, Dokl. Akad. Nauk. SSSR 151, : 501504 = Soviet Math. Dokl. 4(1963), 1035–1038 (1963)
Metadaten
Titel
A Decomposition of the Tikhonov Regularization Functional Oriented to Exploit Hybrid Multilevel Parallelism
verfasst von
Rossella Arcucci
Luisa D’Amore
Luisa Carracciuolo
Giuseppe Scotti
Giuliano Laccetti
Publikationsdatum
11.10.2016
Verlag
Springer US
Erschienen in
International Journal of Parallel Programming / Ausgabe 5/2017
Print ISSN: 0885-7458
Elektronische ISSN: 1573-7640
DOI
https://doi.org/10.1007/s10766-016-0460-3

Weitere Artikel der Ausgabe 5/2017

International Journal of Parallel Programming 5/2017 Zur Ausgabe

Premium Partner