Skip to main content
Top
Published in: The Journal of Supercomputing 1/2021

29-04-2020

Tunable approximations to control time-to-solution in an HPC molecular docking Mini-App

Authors: Davide Gadioli, Gianluca Palermo, Stefano Cherubin, Emanuele Vitali, Giovanni Agosta, Candida Manelfi, Andrea R. Beccari, Carlo Cavazzoni, Nico Sanna, Cristina Silvano

Published in: The Journal of Supercomputing | Issue 1/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The drug discovery process involves several tasks to be performed in vivo, in vitro and in silico. Molecular docking is a task typically performed in silico. It aims at finding the three-dimensional pose of a given molecule when it interacts with the target protein binding site. This task is often done for virtual screening a huge set of molecules to find the most promising ones, which will be forwarded to the later stages of the drug discovery process. Given the huge complexity of the problem, molecular docking cannot be solved by exploring the entire space of the ligand poses. State-of-the-art approaches face the problem by sampling the space of the ligand poses to generate results in a reasonable time budget. In this work, we improve the geometric approach to molecular docking by introducing tunable approximations. In particular, we analysed and enriched the original implementation with tunable software knobs to explore and control the performance-accuracy trade-offs. We modelled time-to-solution of the virtual screening task as a function of software knobs, input data features, and available computational resources. Therefore, the application can autotune its configuration according to a user-defined time budget. We used a Mini-App derived by LiGenDock—a state-of-the-art molecular docking application—to validate the proposed approach. We run the enhanced Mini-App on a high-performance computing system by using a very large database of pockets and ligands. The proposed approach exposes a time-to-solution interval spanning more than one order of magnitude with accuracy degradation up to 30%, more in general providing different accuracy levels according to the needs of the virtual screening campaign.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Ansel J, Wong YL, Chan C, Olszewski M, Edelman A, Amarasinghe S (2011) Language and compiler support for auto-tuning variable-accuracy algorithms. In: Proceedings of the 9th Annual IEEE/ACM International Symposium on Code Generation and Optimization. IEEE Computer Society, pp 85–96 Ansel J, Wong YL, Chan C, Olszewski M, Edelman A, Amarasinghe S (2011) Language and compiler support for auto-tuning variable-accuracy algorithms. In: Proceedings of the 9th Annual IEEE/ACM International Symposium on Code Generation and Optimization. IEEE Computer Society, pp 85–96
3.
go back to reference Beato C, Beccari AR, Cavazzoni C, Lorenzi S, Costantino G (2013) Use of experimental design to optimize docking performance: the case of ligendock, the docking module of ligen, a new de novo design program. J Chem Inf Model 53(6):1503–1517CrossRef Beato C, Beccari AR, Cavazzoni C, Lorenzi S, Costantino G (2013) Use of experimental design to optimize docking performance: the case of ligendock, the docking module of ligen, a new de novo design program. J Chem Inf Model 53(6):1503–1517CrossRef
4.
go back to reference Beccari AR, Cavazzoni C, Beato C, Costantino G (2013) Ligen: a high performance workflow for chemistry driven de novo design. J Chem Inf Model 53(6):1518–1527CrossRef Beccari AR, Cavazzoni C, Beato C, Costantino G (2013) Ligen: a high performance workflow for chemistry driven de novo design. J Chem Inf Model 53(6):1518–1527CrossRef
5.
go back to reference Beccari AR, Gemei M, Monte ML, Menegatti N, Fanton M, Pedretti A, Bovolenta S, Nucci C, Molteni A, Rossignoli A, Brandolini L, Taddei A, Za L, Liberati C, Vistoli G (2017) Novel selective, potent naphthyl trpm8 antagonists identified through a combined ligand- and structure-based virtual screening approach. Sci Rep 7:1–15CrossRef Beccari AR, Gemei M, Monte ML, Menegatti N, Fanton M, Pedretti A, Bovolenta S, Nucci C, Molteni A, Rossignoli A, Brandolini L, Taddei A, Za L, Liberati C, Vistoli G (2017) Novel selective, potent naphthyl trpm8 antagonists identified through a combined ligand- and structure-based virtual screening approach. Sci Rep 7:1–15CrossRef
7.
go back to reference Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE (2000) The protein data bank. Nucl Acids Res 28:235–242CrossRef Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE (2000) The protein data bank. Nucl Acids Res 28:235–242CrossRef
9.
go back to reference Brady GP, Stouten PF (2000) Fast prediction and visualization of protein binding pockets with pass. J Comput Aided Mol Des 14(4):383–401CrossRef Brady GP, Stouten PF (2000) Fast prediction and visualization of protein binding pockets with pass. J Comput Aided Mol Des 14(4):383–401CrossRef
11.
go back to reference Chippa VK, Chakradhar ST, Roy K, Raghunathan A (2013) Analysis and characterization of inherent application resilience for approximate computing. In: Proceedings of the 50th Annual Design Automation Conference, DAC ’13. ACM, New York, pp 113:1–113:9. https://doi.org/10.1145/2463209.2488873 Chippa VK, Chakradhar ST, Roy K, Raghunathan A (2013) Analysis and characterization of inherent application resilience for approximate computing. In: Proceedings of the 50th Annual Design Automation Conference, DAC ’13. ACM, New York, pp 113:1–113:9. https://​doi.​org/​10.​1145/​2463209.​2488873
12.
go back to reference Ding Y, Ansel J, Veeramachaneni K, Shen X, O’Reilly UM, Amarasinghe S (2015) Autotuning algorithmic choice for input sensitivity. In: ACM SIGPLAN Notices, vol 50. ACM, pp 379–390 Ding Y, Ansel J, Veeramachaneni K, Shen X, O’Reilly UM, Amarasinghe S (2015) Autotuning algorithmic choice for input sensitivity. In: ACM SIGPLAN Notices, vol 50. ACM, pp 379–390
13.
go back to reference Esmaeilzadeh H, Sampson A, Ceze L, Burger D (2012) Architecture support for disciplined approximate programming. In: Proceedings of the Seventeenth International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS XVII. ACM, New York, pp 301–312. https://doi.org/10.1145/2150976.2151008 Esmaeilzadeh H, Sampson A, Ceze L, Burger D (2012) Architecture support for disciplined approximate programming. In: Proceedings of the Seventeenth International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS XVII. ACM, New York, pp 301–312. https://​doi.​org/​10.​1145/​2150976.​2151008
14.
go back to reference Everitt C (2001) Interactive order-independent transparency. White Pap nVIDIA 2(6):7 Everitt C (2001) Interactive order-independent transparency. White Pap nVIDIA 2(6):7
15.
go back to reference Ewing TJ, Makino S, Skillman AG, Kuntz ID (2001) DOCK 4.0: search strategies for automated molecular docking of flexible molecule databases. J Comput Aided Mol Des 15(5):411–428CrossRef Ewing TJ, Makino S, Skillman AG, Kuntz ID (2001) DOCK 4.0: search strategies for automated molecular docking of flexible molecule databases. J Comput Aided Mol Des 15(5):411–428CrossRef
17.
go back to reference Friesner RA, Banks JL, Murphy RB, Halgren TA, Klicic JJ, Mainz DT, Repasky MP, Knoll EH, Shelley M, Perry JK, Shaw DE, Francis P, Shenkin PS (2004) Glide: a new approach for rapid, accurate docking and scoring: 1. Method and assessment of docking accuracy. J Med Chem 47(7):1739–1749. https://doi.org/10.1021/jm0306430CrossRef Friesner RA, Banks JL, Murphy RB, Halgren TA, Klicic JJ, Mainz DT, Repasky MP, Knoll EH, Shelley M, Perry JK, Shaw DE, Francis P, Shenkin PS (2004) Glide: a new approach for rapid, accurate docking and scoring: 1. Method and assessment of docking accuracy. J Med Chem 47(7):1739–1749. https://​doi.​org/​10.​1021/​jm0306430CrossRef
18.
go back to reference Frigo M, Johnson SG (2005) The design and implementation of FFTW3. Proc IEEE 93(2):216–231CrossRef Frigo M, Johnson SG (2005) The design and implementation of FFTW3. Proc IEEE 93(2):216–231CrossRef
19.
go back to reference Fuller-Rowell T (1984) A two-dimensional, high-resolution, nested-grid model of the thermosphere: 1. Neutral response to an electric field “spike”. J Geophys Res Space Phys 89(A5):2971–2990CrossRef Fuller-Rowell T (1984) A two-dimensional, high-resolution, nested-grid model of the thermosphere: 1. Neutral response to an electric field “spike”. J Geophys Res Space Phys 89(A5):2971–2990CrossRef
20.
go back to reference Fuller-Rowell T (1985) A two-dimensional, high-resolution, nested-grid model of the thermosphere: 2. Response of the thermosphere to narrow and broad electrodynamic features. J Geophys Res Space Phys 90(A7):6567–6586CrossRef Fuller-Rowell T (1985) A two-dimensional, high-resolution, nested-grid model of the thermosphere: 2. Response of the thermosphere to narrow and broad electrodynamic features. J Geophys Res Space Phys 90(A7):6567–6586CrossRef
24.
25.
go back to reference Hoffmann H, Sidiroglou S, Carbin M, Misailovic S, Agarwal A, Rinard M (2011) Dynamic knobs for responsive power-aware computing. In: Proceedings of the Sixteenth International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS XVI. ACM, New York, pp 199–212. https://doi.org/10.1145/1950365.1950390 Hoffmann H, Sidiroglou S, Carbin M, Misailovic S, Agarwal A, Rinard M (2011) Dynamic knobs for responsive power-aware computing. In: Proceedings of the Sixteenth International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS XVI. ACM, New York, pp 199–212. https://​doi.​org/​10.​1145/​1950365.​1950390
26.
go back to reference Jain AN (2007) Surflex-Dock 2.1: robust performance from ligand energetic modeling, ring flexibility, and knowledge-based search. J Comput Aided Mol Des 21(5):281–306CrossRef Jain AN (2007) Surflex-Dock 2.1: robust performance from ligand energetic modeling, ring flexibility, and knowledge-based search. J Comput Aided Mol Des 21(5):281–306CrossRef
27.
go back to reference Jessup ER, Motter P, Norris B, Sood K (2016) Performance-based numerical solver selection in the lighthouse framework. SIAM J Sci Comput 38:S750–S771MathSciNetMATHCrossRef Jessup ER, Motter P, Norris B, Sood K (2016) Performance-based numerical solver selection in the lighthouse framework. SIAM J Sci Comput 38:S750–S771MathSciNetMATHCrossRef
28.
go back to reference Jiang F, Kim SH (1991) “Soft docking”: matching of molecular surface cubes. J Mol Biol 219(1):79–102CrossRef Jiang F, Kim SH (1991) “Soft docking”: matching of molecular surface cubes. J Mol Biol 219(1):79–102CrossRef
29.
go back to reference Jones G, Willett P, Glen RC, Leach AR, Taylor R (1997) Development and validation of a genetic algorithm for flexible docking. J Mol Biol 267(3):727–748CrossRef Jones G, Willett P, Glen RC, Leach AR, Taylor R (1997) Development and validation of a genetic algorithm for flexible docking. J Mol Biol 267(3):727–748CrossRef
33.
go back to reference Lionta E, Spyrou G, Vassilatis DK, Cournia Z (2014) Structure-based virtual screening for drug discovery: principles, applications and recent advances. Curr Top Med Chem 14(16):1923–1938CrossRef Lionta E, Spyrou G, Vassilatis DK, Cournia Z (2014) Structure-based virtual screening for drug discovery: principles, applications and recent advances. Curr Top Med Chem 14(16):1923–1938CrossRef
34.
go back to reference Liu M, Wang S (1999) MCDOCK: a Monte Carlo simulation approach to the molecular docking problem. J Comput Aided Mol Des 13(5):435–451CrossRef Liu M, Wang S (1999) MCDOCK: a Monte Carlo simulation approach to the molecular docking problem. J Comput Aided Mol Des 13(5):435–451CrossRef
36.
go back to reference Malossi ACI, Schaffner M, Molnos A, Gammaitoni L, Tagliavini G, Emerson A, Tomás A, Nikolopoulos DS, Flamand E, Wehn N (2018) The transprecision computing paradigm: concept, design, and applications. In: Design, Automation Test in Europe Conference Exhibition (DATE), pp 1105–1110. https://doi.org/10.23919/DATE.2018.8342176 Malossi ACI, Schaffner M, Molnos A, Gammaitoni L, Tagliavini G, Emerson A, Tomás A, Nikolopoulos DS, Flamand E, Wehn N (2018) The transprecision computing paradigm: concept, design, and applications. In: Design, Automation Test in Europe Conference Exhibition (DATE), pp 1105–1110. https://​doi.​org/​10.​23919/​DATE.​2018.​8342176
40.
41.
go back to reference Palma PN, Krippahl L, Wampler JE, Moura JJ (2000) Bigger: a new (soft) docking algorithm for predicting protein interactions. Proteins Struct Funct Bioinform 39(4):372–384CrossRef Palma PN, Krippahl L, Wampler JE, Moura JJ (2000) Bigger: a new (soft) docking algorithm for predicting protein interactions. Proteins Struct Funct Bioinform 39(4):372–384CrossRef
42.
go back to reference Püschel M, Moura JM, Singer B, Xiong J, Johnson J, Padua D, Veloso M, Johnson RW (2004) Spiral: a generator for platform-adapted libraries of signal processing algorithms. Int J High Perform Comput Appl 18(1):21–45CrossRef Püschel M, Moura JM, Singer B, Xiong J, Johnson J, Padua D, Veloso M, Johnson RW (2004) Spiral: a generator for platform-adapted libraries of signal processing algorithms. Int J High Perform Comput Appl 18(1):21–45CrossRef
43.
go back to reference Samadi M, Lee J, Jamshidi DA, Hormati A, Mahlke S (2013) Sage: self-tuning approximation for graphics engines. In: Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO-46. ACM, New York, pp 13–24. https://doi.org/10.1145/2540708.2540711 Samadi M, Lee J, Jamshidi DA, Hormati A, Mahlke S (2013) Sage: self-tuning approximation for graphics engines. In: Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO-46. ACM, New York, pp 13–24. https://​doi.​org/​10.​1145/​2540708.​2540711
44.
go back to reference Schellhammer I, Rarey M (2004) Flexx-scan: fast, structure-based virtual screening. Proteins Struc Funct Bioinform 57(3):504–517CrossRef Schellhammer I, Rarey M (2004) Flexx-scan: fast, structure-based virtual screening. Proteins Struc Funct Bioinform 57(3):504–517CrossRef
45.
go back to reference Sidiroglou-Douskos S, Misailovic S, Hoffmann H, Rinard M (2011) Managing performance vs. accuracy trade-offs with loop perforation. In: Proceedings of the 19th ACM SIGSOFT Symposium and the 13th European Conference on Foundations of Software Engineering, ESEC/FSE ’11. ACM, New York, pp 124–134. https://doi.org/10.1145/2025113.2025133 Sidiroglou-Douskos S, Misailovic S, Hoffmann H, Rinard M (2011) Managing performance vs. accuracy trade-offs with loop perforation. In: Proceedings of the 19th ACM SIGSOFT Symposium and the 13th European Conference on Foundations of Software Engineering, ESEC/FSE ’11. ACM, New York, pp 124–134. https://​doi.​org/​10.​1145/​2025113.​2025133
46.
go back to reference Silvano C, Agosta G, Cherubin S, Gadioli D, Palermo G, Bartolini A, Benini L, Martinovič J, Palkovič M, Slaninová K, Bispo J, Cardoso MP, Abreu R, Pinto P, Cavazzoni C, Sanna N, Beccari AR, Cmar R, Rohou E (2016) The ANTAREX approach to autotuning and adaptivity for energy efficient HPC systems. In: Proceedings of the ACM International Conference on Computing Frontiers, CF ’16. ACM, New York, pp 288–293. https://doi.org/10.1145/2903150.2903470 Silvano C, Agosta G, Cherubin S, Gadioli D, Palermo G, Bartolini A, Benini L, Martinovič J, Palkovič M, Slaninová K, Bispo J, Cardoso MP, Abreu R, Pinto P, Cavazzoni C, Sanna N, Beccari AR, Cmar R, Rohou E (2016) The ANTAREX approach to autotuning and adaptivity for energy efficient HPC systems. In: Proceedings of the ACM International Conference on Computing Frontiers, CF ’16. ACM, New York, pp 288–293. https://​doi.​org/​10.​1145/​2903150.​2903470
47.
go back to reference Srinivasan J, Castellino A, Bradley EK, Eksterowicz JE, Grootenhuis PDJ, Putta S, Stanton RV (2002) Evaluation of a novel shape-based computational filter for lead evolution: application to thrombin inhibitors. J Med Chem 45(12):2494–2500. https://doi.org/10.1021/jm010494qCrossRef Srinivasan J, Castellino A, Bradley EK, Eksterowicz JE, Grootenhuis PDJ, Putta S, Stanton RV (2002) Evaluation of a novel shape-based computational filter for lead evolution: application to thrombin inhibitors. J Med Chem 45(12):2494–2500. https://​doi.​org/​10.​1021/​jm010494qCrossRef
48.
go back to reference Sui X, Lenharth A, Fussell DS, Pingali K (2016) Proactive control of approximate programs. ACM SIGOPS Oper Syst Rev 50(2):607–621CrossRef Sui X, Lenharth A, Fussell DS, Pingali K (2016) Proactive control of approximate programs. ACM SIGOPS Oper Syst Rev 50(2):607–621CrossRef
49.
go back to reference Thomsen R, Christensen MH (2006) MolDock: a new technique for high-accuracy molecular docking. J Med Chem 49(11):3315–3321CrossRef Thomsen R, Christensen MH (2006) MolDock: a new technique for high-accuracy molecular docking. J Med Chem 49(11):3315–3321CrossRef
51.
go back to reference Ullrich PA, Jablonowski C (2011) An analysis of 1d finite-volume methods for geophysical problems on refined grids. J Comput Phys 230(3):706–725MathSciNetMATHCrossRef Ullrich PA, Jablonowski C (2011) An analysis of 1d finite-volume methods for geophysical problems on refined grids. J Comput Phys 230(3):706–725MathSciNetMATHCrossRef
52.
go back to reference Vassiliadis V, Chalios C, Parasyris K, Antonopoulos CD, Lalis S, Bellas N, Vandierendonck H, Nikolopoulos DS (2016) Exploiting significance of computations for energy-constrained approximate computing. Int J Parallel Program 44:1078–1098CrossRef Vassiliadis V, Chalios C, Parasyris K, Antonopoulos CD, Lalis S, Bellas N, Vandierendonck H, Nikolopoulos DS (2016) Exploiting significance of computations for energy-constrained approximate computing. Int J Parallel Program 44:1078–1098CrossRef
54.
55.
go back to reference Vuduc R, Demmel JW, Yelick KA (2005) OSKI: a library of automatically tuned sparse matrix kernels. J Phys Conf Ser 16:521CrossRef Vuduc R, Demmel JW, Yelick KA (2005) OSKI: a library of automatically tuned sparse matrix kernels. J Phys Conf Ser 16:521CrossRef
56.
go back to reference Wang W, Killeen TL, Burns AG, Roble RG (1999) A high-resolution, three-dimensional, time dependent, nested grid model of the coupled thermosphere-ionosphere. J Atmos Sol Terr Phys 61(5):385–397CrossRef Wang W, Killeen TL, Burns AG, Roble RG (1999) A high-resolution, three-dimensional, time dependent, nested grid model of the coupled thermosphere-ionosphere. J Atmos Sol Terr Phys 61(5):385–397CrossRef
58.
go back to reference Whaley RC, Dongarra JJ (1998) Automatically tuned linear algebra software. In: Proceedings of the 1998 ACM/IEEE Conference on Supercomputing. IEEE Computer Society, pp 1–27 Whaley RC, Dongarra JJ (1998) Automatically tuned linear algebra software. In: Proceedings of the 1998 ACM/IEEE Conference on Supercomputing. IEEE Computer Society, pp 1–27
Metadata
Title
Tunable approximations to control time-to-solution in an HPC molecular docking Mini-App
Authors
Davide Gadioli
Gianluca Palermo
Stefano Cherubin
Emanuele Vitali
Giovanni Agosta
Candida Manelfi
Andrea R. Beccari
Carlo Cavazzoni
Nico Sanna
Cristina Silvano
Publication date
29-04-2020
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 1/2021
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-020-03295-x

Other articles of this Issue 1/2021

The Journal of Supercomputing 1/2021 Go to the issue

Premium Partner