Issue 3-4/2011
ISC'11
Content (21 Articles)
Astrophysical particle simulations with large custom GPU clusters on three continents
R. Spurzem, P. Berczik, I. Berentzen, K. Nitadori, T. Hamada, G. Marcus, A. Kugel, R. Männer, J. Fiestas, R. Banerjee, R. Klessen
Optimized HPL for AMD GPU and multi-core CPU usage
Matthias Bach, Matthias Kretz, Volker Lindenstruth, David Rohr
Simulation of bevel gear cutting with GPGPUs—performance and productivity
Sandra Wienke, Dmytro Plotnikov, Dieter an Mey, Christian Bischof, Ario Hardjosuwito, Christof Gorgels, Christian Brecher
Predictive analysis of a hydrodynamics application on large-scale CMP clusters
J. A. Davis, G. R. Mudalige, S. D. Hammond, J. A. Herdman, I. Miller, S. A. Jarvis
Shared-memory, distributed-memory, and mixed-mode parallelisation of a CFD simulation code
Adrian Jackson, M. Sergio Campobasso
Wavelet-based adaptive multi-resolution solver on heterogeneous parallel architecture for computational fluid dynamics
L. H. Han, T. Indinger, X. Y. Hu, N. A. Adams
Automatic code generation and tuning for stencil kernels on modern shared memory architectures
Matthias Christen, Olaf Schenk, Helmar Burkhart
Designing and dynamically load balancing hybrid LU for multi/many-core
Michael Deisher, Mikhail Smelyanskiy, Brian Nickerson, Victor W. Lee, Michael Chuvelev, Pradeep Dubey
Unbalanced tree search on a manycore system using the GPI programming model
Rui Machado, Carsten Lojewski, Salvador Abreu, Franz-Josef Pfreundt
High-performance and scalable non-blocking all-to-all with collective offload on InfiniBand clusters: a study with parallel 3D FFT
Krishna Kandalla, Hari Subramoni, Karen Tomko, Dmitry Pekurovsky, Sayantan Sur, Dhabaleswar K. Panda
Mapping communication layouts to network hardware characteristics on massive-scale blue gene systems
Pavan Balaji, Rinku Gupta, Abhinav Vishnu, Pete Beckman
MVAPICH2-GPU: optimized GPU to GPU communication for InfiniBand clusters
Hao Wang, Sreeram Potluri, Miao Luo, Ashish Kumar Singh, Sayantan Sur, Dhabaleswar K. Panda
The development of Mellanox/NVIDIA GPUDirect over InfiniBand—a new model for GPU to GPU communications
Gilad Shainer, Ali Ayoub, Pak Lui, Tong Liu, Michael Kagan, Christian R. Trott, Greg Scantlen, Paul S. Crozier
A system level view of Petascale I/O on IBM Blue Gene/P
Wolfgang Frings, Michael Hennecke
Baler: deterministic, lossless log message clustering tool
Narate Taerat, Jim Brandt, Ann Gentile, Matthew Wong, Chokchai Leangsuksun
Fault oblivious high performance computing with dynamic task replication and substitution
Yevgeniy Vorobeychik, Jackson R. Mayo, Robert C. Armstrong, Ronald G. Minnich, Don W. Rudish
Ultra low latency market data feed on IBM PowerENTM
Davide Pasetto, Karol Lynch, Robert Tucker, Brendan Maguire, Fabrizio Petrini, Hubertus Franke
A system architecture supporting high-performance and cloud computing in an academic consortium environment
Michael Oberg, Matthew Woitaszek, Theron Voran, Henry M. Tufo
Experiments with the Fresh Breeze tree-based memory model
Jack B. Dennis, Guang R. Gao, Xiao X. Meng