Skip to main content
main-content
Top

Hint

Swipe to navigate through the articles of this issue

25-04-2018 | S.I.: Emerging Intelligent Algorithms for Edge-of-Things Computing | Issue 5/2019 Open Access

Neural Computing and Applications 5/2019

Using hardware counter-based performance model to diagnose scaling issues of HPC applications

Journal:
Neural Computing and Applications > Issue 5/2019
Authors:
Nan Ding, Shiming Xu, Zhenya Song, Baoquan Zhang, Jingmei Li, Zhigao Zheng

Abstract

Performance diagnosing for HPC applications can be extremely difficult due to their complicated performance behaviors. One hand, developers used to identify the potential performance bottlenecks by conducting detailed instrumentation, which may introduce significant performance overheads or even performance deviations. On the other hand, developers can only conduct small numbers of application runs for profiling the performance with the limitations on both computing resources and time duration. Meanwhile, the performance bottlenecks of HPC applications may vary with the degree of parallelism. To address these challenges, our paper proposes a systematic performance diagnosing method focusing on building an accurate and interpretable performance model with performance counters. Our method is able to diagnose the HPC application scaling issues by predicting its runtime and performance behaviors in different functions. After applying this modeling method on three real-world HPC applications, HOMME, CICE and OpenFoam, our evaluations show that our diagnosing method based on the performance model has the ability to diagnose the potential scaling issues, which is typically missed by the traditional performance diagnosing method and achieves about 10% prediction errors in a scale of 4096 MPI ranks on two problem sizes.
Literature
About this article

Other articles of this Issue 5/2019

Neural Computing and Applications 5/2019 Go to the issue

S.I. : Emerging Intelligent Algorithms for Edge-of-Things Computing

Deployment of smart home management system at the edge: mechanisms and protocols

S.I. : Emerging Intelligent Algorithms for Edge-of-Things Computing

An efficient cost-based algorithm for scheduling workflow tasks in cloud computing systems

S.I. : Emerging Intelligent Algorithms for Edge-of-Things Computing

LION IDS: A meta-heuristics approach to detect DDoS attacks against Software-Defined Networks

Premium Partner

    Image Credits