Analyzing and Tuning SPECAccel Performance for GPU Workloads on Lenovo ThinkSystem ServersPlanning / Implementation
The use of GPU high-performance accelerators are especially important in High Performance Computing (HPC) workloads. As a result, it is critical to ensure that the GPUs provide the most processing power possible by adjusting key performance and measuring the results.
The SPECAccel benchmark uses the OpenCL and OpenACC programing paradigm to provide a comparative measure of parallel computing performance among systems equipped with an accelerator. Through runtime characteristic analysis, this paper provides a best practices recipe for Lenovo® ThinkSystem™ servers to obtain the best performance for GPU workloads, as well as other applications that have characteristics similar to the SPECAccel benchmark.
This paper is intend for ThinkSystem end users and technical sales representatives who want to understand how to tune GPU performance. The paper assumes readers are familiar with Linux and have basic experience with programming languages such as C/C++.
Table of Contents
Accelerator Programing Paradigm
Analysis of CPU performance
Analysis of memory performance
Analysis of PCIe performance
Analysis of NVIDIA V100 GPU performance
Performance world records
To view the document, click the Download PDF button.