Category: Profiling Tools
-
Nvidia Nsight Systems : Profiling for CUDA code
In this post we will look at steps involved in profiling of the CUDA code using Nvidia Nsight Systems. Let’s take a simple code which performs some array operations. To compile this code, we can use following command. Please note that I am using “-arch=sm_86” which instructs compiler to generate code for compute capability 8.6…
-
Intel Vtune Profiler GUI : Hotspots profiling for serial C code
In this post we will look at steps involved in profiling of the serial code using Intel Vtune Profiler GUI. Let’s take a simple code which performs some array operations. To Profile this code, we need to first compile this code. We can compile this code using GNU compiler or Intel Compiler. OR To Profile…
-
Intel Vtune Profiler Command Line Interface (CLI) : Hotspots profiling for serial C code
In this post we will look at steps involved in profiling of the serial code using Intel Vtune Profiler using command line interface. Let’s take a simple code which performs some array operations. To Profile this code, we need to first compile this code. We can compile this code using GNU compiler or Intel Compiler.…
-
Profiling OpenACC Code using NVPROF
Profiling your OpenACC code on a remote system can be tricky sometimes. Many times we try to profile the code in cluster environment where we need to use a job scheduler to submit our jobs. In such scenarios, command line based profiling comes handy. This tutorials provides some usage examples for NVIDIA’s command line profiler…
-
Profiling Serial C codes using GNU’s profiler – gprof
This post covers the steps to profile a C code using GNU’s profiler – gprof. Profiling your serial code is one of the most important step in writing parallel codes. We use profilers to find out the most time consuming parts of the code. Let us consider following sample C code. Fore this code, we…