Author: Mandar Gurav
-
Nvidia Nsight Systems : Profiling for CUDA code
In this post we will look at steps involved in profiling of the CUDA code using Nvidia Nsight Systems. Let’s take a simple code which performs some array operations. To compile this code, we can use following command. Please note that I am using “-arch=sm_86” which instructs compiler to generate code for compute capability 8.6…
-
Intel Vtune Profiler GUI : Hotspots profiling for serial C code
In this post we will look at steps involved in profiling of the serial code using Intel Vtune Profiler GUI. Let’s take a simple code which performs some array operations. To Profile this code, we need to first compile this code. We can compile this code using GNU compiler or Intel Compiler. OR To Profile…