Tag: Beginner
-
Nvidia Nsight Systems : Profiling for CUDA code
In this post we will look at steps involved in profiling of the CUDA code using Nvidia Nsight Systems. Let’s take a simple code which performs some array operations. To compile this code, we can use following command. Please note that I am using “-arch=sm_86” which instructs compiler to generate code for compute capability 8.6…
-
Intel Vtune Profiler GUI : Hotspots profiling for serial C code
In this post we will look at steps involved in profiling of the serial code using Intel Vtune Profiler GUI. Let’s take a simple code which performs some array operations. To Profile this code, we need to first compile this code. We can compile this code using GNU compiler or Intel Compiler. OR To Profile…
-
Intel Vtune Profiler Command Line Interface (CLI) : Hotspots profiling for serial C code
In this post we will look at steps involved in profiling of the serial code using Intel Vtune Profiler using command line interface. Let’s take a simple code which performs some array operations. To Profile this code, we need to first compile this code. We can compile this code using GNU compiler or Intel Compiler.…
-
Message Passing Interface (MPI) : MPI_Recv example
This post talks about a simple MPI sending and receiving operation. To compile this code, use following command – To execute this code, run following command – Output of this code will be something similar to the following –
-
Message Passing Interface (MPI) : MPI_Send example
This post talks about a simple MPI sending and receiving operation. To compile this code, use following command – To execute this code, run following command – Output of this code will be something similar to the following –