Category: OpenACC
-
Profiling OpenACC Code using NVPROF
Profiling your OpenACC code on a remote system can be tricky sometimes. Many times we try to profile the code in cluster environment where we need to use a job scheduler to submit our jobs. In such scenarios, command line based profiling comes handy. This tutorials provides some usage examples for NVIDIA’s command line profiler…
-
Compiling and Running OpenACC Fortran Codes using PGI Fortran
In this tutorial we will learn how to compile and execute an OpenACC Fortran code using PGI Fortran Compiler. Let’s look at the sample vector addition code parallelized using OpenACC Fortran based parallel loop construct. We can compile this code for Nvidia GPU using following command – Or Here, ‘-ta=tesla’ option informs compiler that compiler…