Author: Mandar Gurav
-
Compiling and Running OpenACC Fortran Codes using PGI Fortran
In this tutorial we will learn how to compile and execute an OpenACC Fortran code using PGI Fortran Compiler. Let’s look at the sample vector addition code parallelized using OpenACC Fortran based parallel loop construct. We can compile this code for Nvidia GPU using following command – Or Here, ‘-ta=tesla’ option informs compiler that compiler…
-
Profiling Serial C codes using GNU’s profiler – gprof
This post covers the steps to profile a C code using GNU’s profiler – gprof. Profiling your serial code is one of the most important step in writing parallel codes. We use profilers to find out the most time consuming parts of the code. Let us consider following sample C code. Fore this code, we…