I help organizations accelerate their scientific computing applications on CPU and GPU platforms. I mainly work on Parallelization and Optimization at the application level. My expertise is looking at the application (algorithm and data structures) and mapping it to the given architecture.
Apart from the code development activity, I am actively involved in conducting training programs in Parallel Programming – specifically CUDA, OpenMP, MPI, OpenACC, SYCL/DPC++ etc.
My detailed profile is available here – https://in.linkedin.com/in/mandargurav
You can contact me here – Contact form