Hi Rick,
Thanks for the info and quick reply. In reading further, it looks like OpenMP 4.0 and above will work for offloading to the GPU instead of CPU. I'm looking into implementing this on our HPC environment (RedHat) and further down the line a docker or singularity container. For the container route I was initially looking at CentOS since it aligns better to RHEL but I'm not tied to it if you have any recommendations.
Just curious, is there any work on this front of adding a switch to some of the more computationally heavy components (e.g. nonlinear warping) to utilize a GPU if present otherwise fallback to CPU parallelization in OpenMP or using something like CUDA (NVIDIA cards only) or OpenACC(opens it up to AMD etc) directly if a GPU is detected?
Thanks,
Ajay
Edited 1 time(s). Last edit at 05/10/2022 03:45PM by AjaySK.