"ERROR: GPU library not compiled for this accelerator" with pascal p100 and cuda 8.0

Fabricio_Cannini · May 16, 2017, 1:42am

Hello all

I'm testing a compilation of the 4May17 version of lammps, and am facing the error on the subject. My environment is the following:

OS = centos 7.3
CC, CXX, FC = intel 17.0
MPI = impi 2017.3
CUDA = 8.0
GPUS = 4 pascal P100

I tried to set CUDA_ARCH to "-arch=sm_60" , "-arch=sm_61" and "-arch=sm_62", all give the same error.

I configured the build with the following command:

Stan_Moore · May 16, 2017, 4:49pm

Using CUDA_ARCH = -arch=sm_60 works fine for me with the GNU compiler. I think in general using the Intel compiler with CUDA is tricky. For more help you need to post your full gpu lib and LAMMPS Makefiles.

Stan

Nguyen_Dac_Trung · May 16, 2017, 5:05pm

I would add that, after you specify CUDA_ARCH = -arch=sm_60 in lib/gpu/Makefile.foo, do

make -f Makefile.foo clean

in lib/gpu, and rebuild libgpu.a with

make -f Makefile.foo -j4

Then in src/, relink the LAMMPS binary to ensure that the binary is linked to the updated libgpu.a.

Finally, double check if the binary is indeed launched on the compute node that has the P100 cards as libgpu.a is just built for.

Hope it helps,
-Trung

Fabricio_Cannini · May 16, 2017, 5:41pm

Paths are relative to the initial tarball directory.

TIA,
Fabricio

lib/gpu/Makefile.linux

Fabricio_Cannini · May 22, 2017, 3:23pm

Hi there

Just for the record, the combination of intel 17.0 + cuda 8.0 would probably not work, because the version is not officially supported by nvcc, but thanks anyway for the tips.

http://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#title-new-cuda-tools