Christophe,
I was able to address a memory issue that affects Kokkos EAM on the GPU, so now the performance is better (~2x faster for the best case on 2 K80 GPUs x 4 MPI), see https://github.com/lammps/lammps/pull/398. This will be merged into LAMMPS soon.
Stan