Kokkos GPU performance for the data Stillinger-Weber(SW)

Rengan,

FYI, the vanilla LAMMPS and Kokkos/CUDA versions of the 3 body potentials like Stillinger-Weber are now significantly faster due to recent improvements in the code.

Stan