run lammps in parallel with cuda acceleration

Thanks Steve! I got your point.