Dear all,
I’m running into two different issues using and compiling USER-CUDA.
I think they may be related, and I hope you can give me some advice.
- A few months ago I installed lammps with the cuda package on a workstation
with a K20. Lammps version is 23-Nov-2013.
I was able to compile it, run a few test and one long simulation (4 days) using
eam. This is just to say that this version works.
Then, a few days ago I wanted to replicate some simulations I saw in a paper; I
wrote the attached script “cu_nanopillar…”.
I tested it on CPUs (1, 2 and 8 mpi tasks) on both the workstation and on my PC
(lammps 20-Mar-2014, cpu only), and it “works”.
The simulation box shrinks a little (~4 angstroms) along Z and then the system
heats up at the rate I am asking.
However, when I try to run it with suffix=cuda, the script runs fine, but the output
changes: the simulation box grows along Z very quickly, and the system deforms
a lot. I made a few test changing some relaxation time scales in the script (namely
in the fix npt), but it does not change the overall behavior of the system.
You can clearly see the effect by comparing the outputs: just let the simulation run
for ~3000 timesteps, and observe that in the CPU-case the box shrinks along z,
and grows when using “user-cuda”.
This is rather unexpected, and led me to issue #2.
- I wanted to make sure that there wasn’t any issue with my older version of
lammps, so I tried to compile two recent tarballs: the latest, 18-Apr-2014 and
the 20-Mar-2014 I was testing on my local machine (for non-cuda stuff).
Unfortunately, I couldn’t compile either of them. Apparently, the folder /lib/cuda
has not changed since November, and in fact liblammpscuda compiles apparently
ok, and the .a file looks identical between the different versions.
When I switch to /src and try to compile lammps with “make openmpi”, however,
the compilation fails in two different ways for the two versions.
I already made sure that the makefiles are identical to the original (Nov) one,
and besides, all lammps versions compile sanely without package user-cuda.
Also, I stripped off lammps of all packages except “manybody” and “user-cuda”,
to make sure there weren’t any interferences, but this does not make any difference.
The OS of the workstation is Fedora 18, with cuda v5.5.
I am attaching the screen of the compile failure for both versions.
As an added information, I also am unable to compile both versions, with “user-cuda”
on my desktop PC (it has a quadro 600).
I just updated to (K)ubuntu 14.04 from 13.10. The older version I had working is the
2013Mar11; this was working just a few days ago; also it still compiles correctly
on this setup (tested today).
So, does have any sense? Do you have any idea of what I’m doing wrong?
I hope I was clear enough, but let me know if I can tell you more details.
Also: thank you very much for your work and efforts!
Best,
Alessandro Sellerio
comperror_20140320_fc18.txt (3.84 KB)
cu_nanopillar_comments.lmp (2.75 KB)
comperror_20140418_fc18.txt (3.93 KB)
comperror_20140418_ubuntu.txt (6.86 KB)