Please recommend one GPU model for computing with LAMMPS on our machine

_Wenqiang_Liu · November 21, 2019, 7:45am

Hi,

Our group are going to purchase several machines. I have heard a lot about GPU speed-up and I want to have a try with that.

I have read the doc page of GPU package and searched on the mail list, but due to that I do not have sufficient knowledge on this field. I still do not have a clue that how to choose the proper model for us considering the price as well as the performance. I see several threads of the related topics on mail list, but the time those mails posted is several year ago.

My questions are:

1. When use GPUs for LAMMPS, it is recommended that to use double-precision calculations. The results would be not very accurate if single-precision is used. Is that right? How do I know whether double or single precision calculation should be used in one particular simulation?

2. Is that right to select the GPU model mainly based on the the rank of double-precision FLOPS?

3. I found several articles said that GPUs for the consumer market like GTX 2080 super have much higher price-to-performance ratio. Although the price of Telsa series is several times expensive. but the increase for the speed-up is much lower. Does such saying make sense for LAMMPS?

Finally, it would be much appreciated if you could recommend a GPU model with high price-to-performance ratio for us. We are going to purchase machines with SuperMicro X11DAi-N mother board shipped.

Thanks very much in advance!

akohlmey · November 21, 2019, 11:55am

Hi,

Our group are going to purchase several machines. I have heard a lot
about GPU speed-up and I want to have a try with that.

I have read the doc page of GPU package and searched on the mail list,
but due to that I do not have sufficient knowledge on this field. I
still do not have a clue that how to choose the proper model for us
considering the price as well as the performance. I see several threads
of the related topics on mail list, but the time those mails posted is
several year ago.

My questions are:

When use GPUs for LAMMPS, it is recommended that to use
double-precision calculations. The results would be not very accurate if
single-precision is used. Is that right?

it depends. different parts of the computation are affected by this differently.
in MD there is a lot of error cancellation going on. it often depends on how much, whether you can get away with single precision or not, and that depends on the system size, topology and geometry, and simulation settings. please note, that the GPU package also has the option of mixed precision, where the most precision sensitive parts of the computation are done in double precision, which can achieve nearly single precision speed with a significant decrease of errors.

How do I know whether double or
single precision calculation should be used in one particular simulation?

this is difficult to tell in general. one thing that is easy to verify is, that using any variable cell calculation (via fix npt or fix press/berendsen) or any calculation where you need to compute the pressure for you analysis, is better done in double precision, since the stress tensor calculation is particularly sensitive to precision settings except if you run simulations at very high pressure.

Is that right to select the GPU model mainly based on the the rank of
double-precision FLOPS?

no. there is also the memory bandwidth (clock and bus width), the amount of RAM and the GPU generation, and how the GPU is connected to the CPU.

I found several articles said that GPUs for the consumer market like
GTX 2080 super have much higher price-to-performance ratio. Although the
price of Telsa series is several times expensive. but the increase for
the speed-up is much lower. Does such saying make sense for LAMMPS?

this primarily applies to calculations in single or mixed precision. most consumer GPUs are (deliberately) crippled in either hardware or driver support to “encourage” folks to buy the significantly more expensive telsa models (or the even more expensive quadro models). there is significant resentment in the community of GPU accelerated MD users (and developers of some GPU MD codes) building because of that, and some developers are actively advocating to stay away from nvidia hardware. technically, the GPU package is capable of supporting non-nvidia GPUs when compiled in OpenCL mode, but there have been recent reports of incompatibilities between the OpenCL implementation of some vendors and the code in LAMMPS. also, there is some uncertainty in how far the support for non-CUDA GPU acceleration in KOKKOS has progressed. neither OpenCL on non-nvidia or KOKKOS on non-CUDA GPUs is currently part of the LAMMPS testing.

Finally, it would be much appreciated if you could recommend a GPU model
with high price-to-performance ratio for us. We are going to purchase
machines with SuperMicro X11DAi-N mother board shipped.

sorry, there is no easy choice. i would recommend you either try to use somebody else’s machine for testing or try to get a machine on loan to do some tests for your specific problems. how well a specific GPU works for a particular application is very difficult to predict without knowing what the exact use case is and what settings are required. see my discussion above. another issue is, that there are often limitations of the mainboard, case and power supply that restrict what kind of GPU (and how many) are possible. another factor is the availability of technical skills. using GPUs well requires more knowledge in compiling and running applications and installing/maintaining tools and divers than when using CPU-only machines. with the availability of CPUs with increasing numbers of cores per socket, the price/performance gap between GPUs and CPUs has been narrowing, even for cases that are well suited for GPU acceleration.

Axel.

Andrew_Jewett · November 22, 2019, 12:45am

Hi,

Our group are going to purchase several machines. I have heard a lot
about GPU speed-up and I want to have a try with that.

I have read the doc page of GPU package and searched on the mail list,
but due to that I do not have sufficient knowledge on this field. I
still do not have a clue that how to choose the proper model for us
considering the price as well as the performance. I see several threads
of the related topics on mail list, but the time those mails posted is
several year ago.

My questions are:

1. When use GPUs for LAMMPS, it is recommended that to use
double-precision calculations. The results would be not very accurate if
single-precision is used. Is that right?

it depends. different parts of the computation are affected by this differently.
in MD there is a lot of error cancellation going on. it often depends on how much, whether you can get away with single precision or not, and that depends on the system size, topology and geometry, and simulation settings. please note, that the GPU package also has the option of mixed precision, where the most precision sensitive parts of the computation are done in double precision, which can achieve nearly single precision speed with a significant decrease of errors.

How do I know whether double or
single precision calculation should be used in one particular simulation?

this is difficult to tell in general. one thing that is easy to verify is, that using any variable cell calculation (via fix npt or fix press/berendsen) or any calculation where you need to compute the pressure for you analysis, is better done in double precision, since the stress tensor calculation is particularly sensitive to precision settings except if you run simulations at very high pressure.

2. Is that right to select the GPU model mainly based on the the rank of
double-precision FLOPS?

no. there is also the memory bandwidth (clock and bus width), the amount of RAM and the GPU generation, and how the GPU is connected to the CPU.

3. I found several articles said that GPUs for the consumer market like
GTX 2080 super have much higher price-to-performance ratio. Although the
price of Telsa series is several times expensive. but the increase for
the speed-up is much lower. Does such saying make sense for LAMMPS?

this primarily applies to calculations in single or mixed precision. most consumer GPUs are
(deliberately) crippled in either hardware or driver support to "encourage" folks to buy the
significantly more expensive telsa models (or the even more expensive quadro models).
there is significant resentment in the community of GPU accelerated MD users (and
developers of some GPU MD codes) building because of that, and some developers are actively advocating to stay away from nvidia hardware. technically, the GPU package is capable of supporting non-nvidia
GPUs when compiled in OpenCL mode, but there have been recent reports of
incompatibilities between the OpenCL implementation of some vendors and the code
in LAMMPS. also, there is some uncertainty in how far the support for non-CUDA
GPU acceleration in KOKKOS has progressed. neither OpenCL on non-nvidia or
KOKKOS on non-CUDA GPUs is currently part of the LAMMPS testing.

I feel unqualified to add my opinion to this discussion, but there are
several papers describing the difference between the expensive Quadro
cards and the cheaper gaming cards. These papers discuss the
difference in calculation errors (memory errors), as well as running
molecular dynamics in single and double precision. It's apparently
not unusual to run simulations in single-precision:

http://www.wmd-lab.org/papers/2014_03_ECC_AMBER_Paper_10.1002_cpe.3232.pdf
https://www.xsede.org/documents/384387/561669/2013_XSEDE_ECC.pdf
https://arxiv.org/pdf/1507.00898.pdf

It has been suggested that if you purchase a cheap gaming video card,
you can test it for errors before using it to calculate simulation
results. You can download software to check the memory of your video
card. Here are some links on that topic:

https://foldingathome.org/2009/04/28/nvidia-gpu-memory-checker/
https://www.google.com/search?q=gpu+memory+test&oq=gpu+memory+test

Incidentally, I've yet to hear anyone advocate purchasing an AMD/ATI
graphics card for running molecular dynamics. I think Nvidia cards
are still the most common and best supported.

-andrew

_Wenqiang_Liu · November 22, 2019, 8:58am

Dear Andrew,

Thanks very much for your reply! I am very glad that you can share your experience with me and give me your advice. I will check those links.

Best Regards,

Liu

Stan_Moore · November 26, 2019, 10:25pm

also, there is some uncertainty in how far the support for non-CUDA GPU acceleration in KOKKOS has progressed.

Kokkos package in LAMMPS will certainly support AMD GPUs in the future.