Dear Axel,
First of all, thank you for your comment. Having a better understanding of the nature of the ReaxFF potential is very helpful for my simulations.
I have decided to give a try and switch to a full reaxFF system by adding the parameters I found for Argon particles with the ReaxFF potential (that I found in a publication).
So I created this “dummy file” with the previous reaxFF I was using that was developped for conditions similar to my simulations, and carefully appended the Argon part. Since the Argon isn’t bonding with the other particles of the file, I could simply append the initial parameters without having to dig with the bond terms and so on and so forth (I checked in the original ReaxFF file I took the parameter from and it is the same: Ar doesn’t bond nor have off diagonal terms, angles or Torsion terms with the other particles). I am still trying to use KOKKOS to run this simulations, and I recompiled in debug mode with the latest LAMMPS patch (30Jun2020).
The input lines of my script have barely changed, I just uncommented the 2 potential lines with the ReaxFF stand-alone potential, and commented the hybrid part. I also restored the fix qeq/reax with all atoms, since the Ar is now included in the ReaxFF file, which should give good performances (for that part at least):
pair_style reax/c NULL
pair_coeff * * ffield.reax.SiOCH_Ar_test Si Ar X X X X X X
(line 95 of the script)
fix reaxc1 all qeq/reax 10 0.0 10.0 1.0e-6 reax/c
(line 153)
I do not have a SegFault anymore but I obtain these kind of errors:
what(): cudaDeviceSynchronize() error( cudaErrorIllegalAddress): an illegal memory access was encountered /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/lib/kokkos/core/src/Cuda/Kokkos_Cuda_Instance.cpp:143
Traceback functionality not available
[gpu004:400526] *** Process received signal ***
[gpu004:400526] Signal: Aborted (6)
[gpu004:400526] Signal code: (-6)
[gpu004:400526] [ 0] /usr/lib64/libpthread.so.0(+0xf5d0)[0x2aaab350b5d0]
[gpu004:400526] [ 1] /usr/lib64/libc.so.6(gsignal+0x37)[0x2aaab3fe8207]
[gpu004:400526] [ 2] /usr/lib64/libc.so.6(abort+0x148)[0x2aaab3fe98f8]
[gpu004:400526] [ 3] /trinity/shared/apps/tr17.10/x86_64/gcc-7.2.0/lib64/libstdc++.so.6(_ZN9__gnu_cxx27__verbose_terminate_handlerEv+0x125)[0x2aaab37a9225]
[gpu004:400526] [ 4] /trinity/shared/apps/tr17.10/x86_64/gcc-7.2.0/lib64/libstdc++.so.6(+0x8eff6)[0x2aaab37a6ff6]
[gpu004:400526] [ 5] /trinity/shared/apps/tr17.10/x86_64/gcc-7.2.0/lib64/libstdc++.so.6(+0x8f041)[0x2aaab37a7041]
[gpu004:400526] [ 6] /trinity/shared/apps/tr17.10/x86_64/gcc-7.2.0/lib64/libstdc++.so.6(+0x8f284)[0x2aaab37a7284]
[gpu004:400526] [ 7] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x29b4b46]
[gpu004:400526] [ 8] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x29bca31]
[gpu004:400526] [ 9] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x54f681]
[gpu004:400526] [10] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x29bc916]
[gpu004:400526] [11] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x29be6b3]
[gpu004:400526] [12] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x29b0b5f]
[gpu004:400526] [13] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x29b30ef]
[gpu004:400526] [14] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x751fef]
[gpu004:400526] [15] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x749702]
[gpu004:400526] [16] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x9a2049]
[gpu004:400526] [17] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x99d718]
[gpu004:400526] [18] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x8268cb]
[gpu004:400526] [19] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0xa04c17]
[gpu004:400526] [20] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x534dcc]
[gpu004:400526] [21] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x41c6db]
[gpu004:400526] [22] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x4105a6]
[gpu004:400526] [23] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x40cfa8]
[gpu004:400526] [24] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x40b50f]
[gpu004:400526] [25] /usr/lib64/libc.so.6(__libc_start_main+0xf5)[0x2aaab3fd43d5]
[gpu004:400526] [26] /home/int/sam/defoort/lammps/lammps-patch_30Jun2020/build_07072020/lmp_mpi[0x40b3d9]
I also wanted to comment on the fact that the script I previously sent can run fine on CPU (with for instance the icc_openmpi compiled version) even with the faulty fix qeq/reax command. It is solely when I try to switch to GPU that the errors appears. I (now) understand way more why the script wasn’t a good model but why can it work with CPU and not GPU ? Is it because the GPU compilation are more sensitive to bad parameters / have a different way of setting the neighbor lists ?
To answer your question for chemical interactions, we plan to observe oxidations / water - Silicon interactions in the future, and we tought that would be better described with ReaxFF than Tersoff / Stillinger-Weber / ZBL. That’s why I am trying to use this potential. But the tight-binding approach is a clever suggestion.
Thank you again for the comments and all the help,
Best regards,
Grégoire