Failed: Connection reset by peer (104)

Recently, I ran a simulation using LAMMPS. The script works when the run step is less than about 9e8. However, the simulation usually stops without any error but shows “failed: Connection reset by peer (104)” (see the figure).

How should I fix it? If you need more information, please let me know. Thanks.

Hello,
This is not a LAMMPS error, but a network issue: I suppose you are running lammps on a server?
Simon

I agree that this is not a LAMMPS issue and am unclear on the appropriateness of offering off-topic advice. That being said, I’d suppose the preferred solution to this is to use a job scheduling system on the remote server (SLURM) but if that is not applicable, I would recommend reading about the screen command as a potential solution.

-Brad

You are using TCP/IP communication for a parallel LAMMPS run. That is likely going to be very inefficient. The error indicates that you may be overloading the network.

It is unlikely to be fixed from the LAMMPS side.

Whenever you report issues you should report: the exact LAMMPS version you are using, how it was compiled/installed, what platform you are running on, what your command line was, what your input is.

Yes, I run my script on the server of my lab.

Thanks for the advice. I will try to find someone to build the job scheduling system.

I would be massively surprised if using a job scheduler can fix a networking issue.

1 Like