This is a bad property to describe the speed of the calculation, since it depends on the simulation settings like the choice of timestep. A better property is the atom-step/s property also output by LAMMPS. This one only depends on the choice of model, so it much more comparable. There is a discussion of how to compare and document performance in the LAMMPS manual at: 9.2. Measuring performance — LAMMPS documentation
Before figuring out the optimal performance for your specific system, you should use one of the input decks in the bench folder of the LAMMPS distribution (make sure you read the README file) and provide some performance information for that. You also need to provide the usual info that should always be reported like:
- exact LAMMPS version
- compilation settings
- compiler and toolkit versions
- host operating system
- full command line
For example, it is not even clear from your message whether you are using the GPU or the KOKKOS package and whether you run with single, double, or mixed precision settings (which has a significant impact on performance since your GPU has - like all consumer GPUs from Nvidia - a deliberately crippled floating-point unit with very restricted double precision performance.