is it because of fftw? (i am suspecting it because it worked well for lj)
impossible to say with this. the error indicates that something that
is linked into your executable was compiled with optimization for a
different CPU than you are running on.
without trapping the failing instruction in a core dump or a debugger
and getting a stack trace, it is impossible to say for certain which
part of the executable is causing this.
you should review your compilation flagss and ask your local HPC
staff, that are operating that cray machine, for assistance.
also, if you suspect the FFT library, then it would have no relation
at all to the GPU package. the GPU package doesn't use FFTs at all.
indeed, there is no indication in what you quote, that any GPU code is
causing issues here.