When I run the VASP calculation software, the calculation job was "okay" during one day. After that, some jobs show the following error:
[firstname.lastname@example.org] HYD_pmcd_pmiserv_send_signal (./pm/pmiserv/pmiserv_cb.c:221): assert (!closed) failed
[email@example.com] ui_cmd_cb (./pm/pmiserv/pmiserv_pmci.c:128): unable to send SIGUSR1 downstream
[firstname.lastname@example.org] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status
[email@example.com] HYD_pmci_wait_for_completion (./pm/pmiserv/pmiserv_pmci.c:388): error waiting for event
[firstname.lastname@example.org] main (./ui/mpich/mpiexec.c:745): process manager error waiting for completion
IMPORTANT: sometimes some of jobs are finished without error. some of them cannot be completed with above error. What should I do?
Could you provide outputs of
env | grep I_MPI
then please compile this code $I_MPI_ROOT/test/test.c as
mpiicc -o test.x test.c
And then after setting I_MPI_DEBUG=5, you can run executable as, e.g.
mpirun -verbose -np 10 ./test.x
and then please provide output