Intel® MPI Library
Get help with building, analyzing, optimizing, and scaling high-performance computing (HPC) applications.

MPI_Parallel

KaYi_T_
Beginner
509 Views

Hello,

   When I run the VASP calculation software, the calculation job was "okay" during one day. After that, some jobs show the following error:

[mpiexec@node46.chess.com] HYD_pmcd_pmiserv_send_signal (./pm/pmiserv/pmiserv_cb.c:221): assert (!closed) failed

[mpiexec@node46.chess.com] ui_cmd_cb (./pm/pmiserv/pmiserv_pmci.c:128): unable to send SIGUSR1 downstream
[mpiexec@node46.chess.com] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status
[mpiexec@node46.chess.com] HYD_pmci_wait_for_completion (./pm/pmiserv/pmiserv_pmci.c:388): error waiting for event
[mpiexec@node46.chess.com] main (./ui/mpich/mpiexec.c:745): process manager error waiting for completion

 

     IMPORTANT: sometimes some of jobs are finished without error. some of them cannot be completed with above error. What should I do?

0 Kudos
1 Reply
Mark_L_Intel
Moderator
509 Views

 

Hi,

Could you provide outputs of

which mpirun

env | grep I_MPI

then please compile this code $I_MPI_ROOT/test/test.c as

mpiicc -o test.x test.c

And then after setting I_MPI_DEBUG=5, you can run executable as, e.g. 

mpirun -verbose -np 10 ./test.x

and then please provide output

Mark

0 Kudos
Reply