Community
cancel
Showing results for 
Search instead for 
Did you mean: 
KaYi_T_
Beginner
70 Views

MPI_Parallel

Hello,

   When I run the VASP calculation software, the calculation job was "okay" during one day. After that, some jobs show the following error:

[mpiexec@node46.chess.com] HYD_pmcd_pmiserv_send_signal (./pm/pmiserv/pmiserv_cb.c:221): assert (!closed) failed

[mpiexec@node46.chess.com] ui_cmd_cb (./pm/pmiserv/pmiserv_pmci.c:128): unable to send SIGUSR1 downstream
[mpiexec@node46.chess.com] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status
[mpiexec@node46.chess.com] HYD_pmci_wait_for_completion (./pm/pmiserv/pmiserv_pmci.c:388): error waiting for event
[mpiexec@node46.chess.com] main (./ui/mpich/mpiexec.c:745): process manager error waiting for completion

 

     IMPORTANT: sometimes some of jobs are finished without error. some of them cannot be completed with above error. What should I do?

0 Kudos
1 Reply
Mark_L_Intel
Employee
70 Views

 

Hi,

Could you provide outputs of

which mpirun

env | grep I_MPI

then please compile this code $I_MPI_ROOT/test/test.c as

mpiicc -o test.x test.c

And then after setting I_MPI_DEBUG=5, you can run executable as, e.g. 

mpirun -verbose -np 10 ./test.x

and then please provide output

Mark

Reply