Intel® oneAPI HPC Toolkit
Get help with building, analyzing, optimizing, and scaling high-performance computing (HPC) applications.

PMPI_waitany fata; error

wangz6
Beginner
278 Views

Hello team, I am using the intel/mpi/64/2019/5.075 mpi lib in my school cluster; this cluster is Cray cs 400 and using craype-network-infiniband for the communication; I currently received the following errors:

  " Fatal error in PMPI_Waitall: Other MPI error, error stack:

   MPIDI_OFI_handle_cq_error(991)"

 

This error seems randomly happened during the run time; 

Thank you for helping me to trouble shoot.

 

BEST

Labels (1)
0 Kudos
3 Replies
HemanthCH_Intel
Moderator
250 Views

Hi,


Thanks for reaching out to us.


Could you please provide a sample reproducer code and the commands you used for reproducing the issue?

Also, please let us know how many nodes you are using for launching the MPI job?


Thanks & Regards,

Hemanth.


HemanthCH_Intel
Moderator
209 Views

Hi,

 

We have not heard back from you. Could you please provide the above requested details?

 

Thanks & Regards,

Hemanth.

 

HemanthCH_Intel
Moderator
182 Views

Hi,


We have not heard back from you. This thread will no longer be monitored by Intel. If you need further assistance, please post a new question.


Thanks & Regards,

Hemanth.


Reply