Intel® oneAPI HPC Toolkit
Get help with building, analyzing, optimizing, and scaling high-performance computing (HPC) applications.
Announcements
This community is designed for sharing of public information. Please do not share Intel or third-party confidential information here.
1910 Discussions

MPI process interactive start-up and fault tolerance

Jimmy821
Beginner
134 Views

I am relatively new to MPI programming.

I am wondering how I can start up each process manually within the same MPI communicator world space.

In addition, when only a single process fails, how can this be detected and relaunched automatically, without crashing the other processors and the host?

Hope somebody can advise on this two issues. Thanks

0 Kudos
1 Reply
Andrey_D_Intel
Employee
134 Views
Hi,

If you want to start MPI process from your program use the MPI_Comm_spawn() call. Refer to the MPI standard for more details.

The fault tolerance support is not required by current standard. So, you should learn the particular MPI implementation which you use if it provide any such functionality.

Best regards,
Andrey



Reply