Intel® MPI Library
Get help with building, analyzing, optimizing, and scaling high-performance computing (HPC) applications.
2190 Discussions

MPI process interactive start-up and fault tolerance

Jimmy821
Beginner
525 Views

I am relatively new to MPI programming.

I am wondering how I can start up each process manually within the same MPI communicator world space.

In addition, when only a single process fails, how can this be detected and relaunched automatically, without crashing the other processors and the host?

Hope somebody can advise on this two issues. Thanks

0 Kudos
1 Reply
Andrey_D_Intel
Employee
525 Views
Hi,

If you want to start MPI process from your program use the MPI_Comm_spawn() call. Refer to the MPI standard for more details.

The fault tolerance support is not required by current standard. So, you should learn the particular MPI implementation which you use if it provide any such functionality.

Best regards,
Andrey



0 Kudos
Reply