- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
Is there support for MPI::ERRORS_THROW_EXCEPTIONS?
I notice thatany exception is not caught when there is network loss.
Thanks.
Is there support for MPI::ERRORS_THROW_EXCEPTIONS?
I notice thatany exception is not caught when there is network loss.
Thanks.
Link Copied
5 Replies
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Jimmy,
Please take a look at the example: here
If you do everything correctly but cannot catch an exception that probably means that MPI functon doesn't return error code.
Regards!
Dmitry
Please take a look at the example: here
If you do everything correctly but cannot catch an exception that probably means that MPI functon doesn't return error code.
Regards!
Dmitry
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi,
Could you please clarify what MPI implementation we are talking about? In the Intel MPi Library the MPI::ERRORS_THROW_EXCEPTIONS is supported according to MPI standard specifications.
Best regards,
Andrey
Could you please clarify what MPI implementation we are talking about? In the Intel MPi Library the MPI::ERRORS_THROW_EXCEPTIONS is supported according to MPI standard specifications.
Best regards,
Andrey
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I am using Intel MPI 4.0. I am running 3 instances of my application on the same computer. To test the exception handling, I forcefully terminate one instance of the application.
However, it appears that the catch block of the 2 other instances are not triggered. I use standard MPI functions such as MPI_TEST, MPI_BCAST, MPI_IRECV, MPI_SEND, MPI_PEEK.
Can I additionally check how to use the I_MPI_TCP_NETMASK flag in a configuration file. I could not include this in any way.
Thanks!
However, it appears that the catch block of the 2 other instances are not triggered. I use standard MPI functions such as MPI_TEST, MPI_BCAST, MPI_IRECV, MPI_SEND, MPI_PEEK.
Can I additionally check how to use the I_MPI_TCP_NETMASK flag in a configuration file. I could not include this in any way.
Thanks!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Just a quick update... I realised that I am able to catch an exception due to software error. For example, there is a mis-match between data size.
However, my objective is to catch errors due to network disconnection, or the other nodes hang abruptly. In this case, I read that there is no way because mpiexec does not trap the errors and will proceed to terminate all running processes.
However, my objective is to catch errors due to network disconnection, or the other nodes hang abruptly. In this case, I read that there is no way because mpiexec does not trap the errors and will proceed to terminate all running processes.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hi Jimmy,
Please read clause 5 of the Reference Manual about fault tolerance - might be this is your case (or might be you are talking about check-points).
Mpiexec does not catch errors! Mpiexec aborts an application if one of the processes has been aborted because of error.
Regards!
Dmitry
Please read clause 5 of the Reference Manual about fault tolerance - might be this is your case (or might be you are talking about check-points).
Mpiexec does not catch errors! Mpiexec aborts an application if one of the processes has been aborted because of error.
Regards!
Dmitry

Reply
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page