Community
cancel
Showing results for 
Search instead for 
Did you mean: 
Figura__Ed
Beginner
337 Views

Disconnecting VPN when running MPMD execution crashes

I've read the thread: https://community.intel.com/t5/Intel-oneAPI-HPC-Toolkit/Disconnect-VPN-causes-parallel-computing-wit...

My problem is slightly different: I have a product which executes as MPMD e.g. mpiexec -localonly -n 1 <main-prog>.exe : -n <X> <worker-prog>.exe

Whilst this is running, if VPN is disconnected, then the application terminates ("connection to proxy 0 failed", etc.)

I can reproduce using the following: mpiexec -localonly -n 4 IMB-MPI1.exe : -n 2 IMB-MPI1.exe

Currently we are running Intel MPI 5.0 (I know, very out-of-date - but there was a historical reason to do with dynamic spawning - no longer used) and will be looking to upgrade. I have actually installed 2019U7 to try but the issue still exists. The VPN disconnection problem for our users on laptops is almost a 'killer issue'. Any advice greatly appreciated.

0 Kudos
8 Replies
PrasanthD_intel
Moderator
313 Views

Hi Ed,


As specified in the thread, you have mentioned the VPN disconnect issue has been resolved in the latest version.

As you were getting an error while using argument sets, we have tried with that too but haven't got any error after disconnecting VPN.

After upgrading to the MPI 2019u7 have you done the prerequisite steps as mentioned in Getting started guide(https://software.intel.com/content/www/us/en/develop/documentation/get-started-with-mpi-for-windows/...)?

If yes Could you please provide the command line so we can debug from our side.

Also if you were using Dynamic processes upgrade to the newer version 2019u8 in which PMI2 spawn support was added. (https://software.intel.com/content/www/us/en/develop/documentation/mpi-developer-reference-linux/top...)


Regards

Prasanth 


Figura__Ed
Beginner
302 Views

Thx for getting back. We're in the middle of a release so I'm a little limited in time at the moment.

Like I said, when I executed the shipped exe as follows:

    mpiexec -localonly -n 4 IMB-MPI1.exe : -n 2 IMB-MPI1.exe

and disconnected the VPN, then I got a crash.

However, executing the following works fine:

    mpiexec -localonly -n 4 IMB-MPI1.exe

 

Figura__Ed
Beginner
278 Views

Note: I am running on Windows 10 (not linux)

PrasanthD_intel
Moderator
257 Views

Hi Ed,


We have tried to reproduce the error in windows 10.

Could you please provide the debug info by setting I_MPI_DEBUG=5.

set I_MPI_DEBUG=5

Also if possible please update to the latest version 2019u8 and check.



Thanks

Prasanth


Figura__Ed
Beginner
249 Views

Hi. I can confirm that with 2019 Update 8, I no longer get the crash with the example I posted. I will test this with our software once I get a chance and post any follow ups.

PrasanthD_intel
Moderator
236 Views

Hi Ed,

Thanks for the confirmation. It should also work with 2019u7 version too.

Anyway if your initial query has been answered please confirm so we can close the case.


Regards

Prasanth


Figura__Ed
Beginner
229 Views

Sure, close the case. If I have any further issues once I upgrade & integrate with our own code then I'll raise a different query. Thanks for your help.

PrasanthD_intel
Moderator
212 Views

Hi Ed,


Glad your problem is resolved.

 If you require additional assistance from Intel, please start a new thread. Any further interaction in this thread will be considered community only


Regards

Prasanth


Reply