Community
cancel
Showing results for 
Search instead for 
Did you mean: 
Yun_L_
Beginner
44 Views

intel MPI codes fails with more than 1 node

For a simple mpi program compiled with Intel compiler studio 2016 and 2017, with intel compiler and mpi, the jobs fail with the following debug errors. The code will run extremely slowly, and get stuck for about 30 seconds at one stage if run on 2 or more nodes. It runs smoothly without any problems on a single node. The same code compiled with gcc and openmpi runs smoothly without any problem on any number of nodes.

Do you know what might be the problem? Thanks.

0 Kudos
2 Replies
TimP
Black Belt
44 Views

This question might get more expert attention on the companion cluster/hpc forum https://software.intel.com/en-us/forums/intel-clusters-and-hpc-technology

jimdempseyatthecove
Black Belt
44 Views

Your error.log appears to be the results from a successful "Hello World" test program.

>>The code will run extremely slowly, and get stuck for about 30 seconds at one stage if run on 2 or more nodes

Sounds like a coding error on your part. Handshake issues resulting in deadlock.

Jim Dempsey

Reply