Community
cancel
Showing results for 
Search instead for 
Did you mean: 
ArthurRatz
Black Belt
101 Views

What could be a cause of the MPI program to run slower on multinode cluster?

Hello everyone,

I've got a question about MPI program performance: I've developed an MPI program that processes large amounts of data (about 10^9) elements, and running this program I've noticed that as many processes I create using mpiexec utility as longer the duration of the program execution. What could be a cause of the following issue ?? When I run this program in a single computational node, it works faster rather running that using two computational nodes. Please, help.

Regards, Arthur.

0 Kudos
5 Replies
ArthurRatz
Black Belt
101 Views

Normally I use the following computational platform: 2 x Intel Core i7 - 4970 4.00 GHZ, 32GB RAM, Network: 1 Gbps.

ArthurRatz
Black Belt
101 Views

My MPI program actually sorts a huge array containing 10^9 elements by splitting the entire array into chunks sorted by each process created by mpiexec utility. The actual sorting is performed using tbb::parallel_sort routine which is a part of Threading Building Blocks (TBB).

Barry_T_Intel
Employee
101 Views

You should probably ask that question in the HPC forum: https://software.intel.com/en-us/forums/intel-clusters-and-hpc-technology . They deal with MPI issues.

I'd start with the Intel MPI Library Troubleshooting Guide: https://software.intel.com/en-us/forums/intel-clusters-and-hpc-technology/topic/563559 .

There's also a TBB forum: https://software.intel.com/en-us/forums/intel-threading-building-blocks

ArthurRatz
Black Belt
101 Views

If anyone who is going to answer my question needs an executable to test it on his side, I'm ready provide one.

ArthurRatz
Black Belt
101 Views

Reply