Intel® oneAPI HPC Toolkit
Get help with building, analyzing, optimizing, and scaling high-performance computing (HPC) applications.
2058 Discussions

performance of Iallreduce on xeon phi

Zhoulong_J_Intel
Employee
263 Views

Hi, 

We are trying to use non blocking api(Iallreduce) on computation intensive program, we tried on two nodes(xeon phi) and find two nodes are not balance with intel trace analyzer tool, it said that one node spent more time on Iallreduce(sum?), We want to know whether we can create a thread and let the iallreduce/sum do in one specific core and let it parallel with user code(openmp)? or is there api or config in intel mpi can do this job? thanks

 

0 Kudos
2 Replies
James_S
Employee
263 Views

Hi Zhoulong, for hybird MPI/OpenMP programming, could you please refer Beginning Hybrid MPI/OpenMP Development and Running an MPI/OpenMP* Program. For process pining, please refer Environment Variables for Process Pinning and Interoperability with OpenMP API. Thank you.

0 Kudos
Zhoulong_J_Intel
Employee
263 Views

 

Thanks

 

Si, Zhuowei (Intel) wrote:

Hi Zhoulong, for hybird MPI/OpenMP programming, could you please refer Beginning Hybrid MPI/OpenMP Development and Running an MPI/OpenMP* Program. For process pining, please refer Environment Variables for Process Pinning and Interoperability with OpenMP API. Thank you.

 

0 Kudos
Reply