Intel® oneAPI DPC++/C++ Compiler
Talk to fellow users of Intel® oneAPI DPC++/C++ Compiler and companion tools like Intel® oneAPI DPC++ Library, Intel® DPC++ Compatibility Tool, and Intel® Distribution for GDB*
684 Discussions

A slower performance when using multi-devices

zchmacchiato
Beginner
1,271 Views

Hi,

 

I try to develop a code based on multi-devices (DPC++ & MPI), I use USM and shared memory. When I do the scaling work, I find that the multi-device performance is worse than the single device performance. I think the problem scale is large enough, so multi-device should work better. Does anyone have any advice for that?

 

By the way, is there a way to make sure that I am using 16 GPUs when I run the problem using "mpirun -np 16 ./main"? I output the name of the devices, but they have the same name which is the same brand I think.

 

Thanks,

Chunheng.

0 Kudos
5 Replies
SantoshY_Intel
Moderator
1,252 Views

Hi,

 

Thank you for posting in Intel Communities.

 

Could you please provide us with the following details?

  1. The operating system you are using.
  2. Intel MPI Library & DPC++ versions you are using.
  3. A sample reproducer code and steps to reproduce your issue from our end. (commands to compile & run the code on multi-devices)
  4. Name of GPU you are using & Environment details of your cluster.

 

>>"I find that the multi-device performance is worse than the single device performance."

Could you please let us know how you are measuring the performance?

 

Thanks & Regards,

Santosh

 

0 Kudos
zchmacchiato
Beginner
1,220 Views

I also attach my makefile here,

 

Chunheng.

0 Kudos
zchmacchiato
Beginner
1,238 Views

I attach my code below.

I run my code on ThetaGPU.

The system information is: #101-Ubuntu SMP Fri Oct 15 20:00:55 UTC 2021

The GPU I use is: Selected device: NVIDIA A100-SXM4-40GB

I am not quite sure about the DPC++ or OneAPI version, but it is for Ubuntu 18.04.

 

I measure the performance by mega lattice updates per second, run the solver by 100 times and get the average time.

 

Chunheng.

0 Kudos
JyotsnaK_Intel
Moderator
1,147 Views

Hi Chunheng,

Thank you for your inquiry. We offer support for hardware platforms that the Intel® oneAPI product supports. These platforms include those that are part of the Intel® Core™ processor family or higher, the Intel® Xeon® processor family, the Intel® Xeon® Scalable processor family, and others which can be found here – Intel® oneAPI Base Toolkit System Requirements, Intel® oneAPI HPC Toolkit System Requirements, Intel® oneAPI IoT Toolkit System Requirements

If you wish to use oneAPI on hardware that is not listed at one of the sites above, we encourage you to visit and contribute to the open oneAPI specification - https://www.oneapi.io/spec/

Best regards,

Jyotsna


0 Kudos
SantoshY_Intel
Moderator
1,093 Views

Hi,


We are closing this issue. If you need any additional information, please post a new question as this thread will no longer be monitored by Intel.


Thanks,

Santosh


0 Kudos
Reply