Intel® DevCloud
Help for those needing help starting or connecting to the Intel® DevCloud
Announcements
The Intel sign-in experience is changing in February to support enhanced security controls. If you sign in, click here for more information.
1218 Discussions

dual_gpu systems only have 1 gpu

Robert_C_Intel
Employee
465 Views

s011-n001 and s011-n002 are listed as dual_gpu, but they have only 1 gpu. For example:

 

uxxxxx@s011-n001:~$ sycl-ls
[opencl:0] ACC : Intel(R) FPGA Emulation Platform for OpenCL(TM) 1.2 [2021.13.11.0.23_160000]
[opencl:0] CPU : Intel(R) OpenCL 3.0 [2021.13.11.0.23_160000]
[opencl:0] GPU : Intel(R) OpenCL HD Graphics 3.0 [21.49.21786]
[level_zero:0] GPU : Intel(R) Level-Zero 1.2 [1.2.21786]
[host:0] HOST: SYCL host platform 1.2 [1.2]
uxxxxx@s011-n001:~$
0 Kudos
1 Solution
JaideepK_Intel
Moderator
420 Views

Hi,

 

We are in the process of relabeling all GPU systems correctly. 

 

Thanks,

Jaideep 

 

View solution in original post

5 Replies
Robert_C_Intel
Employee
463 Views

s011-n004 has 2 gpu's:

 

uxxxxx@s011-n004:~$ sycl-ls
[opencl:0] ACC : Intel(R) FPGA Emulation Platform for OpenCL(TM) 1.2 [2021.13.11.0.23_160000]
[opencl:0] CPU : Intel(R) OpenCL 3.0 [2021.13.11.0.23_160000]
[opencl:0] GPU : Intel(R) OpenCL HD Graphics 3.0 [21.49.21786]
[opencl:1] GPU : Intel(R) OpenCL HD Graphics 3.0 [21.49.21786]
[level_zero:0] GPU : Intel(R) Level-Zero 1.2 [1.2.21786]
[level_zero:1] GPU : Intel(R) Level-Zero 1.2 [1.2.21786]
[host:0] HOST: SYCL host platform 1.2 [1.2]
uxxxxx@s011-n004:~$
Robert_C_Intel
Employee
450 Views

Now s011-n004 also only has 1 GPU.

All of these machines have an error at login:

/var/spool/torque/mom_priv/prologue.d//100-resetpcie.prologue: line 6: echo: write error: No such device

 

s012-n002 should have 4 GPU's, but only reports 1. I get 3 errors at login:

########################################################################
# Date: Sun 13 Feb 2022 12:21:12 PM PST
# Job ID: 1850235.v-qsvr-1.aidevcloud
# User: uxxxxx
# Resources: neednodes=1:quad_gpu:ppn=2,nodes=1:quad_gpu:ppn=2,walltime=06:00:00
########################################################################

/var/spool/torque/mom_priv/prologue.d//100-resetpcie.prologue: line 6: echo: write error: No such device
/var/spool/torque/mom_priv/prologue.d//100-resetpcie.prologue: line 6: echo: write error: No such device
/var/spool/torque/mom_priv/prologue.d//100-resetpcie.prologue: line 6: echo: write error: No such device

 

Maybe a reboot will fix them.

 

 

 

JaideepK_Intel
Moderator
433 Views

Hi,


Thank you for posting in Intel Communities.

We are able to reproduce your issue from our end. We are working on this internally and we will get back with an update.


Thanks,

Jaideep


JaideepK_Intel
Moderator
421 Views

Hi,

 

We are in the process of relabeling all GPU systems correctly. 

 

Thanks,

Jaideep 

 

JaideepK_Intel
Moderator
398 Views

Hi,


Glad to know that your issue is resolved. If you need any additional information, please post a new question as this thread will no longer be monitored by Intel.


Thanks,

Jaideep


Reply