Intel® DevCloud
Help for those needing help starting or connecting to the Intel® DevCloud
1586 Discussions

dual_gpu systems only have 1 gpu

Robert_C_Intel
Employee
907 Views

s011-n001 and s011-n002 are listed as dual_gpu, but they have only 1 gpu. For example:

 

uxxxxx@s011-n001:~$ sycl-ls
[opencl:0] ACC : Intel(R) FPGA Emulation Platform for OpenCL(TM) 1.2 [2021.13.11.0.23_160000]
[opencl:0] CPU : Intel(R) OpenCL 3.0 [2021.13.11.0.23_160000]
[opencl:0] GPU : Intel(R) OpenCL HD Graphics 3.0 [21.49.21786]
[level_zero:0] GPU : Intel(R) Level-Zero 1.2 [1.2.21786]
[host:0] HOST: SYCL host platform 1.2 [1.2]
uxxxxx@s011-n001:~$
0 Kudos
1 Solution
JaideepK_Intel
Moderator
862 Views

Hi,

 

We are in the process of relabeling all GPU systems correctly. 

 

Thanks,

Jaideep 

 

View solution in original post

0 Kudos
5 Replies
Robert_C_Intel
Employee
905 Views

s011-n004 has 2 gpu's:

 

uxxxxx@s011-n004:~$ sycl-ls
[opencl:0] ACC : Intel(R) FPGA Emulation Platform for OpenCL(TM) 1.2 [2021.13.11.0.23_160000]
[opencl:0] CPU : Intel(R) OpenCL 3.0 [2021.13.11.0.23_160000]
[opencl:0] GPU : Intel(R) OpenCL HD Graphics 3.0 [21.49.21786]
[opencl:1] GPU : Intel(R) OpenCL HD Graphics 3.0 [21.49.21786]
[level_zero:0] GPU : Intel(R) Level-Zero 1.2 [1.2.21786]
[level_zero:1] GPU : Intel(R) Level-Zero 1.2 [1.2.21786]
[host:0] HOST: SYCL host platform 1.2 [1.2]
uxxxxx@s011-n004:~$
0 Kudos
Robert_C_Intel
Employee
892 Views

Now s011-n004 also only has 1 GPU.

All of these machines have an error at login:

/var/spool/torque/mom_priv/prologue.d//100-resetpcie.prologue: line 6: echo: write error: No such device

 

s012-n002 should have 4 GPU's, but only reports 1. I get 3 errors at login:

########################################################################
# Date: Sun 13 Feb 2022 12:21:12 PM PST
# Job ID: 1850235.v-qsvr-1.aidevcloud
# User: uxxxxx
# Resources: neednodes=1:quad_gpu:ppn=2,nodes=1:quad_gpu:ppn=2,walltime=06:00:00
########################################################################

/var/spool/torque/mom_priv/prologue.d//100-resetpcie.prologue: line 6: echo: write error: No such device
/var/spool/torque/mom_priv/prologue.d//100-resetpcie.prologue: line 6: echo: write error: No such device
/var/spool/torque/mom_priv/prologue.d//100-resetpcie.prologue: line 6: echo: write error: No such device

 

Maybe a reboot will fix them.

 

 

 

0 Kudos
JaideepK_Intel
Moderator
875 Views

Hi,


Thank you for posting in Intel Communities.

We are able to reproduce your issue from our end. We are working on this internally and we will get back with an update.


Thanks,

Jaideep


0 Kudos
JaideepK_Intel
Moderator
863 Views

Hi,

 

We are in the process of relabeling all GPU systems correctly. 

 

Thanks,

Jaideep 

 

0 Kudos
JaideepK_Intel
Moderator
840 Views

Hi,


Glad to know that your issue is resolved. If you need any additional information, please post a new question as this thread will no longer be monitored by Intel.


Thanks,

Jaideep


0 Kudos
Reply