Intel® DevCloud
Help for those needing help starting or connecting to the Intel® DevCloud
1637 Discussions

dual_gpu systems only have 1 gpu

Robert_C_Intel
Employee
975 Views

s011-n001 and s011-n002 are listed as dual_gpu, but they have only 1 gpu. For example:

 

uxxxxx@s011-n001:~$ sycl-ls
[opencl:0] ACC : Intel(R) FPGA Emulation Platform for OpenCL(TM) 1.2 [2021.13.11.0.23_160000]
[opencl:0] CPU : Intel(R) OpenCL 3.0 [2021.13.11.0.23_160000]
[opencl:0] GPU : Intel(R) OpenCL HD Graphics 3.0 [21.49.21786]
[level_zero:0] GPU : Intel(R) Level-Zero 1.2 [1.2.21786]
[host:0] HOST: SYCL host platform 1.2 [1.2]
uxxxxx@s011-n001:~$
0 Kudos
1 Solution
JaideepK_Intel
Moderator
930 Views

Hi,

 

We are in the process of relabeling all GPU systems correctly. 

 

Thanks,

Jaideep 

 

View solution in original post

0 Kudos
5 Replies
Robert_C_Intel
Employee
973 Views

s011-n004 has 2 gpu's:

 

uxxxxx@s011-n004:~$ sycl-ls
[opencl:0] ACC : Intel(R) FPGA Emulation Platform for OpenCL(TM) 1.2 [2021.13.11.0.23_160000]
[opencl:0] CPU : Intel(R) OpenCL 3.0 [2021.13.11.0.23_160000]
[opencl:0] GPU : Intel(R) OpenCL HD Graphics 3.0 [21.49.21786]
[opencl:1] GPU : Intel(R) OpenCL HD Graphics 3.0 [21.49.21786]
[level_zero:0] GPU : Intel(R) Level-Zero 1.2 [1.2.21786]
[level_zero:1] GPU : Intel(R) Level-Zero 1.2 [1.2.21786]
[host:0] HOST: SYCL host platform 1.2 [1.2]
uxxxxx@s011-n004:~$
0 Kudos
Robert_C_Intel
Employee
960 Views

Now s011-n004 also only has 1 GPU.

All of these machines have an error at login:

/var/spool/torque/mom_priv/prologue.d//100-resetpcie.prologue: line 6: echo: write error: No such device

 

s012-n002 should have 4 GPU's, but only reports 1. I get 3 errors at login:

########################################################################
# Date: Sun 13 Feb 2022 12:21:12 PM PST
# Job ID: 1850235.v-qsvr-1.aidevcloud
# User: uxxxxx
# Resources: neednodes=1:quad_gpu:ppn=2,nodes=1:quad_gpu:ppn=2,walltime=06:00:00
########################################################################

/var/spool/torque/mom_priv/prologue.d//100-resetpcie.prologue: line 6: echo: write error: No such device
/var/spool/torque/mom_priv/prologue.d//100-resetpcie.prologue: line 6: echo: write error: No such device
/var/spool/torque/mom_priv/prologue.d//100-resetpcie.prologue: line 6: echo: write error: No such device

 

Maybe a reboot will fix them.

 

 

 

0 Kudos
JaideepK_Intel
Moderator
943 Views

Hi,


Thank you for posting in Intel Communities.

We are able to reproduce your issue from our end. We are working on this internally and we will get back with an update.


Thanks,

Jaideep


0 Kudos
JaideepK_Intel
Moderator
931 Views

Hi,

 

We are in the process of relabeling all GPU systems correctly. 

 

Thanks,

Jaideep 

 

0 Kudos
JaideepK_Intel
Moderator
908 Views

Hi,


Glad to know that your issue is resolved. If you need any additional information, please post a new question as this thread will no longer be monitored by Intel.


Thanks,

Jaideep


0 Kudos
Reply