Community
cancel
Showing results for 
Search instead for 
Did you mean: 
Highlighted
Novice
99 Views

Server not recognizing Intel Xeon Phi Coprocessor 7120P

Hello everyone,

I'm a PhD student and have 2 Xeon Phi 7120P Coprocessor which i want to use for computations. However, I'm facing problem in getting the 7120P coprocessor to be recognized by my server.

My server is a Huawei RH2288 V2-8S with 2 x Intel(R) Xeon(R) CPU E5-2680 0 @ 2.70GHz. The board details are below.

The card got detected once in CentOS 7 when i manually set the PCIe mode to Gen1 (dont know why), but once i rebooted the server, the card was gone. After that i tried several settings for PCIe (Auto/Gen1/Gen2/Gen3) but it simply doesn't detect the card. The blue light keeps blinking when the server is ON but it doesn't show in lspci.

After that i even tried different OS like windows server 2019, Ubuntu, CentOS8 but it simply doesn't show the card.

To check the card, i installed it in a Dell Precision Tower 5810 with Intel(R) Xeon(R) CPU E5-1620 v3 @ 3.50GHz and Dell baseboard 0K240Y and Intel chipset Haswell-E X99. First the pc did not boot but then i went in bios and set PCIe to Gen2 and it detected the 7120P coprocessor but after that when i tried to install the card, it would not respond. I noticed that the card was getting a bit hot in Dell system because 7120P does not have any cooling fan and can be used only in a server with active cooling. But the problem is server doesn't detect it.

Any help would be really appreciated.

My email is VS00350@surrey.ac.uk

Vikrant1_0-1605704845145.png

Vikrant1_1-1605705308952.png

 

0 Kudos
6 Replies
Highlighted
Moderator
82 Views

Hello Vikrant1,

 

Thank you for joining the community

 

You stated that you own 2 x Intel Xeon Phi Coprocessors 7120P. Is this issue happening with both cards or just one of them? Is it showing up in BIOS at least?  Are you booting the system in legacy mode or UEFI mode?

Besides that by any chance have you check with the server manufactured (Huawei and/or Dell) if this card is compatible with their systems?

We'll look forward for your updates

 

Regards

 

Jose A.

Intel Customer Support

For firmware updates and troubleshooting tips, visit:

https://intel.com/support/serverbios

 

0 Kudos
Highlighted
Novice
66 Views

Hi Jose,

Thank you for your reply.

Both the cards are 7120P and both show the same behaviour. They get detected in my Dell Precision Tower T5810 but since it does not have any cooling mechanism for passive cards so they become quite hot initially and then become cold. I'm assuming they auto shutdown because of no cooling.

Unfortunately the Huawei servers that we have no longer have any support available from Huawei, so we are completely on our own there.

The card got detected only once in the bios and the OS but once i rebooted the server, it was gone from both the bios and the OS. There is a blue light at the back of the card that keeps blinking. Not sure what this means.

The BIOS is currently set to dual EFI/Legacy (Auto select) mode. Can this have any effect on the Phi Coprocessor?

 

 

Regards,

Vikrant Singh

0 Kudos
Highlighted
Moderator
60 Views

Hello Vikrant1,


At this point it is difficult to tell what could be wrong with these cards. It seems unlikely that both cards are damaged and makes me think about a compatibility issue. By any chance do you have access to another different server (ideally an Intel Server) that you could try the cards on, so we can discard an actual hardware failure? I think you could request compatibility information to both Dell and Huawei for this card. The following site shows compatibility with Intel servers only though Intel® Xeon Phi™ Coprocessor 7120P (16GB, 1.238 GHz, 61 core) Product Specifications


Regards


Jose A.

Intel Customer Support Technician


0 Kudos
Highlighted
Novice
32 Views

Hi Jose,

Unfortunately, at the moment i just have access to these servers. I'm quite disappointed at the moment. Was really looking forward for a way to accelerate my computations with these cards.

 

 

Regards,

Vikrant

0 Kudos
Highlighted
Moderator
28 Views

Hello Vikrant1,


Let me check with our senior team if there is something else that we can try.


Regards


Jose A.

Intel Customer Support Technician


0 Kudos
Highlighted
Novice
9 Views

Hi Jose,

Many thanks for your help. Much appreciated.

I have added some snapshots of my BIOS, just in case you might want to have a look at them.

HyperThreading is Disabled and TurboMode is EnabledVikrant1_0-1606657764139.png

 

The server has only 1 PCIex16 slot linked to CPU0 (First CPU in the server) which is Port 3 and the XeonPhi 7120P.

Vikrant1_1-1606657847500.png

The respective PCIe port (3a) is set at Gen2 and PCI-E port max payload is set to 256B.

Vikrant1_2-1606657907096.png

Intel VT is disabled

Vikrant1_3-1606657957277.png

PCIe 64-bit Decode is enabled

Vikrant1_4-1606658028095.png

 

Regards,

Vikrant

0 Kudos