I'm a PhD student and have 2 Xeon Phi 7120P Coprocessor which i want to use for computations. However, I'm facing problem in getting the 7120P coprocessor to be recognized by my server.
My server is a Huawei RH2288 V2-8S with 2 x Intel(R) Xeon(R) CPU E5-2680 0 @ 2.70GHz. The board details are below.
The card got detected once in CentOS 7 when i manually set the PCIe mode to Gen1 (dont know why), but once i rebooted the server, the card was gone. After that i tried several settings for PCIe (Auto/Gen1/Gen2/Gen3) but it simply doesn't detect the card. The blue light keeps blinking when the server is ON but it doesn't show in lspci.
After that i even tried different OS like windows server 2019, Ubuntu, CentOS8 but it simply doesn't show the card.
To check the card, i installed it in a Dell Precision Tower 5810 with Intel(R) Xeon(R) CPU E5-1620 v3 @ 3.50GHz and Dell baseboard 0K240Y and Intel chipset Haswell-E X99. First the pc did not boot but then i went in bios and set PCIe to Gen2 and it detected the 7120P coprocessor but after that when i tried to install the card, it would not respond. I noticed that the card was getting a bit hot in Dell system because 7120P does not have any cooling fan and can be used only in a server with active cooling. But the problem is server doesn't detect it.
Any help would be really appreciated.
My email is VS00350@surrey.ac.uk
Thank you for joining the community
You stated that you own 2 x Intel Xeon Phi Coprocessors 7120P. Is this issue happening with both cards or just one of them? Is it showing up in BIOS at least? Are you booting the system in legacy mode or UEFI mode?
Besides that by any chance have you check with the server manufactured (Huawei and/or Dell) if this card is compatible with their systems?
We'll look forward for your updates
Intel Customer Support
For firmware updates and troubleshooting tips, visit:
Thank you for your reply.
Both the cards are 7120P and both show the same behaviour. They get detected in my Dell Precision Tower T5810 but since it does not have any cooling mechanism for passive cards so they become quite hot initially and then become cold. I'm assuming they auto shutdown because of no cooling.
Unfortunately the Huawei servers that we have no longer have any support available from Huawei, so we are completely on our own there.
The card got detected only once in the bios and the OS but once i rebooted the server, it was gone from both the bios and the OS. There is a blue light at the back of the card that keeps blinking. Not sure what this means.
The BIOS is currently set to dual EFI/Legacy (Auto select) mode. Can this have any effect on the Phi Coprocessor?
At this point it is difficult to tell what could be wrong with these cards. It seems unlikely that both cards are damaged and makes me think about a compatibility issue. By any chance do you have access to another different server (ideally an Intel Server) that you could try the cards on, so we can discard an actual hardware failure? I think you could request compatibility information to both Dell and Huawei for this card. The following site shows compatibility with Intel servers only though Intel® Xeon Phi™ Coprocessor 7120P (16GB, 1.238 GHz, 61 core) Product Specifications
Intel Customer Support Technician
Unfortunately, at the moment i just have access to these servers. I'm quite disappointed at the moment. Was really looking forward for a way to accelerate my computations with these cards.
Many thanks for your help. Much appreciated.
I have added some snapshots of my BIOS, just in case you might want to have a look at them.
HyperThreading is Disabled and TurboMode is Enabled
The server has only 1 PCIex16 slot linked to CPU0 (First CPU in the server) which is Port 3 and the XeonPhi 7120P.
The respective PCIe port (3a) is set at Gen2 and PCI-E port max payload is set to 256B.
Intel VT is disabled
PCIe 64-bit Decode is enabled