Intel® Xeon® Processor and Server Products
Intel® Xeon® Processors, Data Center Products including boards, integrated systems, and RAID Storage
Announcements
FPGA community forums and blogs have moved to the Altera Community. Existing Intel Community members can sign in with their current credentials.
5201 Discussions

Peculiar problem with Xeon W-1250 and graphics

tcsenter89
New Contributor I
2,518 Views

I have two Precision 3650 towers here that I am rebuilding. I found a curious problem affecting both systems exhibit identical symptom. The processor is a supported Xeon W-1250 (SRH48), Intel W580 chipset.

Using the iGPU Intel processor graphics (via onboard DP port), the system will spontaneously reset/reboot shortly after loading to the desktop. Usually within 90 seconds but has gone up to about 5 minutes. There is no crash dump or logs other than the 'Windows did not shut down properly' check bit error that gets reported on reboot. It does not happen early in the installation of Windows but it can happen late (when Windows is guiding you through SETUP questions)

When a graphics card (dGPU) is installed in the PCI-E x16 (PEG) slot, the system is 100% stable. But wait, there's more! It is 100% stable even when I am actually connected through the integrated graphics. i.e. no display connected to the dGPU. I noticed some of the benchmark apps were still defaulting to the dGPU for some workloads and outputting via the iGPU. So I disabled the dGPU in Device Manager, restarted and set primary display adapter in BIOS to Onboard, booted to Windows, checked the dGPU was still disabled, to ensure the iGPU (Intel processor integrated) would be used for everything.

I ran PCMark, 3DMark, processor stress, memory stress for several hours (cumulative), 100% stable. I confirmed the iGPU was being utilized for ALL workloads by using Task Manager and watching utilization of the various execution pipeline such as 3D, Video Decode, Video Encode, or Memory Copy. All showing utilization on the iGPU not the dGPU.

The dGPU I have been testing are both slot powered (under 60W board power), no aux power connector. Running all the benchmarks and stress tests on the (enabled) dGPU also is stable, BTW.

In sum/recap:

System with only iGPU = spontaneous reset/reboot

System with dGPU inserted into PCI-E x16 (PEG) whether utilized or not = 100% stable via iGPU (or dGPU)

Windows 10 (22H2) and 11 (24H2) exhibit same problem. Latest drivers or older drivers, same problem. Latest BIOS or the oldest BIOS I am permitted to revert to several versions ago, same problem. Tried all three BIOS settings for primary display adapter, Auto, Onboard, or dGPU. Multi-display support on or off. PCI-E (PEG) bifurcation Auto or x16. Disabled almost all onboard devices except for LAN and rear panel USB. Same problem. Kernel DMA protection in Windows ON or OFF, same problem. Core Isolation/Memory Integrity OFF. ReBAR OFF (always a safe choice). PCI decode above 4G ON or OFF, same problem.


Any ideas? I have a different supported processor coming to test but won't be here until probably Thursday 13th March.

0 Kudos
1 Solution
tcsenter89
New Contributor I
2,114 Views

That's it! I'm making an 'executive decision' - off to ewaste both mobos are going. I think there is some wonky component or VRM on the mobo. What is the interaction that causes it to be suppressed or masked when PEG slot is populated, I don't know. But time to move on.

View solution in original post

0 Kudos
10 Replies
NormanS_Intel
Moderator
2,447 Views

Hello tcsenter89,

 

Thank you for posting in the community!

 

To ensure you receive the most specialized assistance, we have a dedicated forum that addresses these specific concerns. Therefore, I will be moving this discussion to our Server Forum. This will allow our knowledgeable community and experts to provide you with timely and accurate solutions.

 

Best regards,

Norman S.

Intel Customer Support Engineer


0 Kudos
Ragulan_Intel
Employee
2,396 Views

Hello tcsenter89,


I hope this message finds you well.


Thank you for reaching out to the Intel Community!


To proceed further, we would appreciate it if you could confirm whether the Xeon W-1250 in question came with the Dell Precision 3650 or if it was purchased separately.


According to the spec sheet for the system, it appears that the processor came with the Precision. However, your confirmation would be helpful.


Additionally, have you contacted Dell to check for any known issues, firmware updates, or motherboard-related fixes for this behavior? If the issue is hardware-related, Dell can assist with further diagnostics or potential RMA options. If Dell determines it to be an Intel-specific issue, we will investigate further.



Thank you & Best Regards,


Ragulan_Intel


0 Kudos
tcsenter89
New Contributor I
2,375 Views

Thanks for the reply!

The towers I purchased as previously owned 'barebones' with no CPU, memory, etc. The processor was acquired separately. I only have the one Xeon but am awaiting another CPU model to try. There is no warranty or support contract remaining from DELL. But I have posted to DELL Community Forums in the Precision section, thus far have received no replies.

I also have tried different memory modules, single or two modules, and known-good ATX12V PSU. No change. When I pull the PCI-E graphics card from the PEG slot, spontaneous reboots come back. As long as one is inserted, everything runs and runs with no problem, including the Intel iGPU!

This has me wondering about the curious interaction happening here. I remember back in the LGA775/771 days there was the PCI-E ADD2 card for extending the chipset iGP for multi display output support. I recall reading that when this ADD2 card was inserted, there was PCI-E lane reversal/switching logic for the PEG interface (Northbridge) in order to route the IGP output to the ADD2 card.

I don't pretend to have engineering level knowledge of how that all worked. Mine is more of a chipset/platform block diagram or overview in the public 'technical brief' level knowledge, which isn't much. But I can't help but wonder if there is something going on between the PEG interface and iGP there. What it would be, I have no idea.

I guess I'll need to wait for that processor to arrive to learn more.

0 Kudos
Ragulan_Intel
Employee
2,368 Views

Hello tcsenter89,


Greetings!


Thank you for your feedback.


As your issue now appears to be related to the Intel® UHD Graphics P630 and the abnormal behavior of random reboots occurring when no discrete graphics card is attached via PCIe, we will move this thread to a dedicated team for assistance. This will ensure you receive specialized support and access to a forum that addresses these specific concerns. Our knowledgeable community and experts will be able to provide you with timely and accurate solutions.


Thank You & Best Regards,


Ragulan_Intel


0 Kudos
tcsenter89
New Contributor I
2,311 Views

UPDATE:


Installed a supported Rocket Lake i5-11600K to rule-out the Comet Lake Xeon W-1250. Same problem, but even worse! Unlike before, I can't even get it to load Windows SETUP. As soon as it starts loading the OS (WinPE) boot files = reboot! I have reset/cleared BIOS, load UEFI defaults, etc. No change.

When I insert a PCI Express graphics card, keep primary display adapter in BIOS to Onboard or Auto, with monitor connected to the onboard iGPU, everything works. OR when using the PCI Express graphics card, works great.

So it not the CPU. Something must be going on with the chipset/BIOS, low level PCI/PCIE resource configuration or assignment (firmware) in Dell's BIOS code. GAAAAHHHH!

 

Results from Dell's Diagnostics (BIOS based) Advanced Test image attached (in case anyone was going to ask).

0 Kudos
tcsenter89
New Contributor I
2,246 Views

Tried a different PSU (Antec) and that seemed to improve things, I could boot to the desktop and it ran great! I even ran 3DMark one pass and thought OMG it was the PSU all along? But 15 or 20 minutes of usage = reboot. The ONLY configuration that is stable is when a graphics card is inserted into the PEG slot.

 

I noted that when I was able to run that pass of 3DMark Night Raid with no graphics card inserted, the result was ~9600 on the Intel graphics. When I insert a graphics card BUT run the benchmark on the iGPU (Intel UHD), the result is lower ~8600. This I have verified twice now, in each configuration (when I was able to get that far without a graphics card inserted). Another interesting note is that when I changed the 'rendering device' from NVIDIA graphics card to the Intel graphics, the application warned "rendering device is not connected directly to the selected display" but in fact the display IS plugged to the onboard graphics port, which is the Intel UHD graphics. It further suggests to me there is some kind of PCIe routing, lane reversal or switching bug here.

 

So changing these things seems to be altering something, getting a lot further than half-way through OS loading but in the end, whether it is 5, 10, 15, or 20 minutes, it will reboot spontaneously. With graphics card inserted = UPTIME FOR HOURS AND HOURS.

0 Kudos
NormanS_Intel
Moderator
2,185 Views

Hello tcsenter89,


I wanted to let you know that I'm still looking into your inquiry. Please allow us some additional time, and I'll update you as soon as I have more information.


Best regards,

Norman S.

Intel Customer Support Engineer


0 Kudos
NormanS_Intel
Moderator
2,164 Views

Hello tcsenter89,

 

I've reviewed the case again, and it appears to be a potential hardware issue. Since you've tested different processors and the problem persists, it seems the processor is not the root cause. It might be related to the power supply. To further troubleshoot, please follow these articles:

 

 

Additionally, please share the system utility logs of your computer so I can thoroughly check your system configuration. You can attach the logs to this post if that's okay with you. Also, to clarify, I'd like to know the following:

 

  1. When did the issue start to occur?
  2. Was your system working fine before?
  3. What is the exact make and model of your power supply, and its wattage, since you mentioned using an Antec PSU?

 

I look forward to your response and apologize for any inconvenience this may have caused.

 

Best regards,

Norman S.

Intel Customer Support Engineer

 

0 Kudos
tcsenter89
New Contributor I
2,115 Views

That's it! I'm making an 'executive decision' - off to ewaste both mobos are going. I think there is some wonky component or VRM on the mobo. What is the interaction that causes it to be suppressed or masked when PEG slot is populated, I don't know. But time to move on.

0 Kudos
NormanS_Intel
Moderator
2,102 Views

Hello tcsenter89,


I'm sorry to hear about the persistent issues with your motherboard. It’s clear that you’ve invested considerable effort into troubleshooting, and I completely understand your decision to move forward.


I will proceed to close this inquiry. Should you need further assistance, please feel free to submit a new question, as this thread will no longer be monitored.


Thank you for your patience and perseverance throughout this process.


Best regards,

Norman S.

Intel Customer Support Engineer


0 Kudos
Reply