I have a 12900k, Gigabyte Z690I DDR4, GSkill 2x16GB DDR4 3600 , EVGA 3080 XC3, Samsung 980 Pro, Windows 11.
I get thousands of WHEA errors a minute for ven_8086&dev_ 460d&SUBSYS_86941043&REV_02 with my computer sitting idle. The rate increases if I begin to stress graphics. If I set the PCIe manually to gen 3 in the BIOS the errors stop completely.
I haven't had much luck troubleshooting this with Gigabyte. I see several others with similar configurations on Z690 boards across manufacturers with this issue, so doesn't seem to be board specific, other than Z690 with a PCIe gen 4 video card.
I get the same errors and believe it is a chipset driver issue for Z690 and Alder Lake. Fresh WIN 11 install, updated bios for ASUS Z690-A Gaming WIFI MOBO, 12900K, ASUS TUF OC 3090 GPU, Samsung 970 PLUS as Boot Drive in M.1 slot, and a Samsung 970 PLUS in 2nd M.1 slot.
Others experiencing the same thing. Only solution so far is putting GPU in 2nd PCI slot running at GEN 3 speeds (kinda beats the point of having a GEN 4 card). At this point I'm guessing this devolves into finger pointing between Intel and Motherboard manufacturers in regards to whose drivers are broke for this....insert Spiderman meme.
Someone with a similar issue here: https://community.intel.com/t5/Processors/WHEA-LOGGER-ID17-help/m-p/1328866#M54659
I’ve tried the following:
- disabling PCI-E link power state management seems to reduce the amount of WHEA's, but it still crashes under heavy load
- Next, I removed the GPU and ran the system off of the iGPU. No crashes, no WHEA's, running prime95 and cinebench without any freezing whatsoever.
- Next, I set the top PCIE slot at gen 3 in bios – same issue persists.
- Finally, plugging the GPU in the lower PCI-E slot works fine without issues. But now my ASUS 3090 TUF OC is running at PCIE 3 and lower bandwidth – this is a bandaid and not a fix.
Problem seems isolated to the top PCI-E slot. I tried reseating the card and still had the issue. Problem persists using a riser cable or direct plug in. Also, problem exists for many different people.
When I swap all components back to my backup computer (Z390 Gigabyte Designare, 9900K CPU) I have zero issues and full performance.
Event Viewer shows the following details
A corrected hardware error has occurred.
Component: PCI Express Root Port
Error Source: Advanced Error Reporting (PCI Express)
Primary Bus:Device:Function: 0x0:0x1:0x0
Secondary Bus:Device:Function: 0x0:0x0:0x0
Primary Device Name:PCI\VEN_8086&DEV_460D&SUBSYS_86941043&REV_02
Secondary Device Name:
Hello bigpcboy, I just wanted to check if the information posted by Grendel602 was useful for you and if you need further assistance on this matter?
Intel Customer Support Technician
Alberto, what I posted wasn't a fix, it was a bandaid. Putting the PCIE slot into GEN 3 is a temp fix, it should be running at GEN 4. Lots of people running different MOBO's (ASUS/Gigabyte) are running into this PEG 10 460D issue that produces WHEA 17 errors.
The only way people have stopped the errors is by putting their GPU's into GEN 3 mode in BIOs or some have been able to disable ASPM in BIOS and link state power management. Only going to GEN 3 works for me....or disabling native state power management in BIOS (but that produces artifacting and lower GPU performance. A lot of people are getting crashes as the errors build up over time.
More people with the exact same issue posting here:
The only two devices tied to my PEG 10 460D is NVIDIA HD Audio and the RTX 3090 itself....some people have their NVMe tied to that, but this is clearly not a device issue more so a PCIE Bus issue.
I am currently, on fresh WIN 11 install with new 980 PRO NVME drive with all current ASUS drivers, mobo bios, and windows updates. I still get the errors but it has stopped crashing immediately, but time will tell.
INTEL - Why does disabling PCIE Native Power Management stop the WHEA errors but result in instability, lower GPU performance and artifacting? Also, if leaving Native Power Managment enabled why does going to GEN 3 on the PCIE lane stop the errors?
Grendel602, Thank you very much for clarifying those details.
bigpcboy, Thank you very much for confirming that information.
bigpcboy, in order for us to provide the most accurate assistance on this topic, we just wanted to confirm a few details about your system:
Is this a new computer?
Did you build it?
When did you purchase the Intel® processor?
Was it working fine before?
When did the issue start?
Did you make any recent hardware/software changes that might cause this issue?
Which specific Windows* version are you using?
Does the problem happen at home or in the work environment?
Please attach the SSU report so we can verify further details about the components in your platform, check all the options in the report including the one that says "3rd party software logs":
By any chance, did you try the suggestions provided in the following link?
Any questions, please let me know.
Intel Customer Support Technician
Hello bigpcboy, I just wanted to check if you saw the information posted previously and if you need further assistance on this matter?
Intel Customer Support Technician
Hello bigpcboy, Since we have not heard back from you, we are closing the case, but if you have any additional questions, please post them on a new thread so we can further assist you with this matter.
Intel Customer Support Technician
I have this exact issue, with a Gigabyte Z690i Ultra DDR4 motherboard, 12700KF cpu, and MSI 3090 gpu. One thing I've noticed is that the people having this issue all run nvidia 3000 series video cards. I don't know if AMD gpus aren't affected, or that they are just more rare. Again, the only way to stop the errors is to switch the pcie slot to gen3, which is a band-aid, not a solution. Hopefully if a solution is found, it can be spread to all the motherboard manufacturers.
I have this issue as well, starting a couple of days ago. Same motherboard as TimberWolf1, 12600K CPU, Windows 11. Power supply is a less-than-one-year-old Corsair RM750, which is bigger than what the video card requires). There is a single Samsung 970 EVO SSD in the CPU M.2 slot. It's been working fine since I built the system in the middle of December. Friday I replaced the GTX 1070 I originally had there with a new Gigabyte RX 6800 (so here's one "not nVidia" case) and moved the computer to a different room. (It was treated gently, not dropped, etc.) Late Monday or early yesterday I noticed that it wasn't waking up from sleep, and Tuesday evening I looked at event viewer and saw tens of thousands of WHEA 17 events, with the primary device name of PCI\VEN_8086&DEV_460D&SUBSYS_50001458&REV_02. It seems like it only happens when gaming.
So far I have gone out to the Gigabyte website and applied every driver and BIOS update for the motherboard, gone to AMD's website and gotten the latest drivers, installed the Intell support assistant and let it do driver updates, and then ran Windows Update a few times. I also went into the BIOS and did "load optimized defaults". The only change I made in the BIOS other than that was to enable XMP so my RAM (2x16 G.Skill DDR4-3000) would run at rated speed.
I plan on removing and re-seating the CPU and video card today to see if that does anything. I may try dropping the 1070 back in to see what happens--the system wasn't crashing when it was in there, but that's not a tenable solution given how much the new card cost.
I also have the Z690I Aorus Ultra DDR4,12600K CPU, and Windows 11. My GPU is a RTX 2070 Super. I got that same device error for PCI\VEN_8086&DEV_460D&SUBSYS_50001458&REV_02. I don't get the error while gaming specifically. What happened about half an hour ago was my PC completely froze for about 10 seconds, then reset itself while I was using Google Chrome. My error log shows 12,441 WHEA-Logger errors occurred during the 4 minutes prior to the PC crashing.
I had been gaming for a few hours prior without issue. Everything is up to date and I changed my PCI-E settings to Gen 3 in BIOS, but I still get these crashes and errors. I've reached out to Gigabyte support but they have been unhelpful so far.
I'm having major problems with my Gigabyte Z690 UD and 12600K. Firstly it was my USB devices randomly disconnecting and now the audio is crackling and I'm getting stutters during gaming. I checked with Latency Mon and am getting high latency when this issue happens.
Same issue on Gigabyte z690i ULTRA DDR4 + 12700kf + Asus TUF 3080 v2.
Why is no one acknowledging this problem? Judging by the numerous of posts on Reddit, this problem has affected every owner of this motherboard.
ASUS ROG STRIX Z690-A Gaming WiFi D4 with 12700K and 3080, same issues, WHEA 1 and WHEA 17 errors.
By the way, in case this helps anyone:
In the BIOS with all settings under Platform Misc Configuration disabled I get no WHEA 1 or 17 errors, ie. disable all ASPM settings.
The culprit (but not concrete or limited to, just from little testing) seems to be "PEG - ASPM" for the WHEA 17 errors, massive instability and BSOD / hard crash.
"PCI Express Clock Gating" seems to cause the random WHEA 1 errors but only sometimes.
Apart from that I have a bunch of other errors for other hardware especially my I225-V ethernet (log is full of Event ID 32 from source e2fexpress errors regardless of which driver I use).
Not too happy with this system, was an expensive upgrade only to have an unstable hot mess, was previously on an Asus Maximus Hero and i7-4790K with the same 3080 GPU and it was rock solid.
My system uses an MSI Pro Z690 -A DDR4 Wifi, i9 12900KF, RTX 3090, 64GB GSKILL RIPJAW DDR4 3600 CL18.
If I'm doing a render, or any benchmark it runs fine, but if I'm doing a light task where the system isn't being stressed it freezes up. No BSOD, just frozen. Event viewer mentions a hardware failure, but no information beyond that .
I've tried different RAM, a different GPU, different storage with the same results. I've tried BIOS defaults, even disabling power saving features. I can only assume it's the board. I've ordered a higher end board and a replacement CPU to do further troubleshooting, but does anyone have any idea what it could be? Any help is appreciated.
It seems to be the motherboard, even though it's happening across multiple board makers. On my Gigabyte Aorus Z690-I ultra ddr4, manually setting the GPU PCIe slot to 3.0 makes all the errors go away, and over on the Gigabyte Reddit forum, someone who appears to be a Gigabyte employee says they're planning on replacing affected boards, but they're not ready yet.
I tried that and had no luck. What's weird is the system will be fine under a constant load like furmark or prime95, but intermittent loads like After Effects will eventually lead to freezing. I think it's definitely the board and it's not regulating power properly. My system is a Cyberpower PC and It's funny that they have a small sticker in the back that states "not fully tested".
Just to record that I'm having exactly the same error codes as the OP. My issue stops when I disconnect my RTX 2060. I'm using an i5-12600k on an Asus B660-i board, Win11 loaded on a Samsung 980 NVMe.