Processors
Intel® Processors, Tools, and Utilities
14395 Discussions

BOOT error WHEA on idle, 12700KF

a999
Beginner
1,775 Views

Hi,

I've been getting WHEA errors that WinDbg is telling me are BOOT errors. Strangely enough, this only happens while the system is idling - I've never had this happen while using the system. Every error has been accompanied by the system being unresponsive - no bugchecks or anything weird, the screen is just completely frozen.

The issue started happening after I installed new RAM (DDR4, XMP profile 4000mhz 19-19-19-39, ran in Gear1 CR2).

The CPU is overclocked a little bit (so it hits 5 allcore, I can provide detailed info if needed) but that shouldn't matter on idle, correct (using the Balanced power profile on Windows 11)?

My motherboard is a MSI PRO Z690-A DDR4 and the RAM model is Patriot PVS416G400C9K.

Here's the WinDbg output for the latest crash dump:

Mini Kernel Dump does not contain unloaded driver list For analysis of this file, run !analyze -v nt!LkmdTelCreateReport+0x1de: fffff804`47a344ea 0f1f440000 nop dword ptr [rax+rax] 6: kd> !analyze -v ******************************************************************************* * * * Bugcheck Analysis * * * ******************************************************************************* WHEA_UNCORRECTABLE_ERROR (124) A fatal hardware error has occurred. Parameter 1 identifies the type of error source that reported the error. Parameter 2 holds the address of the nt!_WHEA_ERROR_RECORD structure that describes the error condition. Try !errrec Address of the nt!_WHEA_ERROR_RECORD structure to get more details. Arguments: Arg1: 0000000000000007, BOOT Error Arg2: ffff81825c103020, Address of the nt!_WHEA_ERROR_RECORD structure. Arg3: 0000000000000000 Arg4: 0000000000000000 Debugging Details: ------------------ KEY_VALUES_STRING: 1 Key : Analysis.CPU.mSec Value: 1343 Key : Analysis.DebugAnalysisManager Value: Create Key : Analysis.Elapsed.mSec Value: 4637 Key : Analysis.Init.CPU.mSec Value: 311 Key : Analysis.Init.Elapsed.mSec Value: 20710 Key : Analysis.Memory.CommitPeak.Mb Value: 127 FILE_IN_CAB: WHEA-20211219-2030.dmp DUMP_FILE_ATTRIBUTES: 0x18 Kernel Generated Triage Dump Live Generated Dump BUGCHECK_CODE: 124 BUGCHECK_P1: 7 BUGCHECK_P2: ffff81825c103020 BUGCHECK_P3: 0 BUGCHECK_P4: 0 PROCESS_NAME: smss.exe STACK_TEXT: ffffa585`05a96e80 fffff804`47a06eff : ffff8182`5c103000 00000000`00000000 ffff8182`5c103020 00000055`d2a7fa00 : nt!LkmdTelCreateReport+0x1de ffffa585`05a973c0 fffff804`47a06df6 : ffff8182`5c103000 00000000`00000000 00000000`00000000 ffff8182`5bfd2d00 : nt!WheapReportLiveDump+0x7b ffffa585`05a97400 fffff804`4784c599 : 00000000`00000001 ffffa585`05a97aa0 00000000`00000000 00000000`000002f4 : nt!WheapReportDeferredLiveDumps+0x7a ffffa585`05a97430 fffff804`47739932 : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000001 : nt!WheaCrashDumpInitializationComplete+0x59 ffffa585`05a97460 fffff804`47427f75 : ffff8182`5c597080 00000000`00000000 00000000`0000022c 00000000`00000001 : nt!NtSetSystemInformation+0xa72 ffffa585`05a97a20 00007ffe`d69a69d4 : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : nt!KiSystemServiceCopyEnd+0x25 00000055`d2a7f9a8 00000000`00000000 : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : 0x00007ffe`d69a69d4 MODULE_NAME: GenuineIntel IMAGE_NAME: GenuineIntel.sys STACK_COMMAND: .cxr; .ecxr ; kb FAILURE_BUCKET_ID: LKD_0x124_7_GenuineIntel__UNKNOWN_IMAGE_GenuineIntel.sys OSPLATFORM_TYPE: x64 OSNAME: Windows 10 FAILURE_ID_HASH: {5ea80f6a-69bf-5d6f-8fd2-cd87deb91a03} Followup: MachineOwner --------- 6: kd> !errrec ffff81825c103020 =============================================================================== Common Platform Error Record @ ffff81825c103020 ------------------------------------------------------------------------------- Record Id : 01d7f506713867a9 Severity : Fatal (1) Length : 28896 Creator : Microsoft Notify Type : BOOT Error Record Timestamp : 12/19/2021 18:30:05 (UTC) Flags : 0x00000002 PreviousError =============================================================================== Section 0 : Firmware Error Record Reference ------------------------------------------------------------------------------- Descriptor @ ffff81825c1030a0 Section @ ffff81825c1032e0 Offset : 704 Length : 2592 Flags : 0x00000000 Severity : Fatal =============================================================================== Section 1 : Firmware Error Record Reference ------------------------------------------------------------------------------- Descriptor @ ffff81825c1030e8 Section @ ffff81825c103d00 Offset : 3296 Length : 544 Flags : 0x00000000 Severity : Fatal =============================================================================== Section 2 : Firmware Error Record Reference ------------------------------------------------------------------------------- Descriptor @ ffff81825c103130 Section @ ffff81825c103f20 Offset : 3840 Length : 72 Flags : 0x00000000 Severity : Fatal

At this point I'm beginning to think it's the PSU being faulty (Fractal Design Newton R3 800W) - I've been having complete system shutdowns in heavy CPU+GPU workloads with no bugchecks/whatnot.

 

Attached is the System Support Utility log.

0 Kudos
9 Replies
JosueO_Intel
Moderator
1,731 Views

Hello a999,


Thank you for posting on the Intel communities. In order to have a better understanding of the issue, please share with us the following information: 


  1. Have you tested a different RAM?
  2. Have you tested a different motherboard?
  3. Have you tested using only 1 DIMM of RAM?



Regards, 


Josue O.  

Intel Customer Support Technician



0 Kudos
a999
Beginner
1,726 Views

Hi,

I've tried different RAM and didn't have the issue - the current RAM shouldn't be defective, I've tested it using TestMem5 several times. Haven't tried a different motherboard/using 1 RAM stick.

0 Kudos
JosueO_Intel
Moderator
1,719 Views

Hello a999,


Thank you for the information provided, in this case, please try testing the RAM modules in another system and check if the issue persists. 


Also, please try using only 1 DIMM of RAM and check if the issue persists. 


Regards, 


Josue O.  

Intel Customer Support Technician



0 Kudos
a999
Beginner
1,695 Views

Hi,

 

tested the DIMMs in another PC - no signs of such instability. Haven't tested with 1 DIMM, though.

0 Kudos
JosueO_Intel
Moderator
1,691 Views

Hello a999,


Thank you for the information provided, please also test using 1 DIMM and let us know if the issue persists. 


Also, please share with us the model of the RAM that works fine with the system.

Regards, 


Josue O.  

Intel Customer Support Technician



0 Kudos
a999
Beginner
1,649 Views

Hi,

The memory that worked fine was the Corsair Vengeance Pro 3600mhz CL18-22-22-42. Could the tighter timings/higher clocks of the new memory be the issue?

0 Kudos
JosueO_Intel
Moderator
1,644 Views

Hello a999,


Thank you for the information provided, in this case, it is important for you to know that the maximum DDR4 memory speed supported by the processor is 3200 MT/s, please try setting the RAM to that speed and test the system. 


Regard,


Josue O.  

Intel Customer Support Technician



0 Kudos
JosueO_Intel
Moderator
1,626 Views

Hello a999,


Were you able to check the previous post?  

Let us know if you still need assistance.  



Regard,


Josue O.  

Intel Customer Support Technician



0 Kudos
JosueO_Intel
Moderator
1,595 Views

Hello a999,


We have not heard back from you, so we will close this thread. In case the issue persists, our recommendation is to check compatibility between the RAM with the other components of the system since this could affect its proper functioning, for this, please contact the ram manufacturer and check with them for any compatibility issues. 


 If you need any additional information, please submit a new question as this thread will no longer be monitored. 


Regard,


Josue O.  

Intel Customer Support Technician



0 Kudos
Reply