Processors
Intel® Processors, Tools, and Utilities
16001 Discussions

13900-K causing fatal hardware crash?

Guywhoneedshelp3452
207 Views

Hello,

 

My system has run flawlessly for two years. Out of nowhere yesterday, while I was doing nothing other than typing a URL into Brave Browser, the entire device froze--everything on my desktop froze, mouse and keyboard were unresponsive (clicking Num Lock wouldn't turn the corresponding light on the keyboard on), and even the lights on my CPU cooler and RAM sticks stopped their usual changing RGB colors and had their colors frozen in a static state. I had to click the reset button on my PC, after which the PC turned off and turned itself back on and booted normally.

WHEA Logger logged an "Event ID: 1": "A fatal hardware error has occurred. A record describing the condition is contained in the data section of this event." This is the only time that event has appeared in Event Viewer.

I have the details from the dump file below, which suggests that something related to my 13900-K is the problem, but first here are my system specs:

OS: Windows 10 Version 10.0.19045 Build 19045

Processor: 13th Gen Intel(R) Core(TM) i9-13900K, 3000 Mhz, 24 Core(s), 32 Logical Processor(s)

Motherboard: PRO Z690-A WIFI (MS-7D25)

RAM: 64GB 5600MHz DDR5

GPU: Palit RTX 4090

 

Here is the dump file:

*******************************************************************************
*                                                                             *
*                        Bugcheck Analysis                                    *
*                                                                             *
*******************************************************************************

WHEA_UNCORRECTABLE_ERROR (124)
A fatal hardware error has occurred. Parameter 1 identifies the type of error
source that reported the error. Parameter 2 holds the address of the
nt!_WHEA_ERROR_RECORD structure that describes the error condition. Try !errrec Address of the nt!_WHEA_ERROR_RECORD structure to get more details.
Arguments:
Arg1: 0000000000000007, BOOT Error
Arg2: ffffcc049d926150, Address of the nt!_WHEA_ERROR_RECORD structure.
Arg3: 0000000000000000
Arg4: 0000000000000000

Debugging Details:
------------------

Mini Kernel Dump does not contain unloaded driver list
Mini Kernel Dump does not contain unloaded driver list
Mini Kernel Dump does not contain unloaded driver list

KEY_VALUES_STRING: 1

    Key  : Analysis.CPU.mSec
    Value: 765

    Key  : Analysis.Elapsed.mSec
    Value: 3601

    Key  : Analysis.IO.Other.Mb
    Value: 4

    Key  : Analysis.IO.Read.Mb
    Value: 1

    Key  : Analysis.IO.Write.Mb
    Value: 19

    Key  : Analysis.Init.CPU.mSec
    Value: 328

    Key  : Analysis.Init.Elapsed.mSec
    Value: 89680

    Key  : Analysis.Memory.CommitPeak.Mb
    Value: 83

    Key  : Analysis.Version.DbgEng
    Value: 10.0.27725.1000

    Key  : Analysis.Version.Description
    Value: 10.2408.27.01 amd64fre

    Key  : Analysis.Version.Ext
    Value: 1.2408.27.1

    Key  : Bugcheck.Code.LegacyAPI
    Value: 0x124

    Key  : Bugcheck.Code.TargetModel
    Value: 0x124

    Key  : Dump.Attributes.AsUlong
    Value: 18

    Key  : Dump.Attributes.KernelGeneratedTriageDump
    Value: 1

    Key  : Failure.Bucket
    Value: LKD_0x124_7_GenuineIntel__UNKNOWN_IMAGE_GenuineIntel.sys

    Key  : Failure.Hash
    Value: {5ea80f6a-69bf-5d6f-8fd2-cd87deb91a03}


BUGCHECK_CODE:  124

BUGCHECK_P1: 7

BUGCHECK_P2: ffffcc049d926150

BUGCHECK_P3: 0

BUGCHECK_P4: 0

FILE_IN_CAB:  WHEA-20241122-1525.dmp

DUMP_FILE_ATTRIBUTES: 0x18
  Kernel Generated Triage Dump
  Live Generated Dump

FAULTING_THREAD:  ffffcc04a9859080

PROCESS_NAME:  smss.exe

STACK_TEXT:  
fffff080`edb3a0d0 fffff805`81b621cf     : ffffcc04`9d926130 00000000`00000000 ffffcc04`9d926150 00000000`00000022 : nt!LkmdTelCreateReport+0x139
fffff080`edb3a610 fffff805`81b620c6     : ffffcc04`9d926130 fffff805`00000000 00000085`00000000 00000085`491ff490 : nt!WheapReportLiveDump+0x7b
fffff080`edb3a650 fffff805`819d7941     : 00000000`00000001 fffff080`edb3aac0 00000085`491ff490 00000000`00000300 : nt!WheapReportDeferredLiveDumps+0x7a
fffff080`edb3a680 fffff805`817dd8f7     : 00000000`00000000 ffffcc04`a7f91030 00000000`00000103 00000000`00000000 : nt!WheaCrashDumpInitializationComplete+0x59
fffff080`edb3a6b0 fffff805`81612505     : ffffcc04`a9850000 ffffcc04`a98576b0 fffff080`edb3aac0 ffffcc04`00000000 : nt!NtSetSystemInformation+0x1f7
fffff080`edb3aa40 00007ffa`8add0a64     : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : nt!KiSystemServiceCopyEnd+0x25
00000085`491ff438 00000000`00000000     : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : 0x00007ffa`8add0a64


MODULE_NAME: GenuineIntel

IMAGE_NAME:  GenuineIntel.sys

STACK_COMMAND:  .process /r /p 0xffffcc04a901f0c0; .thread 0xffffcc04a9859080 ; kb

FAILURE_BUCKET_ID:  LKD_0x124_7_GenuineIntel__UNKNOWN_IMAGE_GenuineIntel.sys

OSPLATFORM_TYPE:  x64

OSNAME:  Windows 10

FAILURE_ID_HASH:  {5ea80f6a-69bf-5d6f-8fd2-cd87deb91a03}

Followup:     MachineOwner
---------

There doesn't appear to be anything wrong with the computer apart from the fact that this happened. I ran some CPU benchmarks a few times--3DMark CPU Profile and Time Spy Extreme--and everything seemed normal.

In hopes of avoiding the crash in future, I uninstalled iCUE software as I read this can create problems like this. I also disabled the 13900-K's integrated graphics in Device Manager, as the 4090 video card is the only thing that should be handling video processing anyway. Does anything in the dump file suggest it could be the integrated graphics driver or component that caused this?

0 Kudos
3 Replies
RandyT_Intel
Moderator
123 Views

Hello Guywhoneedshelp3452,

 

I recommend trying the following troubleshooting step to address the issues you've been experiencing: 

 

The error WHEA_UNCORRECTABLE_ERROR"stands for Windows Hardware Error Architecture. If you receive a blue screen error (also known as a stop code) with WHEA_UNCORRECTABLE_ERROR, it means that a hardware issue has occurred on your computer. Issues could be related to such things as hard drives, processor, power supply, bad or corrupted memory (DRAM). It also could be that the device drivers are outdated or corrupted. Another reason could be voltage changes.

 

To fix it, try the following tips in order, as the first or any following step might resolve your issue. 

 

  • Reboot the system and enter BIOS settings, set the BIOS to its default settings, and reboot system.
  • Remove any hardware or drivers that were installed recently to see if they are causing the issue.
  • If unable to start with normal mode, start Windows in safe mode.
    • Check if there are any driver errors in the device manager and update device driver for the devices that are showing errors. 
  • Get the latest updates with Windows Update. Go to Settings → Update & security → Windows Update, and then select Check for updates. When done, reboot your system.
  • Check if there are any hardware issues.
    • Try to boot with minimal hardware settings (for example, one DRAM, less hardware components on your system).
    • Test DRAM: Try Windows 10 app called Windows Memory Diagnostic to check for memory issues.
    • Test the processor, using the Intel® Processor Diagnostic Tool which can be downloaded here. For more information about the tool, see The Intel® Processor Diagnostic Tool.
    • Check the power supply.
    • Try to test other hardware in the system. (For example, if another working hard drive is available, then use it instead.) 

 

ReferenceHow to Resolve Blue Screen Error with WHEA_UNCORRECTABLE_ERROR in...

 

Please let me know the results after you've had a chance to try this.

 

Best regards, 

 

Randy T. 

Intel Customer Support Technician 


0 Kudos
Guywhoneedshelp3452
110 Views

Thanks for your reply. Not long after I made my OP, which there was a big delay on because it was erroneously marked as spam, I got a BSOD while playing a game. It was a CLOCK_WATCHDOG_TIMEOUT BSOD. At this point I was sure the CPU was dying, which seems to be frequently happening to 13900K CPUs. Unfortunately I was unaware of the tendency for 13900Ks to burn themselves out until I started having these problems, and that there were BIOS fixes--but it was too late.

 

I bought a new motherboard and a 14900K, and updated the BIOS to the latest version with the stability fixes as soon as the new hardware was installed. I haven't had a BSOD or any PC instability since.

0 Kudos
RandyT_Intel
Moderator
96 Views

Hello Guywhoneedshelp3452,


Thank you for your update and I apologize for previous OP. I'm glad to hear that you haven't experienced any BSODs or PC instability since upgrading to the new motherboard and 14900K and updating the BIOS.


To confirm, has this completely resolved the issue for you? If so, I will proceed with closing this case. If you encounter any further issues, please feel free to create a new ticket and post again.


Thank you for your patience and understanding.


Best regards, 

 

Randy T. 

Intel Customer Support Technician 


0 Kudos
Reply