Server Products
Data Center Products including boards, integrated systems, Intel® Xeon® Processors, RAID Storage, and Intel® Xeon® Processors
4778 Discussions

Decoding System Event Log data to get more information

MGarg1
Beginner
3,020 Views

Hello

I have a server workstation board and I am experiencing a system freeze with a CPU_CATERR -

I am trying to understand the data in the system event log to see if it will point me in the right direction:

I tried using the document here:

https://www.intel.com/content/www/us/en/support/articles/000006888/server-products.html System Event Log Troubleshooting Guides for Intel® Server Boards

but to no avail.

BMC-Owned Sensors (GID = 0020h) Section 3.1 Table 5 Does not have an entry for 0xDA.

Can anyone shed some light on this?

Board Has A C612 Chipset with 2 xeon E5 V4 2630 Processors on it.

Complete Info From the SEL Data:

04 00 02 2C A1 18

5A 20 00 04 07 DA

6F 00 FF FF

SENSOR # 0xDA

CPU_CATERR

Event Description IERR

RECORD TYPE 0x02

Thank You

0 Kudos
14 Replies
idata
Employee
1,455 Views

Hello Michael,

 

 

Thank you for contacting Intel(R) Technical Support.

 

 

In an effort to better assist can you please provide the Model the Intel(R) server board you are using?

 

 

Looking forward to your results.

 

 

regards,

 

 

Jeremiah A.

 

 

Intel(R) Technical Support
0 Kudos
MGarg1
Beginner
1,455 Views

Hello Jeremiah,

Its an Asus board - Asus z10pe-d16 ws

Thank you

0 Kudos
idata
Employee
1,455 Views

Hello Michael,

 

 

Thank you for your quick response.

 

 

I will go ahead and look into this error and get back with you as soon as possible.

 

 

regards,

 

 

Jeremiah A.

 

 

Intel(R) Technical Support
0 Kudos
idata
Employee
1,455 Views

Hello Michael,

 

 

I looked up for this error: CPU_CATERR. It stands for Catastrophic Error. This error is generated by the processor.

 

I would suggest flashing the firmware on this ASUS board.

 

 

If the problem persists I would suggest checking with the board manufacturer.

 

 

Please let me know your results.

 

 

regards,

 

 

Jeremiah A.

 

Intel(R) Technical Support

 

 

0 Kudos
idata
Employee
1,455 Views

Hello Michael,

 

 

I hope you are doing well today.

 

I'm following up with you see if the instructions provided helps or if you still need assistance from us.

 

 

Please let me know your results.

 

 

regards,

 

 

Jeremiah A.

 

 

Intel(R) Technical Support.

 

 

0 Kudos
MGarg1
Beginner
1,455 Views

I am currently communicating with Asus trying to get more detail about the error.

While its obvious that its a CPU related error, it is odd that I am able to run extreme stability tests on the machine with no issue, and then have the board lock up when doing almost nothing.

Their tech support is.... challenging.... at least to get to anyone with technical knowledge beyond "flash the bios" which I have done and has not helped.

If I find a solution, I will follow up here -

Thanks for following up -

0 Kudos
idata
Employee
1,455 Views

Hello Michael,

 

 

Thank you for your quick response.

 

I was investigating more about this error (CATERROR), and I found that when Catastrophic Error Signal stays asserted, it is a sign that something serious has gone wrong with the hardware.

 

 

Suggestions of what you can do to troubleshoot this error are:

 

 

- Check for the other errors near the time of the CATERR event.

 

- Verify all peripherals are plugged in and operating correctly, particularly Hard Drives, Optical Drives, and I/O.

 

- We already mentioned updating the server board firmware in a previous post.

 

- You mentioned that the server is running two Intel(R) XEON Processors. Can try one CPU at time to discard that they are faulty?

 

 

Please try these steps and let me know your results. More than glad to assist you.

 

 

regards,

 

 

Jeremiah A.

 

Intel(R) Technical Support

 

 

 

0 Kudos
idata
Employee
1,455 Views

Hello Michael,

 

 

I hope you are doing well today.

 

 

I'm following up with you to see if the information provided helped or if you still need assistance with this issue.

 

 

Please let me know your results.

 

 

regards,

 

 

Jeremiah A.

 

 

Intel(R) Technical Support
0 Kudos
idata
Employee
1,455 Views

Hello Michael,

 

 

I hope you are doing well today.

 

 

I'm following up with you to see if the information provided helped or if you still need assistance with this issue.

 

 

Please let me know your results.

 

 

regards,

 

 

Jeremiah A.

 

 

Intel(R) Technical Support
0 Kudos
MGarg1
Beginner
1,455 Views

HI -

Thanks for following up - Im still waiting on some info from Asus -

I am going to try one CPU at a time and a single Ram module to isolate - I just have not done it yet since I need several hours of downtime to try it -

Thank You

0 Kudos
idata
Employee
1,455 Views

Hi,

 

 

Please let me know your results.

 

 

regards,

 

 

Jeremiah A.

 

 

Intel(R) Technical Support
0 Kudos
idata
Employee
1,455 Views

Hello,

 

 

I hope you are doing well today.

 

 

I'm following up with you to see if you were run the troubleshooting.

 

 

Please let me know your results.

 

 

regards,

 

Jeremiah A.

 

0 Kudos
idata
Employee
1,455 Views

Hello,

 

 

I hope you are doing well today.

 

 

I'm following up with you to see if you were run the troubleshooting.

 

 

Please let me know your results.

 

 

regards,

 

Jeremiah A.
0 Kudos
idata
Employee
1,455 Views

Hello,

 

 

I have not heard back from you regarding this case. I will proceed in closing this case. Please feel free to contact us for future assistance with Intel(R) Products. More than glad to assist you.

 

 

regards,

 

 

Jeremiah A.

 

Intel(R) Technical Support
0 Kudos
Reply