Community
cancel
Showing results for 
Search instead for 
Did you mean: 
MGarg1
Beginner
1,710 Views

Decoding System Event Log data to get more information

Hello

I have a server workstation board and I am experiencing a system freeze with a CPU_CATERR -

I am trying to understand the data in the system event log to see if it will point me in the right direction:

I tried using the document here:

https://www.intel.com/content/www/us/en/support/articles/000006888/server-products.html System Event Log Troubleshooting Guides for Intel® Server Boards

but to no avail.

BMC-Owned Sensors (GID = 0020h) Section 3.1 Table 5 Does not have an entry for 0xDA.

Can anyone shed some light on this?

Board Has A C612 Chipset with 2 xeon E5 V4 2630 Processors on it.

Complete Info From the SEL Data:

04 00 02 2C A1 18

5A 20 00 04 07 DA

6F 00 FF FF

SENSOR # 0xDA

CPU_CATERR

Event Description IERR

RECORD TYPE 0x02

Thank You

Tags (1)
0 Kudos
14 Replies
idata
Community Manager
145 Views

Hello Michael,

 

 

Thank you for contacting Intel(R) Technical Support.

 

 

In an effort to better assist can you please provide the Model the Intel(R) server board you are using?

 

 

Looking forward to your results.

 

 

regards,

 

 

Jeremiah A.

 

 

Intel(R) Technical Support
MGarg1
Beginner
145 Views

Hello Jeremiah,

Its an Asus board - Asus z10pe-d16 ws

Thank you

idata
Community Manager
145 Views

Hello Michael,

 

 

Thank you for your quick response.

 

 

I will go ahead and look into this error and get back with you as soon as possible.

 

 

regards,

 

 

Jeremiah A.

 

 

Intel(R) Technical Support
idata
Community Manager
145 Views

Hello Michael,

 

 

I looked up for this error: CPU_CATERR. It stands for Catastrophic Error. This error is generated by the processor.

 

I would suggest flashing the firmware on this ASUS board.

 

 

If the problem persists I would suggest checking with the board manufacturer.

 

 

Please let me know your results.

 

 

regards,

 

 

Jeremiah A.

 

Intel(R) Technical Support

 

 

idata
Community Manager
145 Views

Hello Michael,

 

 

I hope you are doing well today.

 

I'm following up with you see if the instructions provided helps or if you still need assistance from us.

 

 

Please let me know your results.

 

 

regards,

 

 

Jeremiah A.

 

 

Intel(R) Technical Support.

 

 

MGarg1
Beginner
145 Views

I am currently communicating with Asus trying to get more detail about the error.

While its obvious that its a CPU related error, it is odd that I am able to run extreme stability tests on the machine with no issue, and then have the board lock up when doing almost nothing.

Their tech support is.... challenging.... at least to get to anyone with technical knowledge beyond "flash the bios" which I have done and has not helped.

If I find a solution, I will follow up here -

Thanks for following up -

idata
Community Manager
145 Views

Hello Michael,

 

 

Thank you for your quick response.

 

I was investigating more about this error (CATERROR), and I found that when Catastrophic Error Signal stays asserted, it is a sign that something serious has gone wrong with the hardware.

 

 

Suggestions of what you can do to troubleshoot this error are:

 

 

- Check for the other errors near the time of the CATERR event.

 

- Verify all peripherals are plugged in and operating correctly, particularly Hard Drives, Optical Drives, and I/O.

 

- We already mentioned updating the server board firmware in a previous post.

 

- You mentioned that the server is running two Intel(R) XEON Processors. Can try one CPU at time to discard that they are faulty?

 

 

Please try these steps and let me know your results. More than glad to assist you.

 

 

regards,

 

 

Jeremiah A.

 

Intel(R) Technical Support

 

 

 

idata
Community Manager
145 Views

Hello Michael,

 

 

I hope you are doing well today.

 

 

I'm following up with you to see if the information provided helped or if you still need assistance with this issue.

 

 

Please let me know your results.

 

 

regards,

 

 

Jeremiah A.

 

 

Intel(R) Technical Support
idata
Community Manager
145 Views

Hello Michael,

 

 

I hope you are doing well today.

 

 

I'm following up with you to see if the information provided helped or if you still need assistance with this issue.

 

 

Please let me know your results.

 

 

regards,

 

 

Jeremiah A.

 

 

Intel(R) Technical Support
MGarg1
Beginner
145 Views

HI -

Thanks for following up - Im still waiting on some info from Asus -

I am going to try one CPU at a time and a single Ram module to isolate - I just have not done it yet since I need several hours of downtime to try it -

Thank You

idata
Community Manager
145 Views

Hi,

 

 

Please let me know your results.

 

 

regards,

 

 

Jeremiah A.

 

 

Intel(R) Technical Support
idata
Community Manager
145 Views

Hello,

 

 

I hope you are doing well today.

 

 

I'm following up with you to see if you were run the troubleshooting.

 

 

Please let me know your results.

 

 

regards,

 

Jeremiah A.

 

idata
Community Manager
145 Views

Hello,

 

 

I hope you are doing well today.

 

 

I'm following up with you to see if you were run the troubleshooting.

 

 

Please let me know your results.

 

 

regards,

 

Jeremiah A.
idata
Community Manager
145 Views

Hello,

 

 

I have not heard back from you regarding this case. I will proceed in closing this case. Please feel free to contact us for future assistance with Intel(R) Products. More than glad to assist you.

 

 

regards,

 

 

Jeremiah A.

 

Intel(R) Technical Support