Processors
Intel® Processors, Tools, and Utilities
16109 Discussions

How to find out the reason of CATERR?

SamuelLi
Beginner
6,308 Views

May I know if there is any document that explains the reason for CATERR?

0 Kudos
7 Replies
Jocelyn_Intel
Employee
6,284 Views

Hello, @SamuelLi  

 

Thank you for posting on the Intel® communities.  

 

Let me do some research so I can help you out. As soon as I have any information, I will get back to you. 

 

Best regards,  

Jocelyn M.   

Intel Customer Support Technician. 


0 Kudos
Jocelyn_Intel
Employee
6,267 Views

Hello, @SamuelLi  

 

Thank you for your time. 

 

CATERR stands for Catastrophic Error: This signal indicates that the system has experienced a catastrophic error and cannot continue to operate. The processor will set this signal for non-recoverable machine check errors or other unrecoverable internal errors. CATERR# is used for signaling the following types of errors: Legacy MCERRs, CATERR# is asserted for 16 BCLKs. Legacy IERRs, CATERR# remains asserted until warm or cold reset. 

  

For more information, you can check this documentation Error and Thermal Protection Signals 


Please let me know if there's anything that I can help you with.


Best regards,  

Jocelyn M.   

Intel Customer Support Technician. 


0 Kudos
SamuelLi
Beginner
6,258 Views

Hi @Jocelyn_Intel,

 

Appreciate the information you shared with me.

 

According to the information, I need to check the related registers of legacy MCERRs/16 BCLKs/legacy IERRs to know/be aware of the actual reason why CATERR# is triggered, may I know if my understanding is right?

 

If the answer is yes, may I have your help to share the ID of related documents with me? or if there is any application that can analyze the CATERR log downloaded from BMC to figure out the actual errors?

 

 

Best regards,

Samuel Li

 

0 Kudos
Jocelyn_Intel
Employee
6,250 Views

Hello, @SamuelLi  

 

Thank you for your reply. 

 

I will investigate about this. I will get back once I have more information on the matter. 

 

Best regards,  

Jocelyn M.   

Intel Customer Support Technician. 


0 Kudos
Jose_Intel
Employee
6,100 Views

Hello SamuelLi  

 

Thank you for patiently waiting.


You can search MCE-related registers for error reporting in document ID: 640686, Raptor Lake, RPL S, EDS, Volume 2.


Also, you can use the Intel System Debugger / Crashlog Analyzer / Summarizer to obtain more information about the error.

 

Supermicro teams know how to use crash dump to decode MCE/IERR’s. If you have more related questions, please let us know and we will be more than happy to assist you, otherwise we can close the thread.

 

Best regards,  

Jose B.   

Intel Customer Support Technician 


0 Kudos
IsaacQ_Intel
Employee
6,054 Views

Hello ,SamuelLi

 

We hope you are doing fine.

 

Were you able to check the previous post?

Let us know if you still need assistance.

 

Best regards,

Isaac Q. 

Intel Customer Support Technician


0 Kudos
Jose_Intel
Employee
6,019 Views

Hello SamuelLi

 

We hope you are having a nice day. 

 

We have not heard back from you. So, we will close this thread. If you need any additional information, submit a new question, as this thread will no longer be monitored. 

 

Best regards. 

Jose B.  

Intel Customer Support Technician. 


0 Kudos
Reply