Community
cancel
Showing results for 
Search instead for 
Did you mean: 
BGuo2
Beginner
2,410 Views

what does correctable ecc asserted explicily mean?

I have "correctable ecc asserted" warning in the bmc of my server. This event probably lead to the server status light turned amber and blink. I wonder is this event mean only one bit error occurred in the dimm or the number of error occurred in that dimm exceeded the threshold? If it is the first case, I think it is ok, and won't lead to any server health problem. I hope someone can help me with this!

Tags (1)
0 Kudos
8 Replies
idata
Community Manager
239 Views

Hello Mr. Guo,

 

 

In regards to your question the BMC error messages could change from board to board and even with the firmware version. Could you please specify what board/chassis model you have on your server and what BIOS version is it currently running?

 

 

Regards

 

 

Jose H.
BGuo2
Beginner
239 Views

board:s2600cw2r

BIOS01010022

ME030103043

BMC015010802

FRUSDR114

thank you for your help!

idata
Community Manager
239 Views

Hello Mr. Guo,

Thanks for the info.

Are you seen this error message from the Server Health > DIMM Information tab? I couldn't find the exact description of the error even on the BMC user guide: http://download.intel.com/support/motherboards/server/sb/intel_rmm4_ibwc_userguide_r2_72.pdf http://download.intel.com/support/motherboards/server/sb/intel_rmm4_ibwc_userguide_r2_72.pdf

I will continue researching on this and will get back to you with updates

Regards

Jose H.

BGuo2
Beginner
239 Views

No. I saw this in the even log.

Sorry for this late reply, but I was on a business trip last week.

idata
Community Manager
239 Views

Mr. Guo,

 

 

Do you mind to share that event log file here? I would like to take a look at it.

 

 

Jose H.
idata
Community Manager
239 Views

Hello Mr. Guo,

 

 

Let me share with you the following info in regards to Correctable Error Correcting Code (ECC) or other correctable memory error for memory modules
  1. Decode DIMM error(s) using the https://www.intel.com/content/www/us/en/support/server-products/000023940.html System Information Retrieval Utility.
  2. Verify the DIMM is seated properly.
  3. Examine gold fingers on edge of the DIMM to ensure that the contacts are clean.
  4. Inspect the processor socket DIMM for any bent contacts/pins. If you find bent contacts/pins, replace the board.
  5. Consider replacing the DIMM as a preventive measure if the correctable error becomes uncorrectable.
Hope this helps.

 

 

Jose H.
idata
Community Manager
239 Views

Hello Mr. Guo,

 

 

Do you have updates in regards to this?

 

 

Just let me know.

 

 

Jose H.
idata
Community Manager
239 Views

Hello Mr. Guo,

 

 

I will proceed to mark this thread as closed. If you have further questions just create a new topic and we will be glad to assist you.

 

 

Regards

 

 

Jose H.
Reply