Unfortunately, My S2600CW server restarted 2 times unexpectedly several days ago. I want to get the reason, my DB server is not reliable any more.
From the BMC SEL Log, i found CPU ERR2 was noted. But there was no any other related SEL items about this issue.[Reset on ERR2 was enabled]
No PCIE related SEL...
Here is the BMC log:
first time:
From the SEL HEX log, (RID:0021), ED:81 02, i get CPU2 caused ERR2
second time:
Similarly, but i get SEL log ** ED:81 03, CPU1 & CPU2 cause the ERR2.
But it's not enough to get the really root cause.
Any experts can help to analyze the possible reason? Or how to get the root cause?
BR// Marvin
- 標籤:
- CPU
連結已複製
3 回應
Hello Longfei,
This is an indicator of a catastrophic error on CPU 2, and for these type of cases, a replacement is applied. However, before we proceed, is there a way for you to swap processors, run the log utility and check if the error follows the processor or if it stays on socket 2.
Once we get this information, the next step is to create a warranty case and replace the faulty component.