Server Products
Data Center Products including boards, integrated systems, Intel® Xeon® Processors, RAID Storage, and Intel® Xeon® Processors
4778 Discussions

S5000SL, False CPU1 temperature error, possible fix?

RReis2
Beginner
2,311 Views

Hello, sorry for bad english ...

I have a S5000SL with 2 cpus, the cpu1 returning this temperature error.

already changed the cpus and the error persists.

it occurs randomly several times in a row.

Log Selviewer:

1807/17/2013-19:29:24OEM Event Operating system bootupOEM Event

1907/17/2013-19:31:44Temperature /Proc1 VRD Hot (# 0xC8)Informational event: Proc1 VRD Hot reports its limit has no longer been exceeded.BMC - LUN # 0 (Channel # 00h)

2007/17/2013-19:31:51Temperature /P1 Therm Ctrl % (# 0xC0)Informational event: P1 Therm Ctrl % reports the sensor is no longer in a critically high state. Sensor reading: 43.29 unspecified. Threshold value: 49.53 unspecified.BMC - LUN # 0 (Channel # 00h)

2107/17/2013-19:32:32Temperature /Proc1 VRD Hot (# 0xC8)Warning event: Proc1 VRD Hot reports its limit has been exceeded.BMC - LUN # 0 (Channel # 00h)

2207/17/2013-19:32:43Temperature /P1 Therm Ctrl % (# 0xC0)CRITICAL event: P1 Therm Ctrl % reports the sensor is high, critical, and going higher state. Sensor reading: 54.21 unspecified. Threshold value: 49.53 unspecified.BMC - LUN # 0 (Channel # 00h)

2307/17/2013-19:32:43System Event /System Event (# 0x0B)Informational event: System Event reports a PEF action has occurred - alert.BMC - LUN # 0 (Channel # 00h)

2407/17/2013-19:34:34Temperature /Proc1 VRD Hot (# 0xC8)Informational event: Proc1 VRD Hot reports its limit has no longer been exceeded.BMC - LUN # 0 (Channel # 00h)

2507/17/2013-19:34:43Temperature /P1 Therm Ctrl % (# 0xC0)Informational event: P1 Therm Ctrl % reports the sensor is no longer in a critically high state. Sensor reading: 47.97 unspecified. Threshold value: 49.53 unspecified.BMC - LUN # 0 (Channel # 00h)

2607/17/2013-19:34:43Temperature /Proc1 VRD Hot (# 0xC8)Warning event: Proc1 VRD Hot reports its limit has been exceeded.BMC - LUN # 0 (Channel # 00h)

2707/17/2013-19:34:46Temperature /P1 Therm Ctrl % (# 0xC0)CRITICAL event: P1 Therm Ctrl % reports the sensor is high, critical, and going higher state. Sensor reading: 54.21 unspecified. Threshold value: 49.53 unspecified.BMC - LUN # 0 (Channel # 00h)

2807/17/2013-19:34:46System Event /System Event (# 0x0B)Informational event: System Event reports a PEF action has occurred - alert.BMC - LUN # 0 (Channel # 00h)

2907/17/2013-19:34:51Temperature /Proc1 VRD Hot (# 0xC8)Informational event: Proc1 VRD Hot reports its limit has no longer been exceeded.BMC - LUN # 0 (Channel # 00h)

3007/17/2013-19:34:57Temperature /P1 Therm Ctrl % (# 0xC0)Informational event: P1 Therm Ctrl % reports the sensor is no longer in a critically high state. Sensor reading: 43.29 unspecified. Threshold value: 49.53 unspecified.BMC - LUN # 0 (Channel # 00h)

3107/17/2013-19:35:12Temperature /Proc1 VRD Hot (# 0xC8)Warning event: Proc1 VRD Hot reports its limit has been exceeded.BMC - LUN # 0 (Channel # 00h)

3207/17/2013-19:35:21Temperature /P1 Therm Ctrl % (# 0xC0)CRITICAL event: P1 Therm Ctrl % reports the sensor is high, critical, and going higher state. Sensor reading: 49.92 unspecified. Threshold value: 49.53 unspecified.BMC - LUN # 0 (Channel # 00h)

3307/17/2013-19:35:21System Event /System Event (# 0x0B)Informational event: System Event reports a PEF action has occurred - alert.BMC - LUN # 0 (Channel # 00h)

3407/17/2013-19:38:02Temperature /P1 Therm Ctrl % (# 0xC0)Informational event: P1 Therm Ctrl % reports the sensor is no longer in a critically high state. Sensor reading: 44.07 unspecified. Threshold value: 49.53 unspecified.BMC - LUN # 0 (Channel # 00h)

3507/17/2013-19:38:03Temperature /P1 Therm Ctrl % (# 0xC0)CRITICAL event: P1 Therm Ctrl % reports the sensor is high, critical, and going higher state. Sensor reading: 50.31 unspecified. Threshold value: 49.53 unspecified.BMC - LUN # 0 (Channel # 00h)

3607/17/2013-19:38:04System Event /System Event (# 0x0B)Informational event: System Event reports a PEF action has occurred - alert.BMC - LUN # 0 (Channel # 00h)

3707/17/2013-19:38:26Temperature /Proc1 VRD Hot (# 0xC8)Informational event: Proc1 VRD Hot reports its limit has no longer been exceeded.BMC - LUN # 0 (Channel # 00h)

3807/17/2013-19:38:34Temperature /P1 Therm Ctrl % (# 0xC0)Informational event: P1 Therm Ctrl % reports the sensor is no longer in a critically high state. Sensor reading: 43.29 unspecified. Threshold value: 49.53 unspecified.BMC - LUN # 0 (Channel # 00h)

Is there a way to fix this?

Rodrigo.

0 Kudos
4 Replies
Edward_Z_Intel
Employee
571 Views

What makes you believe those are false errors? Did you check if the fans are working, and the heatsink is installed properly? Which chassis are you using?

0 Kudos
RReis2
Beginner
571 Views

Yes, the heatsink is properly installed and the fans are ok

Sometimes it speeds up the fans for a few seconds, sometimes for 5 minutes, sometimes for 10 minutes. Sometimes it works for hours, days. Sometimes speeds for the fans hours. It is very random.

The chassis is SC54000BRP.

Thanks for the feedback.

0 Kudos
Edward_Z_Intel
Employee
571 Views

I bet you also have the air duct installed properly right? Which heatsink are you using?

I'd suggest you apply latest https://downloadcenter.intel.com/Detail_Desc.aspx?agr=Y&DwnldID=19672&lang=eng BIOS/firmware first. If possible, install https://downloadcenter.intel.com/Detail_Desc.aspx?DwnldID=19908 Intel® Active System Console to monitor CPU temperature.

0 Kudos
RReis2
Beginner
571 Views

Sorry for delay...

http://imageshack.us/g/1/10244522/ http://imageshack.us/g/1/10244522/

Are copper heatsinks. I upgraded the bios, upgrade bmc and select my chassis with FSL48, worked for a while and again indicated that CPU1 to 99 °C / 210 °F.

I think it's a problem in the server board.

0 Kudos
Reply