Intel® Xeon® Processor and Server Products
Intel® Xeon® Processors, Data Center Products including boards, integrated systems, and RAID Storage
5181 讨论

[REGRESSION] Serious SAS slowdown after BIOS upgrade

VBerk1
新手
2,350 次查看

Hi,

We have a server and a storage system, as follows:

We've managed to get out some 2800MBps from this setup but had serious issues with thermal management, it easily went over 100 Celsius degrees under heavy load, so I decided to upgrade the BIOS (i.e. BIOS, ME, FRU/SDR, BMC, everything) to the https://downloadcenter.intel.com/Detail_Desc.aspx?agr=Y&ProdId=3594&DwnldID=23008&ProductFamily=Server+Products&ProductLine=Intel%C2%AE+Server+Boards&ProductProduct=Intel%C2%AE+Server+Board+S4600LH+Family&lang=eng latest release. During the FRU/SDR upgrade it asked me several questions regarding if I want to enter a serial number and stuff, I answered all of those with no (if that matters in any way).

The good news is that since the upgrade thermal management seems to have been sorted out, now it greatly increases fan speed when it goes above 90C and cools down to 70-80ish, then the fans slow down and it starts over.

The bad news is that now I can't get out more than about 2100MBps from the storage. Many BIOS settings have been modified after the upgrade, so a related setting (maybe something with PCI-E) might be the cause. I'm yet to find it.

Anyway, we've performed several benchmarks before the upgrade, too. It produced similar speeds when I plugged in the 2 SAS cables into different converter boards in the JBOD (to test HA). So it may be that for some reason the RAID controller only speaks on one cable now. I have no idea how that could happen after a server mobo BIOS upgrade, but I can't rule it out either.

The other possible cause is, again, some bad PCI-E setting.

In any case, I'll try it out with the other JBOD (fortunately we have 2), with different cables, and in a different PCI-E slot.

A last resort option is a BIOS reset, but I'd prefer not to do that.

The system has 4x8 cores, so with HT it's 64 cores in total (0% typical CPU load, even during benchmarks) and 384GB RAM, so these aren't quite likely to be bottlenecks

Do you have any ideas what else may went wrong?

0 项奖励
1 解答
VBerk1
新手
1,252 次查看

Oh guess what, it's proved to be a BIOS regression. After like a day of struggling with other things, finally (and fearfully) I downgraded to the https://downloadcenter.intel.com/Detail_Desc.aspx?agr=Y&DwnldID=22568&lang=eng penultimate release as a last resort, and now BOTH thermal management AND storage throughput is fine OOB.

Where can I report this bug to Intel? I don't want to mark this as an answer just yet, coz the problem's not solved, just worked around.

在原帖中查看解决方案

0 项奖励
3 回复数
VBerk1
新手
1,253 次查看

Oh guess what, it's proved to be a BIOS regression. After like a day of struggling with other things, finally (and fearfully) I downgraded to the https://downloadcenter.intel.com/Detail_Desc.aspx?agr=Y&DwnldID=22568&lang=eng penultimate release as a last resort, and now BOTH thermal management AND storage throughput is fine OOB.

Where can I report this bug to Intel? I don't want to mark this as an answer just yet, coz the problem's not solved, just worked around.

0 项奖励
VBerk1
新手
1,252 次查看

NVM, I've contacted Intel via the http://www.intel.com/support/feedback.htm official contact page.

0 项奖励
VBerk1
新手
1,252 次查看

It's a http://www.intel.com/support/motherboards/server/sb/CS-033827.htm known bug. Until a fix is released, I'm advised to stick with v7.

0 项奖励
回复