Community
cancel
Showing results for 
Search instead for 
Did you mean: 
VBerk1
Novice
1,127 Views

[REGRESSION] Serious SAS slowdown after BIOS upgrade

Jump to solution

Hi,

We have a server and a storage system, as follows:

We've managed to get out some 2800MBps from this setup but had serious issues with thermal management, it easily went over 100 Celsius degrees under heavy load, so I decided to upgrade the BIOS (i.e. BIOS, ME, FRU/SDR, BMC, everything) to the https://downloadcenter.intel.com/Detail_Desc.aspx?agr=Y&ProdId=3594&DwnldID=23008&ProductFamily=Serv... latest release. During the FRU/SDR upgrade it asked me several questions regarding if I want to enter a serial number and stuff, I answered all of those with no (if that matters in any way).

The good news is that since the upgrade thermal management seems to have been sorted out, now it greatly increases fan speed when it goes above 90C and cools down to 70-80ish, then the fans slow down and it starts over.

The bad news is that now I can't get out more than about 2100MBps from the storage. Many BIOS settings have been modified after the upgrade, so a related setting (maybe something with PCI-E) might be the cause. I'm yet to find it.

Anyway, we've performed several benchmarks before the upgrade, too. It produced similar speeds when I plugged in the 2 SAS cables into different converter boards in the JBOD (to test HA). So it may be that for some reason the RAID controller only speaks on one cable now. I have no idea how that could happen after a server mobo BIOS upgrade, but I can't rule it out either.

The other possible cause is, again, some bad PCI-E setting.

In any case, I'll try it out with the other JBOD (fortunately we have 2), with different cables, and in a different PCI-E slot.

A last resort option is a BIOS reset, but I'd prefer not to do that.

The system has 4x8 cores, so with HT it's 64 cores in total (0% typical CPU load, even during benchmarks) and 384GB RAM, so these aren't quite likely to be bottlenecks

Do you have any ideas what else may went wrong?

Tags (2)
0 Kudos

Accepted Solutions
VBerk1
Novice
29 Views

Oh guess what, it's proved to be a BIOS regression. After like a day of struggling with other things, finally (and fearfully) I downgraded to the https://downloadcenter.intel.com/Detail_Desc.aspx?agr=Y&DwnldID=22568&lang=eng penultimate release as a last resort, and now BOTH thermal management AND storage throughput is fine OOB.

Where can I report this bug to Intel? I don't want to mark this as an answer just yet, coz the problem's not solved, just worked around.

View solution in original post

3 Replies
VBerk1
Novice
30 Views

Oh guess what, it's proved to be a BIOS regression. After like a day of struggling with other things, finally (and fearfully) I downgraded to the https://downloadcenter.intel.com/Detail_Desc.aspx?agr=Y&DwnldID=22568&lang=eng penultimate release as a last resort, and now BOTH thermal management AND storage throughput is fine OOB.

Where can I report this bug to Intel? I don't want to mark this as an answer just yet, coz the problem's not solved, just worked around.

View solution in original post

VBerk1
Novice
29 Views

NVM, I've contacted Intel via the http://www.intel.com/support/feedback.htm official contact page.

VBerk1
Novice
29 Views

It's a http://www.intel.com/support/motherboards/server/sb/CS-033827.htm known bug. Until a fix is released, I'm advised to stick with v7.