Server Products
Data Center Products including boards, integrated systems, Intel® Xeon® Processors, RAID Storage, and Intel® Xeon® Processors
4761 Discussions

S1200BTS in Supermicro SC846E16 got sudden death

SHu12
Beginner
2,236 Views

we have been selling the S1200BTS server board with our 8-bay, 12-bay, and 16-bay chassis well. We don't have 24-bay so we chose Supermicro SC846E16 as our default 24-bay server chassis. This chassis has been working great with the LGA1156 S3420GPLC we integrated. But this time, we put the S1200BTS in the SC846E16 to run the WSS2008 R2. Everything looked good and it started working as a NAS.

- BIOS: not yet updated to ver. 029

- CPU : E3-1200 3.1Ghz

- RAM: Kingston 2GB DDR3 ECC x 4

- HDD: HDS SATA 2TB 7200rpm x 24

- RAID: Areca 1880iX-12

- chassis : Supermicro SC846E16-R1200 , with SAS expander inside

However, every 30~48 hours the server got sudden death and we have to power off it directly for a cold boot.

The sudden death means,

- black screen, no signal, can't see any thing

- no response to the ping command from other computers, the NIC LED flashed though

- mouse and keyboard didn't work

- all the LEDs looked in normal state, the LEDs on the chassis, and the post code diagnostic LEDs on the S1200BTS

We checked the log from WSS2008 R2 OS and the application running but got nothing weird or abnormal. We built three servers with the same combination as stated above. All the three units got the same problem, all. Here were what we did to sort it out,

- replaced the server board with another new S1200BTS, still got the same in 48 hours (I have more than 10 S1200BTS)

- uninstall the application software and run again, still got hang with black screen in 48 hours

- replaced the 24-bay chassis with our own 8-bay and 12-bay, both worked very well over 5 days

- replaced the three S1200BTS with three S3420GPLC, all have been working very well over 7 days

any idea ?? please comment. Thanks.

0 Kudos
7 Replies
Edward_Z_Intel
Employee
434 Views

Could be power, thermal or vibration issue. You may try to start with minimal configuration first, and add components one at a time to isolate the cause.

DSilv11
Valued Contributor III
434 Views

Any thing reported in the SEL log?

0 Kudos
Edward_Z_Intel
Employee
434 Views

It's a "S" SKU, which doesn't have BMC...

0 Kudos
DSilv11
Valued Contributor III
434 Views

ahhh, missed that. Guess we are even.

0 Kudos
SHu12
Beginner
434 Views

Thank you, Edward. Thank you Creek.

Could be Power:

The three units of 24-bay server were put together in the same 19" 42U rack. So, we pulled power source from the next rack to support two of the three 24-bay servers. In the left rack, we have one 24-bay server and three 2U Dell servers. The other two 24-bay servers used the power cords from the right rack which contains nothing but the two 24-bay servers. 20 hours passed, the one using the power source of left rack got sudden death. After 6 hours, one of the 24-bay server using the power source from the right rack got sudden death. Holly, should not be the power source problem.

However it's a good idea to add components (like HDD I guessed) especially we prepared another two chassis (Supermicro SC846E16-R1200) which will be tested with S1200BTS and 24 x HDS 2TB 7.2k 6Gb SATA drives.

Anything in SEL ?

I can say it's not thing in the SEL related to the sudden death. No screen (black), nothing we can find or read but the LEDs (power, NIC, post code diagnostic) all looked normal. The dead one has the same LED status as the alive ones. But yes, we did see something after the re-power-on but were not telling anything about the hang/dead/black-screen/... I have read through all the logs including the huge stuff related to Microsoft, all didn't say anything about it. Nothing for us to investigate.

However, the weird thing was, just replaced the S1200BTS with S3420GPLC solved the problem. It's that simple. Replacing it with another two new S1200BTS didn't help. We found the systems were all not busy working when it got sudden death -- from the Windows system event logs. We doubt it could be the BIOS ver. I don't know, we will try the new version of BIOS 029 dedicated for S1200BTS (the ver. 030 is for S1200BTL only).

0 Kudos
SHu12
Beginner
434 Views

hi everyone, it has been a while and seems no progress on this issue. we at last replaced all those S1200BTS with new purchased S3420GPLC which is relatively reliable and more proven compatibleness we believe. And it wroks well without any strange things. So that business was secured finally and fortunately.

0 Kudos
SHu12
Beginner
434 Views

NOPE, even the S3400GPLC didn't solve the problem with the Supermicro SC846E16 chassis (with 6G SAS expander). It's 3 months passed, this combiniation did't win the customer satisfaction. Because, the new system got reboot sometimes. Reboot looks better than sudden death with black-screen. However, it's still a very big problem for a server (or servers) here. We have not yet resolved the issue. We heard rumours that,

"when you purchased the Supermicro chassis, just use their server board. Their own server board matches the special design of their power system."

I forgot to mention,

We built a new system with a new purchased Supermicro 846E16 24-bay chassis (with 6G SAS expander inside) and a new S5500-HCV with E5605 CPU. Yeah, a new set with latest firmware. We spent three day-and-night to test it with enthiusiasm. The new system reboot every night. We are sure now have to test with the Supermicro server board for this g.d. chassis.

Whatever, we just placed the order to buy Supermicro X9SCL server board. It probably would come in 2 weeks. We will see. No tears, all in the stomatch.

0 Kudos
Reply