Here is the problem: dashboard shows this message: "Check Storage Controller Status". When I click the SCM2 in management interface it shows: "Initializing (Offline)".
Our distribuitor ended up support of this model.
Thank you in advance!
Thank you for reaching us out, as you mention this product is in a En of Interactive Support fase since 2011, meaning that there is no longer support for it and resources are limited, however, I can provide you with some recommendations under no liability.
With that said, it is very important for you to confirm you have a valid backup of your data before to take any further action unless data loosing will have no real impact in your production environment.
First of, I would like to confirm the following information to have a better idea of what could be the problem:
Intel Customer Support
Hi Kenneth! Thank you for fast response!
I wrote down below required information:
1. I have a Intel® Modular Server Chassis MFSYS35 with 5 MFS5520VI Compute Modules.
2. I use buil-in chassis dual controller SAN and MFS5520VI HBAs which takes LUNs from it.
3. Previously I run Windows Server 2008R2 on it, but now all the compute modules are clean. I moved up all the data on spare servers.
4. There`s no RAID-controllers in the compute modules, HBAs only which takes LUNs from built-in SAN.
5. Built-in chassis SAN contains 6 3,5 SAS HUC101890CS4204 drives in RAID5.
6. At the moment servers are clean, I can`t do any measure of performance.
7. All drives are in good condition.
8. Chassis firmware: Current Build Version: 184.108.40.20620307.34729
MFS5520VI Compute Module:BMC Firmware: 1.27.1, BMC Boot: 0.28, BIOS: S5500.86B.01.00.0060.092120111445
Storage Control Module 1 Firmware ok 220.127.116.11
Storage Control Module 2 Firmware ok 18.104.22.168
9. I could provide Diagnostic report from the chassis itself: https://drive.google.com/open?id=0B4ywsAhL5S0MR255d0tDUTYza28
INTERNAL DIAGNOSTICS TEST
Test run: 10/19/2017 10:38:56
Device Present I2C Ping SNMP CIM HAPI VBMC PBMC
------- ------- ------ ------ ------ ------ ------ ------ ------
ESM1 yes PASS PASS PASS - - - -
SCM1 yes PASS - - - - - -
SCM2 yes [T/O] - - - - - -
VSCM yes - PASS PASS PASS - - -
FAN1 yes PASS - - - - - -
FAN2 yes PASS - - - - - -
IOFAN yes PASS - - - - - -
PS1 yes - - - - - - -
PS2 yes - - - - - - -
PS3 yes - - - - - - -
PS4 yes - - - - - - -
SERVER1 yes PASS PASS - - PASS PASS PASS
SERVER2 yes PASS PASS - - PASS PASS PASS
SERVER3 yes PASS PASS - - PASS PASS PASS
SERVER4 yes PASS PASS - - PASS PASS PASS
SERVER5 yes PASS PASS - - PASS PASS PASS
CHASSIS yes - - PASS - PASS PASS PASS
I just finished reviewing the logs, and for what I can see at some point the controller 2 was removed then reinserted and after that started to show as Offfline, It could be that after removal the hardware got physically damaged but also chances are that the firmware got corrupted and since the controller was active there were no changes to it, then when the controller was reinserted the corruption took effect, anyhow, based on the logs I can't be sure of any of the two scenarios and I can't find previous cases since the product has been out for a while.
So, with the information available so far, I would recommend to confirm hardware wise all components are properly seated and run the Firmware update, find the last package https://downloadcenter.intel.com/download/23202/Firmware-update-for-Intel-Modular-Server-System-MFSY... here, make sure to take a look on the release notes for the specific instructions, known issues and requirements, I am also thinking in the drive distribution, you say all your drives are in a single raid, what if the controller 2 is showing as offline because it has no drives assigned and all of the current raid is running in controller one? could you check that please,
I'll stay tuned to your comments.
I addition to the previous answer I would like to know:
I would like to know if you had the chance to review the emails sent to your address and if there are any updates from your side, or if the assistance is no longer needed and we are OK to set this case as closed, either way please let me know by replying to this email.
I'll stay tuned to your comments, best regards.
Intel Customer Support
Thanks for response.
I`ve got Complete System Diagnostics file and the screenshot of failed SCM in Management interface.
P.S I swapped controllers. Now the failed one is in SCM1 slot. I`ve tried to figure out is the problem related midplane/controller.
https://drive.google.com/open?id=1Ab6HcPc9EXacMbLZxe-NUenRtyceJktQ Intel_MFSYS35.zip - Google Drive
I noticed the majority of the errors are related to the controller. There is no information that would make us believe the issue is with the midplane.