After several years of operation, I'm suddenly showing both Storage Controller Modules offline. Everything is operating normally except the storage console, and there is nothing in the event logs to indicate a problem with either controller. Can I safely reset the primary controller?
The firmware is not the freshest (Build 18.104.22.16800630.21247), but finding a window to shut everything down has proven difficult given the high utilization of this machine, and I'm hoping I don't have to power down this time.
You can only reset the primary safely, if you have the dual SCMs set up correctly for failover, with the multipath drivers installed. Otherwise, you'd have to shut down all the compute modules first.
Do any of the physical drives show PFA (Predictive Failure Analysis)?