FAS9000のファン障害による環境のシャットダウン
環境
- ONTAP 9
- AFF A700 / FAS9000
- SPファームウェアバージョン4.7以降
問題
「複数のファンで障害が発生しました」 イベントが原因でコントローラの電源がオフになっており、パートナーのテイクオーバーが発生しています。
- EMS /イベントログを確認:
::> event log show -event *fan*
Thu Mar 05 11:20:05 CET [node_name: env_mgr: monitor.chassisFan.stop:error]: Chassis fan contains at least one stopped fan: FanB1 F1 at failed
Thu Mar 05 11:20:05 CET [node_name: env_mgr: monitor.chassisFan.stop:error]: Chassis fan contains at least one stopped fan: FanB1 F2 at failed
Thu Mar 05 11:20:05 CET [node_name: env_mgr: monitor.chassisFan.stop:error]: Chassis fan contains at least one stopped fan: FanB1 F3 at failed
Thu Mar 05 11:20:05 CET [node_name: env_mgr: monitor.chassisFan.stop:error]: Chassis fan contains at least one stopped fan: FanB1 F4 at failed
Thu Mar 05 11:20:07 CET [node_name: env_mgr: monitor.chassisFanFail.xMinShutdown:EMERGENCY]: Multiple Chassis Fan failure: System will shut down in 2 minutes.
Thu Mar 05 11:20:35 CET [node_name: env_mgr: callhome.c.fan.fru.fault:error]: Call home for CHASSIS FAN FRU FAILED: FanB1 F1
Thu Mar 05 11:20:35 CET [node_name: env_mgr: callhome.c.fan.fru.fault:error]: Call home for CHASSIS FAN FRU FAILED: FanB1 F2
- SPログ出力を確認します。
Mar 2 08:15:24 (none) : [477 WARNING][Porting/platform/PDKFan.c:1397]FanDaemon: Failure to read FanModuleData from fan module 1
Mar 2 08:15:24 (none) : [477 WARNING][Porting/platform/PDKFan.c:1416]FanDaemon: Fan module 1 marked bad after 3 consecutive failures
Mar 2 08:15:30 (none) : [477 WARNING][Porting/platform/PDKFan.c:1397]FanDaemon: Failure to read FanModuleData from fan module 1
Mar 2 08:15:30 (none) : [477 WARNING][Porting/platform/PDKFan.c:1416]FanDaemon: Fan module 1 marked bad after 3 consecutive failures
- 次の出力を確認します。
::> system node run -node <node_name> environment status chassis all