SP HBTが停止したA900が応答しないノード
環境
- ONTAP 9
- A900
- FAS9500
- BMC 16.3、16.4、および16.5
問題
- SPハートビートの喪失/停止が原因でノードが停止した場合:
Sat Jan 27 07:42:58 -0800 [netapp-n01: spmgrd: sp.heartbeat.stopped:error]: Have not received a IPMI heartbeat from the Service Processor (SP) in last 600 seconds.
Sat Jan 27 07:55:13 -0800 [netapp-n01: spmgrd: callhome.sp.hbt.missed:notice]: Call home for SP HBT MISSED
Sat Jan 27 08:05:27 -0800 [netapp-n01: spmgrd: callhome.sp.hbt.stopped:alert]: Call home for SP HBT STOPPED
Sat Jan 27 08:08:32 -0800 [netapp-n01: env_mgr: sp.ipmi.lost.shutdown:EMERGENCY]: SP heartbeat stopped and cannot be recovered. To prevent hardware damage and data loss, the system will shut down in 10 minutes.
Sat Jan 27 08:18:32 -0800 [netapp-n01: env_mgr: monitor.shutdown.emergency:EMERGENCY]: Emergency shutdown: Environmental Reason Shutdown (System reboot to recover the SP)
- ノードが自動的にリカバリされず、BMCにアクセスできない