ONTAPによって報告された複数のSHELF_FAULTおよびSHELF COOLING UNIT FAILED
環境
- ONTAP 9
- DS460C (DS460-12)、NS224NSM100
問題
- 複数のSHELF_FAULTとSHELF COOLING UNIT FAILEDがAutoSupportsに表示されます:
HA Group Notification (SHELF_FAULT) ERROR.
HA Group Notification (SHELF COOLING UNIT FAILED) EMERGENCY
- シェルフ障害およびシェルフ冷却ユニット障害エラーは、短時間で回復して正常な状態になります。
例:
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 1: normal status.
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 2: normal status.
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 3: normal status.
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 4: normal status.
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 5: normal status.
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 6: normal status.
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 7: normal status.
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 8: normal status.
[?] Mon Feb 20 19:48:00 +0800 [n19911002-01: monitor: monitor.globalStatus.ok:notice]: The system's global status is normal.
[?] Tue Feb 21 10:36:51 +0800 [n19911002-01: statd: monitor.shelf.fault.ok:notice]: Fault previously reported on disk storage shelf attached to channel 0a has been corrected.
[?] Tue Feb 21 10:37:00 +0800 [n19911002-01: monitor: monitor.globalStatus.ok:notice]: The system's global status is normal.
- すべてまたは一部の冷却ユニットが
environmentの出力で故障しています。Cooling Unit installed element list: 1, 2, 3, 4, 5, 6, 7, 8; with error: 1, 2, 3, 4, 5, 6, 7, 8
- シェルフにディスクを使用するActive File Systemのアグリゲートがある場合、システムでマルチディスクパニックイベントが発生することがあります:
Dec 05 14:13:56 [NODE-01:mgr.boot.reason_abnormal:EMERGENCY]: System rebooted after a panic. PANIC : aggr aggr_name: raid volfsm, fatal multi-disk error..
。