シャーシが過熱したために AFF-A300 / FAS8200 ノードがシャットダウンされます
環境
- FAS8200
- AFF-A300
- ONTAP 9
問題
- シャーシの過熱が原因でノードがシャットダウンされましたが、摂氏88度しか報告されていません:
[?] Mon Jul 26 09:32:44 -0400 [Node_1: env_mgr: monitor.chassisTemperature.warm:alert]: Chassis temperature is too warm: CPU0 Temp Margin is critical high (88 C). [?] Mon Jul 26 09:32:44 -0400 [Node_1: env_mgr: monitor.shutdown.chassisOverTemp:EMERGENCY]: Chassis temperature is too hot: CPU0 Temp Margin is critical high. System will be shutdown in 2 minutes
- BMCからのすべてのコマンドであるイベントの出力からは、次の情報が表示されます。
Record 710: Mon Jul 26 13:32:40 2021 [IPMI.notice]: 6301 | 02 | EVT: 015758f5 | CPU0_Temp_Margin | Assertion Event, "Upper Non-critical going high" Record 711: Mon Jul 26 13:32:40 2021 [IPMI.notice]: 6401 | 02 | EVT: 015958ff | CPU0_Temp_Margin | Assertion Event, "Upper Critical going high" Record 712: Mon Jul 26 13:32:44 2021 [IPMI.notice]: 6501 | 02 | EVT: 0301ffff | Attn_Sensor1 | Assertion Event, "State Asserted" Record 713: Mon Jul 26 13:32:47 2021 [IPMI.notice]: 6601 | 02 | EVT: 8159c3ff | CPU0_Temp_Margin | Deassertion Event, "Upper Critical going high" Record 714: Mon Jul 26 13:32:48 2021 [IPMI.notice]: 6701 | 02 | EVT: 8157c3f5 | CPU0_Temp_Margin | Deassertion Event, "Upper Non-critical going high" Record 715: Mon Jul 26 13:33:02 2021 [IPMI.notice]: 6801 | 02 | EVT: 0300ffff | Attn_Sensor1 | Assertion Event, "State Deasserted" Record 716: Mon Jul 26 13:34:40 2021 [IPMI.emergency]: triggered OS halt: Temperature critical Record 717: Mon Jul 26 13:34:42 2021 [IPMI.notice]: 6901 | 02 | EVT: 0301ffff | Attn_Sensor1 | Assertion Event, "State Asserted" Record 718: Mon Jul 26 13:34:56 2021 [IPMI.notice]: 6a01 | 02 | EVT: 0300ffff | Attn_Sensor1 | Assertion Event, "State Deasserted" Record 719: Mon Jul 26 13:35:18 2021 [IPMI.notice]: 6b01 | 02 | EVT: 6f03ffff | Sensor 255 | Assertion Event, "Storage OS graceful shutdown"