マザーボードの障害によりPower Supply Status Critical
環境
- FAS8200
- AFF-A300
問題
- ハートビートが検出されなかったため、テイクオーバーが発生します。
[cf.fsm.takeover.noHeartbeat:ALERT]: Failover monitor: Takeover initiated after no heartbeat was detected from the partner node.
[cf.fm.takeoverComplete:notice]: (EMS parameters: token="XXXXXXXXXXX_13:44:29_2024:12:14" partner_node_uuid="XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX")
EMS
PSUエラーを表示し、頻繁に自動回復します。
[node1: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Temperature is Unreadable
[node1: power_low_monitor: monitor.chassisPower.degraded:alert]: Chassis power is degraded: Power Supply Status Critical: PSU1.
[node1: power_low_monitor: callhome.chassis.power:error]: Call home for CHASSIS POWER DEGRADED: Power Supply Status Critical: PSU1.
[node1: monitor: monitor.globalStatus.critical:EMERGENCY]: Power Supply Status Critical: PSU1.
[node1: spsm_listener: sp.heartbeat.stopped:error]: Have not received a IPMI heartbeat from the Service Processor (SP) in last 20 seconds.
[node1: spsm_listener: sp.heartbeat.resumed:info]: Received IPMI heartbeat from the Service Processor (SP).
[node1: power_low_monitor: monitor.chassisPowerSupplies.ok:info]: Chassis power supplies OK.
[node1: monitor: monitor.globalStatus.ok:notice]: The system's global status is normal.
SP-LATEST-IPMI
複数のオンボードセンサーをnot_
available
ステータスで表示します。
Fan Override NORMAL
PSU1 Present PRESENT
PSU1 Temp not_available -- C 0 C 5 C 50 C 60 C
PSU1 Curr not_available -- mA -- -- -- --
PSU1 Fan1 Speed not_available -- RPM 4500 RPM 4600 RPM -- --
PSU1 Fan1 Fault not_available --
PSU1 Fan2 Speed not_available -- RPM 4500 RPM 4600 RPM -- --
PSU1 Fan2 Fault not_available --
PSU1 Pwr In OK OK
PSU1 Pwr Out OK OK
PSU1 FAULT OK
PSU1 Input Type not_available --
PSU1 Over Temp not_available --
PSU1 Over Volt not_available --
PSU1 Over Curr not_available --
PSU1 Crest Factor not_available -- 1000 -- 1728 2000
PSU1 InPwr Monitor not_available -- mW -- -- -- --
PSU2 Present PRESENT
PSU2 Temp not_available -- C 0 C 5 C 50 C 60 C
PSU2 Curr not_available -- mA -- -- -- --
PSU2 Fan1 Speed not_available -- RPM 4500 RPM 4600 RPM -- --
PSU2 Fan1 Fault not_available --
PSU2 Fan2 Speed not_available -- RPM 4500 RPM 4600 RPM -- --
PSU2 Fan2 Fault not_available --
PSU2 Pwr In OK OK
PSU2 Pwr Out OK OK
PSU2 FAULT OK
PSU2 Input Type not_available --
PSU2 Over Temp not_available --
PSU2 Over Volt not_available --
PSU2 Over Curr not_available --
PSU2 Crest Factor not_available -- 1000 -- 1728 2000
PSU2 InPwr Monitor not_available -- mW -- -- -- --
Bat Present PRESENT
- Partner nodeには同じ問題がなく、PSU statusはnormalと報告されている。