システムでPSUFruFanBadAlertが報告される
環境
- ONTAP 9
- AFF A700s
- サービス プロセッサ(SP)
問題
- PSUがデグレードと報告され、両方のノードで次のイベントが発生しています。
[Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Fan is Critical Low (0 RPM)
[Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 InPower is Warning High (968 W)
[Node-01: cphmd: hm.alert.raised:alert]: Alert Id = PSUFruFanBadAlert , Alerting Resource = XXXXXXXXXXXXXX raised by monitor chassis
[Node-01: env_mgr: callhome.chassis.ps.degraded:error]: Call home for CHASSIS POWER SUPPLY DEGRADED: PS 1
[Node-01: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process cphm: PSUFruFanBadAlert[XXXXXXXXXXXXXX].
system health alert show
コマンドの出力には次の情報が表示されます。
Cluster1::> system health alert show
Node: Node-01
Alert ID: PSUFruFanBadAlert
Resource: XXXXXXXXXXXXXX
Severity: Major
Indication Time: Tue Feb 28 20:16:33 2023
Suppress: false
Acknowledge: false
Probable Cause: Power Supply Unit PSU1 FRU has a major fan problem.
The nodes in this chassis are Node-01.
Possible Effect: The power supply unit (PSU) might stop functioning if
the temperature increases.
Corrective Actions: 1. Check PSU1 FRU and the fans associated with it.
2. Refer to the Hardware specification guide for more information on the
position of the power supply unit (PSU)
and ways to check or replace it.
3. Contact support personnel if the alert persists.
- 影響を受けたPSUのセンサーは
SP-LATEST-IPMI
、「AutoSupport:
Sensor Reading:
PSU1_VIN | 0.000 | Volts | cr | na | 90.000 | 94.000 | 260.000 | 264.000 | na
PSU1_IIN | 0.000 | Amps | cr | na | 0.000 | 0.000 | 14.960 | 16.000 | na
PSU1_PIN | 0.000 | Watts | cr | na | 4.000 | 4.000 | 960.000 | 1020.000 | na
PSU1_FAN | 0.000 | RPM | cr | na | 768.000 | 1248.000 | na | na | na
- 上記のアラートで報告されたPSUは
PLATFORM-SENSORS.XML
、AutoSupportログのセクションで不良とマークされています。 - 影響を受けるPSUを交換したあともアラートが表示される。