CHW-778:NVDIMMBadHealthAlert Reason:NVDIMM搭載システムでのファームウェアエラー
問題
AFF A800、AFF A320、AFF A400、FAS8300、またはFAS8700ストレージシステム
EMS:Sun Jun 06 17:17:15 -0700 [NetApp1: nphmd: hm.alert.raised:alert]: Alert Id = NVDIMMBadHealthAlert , Alerting Resource = /dev/nvdimm0:NetApp1 raised by monitor controller
environment statusコマンドの表示[NVDIMM0 Health bad]:>
system node run -node local environment status
センサー名 状態 現在の 重大 警告 重大警告
低低 高 高高
NVDIMM0正常性 不良
PVCCIN CPU0 正常 1773 mV 9 mV 19 mV 2459 mV 2469 mV
AutoSupportセクション: nvdimm-statusTotal NVDIMM on this platform is 2
--------------------------------------------------
DIMM(/dev/nvdimm0) Page:0
DIMM(/dev/nvdimm0):
--------------------------------------------------
Controller Ready: Yes
Controller Busy: No
Energy Policy managed by: HOST
Save_N Low During CSAVE: Yes
Save_N Enabled(ARMED): Yes
Data on the Flash: NotValidModule is Health: No
Module Status(0x0040): NVDIMM FIRMWARE error
Flash Lifetime: 94%
Flash Lifetime Status: Normal
正常性アラートの例
::::> system health alert showNode: node02
Alert ID: NVDIMMBadHealthAlert
Resource: /dev/nvdimm0
Severity: Major
Indication Time: Tue Dec 03 02:15:51 2024
Suppress: false
Acknowledge: false
Probable Cause: NVDIMM "NVDIMM-N 0 (DIMM-11)" on node "node02" is indicating a degraded status.
Reason: NVDIMM FIRMWARE error.
Possible Effect: Potential data loss as the NVDIMM becomes degraded.
Corrective Actions: Contact technical support for assistance with NVDIMM module replacement.