PANIC:スロット × の NVRAM:アドレスに修正不能なメモリエラー
環境
- ONTAP 9
- AFF A700、FAS9000(NVRAM10 モジュールはスロット6)
- AFF A900、ASA A900、FAS9500(NVRAM11 モジュールはスロット6)
問題
- ノードがパニック状態になり、EMSで次のメッセージが報告されます
- ONTAP 9.1の例:
[nvram.hw.fail:CRITICAL]: NVRAM hardware failed: Uncorrectable errors detected in NV-DIMM0. Replace NV-DIMM0.
NV-DIMM fault LED has been turned on.
[nvram.hw.fail:CRITICAL]: NVRAM hardware failed: Uncorrectable errors detected in NV-DIMM1. Replace NV-DIMM1.
NV-DIMM fault LED has been turned on.
PANIC: NVRAM in slot 6: uncorrectable memory error at address 0x2dd33fe00 DIMM(0), Rank(0), Bank Group(0),
Bank(0x2), Row(0x16e99), Col(0x3f8) in process idle on release 9.1P11 (C) on Fri Mar 20 04:03:31 UTC 2020
version: 9.1P11: Thu Dec 21 02:32:26 PST 2017
compile flags: x86_64.optimize
HA: current time (in sk_msecs) 15241160060 (in sk_cycles) 58865735105269550
- ONTAP 9.3+の例:
SP-LATEST-CONSOLE-LOGS:
[nvram.hw.fail:CRITICAL]: NVRAM hardware failed: Uncorrectable errors detected in NV-DIMM2.
Replace NV-DIMM2. NV-DIMM fault LED has been turned on.
PANIC: NVRAM in slot 6: uncorrectable memory error at address 0x1011de688 DIMM(1), Rank(0), Bank Group(2),
Bank(0x0), Row(0x1011d), Col(0x399) in process idle: cpu7 on release 9.7P16 (C) on Tue Jan 11 01:40:51 EST 2022
version: 9.7P16: Fri Sep 10 18:35:49 EDT 2021
*******************************************************
WARNING: The NVRAM DIMM1 has caused UECC in the past.
Please replace it as soon as possible!!!
*******************************************************