「ADR timeout」イベントはL2ウォッチドッグリセット後に報告される
環境
- ONTAP 9
- FAS8200
- AFF A300
問題
- ノードが
events all
ログからの ADR(非同期 DRAM リフレッシュ)後のL2 watchdogタイムアウトによりダウンしました:
Record 392: Sat Feb 27 03:41:59 2021 [IPMI.critical]: ADR event ID: 0 at time: 1614397319
Record 393: Sat Feb 27 03:41:59 2021 [IPMI.critical]: ADR Battery destage cycles: 19
Record 394: Sat Feb 27 03:42:01 2021 [IPMI Event.critical]: NMI
Record 395: Sat Feb 27 03:42:01 2021 [IPMI.notice]: bd00 | 02 | EVT: 6fc824ff | System_Watchdog | Assertion Event, "Timer interrupt"
Record 396: Sat Feb 27 03:42:02 2021 [IPMI Event.critical]: L2 watchdog timeout hard reset
Record 397: Sat Feb 27 03:42:02 2021 [Trap Event.critical]: hwassist l2_watchdog_reset (29)
Record 398: Sat Feb 27 03:42:02 2021 [Trap Event.critical]: SNMP l2_watchdog_reset (29)
Record 399: Sat Feb 27 03:42:02 2021 [SysFW.notice]: ADR Timeout
Record 400: Sat Feb 27 03:42:02 2021 [SysFW.notice]: ADR Timeout
Record 401: Sat Feb 27 03:42:07 2021 [IPMI.notice]: be00 | 02 | EVT: 6fc104ff | System_Watchdog | Assertion Event, "Hard reset"
Record 402: Sat Feb 27 03:42:12 2021 [SP.critical]: Filer Reboots
または
Record 926: Sun Sep 29 22:50:42 2024 [IPMI.critical]: ADR event ID: 0 at time: 1727650242
Record 927: Sun Sep 29 22:50:42 2024 [IPMI.critical]: ADR Battery destage cycles: 18
Record 928: Sun Sep 29 22:50:43 2024 [IPMI Event.critical]: NMI
Record 929: Sun Sep 29 22:50:45 2024 [IPMI Event.critical]: L2 watchdog timeout hard reset
Record 930: Sun Sep 29 22:50:45 2024 [Trap Event.critical]: SNMP l2_watchdog_reset (29)
Record 931: Sun Sep 29 22:50:48 2024 [IPMI.notice]: 5301 | 02 | EVT: 6fc804ff | System_Watchdog | Assertion Event, "Timer interrupt"
Record 932: Sun Sep 29 22:50:48 2024 [IPMI.notice]: 5401 | 02 | EVT: 6fc104ff | System_Watchdog | Assertion Event, "Hard reset"
Record 933: Sun Sep 29 22:51:23 2024 [IPMI.notice]: 5501 | 02 | EVT: 0301ffff | Attn_Sensor1 | Assertion Event, "State Asserted"
Record 934: Sun Sep 29 22:52:15 2024 [IPMI Event.critical]: System reset
Record 935: Sun Sep 29 22:52:15 2024 [IPMI Event.critical]: L2 watchdog action completed
Record 936: Sun Sep 29 22:52:15 2024 [IPMI.notice]: L2 to L1 is 1(s) 142192(us)
Record 937: Sun Sep 29 22:52:32 2024 [SP.notice]: Delaying L2_WDOG ASUP email for 120 seconds
Record 938: Sun Sep 29 22:54:32 2024 [ASUP.notice]: First notification email | (REBOOT (watchdog reset)) CRITICAL | Send failed
Record 939: Sun Sep 29 23:02:49 2024 [SP.critical]: Heartbeat stopped
- コンソールログ
[nv2flash.ADR.timeout:ERROR]:ADR(Asynchronous DRAM self-refresh) failed at Sun Aug 24 21:03:26 2025.
[nv2flash.restage.progress:NOTICE]: ReStage is not needed because the flash has no data.
- ノードはL2 watchdogタイムアウト後のリブートループにあります。