AFF-A400 で修正できない ECC エラーが原因で watchdog リセットが発生します
環境
- AFF A400
- FAS 8300
- FAS 8700
問題
- ノードがリブートし、 POST を完了できない
- 送信元
system log sel
10d | 12/25/2021 | 07:10: 44 | Memory #0x08 | Uncorrectable ECC | Asserted
10e | 12/25/2021 | 07:10: 44 | Memory #0x08 | Uncorrectable ECC | Asserted
10f | 12/25/2021 | 07:10:46 | Watchdog 2 #0xb1 | Timer interrupt (NMI/SMS/OS) | Asserted
110 | 12/25/2021 | 07:10: 46 | Critical Interrupt #0xb0 | NMI/Diag Interrupt | Asserted
- 送信元
system log console
PANIC: watchdog nmi on cpu 8, hang cpu is 0 in process idle: cpu8 on release 9.7P12 (C) on Sat Dec 25 01:10:45 CST 2021