ノードがパニック文字列「User Initiated system core NMI on CPU」でリブートされました
環境
- FAS / AFF
- ONTAP 9
問題
- ノードが リブートされ、次の イベントとパニックメッセージが表示されます。
[node1: idle: cpu22: sk.panic:alert]:PanicString: User initiated "system core" nmi on cpu 22 in process idle: cpu22 on release 9.8P11 (C)
[node1: splog_main: mgr.stack.string:notice]:Panicstring: User initiated "system core" nmi on cpu 22 in process idle: cpu22 on release 9.8P11 (C)
[node1: splog_main: mgr.stack.at:notice]:Panicoccurred at: Mon Feb 13 21:45:31 2023
[node1: splog_main: mgr.stack.proc:notice]:Panicin process: idle: cpu22
[node1: splog_main: mgr.stack.frame:notice]: Stack frame 5: kernel::vpanic(0xffffffff80677220) + 0x7aa
[node1: splog_main: mgr.stack.frame:notice]: Stack frame 6: kernel::panic(0xffffffff806771a0) + 0x43
- SP イベントログは次の情報を示します。
Record 1047: Tue Feb 14 05:45:30.760000 2023 [BMC CLI.notice]: ansible "system core "
Record 1048: Tue Feb 14 05:45:31.570000 2023 [IPMI Event.critical]: NMI
Record 1049: Tue Feb 14 05:45:31.570000 2023 [IPMI.notice]: 03a6 | 02 | EVT: 6f00ffff | CriticalInt | Assertion Event, "NMI/Diag Interrupt"
Record 1050: Tue Feb 14 05:45:58.550000 2023 [ASUP.notice]: First notification email | (USER_TRIGGERED (system nmi)) NOTICE | Sent
Record 1051: Tue Feb 14 05:47:18.610000 2023 [BMC CLI.notice]: ansible "system console"
Record 1057: Tue Feb 14 05:57:41.150000 2023 [BMC.critical]: Heartbeat stopped
Record 1058: Tue Feb 14 06:00:44.810000 2023 [ASUP.notice]: Reminder email | (USER_TRIGGERED (system nmi)) NOTICE | Sent
Record 1059: Tue Feb 14 06:01:34.650000 2023 [IPMI.notice]: 03a7 | 02 | EVT: 6f03ffff | Sensor 255 | Assertion Event, "Storage OS graceful shutdown"
Record 1060: Tue Feb 14 05:45:31.000000 2023 [Controller.notice]: Appliance user command panic.
Record 1061: Tue Feb 14 06:01:35.690000 2023 [BMC.critical]: Filer Reboots