DMI障害によって引き起こされる修正不能なMachine Check Errorパニック
環境
- AFF-A800 / AFF-C800
- ONTAP 9
問題
パニック文字列は、Uncorrectable Machine Check Error中に複数のデバイスを指しています。
PANIC : Uncorrectable Machine Check Error at CPUx. SKL_IIO Error: STATUS<0xb380000000000e0b>(VALID,UC,EN,PCC,S,AR,CORR_ERR_STATUS(0),CORR_ERR_CNT(0),MSCOD(0),MCACOD(0xe0b))IIO Machine Check from device(s):Dv[2020](0,0,0): Link down, PCI Device 8086:2020 on Controller, PCI Device 8086:2025 on Controller, PCI Device 8086:a19c on Controller, Seagate M.2 NVMe SSD on Controller, PCI Device 8086:2020 on Controller, PCI Device 8086:2025 on Controller, PCI Device 8086:a19c on Controller, Seagate M.2 NVMe SSD on Controller, PCI Device 8086:2020 on Controller, PCI Device 8086:2025 on Controller, PCI Device 8086:a19c on Controller, Seagate M.2 NVMe SSD on Controller, PCI Device 8086:2020 on Controller, PCI Device 8086:2025 on Controller, PCI Device 8086:a19c on Controller, Seagate M.2 NVMe SSD on Controller, PCI Device 8086:2020 on Controller, PCI Device 8086:2025 on Controller, PCI Device 8086:a19c on Controller, Seagate M.2 NVMe SSD on Controller, PCI Device 8086:2020 on Controller, PCI Device 8086:2025 on Controlleversion: 9.10.1P1: Fri May 19 21:58:49 EDT 2023
conf : x86_64.optimize