故障 LED が点灯し、 PCM ステータスが「 PC SHELF FAULT RQSTD 」と表示される
環境
- FAS 2500シリーズ
- FAS2600シリーズ
- FAS2700シリーズ
- AFF A200
問題
- コントローラの障害 LED が点灯している。
storage show fault -v
次の情報が表示されます。
Enclosure:
Element Status Status Bytes Status Descriptions
1: OK 01,00,02,00 FAIL
Vendor Unique Element 88-IOM6E: (PCM)
Element Status Status Bytes Status Descriptions
1: OK 01,01,00,00
2: OK 01,01,00,80 PC SHELF FAULT RQSTD
これには、次のものが含まれます。
-
EMS レポートでリブートが頻繁に発生し、複数のセンサーエラーが報告される
Sun Aug 08 01:48:43 0000 [Node1: dsa_worker3: ses.status.ACPError:alert]: DS4246 (S/N xxx shelf 0 on channel 0a ACP Processor error for SAS shelf ACP processor 1: critical status ; Alternate Control Path hardware failed This module is on the rear of the shelf at the top center, on shelf module A.
Sun Aug 08 01:48:52 0000 [Node1: statd: monitor.shelf.fault:alert]: Critical fault reported on disk storage shelf attached to channel 0a. Check fans, power supplies, disks, and temperature sensors.
Sun Aug 08 01:49:00 0000 [Node1: monitor: monitor.globalStatus.critical:EMERGENCY]: Disk shelf fault.
Sun Aug 08 02:00:00 0000 [Node1: statd: monitor.shelf.fault:alert]: Critical fault reported on disk storage shelf attached to channel 0a. Check fans, power supplies, disks, and temperature sensors.
Sun Aug 08 02:35:01 0000 [Node1: spsm_listener: sp.update.status:debug]: params: {'reason': 'sp_startup_notify_servprocd: SP startup handler has been called. '}
Sun Aug 08 02:35:04 0000 [Node1: splog_main: splog.running.normally:info]: Process splogd is operating normally.
Sun Aug 08 02:35:22 0000 [Node1: dsa_worker1: ses.status.ACPInfo:info]: DS4246 (S/N xxx) shelf 0 on channel 0a ACP Processor information for SAS shelf ACP processor 1: normal status.
- ACP 通信が失われたため、 SP をリブートしています
Record 2773:Sun Aug 8 02:33:15 2021 [SP.critical]: Rebooting SP due to loss of ACP comms
Record2774:Thu Jan 1 00:00:37 1970 [IPMI.notice]: 2902 | c0 | OEM: ffff70005100 | ManufId: 150300 | SP Reset Internally
Record 2775:Thu Jan 1 00:00:42 1970 [IPMI.notice]: 2a02 | 02 | EVT: 0301ffff | Power_Good | Assertion Event, "State Asserted"
Record 2776:Thu Jan 1 00:00:43 1970 [IPMI.notice]: 2b02 | 02 | EVT: 0301ffff | Power_Proc_OK | Assertion Event, "State Asserted"
Record 2777:Thu Jan 1 00:00:44 1970 [IPMI.notice]: 2c02 | 02 | EVT: 0301ffff | Controller_Fault | Assertion Event, "State Asserted"
Record 2778: Thu Jan 1 00:01:00 1970 [SP.notice]: Running primary version 2.11
Record 2779:Thu Jan 1 00:01:03 1970 [SP.normal]: Heartbeat started