Eシリーズで、コントローラの障害が原因で複数のドライブで読み取り不能セクターが検出されると報告される
環境
- Eシリーズ
- SANtricity OS
問題
- VDDエラー(VDDがエラーを記録した、VDD 修復が開始された、VDD修復が完了した)が複数のドライブで(解析されたMELから)表示される。
A:11/30/20 5:51:28 AM (05:51:28) 102043 201f VDD repair completed - Shelf 1, Bay A - SSID: 3, Devnum: 0x010017 LBA: 0x436886a0
----> Flags: 0x40202085 = READ: Read Operation, ERROR: IO Compl. w. Err, PARITY: Parity data, NOLOCK: Prevent lock during read err., PI: Error coding in effect, NOCACHE: CDB DPO cache lowest retention - Error: 0x844 = UA_MISCORRECTED_DATA_ERROR
A:11/30/20 5:51:28 AM (05:51:28) 102042 201e VDD repair started - Shelf 1, Bay A - SSID: 3, Devnum: 0x01000f
A:11/30/20 5:51:28 AM (05:51:28) 102041 201e VDD repair started - Shelf 1, Bay A - SSID: 3, Devnum: 0x010017
B:11/30/20 4:36:26 AM (04:36:26) 101853 2014 VDD logged an error - Shelf 1, Bay B - SSID: 0, Devnum: 0x01010b LBA: 0x0e19b600, Blocks: 0xb8 - Recovered
----> Flags: 0x200801 = READ: Read Operation, CURRENT: Read current data from cache, PI: Error coding in effect
----> Recovery: 0x2 = Reconstruction used, ASC: 0x1f = IOP_FAST_TIMEOUT_ERROR, Detection: 0xf80b0181
B:11/30/20 4:36:26 AM (04:36:26) 101852 2014 VDD logged an error - Shelf 1, Bay B - SSID: 3, Devnum: 0x010113 LBA: 0x30431628, Blocks: 0x8 - Recovered
----> Flags: 0x200801 = READ: Read Operation, CURRENT: Read current data from cache, PI: Error coding in effect
----> Recovery: 0x2 = Reconstruction used, ASC: 0x1f = IOP_FAST_TIMEOUT_ERROR, Detection: 0xf80b0181
- 読み取り不能セクターでエラーが検出されました。これに 伴い 、Data Assuranceの不一致が検出されました:
A:11/30/20 5:39:26 AM (05:39:26) 102031 6700 Unreadable sector(s) detected data loss occurred - Volume volume04 - LBA: 0x10da21aaa <--CRITICAL
----> Physical Drive in Tray 1 Slot 23, LBA: 0x28f443aa
A:11/30/20 5:39:26 AM (05:39:26) 102030 2061 Data assurance mismatch detected - probable cause is cached data - Volume volume04 - ioType: DST_OUT, hwPIStatus: GUARD_ERROR, swPIStatus: GUARD_ERROR, Host ID: 65535, LBA: 0x10da2bb00
----> Expected Guard:0x0000, AppTag:0x0000, RefTag:0x00000000
----> Found Guard:0x0000, AppTag:0x0000, RefTag:0x00000000
A:11/30/20 5:39:26 AM (05:39:26) 102029 2070 Data assurance mismatch detected -- cached data error on both controllers - Volume volume04
A:11/30/20 5:39:26 AM (05:39:26) 102028 2061 Data assurance mismatch detected - probable cause is cached data - Volume volume04 - ioType: DST_OUT, hwPIStatus: GUARD_ERROR, swPIStatus: GUARD_ERROR, Host ID: 65535, LBA: 0x10da21a00
----> Expected Guard:0x0000, AppTag:0x0000, RefTag:0x00000000
----> Found Guard:0x0000, AppTag:0x0000, RefTag:0x00000000
- 読み取り不能セクターは 複数のドライブに分散されています。
Volume LUN Accessible By Date/Time Volume LBA Drive Location Drive LBA Failure Type
volume01 1 Host Cluster cluster1 11/30/20 12:04:40 PM 0x3c08c9ee Shelf 1 Bay 5 0x2f119ee DA Error
volume03 3 Host Cluster cluster1 11/30/20 6:22:09 PM 0x10241c1ed Shelf 1 Bay 4 0x261838ed DA Error
volume04 4 Host Cluster cluster1 11/30/20 5:39:28 AM 0x10da21aaa Shelf 1 Bay 23 0x28f443aa DA Error
volume04 4 Host Cluster cluster1 11/30/20 5:39:28 AM 0x10da2bbac Shelf 2 Bay 22 0x289457ac DA Error
volume04 4 Host Cluster cluster1 11/30/20 5:39:28 AM 0x10da2bbad Shelf 2 Bay 22 0x289457ad DA Error
volume05 5 Host Cluster cluster1 11/30/20 10:42:42 AM 0xa806930a Shelf 1 Bay 10 0x3ee0d20a DA Error