1 つのパスに障害が発生すると、 Oracle RAC DB がクラッシュします
環境
- ONTAP 9
- Oracle Linux / RHEL
- Oracle Automatic Storage Management (oracleasm)
- SAN
問題
- 1つのパスだけで障害が発生すると、メンテナンス/アップグレード処理中にOracle RAC DB(ASMを使用)がクラッシュする
- Oracle エラーの例:
ORA-15025: could not open disk "/dev/oracleasm/disks/NETAPPASMDISK10"
ORA-27041: unable to open file
Linux-x86_64 Error: 6: No such device or address
Additional information: 3
2021-05-20T16:17:39.327294+03:00
WARNING: Read Failed. group:3 disk:9 AU:202506 offset:2768896 size:8192
path:Unknown disk
incarnation:0xef03276f synchronous result:'I/O error'
subsys:Unknown library krq:0x7f65533e23a0 bufp:0x1851c2a000 osderr1:0x434c5344 osderr2:0x0
IO elapsed time: 0 usec Time waited on I/O: 0 usec
WARNING: failed to read mirror side 1 of virtual extent 7825 logical extent 0 of file 354 in group [3.3887094271] from disk NETAPPASMDISK10 allocation unit 202506 reason error; if possible, will try another mirror side
2021-05-20T16:17:39.347741+03:00
PMON (ospid: 22445): terminating the instance due to ORA error 12752
Cause - 'Instance is being terminated due to fatal process death (pid: 48, ospid: 27976, LG00)'
2021-05-20T16:17:39.449937+03:00
opiodr aborting process unknown ospid (41981) as a result of ORA-1092
2021-05-20T16:17:39.854629+03:00
opiodr aborting process unknown ospid (4201) as a result of ORA-1092
2021-05-20T16:17:40.666706+03:00
*****************************************************************
- ホストエラーの例:
May 20 16:08:55 oracle multipathd: checker failed path 65:240 in map netappdisk10
May 20 16:08:55 oracle multipathd: netappdisk10: remaining active paths: 3
May 20 16:10:31 oracle multipathd: netappdisk10: sdaf - tur checker reports path is up
May 20 16:10:31 oracle multipathd: netappdisk10: remaining active paths: 4
May 20 16:11:10 oracle multipathd: checker failed path 65:240 in map netappdisk10
May 20 16:11:10 oracle multipathd: netappdisk10: remaining active paths: 3
May 20 16:11:16 oracle multipathd: netappdisk10: sdaf - tur checker reports path is up
May 20 16:11:16 oracle multipathd: netappdisk10: remaining active paths: 4