メインコンテンツまでスキップ

DWDM メンテナンス後にリモートドライブで障害が発生する

Views:
10
Visibility:
Public
Votes:
0
Category:
metrocluster
Specialty:
metrocluster
Last Updated:

環境

  • MetroCluster IP
  • ONTAP 9

問題

MetroCluster IP スイッチの ISL ポートから DWDM に接続するケーブルを変更したら、次の手順を実行します。
  • EMS ログにエラーが記録されます
  • リモートドライブで障害が報告されている
  • SyncMirror プレックスで障害が発生したため、複数のディスクに AutoSupport がない
 
例:
  • EMS ログ:

Mon Nov 08 12:32:07 +0100 [ClusterA-02: kernel: iscsi.session.stateChanged:notice]: iSCSI session state is changed to Reconnecting for the target iqn.2016-07.com.netapp:  (type: dr_auxiliary, address: 0.0.0.0:65200). Reason: no ping reply after 5 seconds.
Mon Nov 08 12:32:07 +0100 [ClusterA-02: intr: ctl.session.stateChanged:notice]: iSCSI CAM target layer's session state is changed to terminated for the initiator iqn.1994-09.org.freebsd:  (address: 0.0.0.0). Reason: no ping reply after 5 seconds.
Mon Nov 08 12:32:07 +0100 [ClusterA-02: kernel: iscsi.session.stateChanged:notice]: iSCSI session state is changed to Reconnecting for the target iqn.2016-06.com.netapp:  (type: dr_auxiliary, address: 0.0.0.0:65200). Reason: no ping reply after 5 seconds.
Mon Nov 08 12:32:07 +0100 [ClusterA-02: doneq0: scsi.mcc.adt.ioTransportError:error]: mcc_adt[1] - Transport error during execution of command: HA status 0x13: CAM transport status 0x1b : cdb 0x88:00000001bf178980:00000008.
Mon Nov 08 12:32:07 +0100 [ClusterA-02: doneq0: scsi.mcc.adt.ioTransportError:error]: mcc_adt[1] - Transport error during execution of command: HA status 0x13: CAM transport status 0x1b : cdb 0x88:00000001bf178190:00000008.
Mon Nov 08 12:32:07 +0100 [ClusterA-02: scsi_cmdblk_strthr_admin: scsi.cmd.abortedByHost:error]: Disk device 0v.i2.1L21: Command aborted by host adapter: HA status 0x13: cdb 0x88:00000001bf178980:00000008. 

  • sysconfig -r の出力に示された順序で指定されており、
 Plex /ClusterA-02/plex1 (offline, failed, inactive, pool1) RAID group /ClusterA-02/plex1/rg0 (partial, block checksums) RAID Disk Device HA SHELF BAY CHAN Pool Type RPM Used (MB/blks) Phys (MB/blks) --------- ------ ------------- ---- ---- ---- ----- -------------- -------------- dparity 0m.i2.3L13P1 0m 20 12 1 SSD N/A 1799343/3685054464 1799351/3685070848 (fast zeroed) parity 0m.i1.0L14P1 0m 20 13 1 SSD N/A 1799343/3685054464 1799351/3685070848 (fast zeroed) data FAILED N/A 1799343/ - data FAILED N/A 1799343/ - data FAILED N/A 1799343/ - data FAILED N/A 1799343/ - data FAILED N/A 1799343/ - Raid group is missing 5 disks.

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.