メインコンテンツへスキップ

ONTAP-79534:ネットワークの輻輳が原因でログが同期されていないため、MetroCluster IPでテイクオーバーが無効になっている

Views:
Visibility:
Public
Votes:
0
Category:
ontap-9
Specialty:
MetroCluster
Last Updated:

問題

  • HAインターコネクトエラーが表示され、テイクオーバーが無効になっている場合:

Sun Feb 02 03:45:57 -0500 [Node-02: nvmm_error: rdma.rlib.event.error:debug]: QP wafl event error: client disconnect.
Sun Feb 02 03:45:57 -0500 [Node-02: nvmm_error: nvmm.mirror.offlined:debug]: params: \{'mirror': 'HA_PARTNER'}
Sun Feb 02 03:45:57 -0500 [Node-02: cf_main: cf.fsm.takeoverByPartnerDisabled:error]: Failover monitor: takeover of Node-02 by Node-01 disabled (unsynchronized log).
Sun Feb 02 03:46:00 -0500 [Node-02: nvmm_mirror_sync: nvmm.mirror.state.change:debug]: mirror of sysid 1, partner_type HA Partner, changed state from NVMM_MIRROR_LAYOUT_SYNCING to NVMM_MIRROR_LAYOUT_SYNCED and took 5 msecs.
Sun Feb 02 03:46:00 -0500 [Node-02: nvmm_mirror_sync: nvmm.mirror.state.change:debug]: mirror of sysid 1, partner_type HA Partner, changed state from NVMM_MIRROR_LAYOUT_SYNCED to NVMM_MIRROR_SYNCING_START and took 0 msecs.
Sun Feb 02 03:46:00 -0500 [Node-02: nvmm_mirror_sync: nvmm.mirror.aborting:debug]: mirror of sysid 1, partner_type HA Partner and mirror state NVMM_MIRROR_SYNCING_START is aborted because of reason NVMM_ERR_STREAM_MAP.
Sun Feb 02 03:46:00 -0500 [Node-02: nvmm_error: nvmm.mirror.aborting:debug]: mirror of sysid 1, partner_type HA Partner and mirror state NVMM_MIRROR_OFFLINE is aborted because of reason NVMM_ABORT_SYNCING_MIRROR.
  • HAインターコネクトが数秒後に再確立され、テイクオーバーが有効になります。

Sun Feb 02 03:46:00 -0500 [Node-02: iw_cm_wq: rdma.rlib.connected:debug]: wafl:HA:A QP is now connected.
Sun Feb 02 03:46:00 -0500 [Node-02: iw_cm_wq: rdma.rlib.connected:debug]: raid:HA:A QP is now connected.
Sun Feb 02 03:46:00 -0500 [Node-02: iw_cm_wq: rdma.rlib.connected:debug]: misc:HA:A QP is now connected.
Sun Feb 02 03:46:00 -0500 [Node-02: iw_cm_wq: rdma.rlib.connected:debug]: wafl:HA:A QP is now connected.
Sun Feb 02 03:46:00 -0500 [Node-02: iw_cm_wq: rdma.rlib.connected:debug]: raid:HA:A QP is now connected.
Sun Feb 02 03:46:00 -0500 [Node-02: iw_cm_wq: rdma.rlib.connected:debug]: misc:HA:A QP is now connected.
Sun Feb 02 03:46:00 -0500 [Node-02: nvmm_mirror_sync: nvmm.mirror.state.change:debug]: mirror of sysid 1, partner_type HA Partner, changed state from NVMM_MIRROR_LAYOUT_SYNCING to NVMM_MIRROR_LAYOUT_SYNCED and took 4 msecs.
Sun Feb 02 03:46:00 -0500 [Node-02: nvmm_mirror_sync: nvmm.mirror.state.change:debug]: mirror of sysid 1, partner_type HA Partner, changed state from NVMM_MIRROR_SYNCING_START to NVMM_MIRROR_CP1_START and took 26 msecs.
Sun Feb 02 03:46:00 -0500 [Node-02: nvmm_mirror_sync: nvmm.mirror.state.change:debug]: mirror of sysid 1, partner_type HA Partner, changed state from NVMM_MIRROR_CP1_START to NVMM_MIRROR_WAFL_INIT and took 270 msecs.
Sun Feb 02 03:46:00 -0500 [Node-02: nvmm_mirror_sync: nvmm.mirror.state.change:debug]: mirror of sysid 1, partner_type HA Partner, changed state from NVMM_MIRROR_WAFL_INIT to NVMM_MIRROR_CP2_FINISH and took 20 msecs.
Sun Feb 02 03:46:01 -0500 [Node-02: nvmm_mirror_sync: nvmm.mirror.state.change:debug]: mirror of sysid 1, partner_type HA Partner, changed state from NVMM_MIRROR_CP2_FINISH to NVMM_MIRROR_WAFL_HEADER and took 543 msecs.
Sun Feb 02 03:46:01 -0500 [Node-02: nvmm_mirror_sync: nvmm.mirror.state.change:debug]: mirror of sysid 1, partner_type HA Partner, changed state from NVMM_MIRROR_WAFL_HEADER to NVMM_MIRROR_SYNCING_OTHER and took 1 msecs.
Sun Feb 02 03:46:01 -0500 [Node-02: nvmm_mirror_sync: nvmm.mirror.state.change:debug]: mirror of sysid 1, partner_type HA Partner, changed state from NVMM_MIRROR_SYNCING_OTHER to NVMM_MIRROR_ONLINE and took 169 msecs.
Sun Feb 02 03:46:01 -0500 [Node-02: nvmm_mirror_sync: nvmm.mirror.onlined:debug]: params: \{'mirror': 'HA_PARTNER'}
Sun Feb 02 03:46:02 -0500 [Node-02: cf_main: cf.fsm.takeoverByPartnerEnabled:notice]: Failover monitor: takeover of Node-02 by Node-01 enabled
  • EMSにネットワーク輻輳エラーが表示される:

Mon Feb 03 13:03:36 -0500 [Node-01: mccip_mirror_congestion_mgr_p: mcc.network.congestion:notice]: Network congestion detected. Action taken: Increased ic_timeout to 2000 msec.

 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.