FAS 8300 の内部ポートが停止してテイクオーバーが発生した
環境
- FAS 8300
- FAS 8700
- AFF A400
問題
- FAS 8300 / 8700 / A400 システムの内部 e0a/e0c/e0c/e0d ポートでリンク停止エラーが発生し、そのあとにノードのテイクオーバーが発生した場合は、次のエラーが表示されます。
[?] Fri Oct 08 00:37:27 -0400 [node-01: kernel: netif.linkDown:info]: Ethernet e0a: Link down, check cable.
[?] Fri Oct 08 00:37:27 -0400 [node-01: intr: rlib.ifconfig.linkEvent:notice]: params: {'eventType': 'DOWN', 'ifname': 'e0a'}
[?] Fri Oct 08 00:37:27 -0400 [node-01: kernel: netif.linkDown:info]: Ethernet e0b: Link down, check cable.
[?] Fri Oct 08 00:37:27 -0400 [node-01: intr: rlib.ifconfig.linkEvent:notice]: params: {'eventType': 'DOWN', 'ifname': 'e0b'
[?] Fri Oct 08 00:37:27 -0400 [node-01: kernel: netif.linkDown:info]: Ethernet e0c: Link down, check cable.
[?] Fri Oct 08 00:37:27 -0400 [node-01: kernel: netif.linkDown:info]: Ethernet e0d: Link down, check cable.
[?] Fri Oct 08 00:37:28 -0400 [node-01: cf_main: cf.fsm.takeoverByPartnerDisabled:error]: Failover monitor: takeover of node-01 by node-02 disabled (HA interconnect error. Verify that the partner node is running and that the HA interconnect cabling is correct, if applicable. For further assistance, contact technical support).
[?] Fri Oct 08 00:37:28 -0400 [node-01: cf_firmware: cf.fm.partnerFwTransition:info]: params: {'progresscounter': '0', 'newstate': 'SF_UNKNOWN', 'prevstate': 'SF_UP'}
[?] Fri Oct 08 00:37:30 -0400 [node-01: nvmm_mirror_sync: nvmm.mirror.aborting:debug]: mirror of sysid 1, partner_type HA Partner and mirror state NVMM_MIRROR_LAYOUT_SYNCING is aborted because of reason NVPM_ERR_MSG_SEND_FAILED.
[?] Fri Oct 08 00:37:30 -0400 [node-01: vifmgr: vifmgr.portdown:notice]: A link down event was received on node node-01, port e0c.
[?] Fri Oct 08 00:37:30 -0400 [node-01: vifmgr: vifmgr.clus.linkdown:EMERGENCY]: The cluster port e0c on node node-01 has gone down unexpectedly.
[?] Fri Oct 08 00:37:32 -0400 [node-01: cf_main: cf.fsm.partnerNotResponding:notice]: Failover monitor: partner not responding
[?] Fri Oct 08 00:37:32 -0400 [node-01: cf_main: cf.fsm.takeoverCountdown:info]: Failover monitor: takeover scheduled in 10 seconds
[?] Fri Oct 08 00:37:42 -0400 [node-01: cf_main: cf.fsm.takeover.noHeartbeat:alert]: Failover monitor: Takeover initiated after no heartbeat was detected from the partner node.
[?] Fri Oct 08 00:37:42 -0400 [node-01: cf_main: cf.fsm.stateTransit:info]: Failover monitor: UP --> TAKEOVER