クラスタネットワークスイッチ間のISL停止
環境
問題
- スイッチのISLリンクが停止と表示されている
- ifstatでクラスタポートがCRCエラーを受信していることが表示される
-- interface e0a (12 hours, 37 minutes, 43 seconds) --
RECEIVE
Total frames: 4566k | Frames/second: 100 | Total bytes: 731m
Bytes/second: 16099 | Total errors: 4718 | Errors/minute: 6
Total discards: 0 | Discards/minute: 0 | Multi/broadcast: 6206
Non-primary u/c: 0 | CRC errors: 4718 | Long frames: 0
Jabber: 0 | Length errors: 0 | No buffer: 0
Pause: 0 | Jumbo: 54 | Error symbol: 0
Bus overruns: 0 | LRO segments: 4301k | LRO bytes: 685m
LRO6 segments: 0 | LRO6 bytes: 0 | Bad UDP cksum: 0
-- interface e1a (12 hours, 37 minutes, 43 seconds) --
RECEIVE
Total frames: 4252k | Frames/second: 94 | Total bytes: 512m
Bytes/second: 11277 | Total errors: 1831 | Errors/minute: 2
Total discards: 0 | Discards/minute: 0 | Multi/broadcast: 6204
Non-primary u/c: 0 | CRC errors: 1831 | Long frames: 0
Jabber: 0 | Length errors: 0 | No buffer: 0
Pause: 0 | Jumbo: 0 | Error symbol: 0
Bus overruns: 0 | LRO segments: 4174k | LRO bytes: 488m
LRO6 segments: 0 | LRO6 bytes: 0 | Bad UDP cksum: 0
- EMSログで次のアラートを確認
[Node-01: nphmd: hm.alert.raised:alert]: Alert Id = NodeIfInErrorsWarnAlert , Alerting Resource = Node-01/e0a raised by monitor controller
[Node-01: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process cshm: UnsupportedSwitch_Alert[switch01(xxxxxxxxxxxx)].
[Node-01: vifmgr: callhome.clus.net.degraded:alert]: Call home for CLUSTER NETWORK DEGRADED: Large MTU Packet Loss - Ping failures detected between Node-01_clus1 ( 169.254.210.133 ) on Node-01 and Node-02_clus2 ( 169.254.61.145 ) on Node-02