繰り返されるcf.hwassist.missedKeepAliveエラー
環境
- AFF A220 / AFF A150 / AFF C190 / FAS2750 / FAS2720
- クラスタ\ノード管理ネットワーク
問題
- 継続的なHWアシストのキープアライブエラー
- HAペアの両方のノードで数分後にクリアされます。
EMS.Log
例:
14:03:59 +0100 [node_name01: cf_hwassist: cf.hwassist.missedKeepAlive:error]: HW-assisted takeover missing keep-alive messages from HA partner (node_name02).
14:11:29 +0100 [node_name01: cf_hwassist: cf.hwassist.recvKeepAlive:info]: hw_assist: Received hw_assist KeepAlive alert from partner(node_name02).
14:24:59 +0100 [node_name01: cf_hwassist: cf.hwassist.missedKeepAlive:error]: HW-assisted takeover missing keep-alive messages from HA partner (node_name02).
14:32:29 +0100 [node_name01: cf_hwassist: cf.hwassist.recvKeepAlive:info]: hw_assist: Received hw_assist KeepAlive alert from partner(node_name02).
15:28:00 +0100 [node_name01: cf_hwassist: cf.hwassist.missedKeepAlive:error]: HW-assisted takeover missing keep-alive messages from HA partner (node_name02).
15:35:30 +0100 [node_name01: cf_hwassist: cf.hwassist.recvKeepAlive:info]: hw_assist: Received hw_assist KeepAlive alert from partner(node_name02).
13:34:02 +0100 [node_name02: cf_hwassist: cf.hwassist.missedKeepAlive:error]: HW-assisted takeover missing keep-alive messages from HA partner (node_name01).
13:41:32 +0100 [node_name02: cf_hwassist: cf.hwassist.recvKeepAlive:info]: hw_assist: Received hw_assist KeepAlive alert from partner(node_name01).
13:55:02 +0100 [node_name02: cf_hwassist: cf.hwassist.missedKeepAlive:error]: HW-assisted takeover missing keep-alive messages from HA partner (node_name01).
14:02:32 +0100 [node_name02: cf_hwassist: cf.hwassist.recvKeepAlive:info]: hw_assist: Received hw_assist KeepAlive alert from partner(node_name01).
14:16:02 +0100 [node_name02: cf_hwassist: cf.hwassist.missedKeepAlive:error]: HW-assisted takeover missing keep-alive messages from HA partner (node_name01).
14:23:32 +0100 [node_name02: cf_hwassist: cf.hwassist.recvKeepAlive:info]: hw_assist: Received hw_assist KeepAlive alert from partner(node_name01).
- エラーはタイミングパターンに従います。
例:every 7:30 minutes.
- Command
storage failover hwassist show
の出力に次のエラーが表示されます。
cluster::> storage failover hwassist show
Node
-----------------
node-01
Partner: node-02
Hwassist Enabled: true
Hwassist IP: 10.XX.XX.X
Hwassist Port: 4444
Monitor Status: active
Inactive Reason: -
Corrective Action: -
Keep-Alive Status: Error: did not receive hwassist keep alive alerts from partner.
node-02
Partner: node-01
Hwassist Enabled: true
Hwassist IP: 10.XX.XX.X
Hwassist Port: 4444
Monitor Status: active
Inactive Reason: -
Corrective Action: -
Keep-Alive Status: Error: did not receive hwassist keep alive alerts from partner.
2 entries were displayed.
- コマンド
storage failover hwassist test
の出力にtimed outエラーが表示されます。
cluster::> storage failover hwassist test -node *
Info: No response from partner(node-01).Timed out.
Info: No response from partner(node-02).Timed out.