接続チェックの失敗が原因で、StorageGRID が運用停止タスクを開始できません
環境
問題
- StorageGRID がノードの運用停止タスクを開始できない
- [Start Decommission]を押すと、[Maintenance]>[Decommission]>[Decommission Nodes]にエラーが表示されます
500: Internal Server Error
This procedure cannot be started or configured while an existing decommission procedure is in progress.
- [Support] > [Grid topology] > [admin node] > [CMN]>[Grid]をクリックします [Tasks] > [Overview] に、運用停止タスクが [Pending]に表示されます
gdu-server.log
管理ノードで、特定のノードの接続チェックがタイムアウトまでに失敗することが示されています
INFO -- gdu-server: Decommissioning nodes: [<NODE>]
INFO -- gdu-server: Initializing GDU MakeGrid composite task for decommission procedure
INFO -- gdu-server: Starting GDU MakeGrid composite task
INFO -- gdu-server: Calling make-grid action: checkConnectivity
INFO -- gdu-server: Checking grid node connectivity
INFO -- gdu-server: Checking connectivity of <ADMIN_NODE>
INFO -- gdu-server: Executing command `test -e /etc/bycast-version.d/storage-grid-release` on <ADMIN_NODE>
INFO -- gdu-server: Checking connectivity of <STORAEG_NODE>
INFO -- gdu-server: Connectivity check for <STORAEG_NODE> timed out. Retrying one more time ...
ERROR -- gdu-server: execution expired (Timeout::Error)
ERROR -- gdu-server: /usr/lib/ruby/2.5.0/socket.rb:64:in `connect'
ERROR -- gdu-server: /usr/lib/ruby/2.5.0/socket.rb:64:in `connect_internal'
ERROR -- gdu-server: /usr/lib/ruby/2.5.0/socket.rb:137:in `connect'
ERROR -- gdu-server: Unable to connect to <STORAEG_NODE>: execution expired
ERROR -- gdu-server: Failure occurred during Decommission