数分後にECのリバランシングが失敗する
環境
StorageGRID 11.6.0.4
問題
ECの再バランスを実行すると、数分後に失敗します
root@mdebglgrid-admin:~ # rebalance-data
status
==============================================================================
Job ID :
14639721269233092077
Site : mdebgl-rz3
State : Failure
Percentage
: Unknown
Start Time : 2022-12-16 12:04:43 UTC
End Time : 2022-12-23 12:06:46 UTC
rebalance-data.log
bycast.log
ECリーダーからとをチェックする
Dec 30 07:05:29 mdebglgrid-node10 ADE: |21664688 0784005611 ECJM CSRT 2022-12-30T07:05:29.685124| ERROR 1067 PROC: Exception: /build/src/modules/ErasureCoding/EC_JobManager_Module/SiteRebalanceJob.cc(346): Throw in function std::vector<VcsMoveInfo> erasurecoding::SiteRebalanceJob::getMoveRecommendations(byc::GroupID)#012Dynamic exception type: boost::exception_detail::clone_impl<boost::exception_detail::error_info_injector<std::runtime_error>>#012std::exception::what: ENFORCE failed: !"Exhausted retry limit or retry time for getting move recommendations"#012
7