ルートボリュームでのWAFL の不整合が原因で、回復不能なメタデータブロックがあるとノードがパニック状態になります
環境
問題
- 次の文字列でノードがパニック状態になります。
PANIC: Unrecoverable metadata block (file -1, block 1078672, fbn 820, level 0, file type 1) in volume vol0.
WAFL inconsistent. Contact NetApp technical support. in SK process wafl_exempt06 on release 9.11.1P3 (C) on Wed Nov 2 22:28:29 CST 2022
- ONTAP のアップグレード中にパニックが発生した場合は、ANDUが一時停止することがあります。
[Node-01: upgrademgr: upgrademgr.update.pausedErr:error]: The automated update of the cluster has been paused due to the following reason: Error: {Update timeout occured in State[Giveback]-- Tasks Pending in nodes Task Name: do-giveback-job}, Action: {Ensure that nodes are healthy and try resuming the update operation after some time}
[Node-01: notifyd: callhome.andu.pausederr:alert]: params: {'epoch': 'baXXXXX-XXXX-XXXX-aXXX-5dXXXXXXXXX', 'subject': 'AUTOMATED NDU PAUSED ON NODE: MUM'}
[Node-01: mgwd: mgmtgwd.jobmgr.private.jobcomplete.failure:info]: Private job "Balanced Placement Model Cache Update" [id 1665] (Balanced Placement model cache update for node Node-01.) completed unsuccessfully: Provisioning model cache update failed. Reason: Node "Node-01" on ring "Management" is offline. Check the health of the cluster using the "cluster show" command. For further assistance, contact technical support. (5440894).
[Node-01: cf_main: ha.takeoverImpVersion:alert]: Takeover of the partner node is impossible due to version mismatch.