SGF6112ストレージノードに対してUnexpected node rebootアラートがトリガーされました
環境
- NetApp StorageGRID
- StorageGRIDアプライアンスSGF6112
問題
- StorageGRID Grid Managerで、1つ以上のSGF6112 ストレージノードについて「
Unexpected node reboot
」アラートが報告されます。 - リブートは月の最初の日曜日に実行されました。
- ストレージノードのベースOS syslogには 、ノードがリブートされる前に毎月の内部RAIDスクラビング(
mdadm checkarray
)処理が実行されていたことが記録されます。
Aug 4 00:57:01 localhost CRON[2431869]: (root) CMD (if [ -x /usr/share/mdadm/checkarray ] && [ $(date +%d) -le 7 ]; then /usr/share/mdadm/checkarray --cron --all --idle --quiet; fi)
Aug 4 08:00:53 localhost kernel: [992624.118933] md: delaying data-check of md124 until md123 has finished (they share one or more physical units)
Aug 4 08:00:53 localhost kernel: [992624.118935] md: delaying data-check of md120 until md123 has finished (they share one or more physical units)
Aug 4 08:00:53 localhost kernel: [992624.118943] md: delaying data-check of md121 until md120 has finished (they share one or more physical units)
Aug 4 08:00:53 localhost kernel: [992624.118946] md: delaying data-check of md125 until md120 has finished (they share one or more physical units)
Aug 4 08:00:53 localhost kernel: [992624.118949] md: data-check of RAID array md123
.
Aug 4 08:53:06 localhost kernel: [995757.911186] md: delaying data-check of md127 until md115 has finished (they share one or more physical units)
Aug 4 08:53:06 localhost kernel: [995757.911195] md: delaying data-check of md118 until md115 has finished (they share one or more physical units)
Aug 4 08:53:06 localhost kernel: [995757.911204] md: delaying data-check of md122 until md115 has finished (they share one or more physical units)
Aug 4 08:53:06 localhost kernel: [995757.911232] md: data-check of RAID array md116
### REBOOT
Aug 4 10:16:19 localhost kernel: [ 0.000000] Linux version 5.10.0-26-amd64 (debian-kernel@lists.debian.org) (gcc-10 (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU Binutils for Debian) 2.35.2) #1 SMP Debian 5.10.197-1+ntap1 (2023-10-30)