緊急シャットダウン:環境上の理由シャットダウン(バッテリがない)マスタ-異常停止-スタート
のしんだ
環境
- AFF A800
- NVRAMバッテリ
問題
マザーボードがバッテリのセンサーを読み取れません。
- SP(BMC)ログでは、次の動作を確認できます。
Mar 12 07:26:05 BMC kernel: [2831352.560000] i2c bus 0x3: !!!!!! master-abnormal stop-start !!!!!!!!!
Mar 12 07:26:06 BMC kernel: [2831353.560000] i2c i2c-3: send_receive: Timed out on i2c data Operating
Mar 12 07:29:44 BMC hsam[1524]: HSAM OS(bmc):cmd(set) FLD(nvbatt-1):fault(Bat_Temp: Temperature Sensor 144 LNC)
Mar 12 07:29:44 BMC hsam[1524]: FRU /chassis-1 LED on
Mar 12 07:29:44 BMC hsam[1524]: FRU /chassis-1/controller-b LED on
Mar 12 07:29:44 BMC hsam[1524]: HSAM OS(bmc):cmd(set) FLD(nvbatt-1):fault(Bat_Temp: Temperature Sensor 144 LCR)
Mar 12 07:29:46 BMC hsam[1524]: HSAM OS(bmc):cmd(clr) FLD(nvbatt-1):fault(Bat_Temp: Temperature Sensor 144 LCR)
Mar 12 07:29:46 BMC hsam[1524]: HSAM OS(bmc):cmd(clr) FLD(nvbatt-1):fault(Bat_Temp: Temperature Sensor 144 LNC)
Mar 12 10:20:40 BMC kernel: [2841827.370000] i2c bus 0x3: !!!!!master-abort!!!!!!!!!
Mar 12 10:20:41 BMC kernel: [2841828.370000]
Mar 12 10:20:41 BMC kernel: [2841828.370000] i2c bus 0x3: !!!!!! master-abnormal stop-start !!!!!!!!!
Mar 12 10:20:42 BMC kernel: [2841829.370000] i2c i2c-3: send_bytes: Timed out sending data
Mar 12 10:20:42 BMC kernel: [2841829.380000]
Mar 12 10:20:42 BMC kernel: [2841829.380000] i2c bus 0x3: !!!!!! master-abnormal stop-start !!!!!!!!!
Mar 12 10:20:43 BMC kernel: [2841830.380000] i2c i2c-3: send_receive: Timed out on i2c data Operating
Mar 12 10:20:43 BMC kernel: [2841830.380000]
Mar 12 10:20:43 BMC kernel: [2841830.380000] i2c bus 0x3: !!!!!master-abort!!!!!!!!!
- 緊急のシャットダウンによりノードが停止しました。
Mar 12 18:08:56 [node_name:callhome.battery.low:ALERT]: Call home for BATTERY_LOW.
Thu Mar 12 18:10:11 IST 2020
SP-login: login: Mar 12 18:13:07 [node_name:monitor.shutdown.emergency:EMERGENCY]: Emergency shutdown: Environmental Reason Shutdown (Battery not present)
Terminated
.
Uptime: 9m25s
System powering down...