SP HBT MISSED イベントが報告され、自動的に回復されました
環境
- AFF-A300
- ONTAP 9.6P2
- Service Processor(SP)5.6P2
問題
- SP HBT MISSED(Service Processor Heartbeat Missed)イベントがノードnode-02で報告され、回復されました
- 次のログエントリが観察されました
例:
EMSFri Dec 19 16:12:50 [node-02: nphmd: hm.alert.cleared:notice]: Alert Id = SPNotConfiguredAlert , Alerting Resource = SP Config cleared by monitor controllerFri Dec 19 16:13:33 [node-02: spsm_listener: sp.heartbeat.stopped:error]: Have not received a IPMI heartbeat from the Service Processor (SP) in last 20 seconds.Fri Dec 19 16:14:17 [node-02: spsm_listener: callhome.sp.hbt.missed:notice]: Call home for SP HBT MISSEDFri Dec 19 16:20:34 [node-02: spsm_listener: sp.update.status:debug]: params: {'reason': 'sp_startup_notify_servprocd: SP startup handler has been called. '}Fri Dec 19 16:20:38 [node-02: spsm_listener: sp.heartbeat.resumed:info]: Received IPMI heartbeat from the Service Processor (SP).Fri Dec 19 16:21:06 [node-02: splog_main: splog.running.normally:info]: Process splogd is operating normally.
SP-MGMT-MLOG-TXT.GZ00000ae5.0086fbb9 74a3fe6d Fri Dec 19 2025 16:12:38 [kern_servprocd:info:5164] 0x80a1dcb00: 8503e8000169c8e5: NOTICE: Servprocd::CLI: get_spcs_port : spcs port value in sp_api_service mdb is 5000000000ae5.0086fbbb 74a3fe9a Fri Dec 19 2025 16:12:43 [kern_servprocd:info:5164] Thrift: Fri Dec 19 16:12:43 2025 TSocket::open() timed out <Host: 192.0.2.80 Port: 50000>00000ae5.0086fbbc 74a3fe9a Fri Dec 19 2025 16:12:43 [kern_servprocd:info:5164] 0x80a1dcb00: 8503e8000169c8e5: ERR: Servprocd::CLI: sp_get_sensor_info_worker : TTX ERROR: 1 (open() timed out)00000ae5.0086fbed 74a40282 Fri Dec 19 2025 16:14:24 [kern_servprocd:info:5164] 0x80a1dd000: 8503e8000169c8e5: NOTICE: Servprocd::CLI: get_spcs_port : spcs port value in sp_api_service mdb is 5000000000ae5.0086fbee 74a402b4 Fri Dec 19 2025 16:14:29 [kern_servprocd:info:5164] Thrift: Fri Dec 19 16:14:29 2025 TSocket::open() timed out <Host: 192.0.2.80 Port: 50000>00000ae5.0086fbef 74a402b4 Fri Dec 19 2025 16:14:29 [kern_servprocd:info:5164] 0x80a1dd000: 8503e8000169c8e5: ERR: Servprocd::CLI: sp_get_sensor_info_worker : TTX ERROR: 1 (open() timed out)00000ae5.0086fc17 74a41114 Fri Dec 19 2025 16:20:34 [kern_servprocd:info:5164] 0x80a1dc600: 0: NOTICE: Servprocd::SpUpdate: SpUpdateStateHandleEvent: e_sp_startup event received in SP_UPDATE_NOT_IN_PROGRESS state.00000ae5.0086fc1b 74a41114 Fri Dec 19 2025 16:20:34 [kern_servprocd:info:5164] 0x80a1dc600: 0: NOTICE: Servprocd::SpUpdate: sp_startup_notify_servprocd: SP startup handler has been called.00000ae5.0086fc2b 74a412e7 Fri Dec 19 2025 16:21:23 [kern_servprocd:info:5164] 0x80a1dda00: 0: NOTICE: Servprocd::SpUpdate: SpUpdateStateHandleEvent: e_sp_online event received in SP_UPDATE_NOT_IN_PROGRESS state.00000ae5.0086fc2c 74a412e7 Fri Dec 19 2025 16:21:23 [kern_servprocd:info:5164] 0x80a1dda00: 0: NOTICE: Servprocd::SpUpdate: sp_bootup_notify_servprocd: SP online handler has been called