メインコンテンツへスキップ

AFF A250またはFAS500fでシステムのシャットダウンSPハートビートが停止し、ブート時にKCSエラーが発生

Views:
25
Visibility:
Public
Votes:
0
Category:
aff-series
Specialty:
HW
Last Updated:

環境

  • AFF A250
  • FAS500f

問題

  • SP HBTが停止したためノードがシャットダウンする:
Sat Aug 19 03:46:24 -0400 [cluster-01: spmgrd: sp.heartbeat.stopped:debug]: Have not received a IPMI heartbeat from the Service Processor (SP) in last 600 seconds.
Sat Aug 19 03:46:24 -0400 [cluster-01: spmgrd: callhome.sp.hbt.missed:debug]: Call home for SP HBT MISSED
Sat Aug 19 03:56:44 -0400 [cluster-01: spmgrd: callhome.sp.hbt.stopped:debug]: Call home for SP HBT STOPPED
Sat Aug 19 03:59:08 -0400 [cluster-01: env_mgr: sp.ipmi.lost.shutdown:EMERGENCY]: SP heartbeat stopped and cannot be recovered. To prevent hardware damage and data loss, the system will shut down in 10 minutes.
Sat Aug 19 04:09:08 -0400 [cluster-01: env_mgr: monitor.shutdown.emergency:EMERGENCY]: Emergency shutdown: Environmental Reason Shutdown (System reboot to recover the BMC)
  • パートナーがリブートしていることを確認するため、パートナーがテイクオーバーします。
Sat Aug 19 04:09:33 -0400 [cluster-02: cf_main: cf.fsm.takeover.on.reboot:debug]: Failover monitor: One node initiated automatic takeover after detecting that its partner node is rebooting.
  • ノードにコンソールが接続されている場合、ノードはLOADER状態です。SPに切り替えると、次のスパムが検出されます。
sh: can't create /sys/module/watchdog_hw/parameters/current_wdt_device: nonexistent directory
sh: can't create /sys/module/watchdog_hw/parameters/current_wdt_device: nonexistent directory
 
KCS cmd(NETFN 0x6, CMD 0x1) failed, ret -2
  • ノードの電源再投入に変更はありませんでした
    • ノードは引き続きブートせず、BMCが応答しませんでした。
    • LOADERから_ontapをブートしようとすると、ブート時に次のようになります。
KCS cmd(NETFN 0xa, CMD 0x10) failed, ret -2
KCS cmd(NETFN 0xa, CMD 0x10) failed, ret -2
KCS cmd(NETFN 0xa, CMD 0x10) failed, ret -2
KCS cmd(NETFN 0xa, CMD 0x10) failed, ret -2
Could not patch the required SMBIOS 1 field 1 with the FRU data.
KCS cmd(NETFN 0xa, CMD 0x10) failed, ret -2
KCS cmd(NETFN 0xa, CMD 0x10) failed, ret -2
Copyright(c) 2021 American Megatrends, Inc. 
��Copyright(c) 2021 American Megatrends, Inc. 
��ERROR: Class:0; Subclass:20000; Operation: 1002
 
Boot Loader version 6.5.8 
Copyright (C) 2000-2003 Broadcom Corporation.
Portions Copyright (C) 2002-2023 NetApp, Inc. All Rights Reserved.
 
KCS cmd(NETFN 0x6, CMD 0x1) failed, ret -2
Resetting BMC from backup FW...
Waiting 30 seconds for BMC to reboot...
KCS cmd(NETFN 0x6, CMD 0x1) failed, ret -2
Copyright(c) 2021 American Megatrends, Inc. 
��ERROR: Class:0; Subclass:20000; Operation: 1002

 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.