メインコンテンツへスキップ

ONTAPのアップグレード後に重複するアグリゲートが表示される

Views:
1
Visibility:
Public
Votes:
0
Category:
fas-systems
Specialty:
hw
Last Updated:

環境

  • FAS2750
  • 自動無停止アップグレード(ANDU)
  • ディスクファームウェアのバックグラウンド更新(BDFU)
  • ONTAP 9.7P15から9.8P21経由で9.10.1P17にアップグレード

問題

  • 9.8P21から9.10.1P17へのONTAPのアップグレード中に、ディスク(0b.00.11 )がオフラインになり、 見つからないとマークされました。
    • ディスクファームウェアの更新により、ディスクがオフラインになりました。
    • アグリゲート aggr01  がデグレード状態であり、ディスクが不足しています。

node04のEMSログ:

 [?]  Thu Dec 12 21:17:46 +0900 [node04: cf_giveback: ha.giveback.sysCommit:info]: Subsystem qos_ll_sfo_giveback took 151 msecs to commit giveback of aggregate 'aggr01'.
 [?]  Thu Dec 12 21:17:46 +0900 [node04: config_thread: raid.disk.assign.offline_ref:debug]: aggregate /aggr01/plex0/rg0/0b.00.5 assigned as an offline reference storage for /aggr01/plex0/rg0/0b.00.11.
 [?]  Thu Dec 12 21:17:46 +0900 [node04: config_thread: raid.disk.assign.offline_ref:debug]: aggregate /aggr01/plex0/rg0/0a.01.3 assigned as an offline reference storage for /aggr01/plex0/rg0/0b.00.11.
 [?]  Thu Dec 12 21:17:46 +0900 [node04: config_thread: raid.rg.degraded:notice]: : Raid group /aggr01/plex0/rg0 is degraded
 [?]  Thu Dec 12 21:17:46 +0900 [node04: config_thread: raid.disk.offline:notice]: Marking Disk /aggr01/plex0/rg0/0b.00.11 Shelf 0 Bay 11 [NETAPP   X343_SSKBE1T8A10 NA02] S/N [WXXXXXXN] UID [5000C500:DE81263B:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000] offline.
 [?]  Thu Dec 12 21:17:46 +0900 [node04: bg_disk_fw_update_admin: bdfu.selected:info]: Disk 0b.00.11 [NETAPP   X343_SSKBE1T8A10 NA02] S/N [WXXXXXXN] selected for background disk firmware update.
 [?]  Thu Dec 12 21:17:46 +0900 [node04: config_thread: raid.disk.online:notice]: Onlining Disk /aggr01/plex0/rg0/0b.00.11 Shelf 0 Bay 11 [NETAPP   X343_SSKBE1T8A10 NA02] S/N [WXXXXXXN] UID [5000C500:DE81263B:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000]

 

  • ギブバック後、スペアディスク0b.00.23 を使用して再構築されます。

node03 EMSログ:

 [?]  Thu Dec 12 21:17:47 +0900 [node03: config_thread: raid.rg.recons.missing:notice]: RAID group /aggr01/plex0/rg0 is missing 1 disk(s).
 [?]  Thu Dec 12 21:17:47 +0900 [node03: config_thread: raid.rg.recons.info:notice]: Spare disk 0b.00.23 will be used to reconstruct one missing disk in RAID group /aggr01/plex0/rg0.
 [?]  Thu Dec 12 21:17:47 +0900 [node03: config_thread: raid.rg.recons.start:notice]: Disk /aggr01/plex0/rg0/0b.00.23 Shelf 0 Bay 23 [NETAPP   X343_SSKBE1T8A10 NA02] S/N [WXXXXXXG] UID [5000C500:DE8204D7:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000]: starting reconstruction, using disk 0b.00.23, disk block 5248.
 [?]  Thu Dec 12 21:17:47 +0900 [node03: config_thread: raid.vol.undestroy.info.missing:info]: params: {'disk_info': 'Disk /aggr01/plex0/rg0/0b.00.23 Shelf 0 Bay 23 [NETAPP   X343_SSKBE1T8A10 NA02] S/N [WXXXXXXG] UID [5000C500:DE8204D7:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000]', 'shelf': '0', 'bay': '23', 'vendor': 'NETAPP  ', 'model': 'X343_SSKBE1T8A10', 'firmware_revision': 'NA02', 'serialno': 'WXXXXXXG', 'disk_type': '4', 'disk_rpm': '10000', 'carrier': '', 'site': 'Local'}

  • 障害が発生した別のディスクを交換すると、  node04 のフェイルオーバーステータスが部分的なギブバックに変わります。

::> storage failover show
                Takeover
Node       Partner     Possible State Description
-------------- -------------- -------- -------------------------------------
node03   node04   true    Connected to node04
node04    node03   true    Connected to node03, Partial giveback
2 entries were displayed.

  • 両方のHAノードでaggr01 が表示され、node04 では不明なディスクのみが表示され、他のディスクはFAILED とマークされます。

node04 sysconfig -r:

Aggregate aggr01 (failed, raid_dp, partial, fast zeroed) (block checksums) Plex /aggr01/plex0 (offline, failed, inactive) RAID group /aggr01/plex0/rg0 (partial, block checksums)

  RAID Disk Device      HA  SHELF BAY CHAN Pool Type  RPM  Used (MB/blks)   Phys (MB/blks)
  --------- ------      ------------- ---- ---- ---- ----- --------------   --------------
  dparity  FAILED         N/A             1713523/ -
  parity   FAILED         N/A             1713523/ -
  data    FAILED         N/A             1713523/ -
  data    FAILED         N/A             1713523/ -
  data    FAILED         N/A             1713523/ -
  data    FAILED         N/A             1713523/ -
  data    FAILED         N/A             1713523/ -
  data    0b.00.11     0b   0   11  SA:B   0   SAS 10000 1713523/3509295616 1716957/3516328368 (fast zeroed)
  data    FAILED         N/A             1713523/ -
  data    FAILED         N/A             1713523/ -
  data    FAILED         N/A             1713523/ -
  data    FAILED         N/A             1713523/ -
  data    FAILED         N/A             1713523/ -
  data    FAILED         N/A             1713523/ -
  data    FAILED         N/A             1713523/ -
  data    FAILED         N/A             1713523/ -
  data    FAILED         N/A             1713523/ -
  data    FAILED         N/A             1713523/ -
  data    FAILED         N/A             1713523/ -
  Raid group is missing 18 disks.

 

 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.