MCCスイッチオーバー中にONTAP SAN LIFがファブリックログインに失敗しました
環境
- ONTAP 9.10
- NX-OS 8.4 2cを搭載したCisco MDS 9706
- NX-OS 9.3搭載Cisco MDS 9396T(2)
問題
- 更新中にONTAP MCCスイッチオーバーを実行したあと、一部のSAN FC LIFがファブリックにログインしない(FLOGI)
- ベストプラクティスに従って、 スイッチオーバーとスイッチバックの間でポートの重複を回避しています。
- ホストイニシエータがログインしていませんigroup show -v
- FLOGIスケール最適化の調整 が試行されました
- LIFがCiscoファブリックにログインしていません。つまり、 NS登録がありません。
::> network interface show -data-protocol fcp -fields status-admin,status-oper,status-extended
- ONTAPの例:
scsitarget.fct.loginfailed:error FC port  NPIV port  failed fabric login with status of 3 and an extended status of x1a- 一定の間隔でリンクが切断される:
scsitarget.hwpfct.linkUp:notice]: Link up on Fibre Channel target adapter 1c.
 scsitarget.hwpfct.linkUp:notice]: Link up on Fibre Channel target adapter 1a.
 scsitarget.hwpfct.linkUp:notice]: Link up on Fibre Channel target adapter 1b.
 scsitarget.hwpfct.linkUp:notice]: Link up on Fibre Channel target adapter 1d.
scsitarget.slifct.linkBreak:error]: Link break detected on Fibre Channel target HBA 1c with event status 1 , topology type 1, status1 0x0, status2 0x0.
 scsitarget.slifct.wqeError:debug]: FC port 1c has a WQE processing failure of type SLI_ELS_REQ, command FLOGI, subtype x0, with status of 3 and an extended status x1a for NPIV port 0.
 scsitarget.fct.loginfailed:error]: FC port 1c, NPIV port 0 failed fabric login with status of 3 and an extended status of x1a.
 scsitarget.slifct.linkBreak:error]: Link break detected on Fibre Channel target HBA 1a with event status 1 , topology type 1, status1 0x0, status2 0x0.
 scsitarget.slifct.linkBreak:error]: Link break detected on Fibre Channel target HBA 1b with event status 1 , topology type 1, status1 0x0, status2 0x0.
- シスコの例:
showflogi internal event-history errorsshowfcns internal errorsshow flogi internal info
 
 Fport server platform inf info:
 max_count_of_flogi_being_processed 75
 Flogi Timeout value 5200
 flogi_mts_q_weight 3
 flogi_fc2_q_weight 1
 process_prio_switch_upper_limit 200
 process_prio_switch_lower_limit 0
 Is process high priority 0
 
 Fport server global info.
 FCID interop mode auto, Loop monitor mode FALSE
 Stats : max_count_of_flogi_being_processed 75
 count_of_flogi_being_processed 0
 Stats: max_count_ever_reached 5
 Stats: num_of_times_hit_max_count 0
 Stats: num_of_config_timeouts 0
 Stats: num_of_preconfig_timeouts 0
 Stats: num_of_times_process_prio_changed 0
 Stats: num_total_flogi_received 143
 Stats: fs_flogi_pacer_enabled: 0
 Stats: fs_flogi_pacer_timerval: 1000
 Stats: fs_flogi_scale_enabled: 1
 Stats: fs_flogi_quiesce_timerval: 0
 Pending queue: len(0), max len(0) [Wed Oct 18 13:39:57 2023]
 Timer queue: len(0), max len(1) [Wed Oct 18 13:39:57 2023]
 Upgrade in progress FALSE 0; dpvm en[0] vmis en[0] npiv en[1] evfp en[0]
 notifications: npiv FALSE, evfp FALSE
 fc redirect confcheck 0
 E_D_TOV 2000 R_A_TOV 10000 MaxRxBuf 2112 FCClasses 0xc
 
 show logging log
 %PORT-5-IF_DOWN_OFFLINE: %$VSAN 3%$ Interface fc1/30 is down (Offline)
 %PORT-5-IF_DOWN_LINK_FAILURE: %$VSAN 3%$ Interface fc1/30 is down (Link failure loss of sync)
 %PORT-5-IF_DOWN_OFFLINE: %$VSAN 3%$ Interface fc1/31 is down (Offline)
 %PORT-5-IF_DOWN_LINK_FAILURE: %$VSAN 3%$ Interface fc1/31 is down (Link failure loss of sync)
 
 %PORT-5-IF_DOWN_ADMIN_DOWN: %$VSAN 3%$ Interface fc1/32 is down (Administratively down)
 %PORT-5-IF_UP: %$VSAN 3%$ Interface fc1/34 is up in mode F
 %PORT-5-IF_DOWN_ADMIN_DOWN: %$VSAN 3%$ Interface fc1/30 is down (Administratively down)
 %PORT-5-IF_UP: %$VSAN 3%$ Interface fc1/35 is up in mode F
 
 
 show hardware internal fcmac port 34 port-event
 
 -------------------- ----- -------------
 - - IPS_LINK_UP <<< Link is up >>>
 E_IPS_LINK_INIT_SUC IPS_LINK_UP (0)
 E_IPS_PM_DIS IPS_LINK_OFFLINE (0) <<< Link is hung in offline state issue 'shutdown / no shut to recover '>>>>