MCCスイッチオーバー中にONTAP SAN LIFがファブリックログインに失敗しました
環境
- ONTAP 9.10
- NX-OS 8.4 2cを搭載したCisco MDS 9706
- NX-OS 9.3搭載Cisco MDS 9396T(2)
問題
- 更新中にONTAP MCCスイッチオーバーを実行したあと、一部のSAN FC LIFがファブリックにログインしない(FLOGI)
- ベストプラクティスに従って、 スイッチオーバーとスイッチバックの間でポートの重複を回避しています。
- ホストイニシエータがログインしていません
igroup show -v
- FLOGIスケール最適化の調整 が試行されました
- LIFがCiscoファブリックにログインしていません。つまり、 NS登録がありません。
::> network interface show -data-protocol fcp -fields status-admin,status-oper,status-extended
- ONTAPの例:
scsitarget.fct.loginfailed:error FC port NPIV port failed fabric login with status of 3 and an extended status of x1a
- 一定の間隔でリンクが切断される:
scsitarget.hwpfct.linkUp:notice]: Link up on Fibre Channel target adapter 1c.
scsitarget.hwpfct.linkUp:notice]: Link up on Fibre Channel target adapter 1a.
scsitarget.hwpfct.linkUp:notice]: Link up on Fibre Channel target adapter 1b.
scsitarget.hwpfct.linkUp:notice]: Link up on Fibre Channel target adapter 1d.
scsitarget.slifct.linkBreak:error]: Link break detected on Fibre Channel target HBA 1c with event status 1 , topology type 1, status1 0x0, status2 0x0.
scsitarget.slifct.wqeError:debug]: FC port 1c has a WQE processing failure of type SLI_ELS_REQ, command FLOGI, subtype x0, with status of 3 and an extended status x1a for NPIV port 0.
scsitarget.fct.loginfailed:error]: FC port 1c, NPIV port 0 failed fabric login with status of 3 and an extended status of x1a.
scsitarget.slifct.linkBreak:error]: Link break detected on Fibre Channel target HBA 1a with event status 1 , topology type 1, status1 0x0, status2 0x0.
scsitarget.slifct.linkBreak:error]: Link break detected on Fibre Channel target HBA 1b with event status 1 , topology type 1, status1 0x0, status2 0x0.
- シスコの例:
showflogi internal event-history errors
showfcns internal errors
show flogi internal info
Fport server platform inf info:
max_count_of_flogi_being_processed 75
Flogi Timeout value 5200
flogi_mts_q_weight 3
flogi_fc2_q_weight 1
process_prio_switch_upper_limit 200
process_prio_switch_lower_limit 0
Is process high priority 0
Fport server global info.
FCID interop mode auto, Loop monitor mode FALSE
Stats : max_count_of_flogi_being_processed 75
count_of_flogi_being_processed 0
Stats: max_count_ever_reached 5
Stats: num_of_times_hit_max_count 0
Stats: num_of_config_timeouts 0
Stats: num_of_preconfig_timeouts 0
Stats: num_of_times_process_prio_changed 0
Stats: num_total_flogi_received 143
Stats: fs_flogi_pacer_enabled: 0
Stats: fs_flogi_pacer_timerval: 1000
Stats: fs_flogi_scale_enabled: 1
Stats: fs_flogi_quiesce_timerval: 0
Pending queue: len(0), max len(0) [Wed Oct 18 13:39:57 2023]
Timer queue: len(0), max len(1) [Wed Oct 18 13:39:57 2023]
Upgrade in progress FALSE 0; dpvm en[0] vmis en[0] npiv en[1] evfp en[0]
notifications: npiv FALSE, evfp FALSE
fc redirect confcheck 0
E_D_TOV 2000 R_A_TOV 10000 MaxRxBuf 2112 FCClasses 0xc
show logging log
%PORT-5-IF_DOWN_OFFLINE: %$VSAN 3%$ Interface fc1/30 is down (Offline)
%PORT-5-IF_DOWN_LINK_FAILURE: %$VSAN 3%$ Interface fc1/30 is down (Link failure loss of sync)
%PORT-5-IF_DOWN_OFFLINE: %$VSAN 3%$ Interface fc1/31 is down (Offline)
%PORT-5-IF_DOWN_LINK_FAILURE: %$VSAN 3%$ Interface fc1/31 is down (Link failure loss of sync)
%PORT-5-IF_DOWN_ADMIN_DOWN: %$VSAN 3%$ Interface fc1/32 is down (Administratively down)
%PORT-5-IF_UP: %$VSAN 3%$ Interface fc1/34 is up in mode F
%PORT-5-IF_DOWN_ADMIN_DOWN: %$VSAN 3%$ Interface fc1/30 is down (Administratively down)
%PORT-5-IF_UP: %$VSAN 3%$ Interface fc1/35 is up in mode F
show hardware internal fcmac port 34 port-event
-------------------- ----- -------------
- - IPS_LINK_UP <<< Link is up >>>
E_IPS_LINK_INIT_SUC IPS_LINK_UP (0)
E_IPS_PM_DIS IPS_LINK_OFFLINE (0) <<< Link is hung in offline state issue 'shutdown / no shut to recover '>>>>