Brocade スイッチの C3 Tx 破棄とフレーム損失
環境
Brocadeスイッチ
問題
- errdump で C3 Tx 廃棄数が大量にあるスイッチポート。
frames enc crc crc too too bad enc disc link loss loss frjt fbsy c3timeout pcs uncor
tx rx in err g_eof shrt long eof out c3 fail sync sig tx rx err err
352: 40.0m 183.4m 0 0 0 0 0 0 0 639 0 0 0 0 0 605 0 0 0
portstatsshow
出力には、タイムアウトによって破棄された送信フレームが大量に表示されます。
portstatsshow 103
er_tx_c3_timeout 360 Class 3 transmit frames discarded due to timeout
-
上記のC3 Tx廃棄と併せて、errdumpで検出されたフレーム損失イベントは次のとおりです。
2022/02/08-11:30:43, [MAPS-1001], 2143, SLOT 1 | FID 128, CRITICAL, CDVL_X, slot11 port32, F-Port 11/32, Condition=ALL_OTHER_F_PORTS(C3TXTO/min>3), Current Value:[C3TXTO, 20 Timeouts], RuleName=defALL_OTHER_F_PORTSC3TXTO_3, Dashboard Category=Port Health.
2022/02/08-11:30:43, [MAPS-1001], 2144, SLOT 1 | FID 128, CRITICAL, CDVL_X, slot11 port32, F-Port 11/32, Condition=ALL_PORTS(DEV_LATENCY_IMPACT==IO_FRAME_LOSS), Current Value:[DEV_LATENCY_IMPACT, IO_FRAME_LOSS, (20 C3TX Timeouts) ], RuleName=defALL_PORTS_IO_FRAME_LOSS_UNQUAR, Dashboard Category=Fabric Performance Impact.
2022/02/08-11:31:43, [MAPS-1001], 2145, SLOT 1 | FID 128, CRITICAL, CDVL_X, slot11 port32, F-Port 11/32, Condition=ALL_OTHER_F_PORTS(C3TXTO/min>3), Current Value:[C3TXTO, 585 Timeouts], RuleName=defALL_OTHER_F_PORTSC3TXTO_3, Dashboard Category=Port Health.
2022/02/08-11:32:43, [MAPS-1004], 2146, SLOT 1 | FID 128, INFO, CDVL_X, slot11 port32, F-Port 11/32, Condition=ALL_PORTS(DEV_LATENCY_IMPACT==IO_LATENCY_CLEAR), Current Value:[DEV_LATENCY_IMPACT, IO_LATENCY_CLEAR], RuleName=defALL_PORTS_IO_LATENCY_CLEAR, Dashboard Category=Fabric Performance Impact
- フレームタイムアウトイベント。errdumpでSlow Drainとマークされたデバイスとともに発生します。
2022/05/27-08:42:32, [AN-1014], 22154, SLOT 2 | FID 128, INFO, FAB1, Frame timeout detected, tx port 4/7 rx port 4/27, sid 678801, did 676700, timestamp 2022-05-27 08:42:32 .
2022/05/27-08:42:32, [AN-1014], 22155, SLOT 2 | FID 128, INFO, FAB1, Frame timeout detected, tx port 4/7 rx port 4/45, sid 678801, did 676700, timestamp 2022-05-27 08:42:32 .
2022/05/27-08:42:32, [AN-1014], 22156, SLOT 2 | FID 128, INFO, FAB1, Frame timeout detected, tx port 4/7 rx port 4/20, sid 678801, did 676700, timestamp 2022-05-27 08:42:32 .
2022/05/27-08:42:35, [MAPS-2036], 22190, SLOT 2 | FID 128, CRITICAL, FAB1, slot4 port7, F-Port 4/7, Condition=ALL_OTHER_F_PORTS(C3TXTO/min>3), Current Value:[C3TXTO, 360 Timeouts], RuleName=defALL_OTHER_F_PORTSC3TXTO_3, Dashboard Category=Port Health.
2022/05/27-08:42:59, [MAPS-2068], 22191, SLOT 2 | FID 128, CRITICAL, FAB1, slot4 port7, F-Port 4/7, Condition=ALL_PORTS(DEV_LATENCY_IMPACT==IO_FRAME_LOSS), Current Value:[DEV_LATENCY_IMPACT, IO_FRAME_LOSS, (360 C3TX Timeouts) ], RuleName=defALL_PORTS_IO_FRAME_LOSS_UNQUAR, Dashboard Category=Fabric Performance Impact.
2022/05/27-08:42:59, [NS-1026], 22192, SLOT 2 | FID 128, INFO, FAB1, Local domain 103, port index 103: A zoned device 0x676700 has been quarantined.
2022/05/27-08:42:59, [MAPS-1022], 22193, SLOT 2 | FID 128, WARNING, FAB1, Port 4/7 (Port index 103) has been marked as Slow Drain Device.