メインコンテンツまでスキップ

デバイスからPCI Error NMIが発生してシステムがパニック状態になりました:リンクがダウンしました

Views:
185
Visibility:
Public
Votes:
0
Category:
fas-systems<a>2008755950 と入力します</a>
Specialty:
hw
Last Updated:

環境

  • FAS80X0
  • AFF A80X0
  • AFF A300
  • AFF A700

問題

  • ストレージコントローラによって PCI NMIエラーがトリガーされ、 システムがリブートされます。
  • SP システムログに「ERROR:
    • PANIC: PCI Error NMI from device(s):ErrSrcID(CorrSrc(0x5f20),UCorrSrc(0)), RPT(0,3,0):PLX PCIE 8764 switch in slot 6 on Controller, Br[8764](95,4,0) in slot 6: Link down. in process idle: cpu25

    • PANIC: PCI Error NMI from device(s):ErrSrcID(CorrSrc(0),UCorrSrc(0x680)), PLX PCIE 8748 switch on Controller, Br[8748](6,16,0): Link down. in process idle on release 8.2.4
      • sysconfig -PCI コマンド出力から、スロット2のマザーボードとカードの間のリンクがダウンしていたことがわかります。
        • Br[8748](6,16,0): PLX PCIE 8748 switch on Controller
            LinkCap(MaxLkSp(3),MaxLkWd(8),ASPM(3),L0(5),L1(0),Port(0))
            LinkStatus(LkSp(2),LkWd(4),DLAct),  
                  Dv[150e](9,0,0) in slot 2: Intel 1G NIC in slot 2 on Controller
                    LinkCap(MaxLkSp(2),MaxLkWd(4),ASPM(3),L0(6),L1(1),Port(0))
                    LinkStatus(LkSp(2),LkWd(4),SClk),  
                  Dv[150e](9,0,1) in slot 2: Intel 1G NIC in slot 2 on Controller
                    LinkCap(MaxLkSp(2),MaxLkWd(4),ASPM(3),L0(6),L1(1),Port(0))
                    LinkStatus(LkSp(2),LkWd(4),SClk),  
                  Dv[150e](9,0,2) in slot 2: Intel 1G NIC in slot 2 on Controller
                    LinkCap(MaxLkSp(2),MaxLkWd(4),ASPM(3),L0(6),L1(1),Port(0))
                    LinkStatus(LkSp(2),LkWd(4),SClk),  
                  Dv[150e](9,0,3) in slot 2: Intel 1G NIC in slot 2 on Controller
                    LinkCap(MaxLkSp(2),MaxLkWd(4),ASPM(3),L0(6),L1(1),Port(0))
                    LinkStatus(LkSp(2),LkWd(4),SClk),
      • sysconfig -ac スロット2のカードの詳細を表示します。
        • sysconfig: slot 2 OK: X1049C: PCI-E Quad 10/100/1000 Ethernet 82580(v3.29以降)
      • PANIC: PCI Error NMI from device(s):DMI(0,0,0),Br[8c10](0,28,0): Link down.  in process idle on release 9.1P11
        • の出力から、 PCI-HIERARCHY.XML マザーボードのNICポートでリンクがダウンしていたことがわかります。
          • Br[8c10](0,28,0): PCI Device 8086:8c10 on Controller LinkCap(MaxLkSp(2),MaxLkWd(4),ASPM(3),L0(3),L1(2),DLAct,Port(1)) LinkStatus(LkSp(2),LkWd(4),SClk,DLAct),
            2 Dv[1563](16,0,0): Intel Dual 10G NIC on Controller LinkCap(MaxLkSp(2),MaxLkWd(8),ASPM(2),L0(5),L1(4),Port(0)) LinkStatus(LkSp(2),LkWd(4),SClk),
            2 Dv[1563](16,0,1): Intel Dual 10G NIC on Controller LinkCap(MaxLkSp(2),MaxLkWd(8),ASPM(2),L0(5),L1(4),Port(0)) LinkStatus(LkSp(2),LkWd(4),SClk),
      • PANIC: PCI Error NMI from device(s):ErrSrcID(CorrSrc(0),UCorrSrc(0x8010)), RPT(128,2,0):Br[3c04](128,2,0): Link down, ErrSrcID(CorrSrc(0),UCorrSrc(0x8018)), RPT(128,3,0):Br[3c08](128,3,0): Link down. in process idle on release 8.2.5P5 on Wed Jun  1 18:40:17 KST 2022
        • の出力から、 PCI-HIERARCHY.XML マザーボードとIOXM間のリンクが停止していたことがわかります。
          •     Br[3c08](128,3,0): PCI Device 8086:3c08 on Controller
                  LinkCap(MaxLkSp(3),MaxLkWd(16),ASPM(0),L0(7),L1(0),Port(0))
                  LinkStatus(LkSp(3),LkWd(16),SClk,DLAct),
                    Br[8732](145,0,0): PLX PCIE 8732 switch on IO Expansion
                     LinkCap(MaxLkSp(3),MaxLkWd(16),ASPM(2),L0(6),L1(0),Port(0))
                     LinkStatus(LkSp(3),LkWd(16)),
                        Br[8732](146,8,0): PLX PCIE 8732 switch on IO Expansion
                         LinkCap(MaxLkSp(3),MaxLkWd(16),ASPM(3),L0(6),L1(0),Port(0))
                         LinkStatus(LkSp(3),LkWd(16),DLAct),
                            Br[8748](147,0,0): PLX PCIE 8748 switch on IO Expansion
                             LinkCap(MaxLkSp(3),MaxLkWd(16),ASPM(2),L0(6),L1(0),Port(0))
                             LinkStatus(LkSp(3),LkWd(16)),
                                Br[8748](148,8,0): PLX PCIE 8748 switch on IO Expansion
                                 LinkCap(MaxLkSp(3),MaxLkWd(8),ASPM(3),L0(5),L1(0),Port(0))
                                 LinkStatus(LkSp(2),LkWd(8),DLAct),
                                    Dv[10fb](149,0,0) in slot 9: Intel 10G NIC in slot 9 on IO Expansion
                                     LinkCap(MaxLkSp(2),MaxLkWd(8),ASPM(1),L0(4),L1(1),Port(0))
                                     LinkStatus(LkSp(2),LkWd(8),SClk),
                                    Dv[10fb](149,0,1) in slot 9: Intel 10G NIC in slot 9 on IO Expansion
                                     LinkCap(MaxLkSp(2),MaxLkWd(8),ASPM(1),L0(4),L1(1),Port(0))
                                     LinkStatus(LkSp(2),LkWd(8),SClk),

 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.