Fedora 40: PCI bridge from AMD errors : PCIe Bus Error: severity=Correctable, type=Data Link Layer, (Receiver ID)

I found I have non stop warnings from kernel

sudo journalctl -k | grep pcieport

un 23 17:32:27 fedora kernel: pcieport 0000:00:02.1: AER: Correctable error message received from 0000:00:02.1
Jun 23 17:32:27 fedora kernel: pcieport 0000:00:02.1: PCIe Bus Error: severity=Correctable, type=Data Link Layer, (Receiver ID)
Jun 23 17:32:27 fedora kernel: pcieport 0000:00:02.1:   device [1022:14ee] error status/mask=00000040/00006000
Jun 23 17:32:27 fedora kernel: pcieport 0000:00:02.1:    [ 6] BadTLP 

My hardware
minisforum-um780-xtx

udo lspci -vv -s 0000:00:02.1
[sudo] password for michal: 
00:02.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Device 14ee (prog-if 00 [Normal decode])
        Subsystem: Device 1f4c:b016
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0, Cache Line Size: 64 bytes
        Interrupt: pin ? routed to IRQ 30
        IOMMU group: 3
        Bus: primary=00, secondary=02, subordinate=02, sec-latency=0
        I/O behind bridge: f000-ffff [size=4K] [16-bit]
        Memory behind bridge: dcc00000-dccfffff [size=1M] [32-bit]
        Prefetchable memory behind bridge: [disabled] [64-bit]
        Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- <SERR- <PERR-
        BridgeCtl: Parity- SERR+ NoISA- VGA- VGA16+ MAbort- >Reset- FastB2B-
                PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn-
        Capabilities: [50] Power Management version 3
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+)
                Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [58] Express (v2) Root Port (Slot+), IntMsgNum 0
                DevCap: MaxPayload 256 bytes, PhantFunc 0
                        ExtTag+ RBE+ TEE-IO-
                DevCtl: CorrErr+ NonFatalErr+ FatalErr+ UnsupReq+
                        RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop+
                        MaxPayload 256 bytes, MaxReadReq 512 bytes
                DevSta: CorrErr- NonFatalErr- FatalErr- UnsupReq- AuxPwr- TransPend-
                LnkCap: Port #2, Speed 16GT/s, Width x1, ASPM not supported
                        ClockPM- Surprise- LLActRep+ BwNot+ ASPMOptComp+
                LnkCtl: ASPM Disabled; RCB 64 bytes, LnkDisable- CommClk+
                        ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
                LnkSta: Speed 5GT/s, Width x1
                        TrErr- Train- SlotClk+ DLActive+ BWMgmt- ABWMgmt+
                SltCap: AttnBtn- PwrCtrl- MRL- AttnInd- PwrInd- HotPlug- Surprise-
                        Slot #0, PowerLimit 75W; Interlock- NoCompl+
                SltCtl: Enable: AttnBtn- PwrFlt- MRL- PresDet- CmdCplt- HPIrq- LinkChg-
                        Control: AttnInd Unknown, PwrInd Unknown, Power- Interlock-
                SltSta: Status: AttnBtn- PowerFlt- MRL- CmdCplt- PresDet+ Interlock-
                        Changed: MRL- PresDet- LinkState-
                RootCap: CRSVisible+
                RootCtl: ErrCorrectable- ErrNon-Fatal- ErrFatal- PMEIntEna+ CRSVisible+
                RootSta: PME ReqID 0000, PMEStatus- PMEPending-
                DevCap2: Completion Timeout: Range ABCD, TimeoutDis+ NROPrPrP- LTR+
                         10BitTagComp+ 10BitTagReq+ OBFF Not Supported, ExtFmt+ EETLPPrefix+, MaxEETLPPrefixes 1
                         EmergencyPowerReduction Not Supported, EmergencyPowerReductionInit-
                         FRS- LN System CLS Not Supported, TPHComp+ ExtTPHComp- ARIFwd+
                         AtomicOpsCap: Routing+ 32bit+ 64bit+ 128bitCAS-
                DevCtl2: Completion Timeout: 65ms to 210ms, TimeoutDis- ARIFwd-
                         AtomicOpsCtl: ReqEn- EgressBlck-
                         IDOReq- IDOCompl- LTR+ EmergencyPowerReductionReq-
                         10BitTagReq- OBFF Disabled, EETLPPrefixBlk-
                LnkCap2: Supported Link Speeds: 2.5-16GT/s, Crosslink- Retimer+ 2Retimers+ DRS-
                LnkCtl2: Target Link Speed: 5GT/s, EnterCompliance- SpeedDis-
                         Transmit Margin: Normal Operating Range, EnterModifiedCompliance- ComplianceSOS-
                         Compliance Preset/De-emphasis: -6dB de-emphasis, 0dB preshoot
                LnkSta2: Current De-emphasis Level: -3.5dB, EqualizationComplete- EqualizationPhase1-
                         EqualizationPhase2- EqualizationPhase3- LinkEqualizationRequest-
                         Retimer- 2Retimers- CrosslinkRes: unsupported
        Capabilities: [a0] MSI: Enable+ Count=1/1 Maskable- 64bit+
                Address: 00000000fee00000  Data: 0000
        Capabilities: [c0] Subsystem: Device 1f4c:b016
        Capabilities: [c8] HyperTransport: MSI Mapping Enable+ Fixed+
        Capabilities: [100 v1] Vendor Specific Information: ID=0001 Rev=1 Len=010 <?>
        Capabilities: [150 v2] Advanced Error Reporting
                UESta:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
                UEMsk:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
                UESvrt: DLP- SDES+ TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt- RxOF+ MalfTLP- ECRC- UnsupReq- ACSViol-
                CESta:  RxErr- BadTLP- BadDLLP- Rollover- Timeout- AdvNonFatalErr-
                CEMsk:  RxErr- BadTLP- BadDLLP- Rollover- Timeout- AdvNonFatalErr+
                AERCap: First Error Pointer: 00, ECRCGenCap+ ECRCGenEn- ECRCChkCap+ ECRCChkEn-
                        MultHdrRecCap- MultHdrRecEn- TLPPfxPres- HdrLogCap-
                HeaderLog: 00000000 00000000 00000000 00000000
                RootCmd: CERptEn+ NFERptEn+ FERptEn+
                RootSta: CERcvd- MultCERcvd- UERcvd- MultUERcvd-
                         FirstFatal- NonFatalMsg- FatalMsg- IntMsgNum 0
                ErrorSrc: ERR_COR: 0011 ERR_FATAL/NONFATAL: 0000
        Capabilities: [270 v1] Secondary PCI Express
                LnkCtl3: LnkEquIntrruptEn- PerformEqu-
                LaneErrStat: LaneErr at lane: 0
        Capabilities: [2a0 v1] Access Control Services
                ACSCap: SrcValid+ TransBlk+ ReqRedir+ CmpltRedir+ UpstreamFwd+ EgressCtrl- DirectTrans+
                ACSCtl: SrcValid+ TransBlk- ReqRedir+ CmpltRedir+ UpstreamFwd+ EgressCtrl- DirectTrans-
        Capabilities: [370 v1] L1 PM Substates
                L1SubCap: PCI-PM_L1.2- PCI-PM_L1.1+ ASPM_L1.2- ASPM_L1.1- L1_PM_Substates+
                L1SubCtl1: PCI-PM_L1.2- PCI-PM_L1.1- ASPM_L1.2- ASPM_L1.1-
                L1SubCtl2:
        Capabilities: [400 v1] Data Link Feature <?>
        Capabilities: [410 v1] Physical Layer 16.0 GT/s <?>
        Capabilities: [440 v1] Lane Margining at the Receiver
                PortCap: Uses Driver-
                PortSta: MargReady- MargSoftReady-
        Kernel driver in use: pcieport

❯ inxi --basic
System:
  Host: fedora Kernel: 6.9.5-200.fc40.x86_64 arch: x86_64 bits: 64
  Desktop: KDE Plasma v: 6.1.0 Distro: Fedora Linux 40 (KDE Plasma)
Machine:
  Type: Mini-pc System: Micro (HK) Tech product: Venus series v: 1.0
    serial: <superuser required>
  Mobo: Shenzhen Meigao Equipment model: F7BSD v: 1.0
    serial: <superuser required> UEFI: American Megatrends LLC. v: 1.04
    date: 11/15/2023
Battery:
  ID-1: hidpp_battery_0 charge: 89% condition: N/A
CPU:
  Info: 8-core AMD Ryzen 7 7840HS w/ Radeon 780M Graphics [MT MCP]
    speed (MHz): avg: 1073 min/max: 400/5137
Graphics:
  Device-1: AMD Phoenix1 driver: amdgpu v: kernel
  Display: wayland server: Xwayland v: 24.1.0 compositor: kwin_wayland
    driver: N/A resolution: 3413x1440
  API: OpenGL v: 4.6 compat-v: 4.5 vendor: amd mesa v: 24.1.2 renderer: AMD
    Radeon 780M (radeonsi gfx1103_r1 LLVM 18.1.6 DRM 3.57
    6.9.5-200.fc40.x86_64)
Network:
  Device-1: Realtek RTL8125 2.5GbE driver: r8169
  Device-2: Realtek RTL8125 2.5GbE driver: r8169
  Device-3: Intel Wi-Fi 6E AX210/AX1675 2x2 [Typhoon Peak] driver: iwlwifi
  Device-4: Realtek RTL8153 Gigabit Ethernet Adapter driver: r8152 type: USB
Drives:
  Local Storage: total: 2.27 TiB used: 299.35 GiB (12.9%)
Info:
  Memory: total: 60 GiB note: est. available: 60.54 GiB used: 8.83 GiB (14.6%)
  Processes: 483 Uptime: 5h 39m Shell: Bash inxi: 3.3.34

Is this something I should worry about?
But I see it not stop putting these errors …I assume it can consume a lot of disk space.
It can be that some pcie device is causing this error?
If so how to find this device ?

There appear to be some answers about that error code here: hardware - What causes this? pcieport 0000:00:03.0: PCIe Bus Error: AER / Bad TLP - Unix & Linux Stack Exchange

It appears the software is detecting occasional errors at the hardware (“data link”) layer/level. It is hard to say what might be causing the error. For example, it could be caused by something like a power line running too close to one of your motherboard’s chips in your PC case, or the interference could even be coming from something external like a microwave oven or that giant unshielded nuclear reactor floating over our heads. :sun_with_face:

Probably not since it reports “severity=Correctable”. However, it might be degrading your system’s performance.

There are limits set for how much disk space the logs can consume. When the limit is exceeded, the oldest logs are deleted. It would contribute to the “wear” on your disk, however, so I’d probably try to silence the messages with the pci=noaer setting that was mentioned in one of the comments in that StackExchange link.

1 Like

Yes setting this option removed warnings :sunny:
Thank you very much