07-21-2019 09:21 PM
"I inserted a new A9K-24X10GE-TR line card today, which prompted the following errors:
RP/0/RSP1/CPU0:Jul 21 22:26:39.997 : pfm_node_rp[357]: %PLATFORM-CROSSBAR-1-SERDES_ERROR_LNK1 : Set|fab_xbar[213105]|0x1017007|XBAR_0_Slot_3 RP/0/RSP1/CPU0:Jul 21 22:26:50.015 : pfm_node_rp[357]: %PLATFORM-CROSSBAR-1-SERDES_ERROR_LNK1 : Clear|fab_xbar[213105]|0x1017007|XBAR_0_Slot_3
This error is repeated several times per hour, each time with ~10 seconds between the "Set" and "Clear" messages.
Immediately after "Set" message appears:
PORT Remote Slot Remote Inst Logical ID Status ====================================================== 00 0/3/CPU0 02 1 Up 01 0/3/CPU0 01 1 Up 02 0/3/CPU0 01 0 Up 03 0/3/CPU0 00 0 Up 04 0/3/CPU0 00 1 Up 05 0/3/CPU0 03 1 Up 07 0/RSP1/CPU0 00 1 Up 08 0/3/CPU0 03 0 Up 09 0/RSP0/CPU0 00 1 Down 11 0/RSP1/CPU0 00 0 Up 12 0/RSP0/CPU0 00 0 Up 14 0/RSP0/CPU0 01 1 Up 15 0/RSP1/CPU0 01 1 Up 16 0/RSP0/CPU0 01 0 Up 17 0/RSP1/CPU0 01 0 Up 24 0/3/CPU0 02 0 Up
Immediately after the message clears:
PORT Remote Slot Remote Inst Logical ID Status ====================================================== 00 0/3/CPU0 02 1 Up 01 0/3/CPU0 01 1 Up 02 0/3/CPU0 01 0 Up 03 0/3/CPU0 00 0 Up 04 0/3/CPU0 00 1 Up 05 0/3/CPU0 03 1 Up 07 0/RSP1/CPU0 00 1 Up 08 0/3/CPU0 03 0 Up 09 0/RSP0/CPU0 00 1 Up 11 0/RSP1/CPU0 00 0 Up 12 0/RSP0/CPU0 00 0 Up 14 0/RSP0/CPU0 01 1 Up 15 0/RSP1/CPU0 01 1 Up 16 0/RSP0/CPU0 01 0 Up 17 0/RSP1/CPU0 01 0 Up 24 0/3/CPU0 02 0 Up
The line card also appears to be dropping traffic (noticeable drop for users). "show drops" reveals that "Egress Uc dq pkt-len-crc/RO-seq/len error drp" is increasing rapidly.
The line card was reseated, rebooted, as well as tested in another slot slot 0 and slot 3). Same issues happen with both slots immediately after the line card boots.
I'd assume that this is a bad line card that needs to be RMA'd, but would just like to confirm as that is a last-resort option. Is there any possibility that this error stemmed from an issue with the software, chassis or some underlying fault in one or more RSP's? I've had several (different) other issues with the fabric crossbar / FIA's on trident LC's before, so I'm unsure whether it's just bad luck or if another component in my system is damaged.
IOS XR v5.3.4 base, ASR9010 with 2x RSP440-SE. FPD on LC is latest (for this release of IOS XR).
07-22-2019 04:40 AM
07-22-2019 04:59 AM
Hi,
I currently am only running the base version of XR 5.3.4 with no SMU's or SP's installed. I don't want to install any SMU's or SP's unless there is even a remote possibility that it will help.
Is there any way to verify (ie. looking for a specific keyword in a SMU or SP) that a SMU/SP has an "update" specifically for link-training with my applicable line cards? Doing most SMU/SP upgrades requires a router reboot with additional downtime, which I'd like to keep to a minimum at this time.
Thanks!
07-22-2019 06:04 AM
07-22-2019 01:05 PM - edited 07-22-2019 01:06 PM
Hi,
For debugging purposes, IOS XR was upgraded to 6.4.2 (latest supported on RSP440) with SP3. However, this error is still present. I assume this means bad hardware.
Is there any way to verify if this error is 100% due to a bad linecard only? What are the possibilities (or debugging options) I can check to ensure that my RSP's or chassis is not bad as well?
Thanks!
07-24-2019 04:06 AM
The usual HW troubleshooting should help. Visual inspection of connectors on the LC back-end and inside the slot should be carried out to see whether there's any observable physical damage. If none is observed, insert another LC into the slot and see whether the problem persist. Based on what you wrote so far, I expect only the LC to be faulty.
12-18-2020 01:56 AM
HI Lyphiard,
Good day, Have you resloved this issue? I am facing the same issue now. Could you tell me how to fixed it if you have fixed this issue.
Thx
12-18-2020 08:21 PM
Ended up being a hardware issue. We had to RMA the line card.
12-18-2020 11:46 PM
Thanks for your reply. So,it looks like a HW issue.Maybe i also need to relplace the bad part LC.
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide