12-29-2021 10:52 AM
We have upgraded the A9K-RSP880-LT-SE processors with the IOS XR 7.3.2
They seem to boot fine in our ASR-9006-AC-V2 chassis, and we do not see any errors.
Also, all of the FPDs are upgraded to the required current IOS version.
However, the red FAIL light is ON on the front bezel.
Is there anything that we need to check or upgrade, so that this FAIL light is not on?
Solved! Go to Solution.
01-19-2022 04:22 AM
I also had two RSPs showing the same System Scoped Alarm error.
"Error in device/link [10b5:8749 09:05.0]: Pcie uncorrectable surprise down error"
I note the numbers are the exact same and match yours as well.
Reseating the affected RSPs did not resolve the issues.
Replacing one with a spare on hand did clear it.
Through a case with lab confirmation, this has been determined to be an FPD bug, cosmetic and non-service affecting. I do not have the bug ID yet.
12-29-2021 11:27 AM
Can we check power modules?
"show environment power" it may not have enough power for N+1 power redundancy
thanks
12-29-2021 11:43 AM
We are testing this router without any line cards and it has 4 x PWR-3KW-AC-V2 units installed, so there is plenty of power.
While booting, we do not see any significant error messages except for redundancy until both RSPs get synced.
Here is the output showing power capacity
===================================================
sysadmin-vm:0_RSP0# sh environment power
Wed Dec 29 19:31:32.256 UTC+00:00
================================================================================
CHASSIS LEVEL POWER INFO: 0
================================================================================
Total output power capacity (N + 1) : 9000W + 3000W
Total output power required : 1180W
Total power input : 948W
Total power output : 478W
Power Shelf 0:
================================================================================
Power Supply ------Input---- ------Output--- Status
Module Type Volts Amps Volts Amps
================================================================================
0/PT0-PM0 3kW-AC 237.8 1.2 55.7 3.1 OK
0/PT0-PM1 3kW-AC 237.2 0.8 55.6 1.6 OK
0/PT0-PM2 3kW-AC 237.0 1.2 55.5 2.7 OK
0/PT0-PM3 3kW-AC 236.1 0.8 55.5 1.2 OK
Total of Power Shelf 0: 948W/ 4.0A 478W/ 8.6A
================================================================================
Location Card Type Power Power Status
Allocated Used
Watts Watts
================================================================================
0/0 - 10 - RESERVED
0/1 - 10 - RESERVED
0/2 - 10 - RESERVED
0/3 - 10 - RESERVED
0/RSP0 A9K-RSP880-LT-SE 295 168 ON
0/RSP1 A9K-RSP880-LT-SE 295 174 ON
0/FT0 ASR-9006-FAN-V2 275 - ON
0/FT1 ASR-9006-FAN-V2 275 - ON
sysadmin-vm:0_RSP0#
12-29-2021 12:16 PM
great,
what about "show alarms brief" anything alarming there?
thanks
12-29-2021 12:30 PM
Looks like we may need to do a "forced" FPD upgrade. Here is the output showing all alarms.
============================================================================
RP/0/RSP0/CPU0:ios#sh alarms brief
Wed Dec 29 20:20:12.209 UTC
--------------------------------------------------------------------------------
Active Alarms for 0/RSP0
--------------------------------------------------------------------------------
No entries.
--------------------------------------------------------------------------------
History Alarms for 0/RSP0
--------------------------------------------------------------------------------
Location Severity Group Set Time Description
Clear Time
--------------------------------------------------------------------------------
0/RSP0/CPU0 Major Software 12/29/2021 18:22:31 UTC Node Not Ready
12/29/2021 18:25:30 UTC
--------------------------------------------------------------------------------
Suppressed Alarms for 0/RSP0
--------------------------------------------------------------------------------
No entries.
--------------------------------------------------------------------------------
Conditions for 0/RSP0
--------------------------------------------------------------------------------
No entries.
--------------------------------------------------------------------------------
System Scoped Active Alarms
--------------------------------------------------------------------------------
Location Severity Group Set Time Description
--------------------------------------------------------------------------------
0/RSP0 Major FPD_Infra 12/29/2021 18:20:25 UTC IPU-FPGA: Golden FPGA is below minimum version, Perform force fpd upgrade
0/RSP0 Major FPD_Infra 12/29/2021 18:20:25 UTC IPU-FSBL: Golden FPGA is below minimum version, Perform force fpd upgrade
0/RSP0 Major FPD_Infra 12/29/2021 18:20:25 UTC IPU-Linux: Golden FPGA is below minimum version, Perform force fpd upgrade
0/RSP1 Major FPD_Infra 12/29/2021 18:20:27 UTC IPU-FPGA: Golden FPGA is below minimum version, Perform force fpd upgrade
0/RSP1 Major FPD_Infra 12/29/2021 18:20:27 UTC IPU-FSBL: Golden FPGA is below minimum version, Perform force fpd upgrade
0/RSP1 Major FPD_Infra 12/29/2021 18:20:27 UTC IPU-Linux: Golden FPGA is below minimum version, Perform force fpd upgrade
0/RSP0 Major Cpu 12/29/2021 18:21:39 UTC Error in device/link [10b5:8749 09:05.0]: Pcie uncorrectable surprise down error
--------------------------------------------------------------------------------
System Scoped History Alarms
--------------------------------------------------------------------------------
Location Severity Group Set Time Description
Clear Time
--------------------------------------------------------------------------------
0/RSP0/CPU0 Critical Software 12/28/2021 23:39:37 UTC Redundancy Partner Not Present
12/28/2021 23:46:04 UTC
0/RSP0/CPU0 Major Software 12/28/2021 23:46:04 UTC Node Not Ready
12/28/2021 23:47:53 UTC
0/RSP0/CPU0 Major Software 12/29/2021 18:22:31 UTC Node Not Ready
12/29/2021 18:25:30 UTC
0/PT0-PM2 Major Environ 12/29/2021 18:27:52 UTC Power Module Output Disabled (PM_OUTPUT_EN_PIN_HI).
12/29/2021 18:27:52 UTC
0/PT0-PM2 Major Environ 12/29/2021 18:28:53 UTC Power Module Output Disabled (PM_OUTPUT_EN_PIN_HI).
12/29/2021 18:28:53 UTC
0/PT0-PM3 Major Environ 12/29/2021 18:29:03 UTC Power Module Output Disabled (PM_OUTPUT_EN_PIN_HI).
12/29/2021 18:29:03 UTC
------------------------------------------------------------------------------------
Active Alarms (brief) for EDT
------------------------------------------------------------------------------------
Location Severity Group Set Time Description
------------------------------------------------------------------------------------
0/RSP0 Major FPD_Infra 12/29/2021 18:20:25 UTC IPU-FPGA: Golden FPGA is below minimum version, Perform force fpd upgrade
0/RSP0 Major FPD_Infra 12/29/2021 18:20:25 UTC IPU-FSBL: Golden FPGA is below minimum version, Perform force fpd upgrade
0/RSP0 Major FPD_Infra 12/29/2021 18:20:25 UTC IPU-Linux: Golden FPGA is below minimum version, Perform force fpd upgrade
0/RSP1 Major FPD_Infra 12/29/2021 18:20:27 UTC IPU-FPGA: Golden FPGA is below minimum version, Perform force fpd upgrade
0/RSP1 Major FPD_Infra 12/29/2021 18:20:27 UTC IPU-FSBL: Golden FPGA is below minimum version, Perform force fpd upgrade
0/RSP1 Major FPD_Infra 12/29/2021 18:20:27 UTC IPU-Linux: Golden FPGA is below minimum version, Perform force fpd upgrade
0/RSP0 Major Cpu 12/29/2021 18:21:39 UTC Error in device/link [10b5:8749 09:05.0]: Pcie uncorrectable surprise down error
RP/0/RSP0/CPU0:ios#
12-30-2021 03:13 PM
We have done the forced FPD upgrades for these
IPU-FPGA: Golden FPGA is below minimum version, Perform force fpd
IPU-FSBL: Golden FPGA is below minimum version, Perform force fpd
IPU-Linux: Golden FPGA is below minimum version, Perform force fpd
After doing the upgrade the front bezel lights went off, but reappeared again after we did a system reboot
The following alarms are still present:
====================================
Active Alarms (brief) for EDT
------------------------------------------------------------------------------------
Location Severity Group Set Time Description
------------------------------------------------------------------------------------
0/RSP1 Major Cpu 12/30/2021 05:17:59 UTC Error in device/link [10b5:8749 09:05.0]: Pcie uncorrectable surprise down error
0/RSP0 Major Cpu 12/30/2021 05:18:02 UTC Error in device/link [10b5:8749 09:05.0]: Pcie uncorrectable surprise down error
=======================================================
Do you have any recommendation about getting rid of these " Pcie uncorrectable surprise down error " ??
Thank you.
01-18-2022 04:12 AM
I've no answer but just chiming in that after upgrading two 9906s to 7.3.2, I have RSPs showing the same System Scoped Alarm error.
"Error in device/link [10b5:8749 09:05.0]: Pcie uncorrectable surprise down error"
I note the numbers are the exact same on two different disparate routers and match yours as well.
Reseating the affected RSPs did not resolve the issues.
Replacing one with a spare on hand did clear it.
There appears no functional/traffic issue for us, just the flagged alarm.
I've opened a case and will see where it leads.
01-19-2022 04:22 AM
I also had two RSPs showing the same System Scoped Alarm error.
"Error in device/link [10b5:8749 09:05.0]: Pcie uncorrectable surprise down error"
I note the numbers are the exact same and match yours as well.
Reseating the affected RSPs did not resolve the issues.
Replacing one with a spare on hand did clear it.
Through a case with lab confirmation, this has been determined to be an FPD bug, cosmetic and non-service affecting. I do not have the bug ID yet.
01-26-2022 10:18 AM
We have noticed that this alarm below is intermittent
"Error in device/link [10b5:8749 09:05.0]: Pcie uncorrectable surprise down error"
Sometimes the system boots just fine without this alarm and then after the system is powered down and re-booted, this alarm appears again.
This does not seem to affect the operation of the system.
Hopefully, this cosmetic bug will be resolved in some future release.
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide