cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
2631
Views
0
Helpful
5
Replies

Tomahawk (A9K-8X100GE & A9K-4X100GE) Must have SMU for 5.3.x

Eddie Chami
Cisco Employee
Cisco Employee

Hi,

We’ve seen instances where after router reload intermittently 100G LCs may stay down and not come up to IOS XR RUN state, the LC may be in BRINGDOWN state, or the following syslog maybe seen in the syslog after failed boot up.

envmon[209]: %PLATFORM-ENVMON-3-ALARM_RAISED : Raise alarm from CBC in slot 0/10/CPU0, alarm code CBC_ALRM_PWR_SEQ_FAILED Reason: OUTPUT VOLTAGE FAULT Bus:0x2 Addresss:0xbc Chan:0x4 STATUS_VOUT 0x10

This impacts 100GEx4, 100GEx8 LC’s. 

An OIR at times can fix the problem, though the more prudent method to avoid this, is install SMU: CSCuz52900. 

SMUs are available for the following releases:

5.3.1, 5.3.2 & 5.3.3.

It's mandatory that the SMU is installed if 100GE Linecards are present in the chassis. 

In the chance that the issue is experienced, the following command must be tried to recover the LC if an OIR is not successful:

“run pwrseq program 0/X/CPU0 0 0x2” 


Regards

Eddie.

5 Replies 5

smailmilak
Level 4
Level 4

Hi,

we have a very similar problem with a Trident LC A9K-8T-L.

We did an upgrade from 5.3.3 to 5.3.4 and this LC could not boot and it's now in BRINGDOWN state.

Reinsert did not help and we can't reload it with "admin reload location" and "hw-module power disable" etc.

I have tried your command: "run pwrseq program 0/0/CPU0 0 0x2" but it says 

Error: command is not available for this card!

It's probably for TH LC only.

I found this log:

envmon[209]: %PLATFORM-ENVMON-3-ALARM_INDICATION : Raise alarm from CBC in slot 0/0/CPU0, alarm code CBC_ALRM_PWR_SEQ_FAILED

RP/0/RSP0/CPU0:Nov 17 06:55:11.572 : shelfmgr[409]: %PLATFORM-SHELFMGR-3-MIXED_MEMORY_POWERED : Node 0/0/CPU0 is powered, system running mixed line card memory mode
RP/0/RSP0/CPU0:Nov 17 06:55:11.572 : shelfmgr[409]: %PLATFORM-SHELFMGR-3-MIXED_MEMORY_ALARM : System enters mixed line card memory mode
RP/0/RSP0/CPU0:Nov 17 06:56:28.037 : shelfmgr[409]: %PLATFORM-SHELFMGR-3-SHUTDOWN_RESET : Node 0/0/CPU0 is reset due to admin shut/no shut request
RP/0/RSP0/CPU0:Nov 17 06:57:02.170 : shelfmgr[409]: %PLATFORM-SHELFMGR-3-SHUTDOWN_RESET : Node 0/0/CPU0 is reset due to admin shut/no shut request
RP/0/RSP0/CPU0:Nov 17 06:57:10.198 : envmon[209]: %PLATFORM-ENVMON-3-ALARM_INDICATION : Raise alarm from CBC in slot 0/0/CPU0, alarm code CBC_ALRM_PWR_SEQ_FAILED
RP/0/RSP0/CPU0:Nov 17 07:02:15.832 : shelfmgr[409]: %PLATFORM-SHELFMGR-3-SHUTDOWN_RESET : Node 0/0/CPU0 is reset due to admin shut/no shut request
RP/0/RSP0/CPU0:Nov 17 07:04:07.620 : shelfmgr[409]: %PLATFORM-SHELFMGR-3-MIXED_MEMORY_POWERED : Node 0/0/CPU0 is powered, system running mixed line card memory mode
RP/0/RSP0/CPU0:Nov 17 07:04:07.620 : shelfmgr[409]: %PLATFORM-SHELFMGR-3-MIXED_MEMORY_ALARM : System enters mixed line card memory mode
RP/0/RSP0/CPU0:Nov 17 07:04:45.623 : shelfmgr[409]: %PLATFORM-SHELFMGR-3-SHUTDOWN_RESET : Node 0/0/CPU0 is reset due to admin shut/no shut request
RP/0/RSP0/CPU0:Nov 17 07:05:56.743 : envmon[209]: %PLATFORM-ENVMON-3-ALARM_INDICATION : Raise alarm from CBC in slot 0/0/CPU0, alarm code CBC_ALRM_PWR_SEQ_FAILED

We have same bug with A9K-8T-L, do you have any solution?

On Trident this would be a HW fault. The board has to be replaced. CSCuz52900 is specific to Tomahawk line card generation.

It's very disapointing because that card worked only about 5 years...

Im fighting the smae issue  Can anyone tell me what the fix is?