cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
4817
Views
0
Helpful
10
Replies

ASR 9006 Standby RSP rebooting again and again

Abinash.k
Level 1
Level 1

Hello All,

We have ASR 9006 and its ak9-rsp-4g is not stable in standby mode. It's rebooting again and again. If i remove my Active rsp then it is coming to active state and remains stable. Logs of both the RSP is given below.

 

Thanks in advance.

 

RP/0/RSP1/CPU0:ios(admin)#show RP/0/RSP1/CPU0:Jan 5 22:55:51.047 : licmgr[315]: %LICENSE-LICMGR-4-PACKAGE_LICENSE_INVALID : Package requesting A9K-LI-LIC license is activated on Rack0 and node node0_RSP1_CPU0 without a valid license/ valid configuration


RP/0/RSP1/CPU0:ios(admin)#show RP/0/RSP1/CPU0:Jan 5 22:54:23.107 : wdsysmon[469]: %HA-HA_WD-4-DISK_ALARM : A monitored device alarm set by /disk0:

RP/0/RSP0/CPU0:Jan 5 22:54:23.108 : wdsysmon[469]: %HA-HA_WD-4-DISK_WARN : A monitored device /disk0: is above 80% utilization. Current utilization = 94. Please remove unwanted user files and configuration rollback points.

RP/0/RSP0/CPU0:Jan 5 22:54:23.108 : wdsysmon[469]: %HA-HA_WD-4-DISK_WARN : A monitored device /disk1: is above 80% utilization. Current utilization = 94. Please remove unwanted user files and configuration rollback points.

 

:Jan 5 22:58:11.106 : licmgr[315]: %LICENSE-LICMGR-4-PACKAGE_LICENSE_INVALID : Package requesting A9K-LI-LIC license is activated on Rack0 and node node0_RSP1_CPU0 without a valid license/ valid configuration

pm
:Jan 5 22:58:18.290 : FABMGR[220]Writing crashinfo
Active processes:
pkg/bin/redfs_svr Thread ID 1 on cpu 0

Active processes:
pkg/bin/pfm_node_rp Thread ID 0 on cpu 1

[0xfc069a720] Record Reboot History, reboot cause = 0x2c00001b, descr = Cause: pfm_dev_sm_perform_recovery_action, Card reset requested by: Process ID: 331880 (fabmgr), Fault Sev: 0, Target node: 0/RSP1/CPU0, CompId: 0x15, Device Handle: 0x1033000, CondI[0xfc2b895c2] Record crashinfo
[0xfc306930a] Record Syslog
2000-01-05 22:58:18.336
NOTE: This is NOT a Kernel Crash. This crash was triggered
by the process 'pfm_node_rp', by calling reboot API.

Crash Reason: Cause: pfm_dev_sm_perform_recovery_action, Card reset requested by: Process ID: 331880 (fabmgr), Fault Sev: 0, Target node: 0/RSP1/CPU0, CompId: 0x15, Device Handle: 0x1033000, CondID: 8705, Fault Reason: Fabmgr encountered fatal fault. Switchover. Proces (Cause Code: 0x2c00001b)

Exception at 0x4a29b3b4 signal 5 c=1 f=3

Active process(s):
pkg/bin/redfs_svr Thread ID 1 on cpu 0
pkg/bin/pfm_node_rp Thread ID 0 on cpu 1

REGISTER INFO
r0 r1 r2 r3
R0 4a29b3b0 e7ffdb90 50013e30 00000003
r4 r5 r6 r7
R4 2800001b e7ffe205 e7ffdb68 00000000
r8 r9 r10 r11
R8 af9e7800 00000000 13e1c08b e7ffdb90
r12 r13 r14 r15
R12 4a2d6878 50013e30 e7fffb10 00000001
r16 r17 r18 r19
R16 e7fffb24 e7ffe6b0 00000000 00000000
r20 r21 r22 r23
R20 00000000 ec3cb164 e7ffe205 e7ffdb98
r24 r25 r26 r27
R24 e7ffdef3 e7ffe205 e7ffdef6 2800001b
r28 r29 r30 r31
R28 44002082 00000000 ec01f9e0 e7ffdb90
cnt lr msr pc
R32 4a203254 4a29b3b0 0002d932 4a29b3b4
cnd xer
R36 44002084 20000000

SUPERVISOR REGISTERS

 

Memory Management Registers

Instruction BAT Registers
Index # Value
IBAT0U # 0x1ffe
IBAT0L # 0x12
IBAT1U # 0
IBAT1L # 0
IBAT2U # 0
IBAT2L # 0
IBAT3U # 0xfffc0003
IBAT3L # 0x60011
IBAT4U # 0x4a0003ff
IBAT4L # 0x70000011
IBAT5U # 0x4c0003ff
IBAT5L # 0x72000011
IBAT6U # 0x4e0003ff
IBAT6L # 0x74000011
IBAT7U # 0
IBAT7L # 0

Data BAT Registers
Index # Value
DBAT0U # 0x1ffe
DBAT0L # 0x12
DBAT1U # 0x34000002
DBAT1L # 0xdc00002a
DBAT2U # 0x3000001e
DBAT2L # 0xc800002a
DBAT3U # 0xfffc0003
DBAT3L # 0x60011
DBAT4U # 0x4a0003ff
DBAT4L # 0x70000011
DBAT5U # 0x4c0003ff
DBAT5L # 0x72000011
DBAT6U # 0x4e0003ff
DBAT6L # 0x74000011
DBAT7U # 0
DBAT7L # 0


Exception Handling Registers
Data Addr Reg # DSISR
0 # 0
SPRG0 # SPRG1 # SPRG2 # SPRG3
0xe7ffdb90 # 0xec01f9e0 # 0 # 0x1
SaveNRestore SRR0 # SaveNRestore SRR1
0x4a29b3b0 # 0x2d932


Miscellaneous Registers
Processor Id Reg # 0
HID0 # 0x8493c1bc
HID1 # 0x2cc80

MSSCR0 # 0
MSSSR0 # 0

STACK TRACE
#0 0x4a29b3b0
[0xfe589af7a] Initializing harddisk file system
[0xfe9ede87d] Record TSEC information
!!!
Writing TSEC done
!!
Writing crashinfo done!

Examine crashinfo file for reboot reason

Writing ppc kernel core file
kernel core device: /kernel_core.by.pfm_node_rp.Z
[0xfec054d30] Kernel core dump start...
fill phdr vaddr=0x8f94000, offset=0x191a3d4, size=0x706c000
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
Core dump success. Total_size 144204756
[0x10312ba137] Successfully dumped Kernel core
[0x103196e451] Record PCDS information

Writing PCDS done
Dump Directory
KD: RSP1.000105-225818.tsec, start = 1000, size = 7024, crc = 0
KD: RSP1.000105-225818.crashinfo.by.pfm_node_rp, start = 9000, size = ae7a, crc = 0
KD: RSP1.000105-225818.kernel_core.by.pfm_node_rp.Z, start = 14000, size = 2c21b08, crc = 57577412
KD: RSP1.000105-225818.pcds, start = 2c36000, size = ff000, crc = faca6283

Writing kernel core file done!
rebooting


Selecting ROMMON Image... B

DDR in Interleaved mode

POST 1 : PASSED : code 0 : DDR2 Memory Quick Test


CPU Reset Reason = 0x000d

POST 2 : PASSED : code 0 : FPGA Flash Image CRC Checks


Loading Field Programmable Devices:

FPGA 0-B PROGRAMMED : image: 0xff500028 - 0xff576cca, et: 117ms

FPGA 1-B PROGRAMMED : image: 0xff400028 - 0xff4d1034, et: 206ms

FPGA 2-B PROGRAMMED : image: 0xff100028 - 0xff276358, et: 369ms

FPGA 3-B PROGRAMMED : image: 0xff000028 - 0xff0454a8, et: 69ms

 

System Bootstrap, Version 1.06(20120210:003513) [ASR9K ROMMON],

Copyright (c) 1994-2012 by Cisco Systems, Inc.

Compiled Thu 09-Feb-12 16:35 by saurabja


CPUCtrl: 1.18 [00000001/00000012]

ClkCtrl: 1.23 [00000001/00000017]

IntCtrl: 1.15 [00000001/0000000f]

Punt: 1.5 [00000001/00000005]

CBC: 1.3

BID: 0x0006

 

PPC 8641D (partnum 0x8004), Revision 03.00, (Core Version 02.02)

M8641 CLKIN: 66 Mhz

Core Clock: 1333 Mhz

MPX Clock: 533 Mhz

LBC Clock: 33 Mhz


POST 3 : PASSED : code 0 : Slot ID/Board Type Validity

PCI-E1: Ready as Root Complex

PCI-E2: Ready as Root Complex

 

set_chassis_type: chassis_type=0xef02fb found=TRUE

ASR9K (8641D PPC) platform with 4096 Mb of main memory


program load complete, entry point: 0x100000, size: 0x2ac20

program load complete, entry point: 0x100000, size: 0x2ac20

MBI Candidate = disk0:asr9k-os-mbi-5.2.2/0x100000/mbiasr9k-rp.vm


CARD_SLOT_NUMBER: 1

CPU_INSTANCE: 1

MBI Validation starts ...


Mgt LAN 0 interface is selected

tsec_init_hw: configuring FE (port 2) for: Auto Speed, Auto Duplex


tsec_init_interface: hardware initialization completed

Interface link changed state to UP.

Interface link state up.


MBI validation sending request.

HIT CTRL-C to abort

..........

No MBI confirmation received from dSC


AUTOBOOT: Boot string = disk0:asr9k-os-mbi-5.2.2/0x100000/mbiasr9k-rp.vm,1;

AUTOBOOT: autobootstate=0, autobootcount=0, cmd=boot disk0:asr9k-os-mbi-5.2.2/0x100000/mbiasr9k-rp.vm

program load complete, entry point: 0x100000, size: 0x2ac20


MBI size from header = 20163132,Bootflash resident MBI filesize = 20163132

.............................................................................

program load complete, entry point: 0x203d78, size: 0x1339b3c

Attempting to start second CPU
Config = SMP, Running = SMP
Board type: 0x00100302
Card Capability = 0xffffffff
########################################################################################################
BSP: Board type : RO-RSP2
tracelogger: starting tracing in background ring mode
tracelogger running with args: -startring -F 1 -F 2
Restricted Rights Legend

Use, duplication, or disclosure by the Government is
subject to restrictions as set forth in subparagraph
(c) of the Commercial Computer Software - Restricted
Rights clause at FAR sec. 52.227-19 and subparagraph
(c) (1) (ii) of the Rights in Technical Data and Computer
Software clause at DFARS sec. 252.227-7013.

cisco Systems, Inc.
170 West Tasman Drive
San Jose, California 95134-1706

 

Cisco IOS XR Software for the Cisco XR ASR9K, Version 5.2.2
Copyright (c) 2014 by Cisco Systems, Inc.
File RSP1.000105-225818.pcds content has been changed during previous dump process. There could be some residual HW activities. It can be ignored for now.
Jan 05 23:00:06.266: Install Setup: Booting with committed software
Failed to rename debug file, 18, src: /nvram:/sysmgr.log.timeout.Z, target: /nvram:/prev.sysmgr.log.timeout.Z
Jan 05 23:04:25.498 : SYSMGR_LITE: Saving init logs in /nvram:/sysmgr.log.timeout.Z ...
FPD ltrace_file_name => fpd-agent/fiarsp

SAM detects CA certificate(Code Signing Server Certificate Authority,O=Cisco,C=US) has expired. The validity period is Oct 17, 2000 01:46:24 UTC - Oct 17, 2015 01:51:47 UTC. Continue at risk? (Y/N) [Default: N w/in 10]: Jan 05 23:11:29.618: Install Setup: Cleaning packages not in sync list
Jan 05 23:11:29.743: Install Setup: Complete

 

TAS1 con0/RSP1/CPU0 is in standby

 

 


FPD ltrace_file_name => fpd-agent/longbeach
FPD ltrace_file_name => fpd-agent/tempo
Jan 05 23:12:21.927: Install Setup: Syncing meta-data:
Jan 05 23:12:23.388: Install Setup: Complete
SRESET Exception on Core 1....

Writing crashinfo
Active processes:
proc/boot/procnto-booke-smp-instr Thread ID 0 on cpu 0

Active processes:
pkg/bin/redfs_svr Thread ID 1 on cpu 1

[0x1927967392] Record Reboot History, reboot cause = 0x28000125, descr = SRESET Exception
[0x19286bc0dc] Record crashinfo
[0x1928bbbd4d] Record Syslog
2000-01-05 23:12:40.862
Crash Reason: SRESET Exception (Cause Code: 0x28000125)

Exception at 0x4000043c signal 5 c=1 f=3

Active process(s):
proc/boot/procnto-booke-smp-instr Thread ID 0 on cpu 0
pkg/bin/redfs_svr Thread ID 1 on cpu 1

REGISTER INFO
r0 r1 r2 r3
R0 40000438 0ff8ef90 500083b0 5001a4d4
r4 r5 r6 r7
R4 e7fffc3b 00000014 30004500 00000005
r8 r9 r10 r11
R8 00009036 5001a000 00000000 0ff8ef90
r12 r13 r14 r15
R12 5000007c 500083b0 00000000 0fffe790
r16 r17 r18 r19
R16 901b285f 50000000 50016b94 00000000
r20 r21 r22 r23
R20 0bfa1554 00000000 00000001 00000000
r24 r25 r26 r27
R24 e7f1bfc0 0ffeb560 00060000 0fffe790
r28 r29 r30 r31
R28 00b4b214 50000824 0efdfa44 0ff8ef90
cnt lr msr pc
R32 00000000 40000438 00029036 4000043c
cnd xer
R36 44002082 20000000

SUPERVISOR REGISTERS

 

Memory Management Registers

Instruction BAT Registers
Index # Value
IBAT0U # 0x1ffe
IBAT0L # 0x12
IBAT1U # 0
IBAT1L # 0
IBAT2U # 0
IBAT2L # 0
IBAT3U # 0xfffc0003
IBAT3L # 0x60011
IBAT4U # 0x4a0003ff
IBAT4L # 0x70000011
IBAT5U # 0x4c0003ff
IBAT5L # 0x72000011
IBAT6U # 0x4e0003ff
IBAT6L # 0x74000011
IBAT7U # 0
IBAT7L # 0

Data BAT Registers
Index # Value
DBAT0U # 0x1ffe
DBAT0L # 0x12
DBAT1U # 0x34000002
DBAT1L # 0xdc00002a
DBAT2U # 0x3000001e
DBAT2L # 0xc800002a
DBAT3U # 0xfffc0003
DBAT3L # 0x60011
DBAT4U # 0x4a0003ff
DBAT4L # 0x70000011
DBAT5U # 0x4c0003ff
DBAT5L # 0x72000011
DBAT6U # 0x4e0003ff
DBAT6L # 0x74000011
DBAT7U # 0
DBAT7L # 0


Exception Handling Registers
Data Addr Reg # DSISR
0 # 0
SPRG0 # SPRG1 # SPRG2 # SPRG3
0xff8ef90 # 0xefdfa44 # 0x50000824 # 0x1
SaveNRestore SRR0 # SaveNRestore SRR1
0x40000438 # 0x29036


Miscellaneous Registers
Processor Id Reg # 0
HID0 # 0x8493c1bc
HID1 # 0x2cc80

MSSCR0 # 0
MSSSR0 # 0

STACK TRACE
#0 0x40000438
[0x19481af89f] Initializing harddisk file system
[0x194c814a0c] Record TSEC information
!!!
Writing TSEC done
!!
Writing crashinfo done!

Examine crashinfo file for reboot reason

Writing ppc kernel core file
kernel core device: /kernel_core.by.kernel.Z
[0x194e8d064a] Kernel core dump start...
fill phdr vaddr=0xfc7b000, offset=0x5ca4514, size=0x385000
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
Core dump success. Total_size 100832532
[0x197704625c] Successfully dumped Kernel core
[0x19776fa5b9] Record PCDS information
!
Writing PCDS done
Dump Directory
KD: RSP1.000105-231240.tsec, start = 1000, size = 7024, crc = 0
KD: RSP1.000105-231240.crashinfo.by.kernel, start = 9000, size = ad23, crc = 0
KD: RSP1.000105-231240.kernel_core.by.kernel.Z, start = 14000, size = 162cf0d, crc = 2cca17c5
KD: RSP1.000105-231240.pcds, start = 1641000, size = ff000, crc = 18ff3d46

Writing kernel core file done!
rebooting


Selecting ROMMON Image... B

DDR in Interleaved mode

POST 1 : PASSED : code 0 : DDR2 Memory Quick Test


CPU Reset Reason = 0x000b

POST 2 : PASSED : code 0 : FPGA Flash Image CRC Checks


Loading Field Programmable Devices:

FPGA 0-B PROGRAMMED : image: 0xff500028 - 0xff576cca, et: 117ms

FPGA 1-B PROGRAMMED : image: 0xff400028 - 0xff4d1034, et: 206ms

FPGA 2-B PROGRAMMED : image: 0xff100028 - 0xff276358, et: 369ms

FPGA 3-B PROGRAMMED : image: 0xff000028 - 0xff0454a8, et: 69ms

 

System Bootstrap, Version 1.06(20120210:003513) [ASR9K ROMMON],

Copyright (c) 1994-2012 by Cisco Systems, Inc.

Compiled Thu 09-Feb-12 16:35 by saurabja


CPUCtrl: 1.18 [00000001/00000012]

ClkCtrl: 1.23 [00000001/00000017]

IntCtrl: 1.15 [00000001/0000000f]

Punt: 1.5 [00000001/00000005]

CBC: 1.3

BID: 0x0006

 

PPC 8641D (partnum 0x8004), Revision 03.00, (Core Version 02.02)

M8641 CLKIN: 66 Mhz

Core Clock: 1333 Mhz

MPX Clock: 533 Mhz

LBC Clock: 33 Mhz


POST 3 : PASSED : code 0 : Slot ID/Board Type Validity

PCI-E1: Ready as Root Complex

PCI-E2: Ready as Root Complex

 

set_chassis_type: chassis_type=0xef02fb found=TRUE

ASR9K (8641D PPC) platform with 4096 Mb of main memory


program load complete, entry point: 0x100000, size: 0x2ac20

program load complete, entry point: 0x100000, size: 0x2ac20

MBI Candidate = disk0:asr9k-os-mbi-5.2.2/0x100000/mbiasr9k-rp.vm


CARD_SLOT_NUMBER: 1

CPU_INSTANCE: 1

MBI Validation starts ...


Mgt LAN 0 interface is selected

tsec_init_hw: configuring FE (port 2) for: Auto Speed, Auto Duplex


tsec_init_interface: hardware initialization completed

Interface link changed state to UP.

Interface link state up.


MBI validation sending request.

HIT CTRL-C to abort


mbi_val_process_packet: received repsonse (rack 0)

Local image to boot : disk0:asr9k-os-mbi-5.2.2/0x100000/mbiasr9k-rp.vm

program load complete, entry point: 0x100000, size: 0x2ac20


MBI size from header = 20163132,Bootflash resident MBI filesize = 20163132

.............................................................................

program load complete, entry point: 0x203d78, size: 0x1339b3c

Attempting to start second CPU
Config = SMP, Running = SMP
Board type: 0x00100302
Card Capability = 0xffffffff
########################################################################################################
BSP: Board type : RO-RSP2
tracelogger: starting tracing in background ring mode
tracelogger running with args: -startring -F 1 -F 2
Restricted Rights Legend

Use, duplication, or disclosure by the Government is
subject to restrictions as set forth in subparagraph
(c) of the Commercial Computer Software - Restricted
Rights clause at FAR sec. 52.227-19 and subparagraph
(c) (1) (ii) of the Rights in Technical Data and Computer
Software clause at DFARS sec. 252.227-7013.

cisco Systems, Inc.
170 West Tasman Drive
San Jose, California 95134-1706

 

Cisco IOS XR Software for the Cisco XR ASR9K, Version 5.2.2
Copyright (c) 2014 by Cisco Systems, Inc.
Jan 05 23:13:51.539: Install Setup: Using install device 'disk0:'
File RSP1.000105-231240.pcds content has been changed during previous dump process. There could be some residual HW activities. It can be ignored for now.
Jan 05 23:14:01.580: Install Setup: Using MBI device 'bootflash:'
Jan 05 23:14:01.644: Install Setup: Preparing devices:
Jan 05 23:14:01.657: Install Setup: Complete
Jan 05 23:14:13.067: Install Setup: Starting package and meta-data sync
Jan 05 23:14:13.086: Install Setup: Cleaning packages not in sync list
Jan 05 23:14:13.093: Install Setup: Complete
Jan 05 23:14:20.325: Install Setup: Syncing meta-data:
Jan 05 23:14:21.802: Install Setup: Complete
Jan 05 23:14:21.802: Install Setup: Completed sync of all packages and meta-data
Jan 05 23:14:21.802: Install Setup: Starting MBI sync
Jan 05 23:14:37.771: Install Setup: Completed sync of MBIs

 

Thanks & Regards,

Abinash Kumar

1 Accepted Solution

Accepted Solutions

Hi Aleksandar Vidakovic,

 

I have updated the rsp as you instructed but its behaving same. might be there is some hardware issue.

 

thanks for your support.

Best regards,

Abinash kumar. 

View solution in original post

10 Replies 10

Aleksandar Vidakovic
Cisco Employee
Cisco Employee

Lack of a SW license and/or lack of space on the disk are not causing the standby reload. Nevertheless, please take care of those two items.

 

Also see https://www.cisco.com/c/en/us/support/docs/field-notices/639/fn63979.html because the router needs the new SW signing method.

 

The reason for the standby RSP reset has to do with the fabric connection:

 

Crash Reason: Cause: pfm_dev_sm_perform_recovery_action, Card reset requested by: Process ID: 331880 (fabmgr),...

 

It would be good to upgrade the router to 5.3.4 plus latest Service Pack to eliminate any SW issue.

Thanks a lot Aleksandar Vidakovic.

 

As per your suggestion, I will try to upgrade to 5.3.4 to eliminate any sw issues.

Thanks & Regards,

Abinash Kumar

hi Abinash,

 

that will be very good indeed. Please don't take just the base 5.3.4, install the latest Service Pack. If the router is already running 5.1.3 or later, you can activate 5.3.4+SP in one go. 

/Aleksandar

Hi Aleksandar,
I have the same problem, but in my case is ASR 9010 Standby RSP rebooting again and again, the log is bellow:
Cisco IOS XR Software for the Cisco XR ASR9K, Version 5.2.4
Copyright (c) 2015 by Cisco Systems, Inc.
Media storage device /harddisk: was repaired. Check fsck log at /harddisk:/chkfs_repair.log
Mar 22 14:17:35.456: Install Setup: Using install device 'disk0:'
Mar 22 14:17:35.490: Install Setup: Preparing devices:
Mar 22 14:17:35.492: Install Setup: Removing all packages and MBIs from install device (disk0:)
Mar 22 14:17:35.493: Install Setup: Complete
Mar 22 14:17:35.493: Install Setup: Complete
Mar 22 14:17:35.494: Install Setup: Starting package and meta-data sync
Mar 22 14:17:35.498: Install Setup: Syncing package 'disk0/asr9k-fwding-5.2.4.CSCux76794-1.0.0':
Mar 22 14:17:36.137: Install Setup: Complete
Mar 22 14:17:36.137: Install Setup: Syncing package 'disk0/asr9k-fwding-5.2.4.CSCuz46500-1.0.0':
Mar 22 14:17:36.695: Install Setup: Complete
Mar 22 14:17:36.696: Install Setup: Syncing package 'disk0/asr9k-optics-supp-5.2.4':
Mar 22 14:17:36.770: Install Setup: Complete
Mar 22 14:17:36.771: Install Setup: Syncing package 'disk0/iosxr-mcast-5.2.4.CSCuw01943-1.0.0':
Mar 22 14:17:40.165: Install Setup: Complete
Mar 22 14:17:40.167: Install Setup: Syncing package 'disk0/asr9k-service-supp-5.2.4':
Mar 22 14:17:40.814: Install Setup: Complete
Mar 22 14:17:40.814: Install Setup: Syncing package 'disk0/iosxr-fwding-5.2.4.CSCuv71988-1.0.0':
Mar 22 14:17:47.221: Install Setup: Complete
Mar 22 14:17:47.222: Install Setup: Syncing package 'disk0/iosxr-service-5.2.4':
Mar 22 14:18:07.112: Install Setup: Complete
Mar 22 14:18:07.113: Install Setup: Syncing package 'disk0/iosxr-routing-5.2.4.CSCux83178-1.0.0':
Mar 22 14:18:09.388: Install Setup: Complete
Mar 22 14:18:09.389: Install Setup: Syncing package 'disk0/iosxr-fwding-5.2.4':
Mar 22 14:18:58.592: Install Setup: Failed: Unable to sync all of the contents of '/disk0/iosxr-fwding-5.2.4'
Mar 22 14:18:58.645: Install Setup: Failed to sync some packages or meta-data
Mar 22 14:18:58.645: Install Setup: Install Setup (pid 114730) has failed to prepare this node successfully and will now exit: (2949233664) 'Subsystem(8083)' detected the 'fatal' condition 'Code(30)'
/pkg/bin/idiags_persist_store.sh[257]: ã % 5: unexpected `%'
Mar 22 14:19:00.499: Install Setup: Using install device 'disk0:'
Mar 22 14:19:00.529: Install Setup: Preparing devices:
Mar 22 14:19:00.530: Install Setup: Removing all packages and MBIs from install device (disk0:)
Mar 22 14:19:03.436: Install Setup: Complete
Mar 22 14:19:03.436: Install Setup: Complete
Mar 22 14:19:03.437: Install Setup: Starting package and meta-data sync
Mar 22 14:19:03.439: Install Setup: Syncing package 'disk0/asr9k-fwding-5.2.4.CSCux76794-1.0.0':
Mar 22 14:19:46.318: Install Setup: Failed: Unable to sync all of the contents of '/disk0/asr9k-fwding-5.2.4.CSCux76794-1.0.0'
Mar 22 14:19:46.318: Install Setup: Failed to sync some packages or meta-data
Mar 22 14:19:46.318: Install Setup: Install Setup (pid 208937) has failed to prepare this node successfully and will now exit: (2949233664) 'Subsystem(8083)' detected the 'fatal' condition 'Code(30)'
/pkg/bin/idiags_persist_store.sh[257]: ã % 5: unexpected `%'
Mar 22 14:19:50.047: Install Setup: Using install device 'disk0:'
Mar 22 14:19:50.077: Install Setup: Preparing devices:
Mar 22 14:19:50.078: Install Setup: Removing all packages and MBIs from install device (disk0:)
Mar 22 14:19:50.078: Install Setup: Complete
Mar 22 14:19:50.079: Install Setup: Complete
Mar 22 14:19:50.080: Install Setup: Starting package and meta-data sync
Mar 22 14:19:50.083: Install Setup: Syncing package 'disk0/asr9k-fwding-5.2.4.CSCux76794-1.0.0':
Mar 22 14:19:55.137: Install Setup: Complete
Mar 22 14:19:55.138: Install Setup: Syncing package 'disk0/asr9k-fwding-5.2.4.CSCuz46500-1.0.0':
Mar 22 14:20:05.837: Install Setup: Complete
Mar 22 14:20:05.838: Install Setup: Syncing package 'disk0/asr9k-optics-supp-5.2.4':
Mar 22 14:20:05.916: Install Setup: Complete
Mar 22 14:20:05.916: Install Setup: Syncing package 'disk0/iosxr-mcast-5.2.4.CSCuw01943-1.0.0':
Mar 22 14:20:47.776: Install Setup: Failed: Unable to sync all of the contents of '/disk0/iosxr-mcast-5.2.4.CSCuw01943-1.0.0'
Mar 22 14:20:47.777: Install Setup: Failed to sync some packages or meta-data
Mar 22 14:20:47.777: Install Setup: Install Setup (pid 266281) has failed to prepare this node successfully and will now exit: (2949233664) 'Subsystem(8083)' detected the 'fatal' condition 'Code(30)'
/pkg/bin/idiags_persist_store.sh[257]: ã % 5: unexpected `%'
Mar 22 14:20:48.413: Install Setup: Using install device 'disk0:'
Mar 22 14:20:48.443: Install Setup: Preparing devices:
Mar 22 14:20:48.444: Install Setup: Removing all packages and MBIs from install device (disk0:)
Mar 22 14:20:48.811: Install Setup: Complete
Mar 22 14:20:48.812: Install Setup: Complete
Mar 22 14:20:48.812: Install Setup: Starting package and meta-data sync
Mar 22 14:20:48.815: Install Setup: Syncing package 'disk0/asr9k-fwding-5.2.4.CSCux76794-1.0.0':
Mar 22 14:21:26.264: Install Setup: Failed: Unable to sync all of the contents of '/disk0/asr9k-fwding-5.2.4.CSCux76794-1.0.0'
Mar 22 14:21:26.264: Install Setup: Failed to sync some packages or meta-data
Mar 22 14:21:26.265: Install Setup: Install Setup (pid 323625) has failed to prepare this node successfully and will now exit: (2949233664) 'Subsystem(8083)' detected the 'fatal' condition 'Code(30)'
/pkg/bin/idiags_persist_store.sh[257]: ã % 5: unexpected `%'
Mar 22 14:21:26.728: Install Setup: Using install device 'disk0:'
Mar 22 14:21:26.758: Install Setup: Preparing devices:
Mar 22 14:21:26.759: Install Setup: Removing all packages and MBIs from install device (disk0:)
Mar 22 14:21:26.760: Install Setup: Complete
Mar 22 14:21:26.760: Install Setup: Complete
Mar 22 14:21:26.761: Install Setup: Starting package and meta-data sync
Mar 22 14:21:26.765: Install Setup: Syncing package 'disk0/asr9k-fwding-5.2.4.CSCux76794-1.0.0':
Failed to rename debug file, 18, src: /nvram:/sysmgr.log.timeout.Z, target: /nvram:/prev.sysmgr.log.timeout.Z
Mar 22 14:22:22.054 : SYSMGR_LITE: Saving init logs in /nvram:/sysmgr.log.timeout.Z ...
Mar 22 14:22:30.942: Install SetFailed to rename debug file, 18, src: /nvram:/sysmgr.log.timeout.Z, target: /nvram:/prev.sysmgr.log.timeout.Z
Mar 22 14:22:31.109 : SYSMGR_LITE: Saving init logs in /nvram:/sysmgr.log.timeout.Z ...
up: Failed: UnMar 22 14:22:31.229 : SYSMGR_LITE: INIT: respawn 'instsetup' disabled, exit_code 256, INIT_MAX_SPAWN reached
able to sync all
Reboot on ASR9010-V2 RSP3 (0x100306) in slot 1
By init via REBOOT_CAUSE_SYSMGR (2c000007)
Current time: 2018-03-22 14:22:31.366, Up time: 5m 12s
A kernel core file was explicitly requested by process init
Reboot Reason: Cause code 0x2c000007 Cause: INIT: respawn 'instsetup' disabled, exit_code 256, INIT_MAX_SPAWN reached Process: init Traceback: 8092148 80929ed 809284b 4207c55 800f050 0


Active process(s):
proc/boot/procnto-smp-instr pid 1 tid 1 on cpu 0, pri 0
proc: fdfe4010, utime = 1230407 ms, stime = 1842 ms
thread: fdfc4010, thread sutime = 307493 ms, pc = fe6f096a

x86/bin/init pid 8196 tid 2 on cpu 1, pri 10
proc: fdfe4748, utime = 63 ms, stime = 8 ms
thread: fdd23680, thread sutime = 1 ms, pc = 8092b99
eax = 28000007, ebx = 28000007, ecx = 80d1f44, edx = 81ba4d8
edi = 417ef94, esi = 0, ebp = 417ef94, exx = fdd23998
cs = f3, efl = 1212, esp = 417eb20, ss = fb

pkg/bin/devb-ahci pid 36888 tid 9 on cpu 2, pri 21
proc: fdfe7c18, utime = 3305 ms, stime = 471 ms
thread: fdd2f010, thread sutime = 1233 ms, pc = 421d2fe
eax = 10180009, ebx = 101eef68, ecx = 10180009, edx = 424fd60
edi = 101574fc, esi = 101f101c, ebp = 4160eec, exx = fdd2f328
cs = f3, efl = 3206, esp = 4160ed4, ss = fb

proc/boot/procnto-smp-instr pid 1 tid 4 on cpu 3, pri 0
proc: fdfe4010, utime = 1230407 ms, stime = 1842 ms
thread: fdfc49b8, thread sutime = 302784 ms, pc = fe6f096a


Normal reboot
Writing crashinfo
Crash Reason: Cause code 0x2c000007 Cause: INIT: respawn 'instsetup' disabled, exit_code 256, INIT_MAX_SPAWN reached Process: init Traceback: 8092148 80929ed 809284b 4207c55 800f050 0

Exception at 0x8092b99 signal 5 c=2 f=0

Active process(s):
proc/boot/procnto-smp-instr Thread ID 0 on cpu 0
x86/bin/init Thread ID 1 on cpu 1
pkg/bin/devb-ahci Thread ID 8 on cpu 2
proc/boot/procnto-smp-instr Thread ID 3 on cpu 3

Reboot reason: Cause: INIT: respawn 'instsetup' disabled, exit_code 256, INIT_MAX_SPAWN reached Process: init Traceback: 8092148 80929ed 809284b 4207c55 800f050 0

REGISTER INFO
EDI ESI EBP EXX
R0 0417ef94 00000000 0417ef94 fdd23998
EBX EDX ECX EAX
R4 28000007 081ba4d8 080d1f44 28000007
EIP CS EFL ESP
R8 08092b99 000000f3 00001212 0417eb20
SS
R12 000000fb
CR0 # 0x8004003b
CR2 # 0x417cfc0
CR3 # 0xfe3cfc0
CR4 # 0x2f0
DR0 # 0
DR1 # 0
DR2 # 0
DR3 # 0
DR6 # 0xffff0ff0
DR7 # 0x400


GDTR limit # 0x13f
GDTR base address # 0xfed29820
LDTR # 0xa0
IDTR limit # 0x7ff
IDTR base address # 0xfed26960
TR # 0x108
Release mastership on RSP3
!
Dumping local syslog messages
RP/0/RSP1/CPU0:Mar 22 11:16:26.498 : init[65540]: %OS-INIT-7-MBI_STARTED : total time 9.751 seconds
RP/0/RSP1/CPU0:Mar 22 11:16:28.543 : ce_switch_srv[54]: %PLATFORM-CE_SWITCH-6-UPDN : Interface 8 (Peer_RSP) is up
RP/0/RSP1/CPU0:Mar 22 11:16:28.941 : ce_switch_srv[54]: %PLATFORM-CE_SWITCH-6-UPDN : Interface 12 (SFP+_00_10GE) is up
RP/0/RSP1/CPU0:Mar 22 11:16:29.058 : ce_switch_srv[54]: %PLATFORM-CE_SWITCH-6-UPDN : Interface 13 (SFP+_01_10GE) is up
RP/0/RSP1/CPU0:Mar 22 11:16:29.070 : ce_switch_srv[54]:
Dumping kernel printf messages
!Kernel Dumper Boot Init
*** Welcome to QNX neutrino!
*** ASR9K RSP3.
tracelogger: starting tracing in background ring mode
tracelogger running with args: -startring -F 1 -F 2
Use private TLB mappings
Failed to rename debug file, 18, src: /nvram:/sysmgr.log.timeout.Z, target: /nvram:/prev.sysmgr.log.timeout.Z
Mar 22 14:22:22.054 : SYSMGR_LITE: Saving init logs in /nvram:/sysmgr.log.timeout.Z ...
Failed to rename debug file, 18, src: /nvram:/sysmgr.log.timeout.Z, target: /nvram:/prev.sysmgr.log.timeo

Writing crashinfo done!

Writing x86 kernel core file
ENTER get_kernel_end
Getting kernel end success: 0xb0c000 rc = 0
Heap start = 0xe0000000 Heap End = 0xfe000000
Checking heap num_seg = 0
Total number of segments found 24
Writing heap elf header
Done writing heap elf headers

Aleksandar, could you give a help please.

Sounds like it might be CSCus81167

 

You could try removing the inactive packages ("admin install remove inactive").

It's possible that the installation on active got somehow corrupted, because it complains about one specific package that couldn't be synced (iosxr-fwding). Try running "admin install verify packages" and "admin install verify packages repair" on the active.

In any case, it would be good to upgrade to 5.3.4 plus latest Service Pack. That is in case this router has any Trident line card. If all cards are Typhoon, our recommendation is 6.2.3.

/Aleksandar

this is a standart pack in my company so upgrade to 5.3.4 is not a option in a short time. The output from the commands are bellow, it sames all right, any other idea ? do you think is better reinstall only the corrupted package (iosxr-fwding) or do a turboboot in the active RSP?
RP/0/RSP0/CPU0:(admin)#install verify packages
Thu Mar 22 16:02:50.781 UTC
Install operation 48 '(admin) install verify packages' started by user 'root'
via CLI at 16:02:50 UTC Thu Mar 22 2018.
The install operation will continue asynchronously.
RP/0/RSP0/CPU0:FTLZBZAN9K002(admin)#Info: This operation can take up to 2 minutes per package being verified.
Info: Please be patient.
Info: 0/RSP0/CPU0 [RP] [SDR: Owner]
Info: meta-data: [SUCCESS] Verification Successful.
Info: /install/asr9k-fwding-5.2.4.CSCuz46500-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-mcast-5.2.4.CSCuw01943-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-fwding-5.2.4.CSCuv71988-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-routing-5.2.4.CSCux83178-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-fwding-5.2.4.CSCuq64356-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/asr9k-base-5.2.4.CSCux46204-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/asr9k-fwding-5.2.4.CSCux76794-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-mpls-5.2.4.CSCus83607-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-infra-5.2.4.CSCuw35524-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-fwding-5.2.4.CSCux13305-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-fwding-5.2.4.CSCuu70514-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/asr9k-fwding-5.2.4.CSCuu70514-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/asr9k-fwding-5.2.4.CSCux65585-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/asr9k-fwding-5.2.4.CSCus02689-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/asr9k-9000v-nV-supp-5.2.4.CSCuu49049-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/asr9k-base-5.2.4.CSCux43034-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/asr9k-fwding-5.2.4.CSCuw77529-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-routing-5.2.4.CSCuv58845-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-fwding-5.2.4.CSCuw24345-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-mgbl-5.2.4.CSCuv25059-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-ce-5.2.4.CSCuv96165-1.0.0: [SUCCESS] Verification
Info: Successful.
Info: /install/iosxr-fwding-5.2.4.CSCuv40032-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-routing-5.2.4.CSCuv72695-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-infra-5.2.4.CSCuv15754-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-infra-5.2.4.CSCut59034-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-fwding-5.2.4.CSCuv39265-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-infra-5.2.4.CSCuv39255-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-infra-5.2.4.CSCut46951-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-fwding-5.2.4.CSCut57910-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-fwding-5.2.4.CSCuu20681-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/iosxr-infra-5.2.4.CSCut52232-1.0.0: [SUCCESS]
Info: Verification Successful.
Info: /install/asr9k-k9sec-supp-5.2.4: [SUCCESS] Verification
Info: Successful.
Info: /install/iosxr-security-5.2.4: [SUCCESS] Verification Successful.
Info: /install/asr9k-asr903-supp-5.2.4: [SUCCESS] Verification
Info: Successful.
Info: /install/asr9k-mgbl-supp-5.2.4: [SUCCESS] Verification
Info: Successful.
Info: /install/iosxr-mgbl-5.2.4: [SUCCESS] Verification Successful.
Info: /install/asr9k-mcast-supp-5.2.4: [SUCCESS] Verification
Info: Successful.
Info: /install/iosxr-mcast-5.2.4: [SUCCESS] Verification Successful.
Info: /install/asr9k-ce-5.2.4: [SUCCESS] Verification Successful.
Info: /install/asr9k-cpp-5.2.4: [SUCCESS] Verification Successful.
Info: /install/asr9k-scfclient-5.2.4: [SUCCESS] Verification
Info: Successful.
Info: /install/asr9k-diags-supp-5.2.4: [SUCCESS] Verification
Info: Successful.
Info: /install/asr9k-fwding-5.2.4: [SUCCESS] Verification Successful.
Info: /install/asr9k-base-5.2.4: [SUCCESS] Verification Successful.
Info: /install/iosxr-ce-5.2.4: [SUCCESS] Verification Successful.
Info: /install/iosxr-diags-5.2.4: [SUCCESS] Verification Successful.
Info: /install/iosxr-routing-5.2.4: [SUCCESS] Verification Successful.
Info: /install/iosxr-fwding-5.2.4: [SUCCESS] Verification Successful.
Info: /install/iosxr-infra-5.2.4: [SUCCESS] Verification Successful.
Info: /install/asr9k-fpd-5.2.4: [SUCCESS] Verification Successful.
Info: /install/asr9k-9000v-nV-supp-5.2.4: [SUCCESS] Verification
Info: Successful.
Info: /install/asr9k-optics-supp-5.2.4: [SUCCESS] Verification
Info: Successful.
Info: /install/iosxr-mpls-5.2.4: [SUCCESS] Verification Successful.
Info: /install/asr9k-service-supp-5.2.4: [SUCCESS] Verification
Info: Successful.
Info: /install/iosxr-service-5.2.4: [SUCCESS] Verification Successful.
Info: /install/asr9k-asr901-supp-5.2.4: [SUCCESS] Verification
Info: Successful.
Info: /install/asr9k-services-infra-5.2.4: [SUCCESS] Verification
Info: Successful.
Info: Verification Summary:
Info: 0/RSP0/CPU0: SUCCESSFUL. No anomalies found.
Info: The system needs no repair.

Hi Aleksandar Vidakovic,

 

I have updated the rsp as you instructed but its behaving same. might be there is some hardware issue.

 

thanks for your support.

Best regards,

Abinash kumar. 

yes, the RSP should be replaced.