01-09-2012 06:41 AM - edited 03-07-2019 04:14 AM
Greetings!
Last weekend I installed a new redundant supervisor (VS-S720-10G) to our WS-C6509-E chassis. Everything went smoothly, the new sup booted into IOS and redundancy was established in RPR mode (due to different IOS versions). We want to run SSO mode, so I copied the 12.2(33)SXH4 IOS image from sup-bootdisk to the same location on slavesup-bootdisk, set and verified bootvars, waited a while to make sure the configs are synced, and reset the new supervisor.
I don't know if it's relevant that the IOS image I copied over to the redundant module (SXH4) is older than the version it shipped with (SXH5), but anyways this is where the problems started.
Booting up, the IOS image decompressed successfully, I got to the "running startup...." message in the console, and then nothing. All LEDs on the supervisor were red, and show module reported status Other. Thinking maybe something went wrong copying the SXH4 image, I reset the module again, interrupted the boot sequence, and booted the SXH5 image from ROMMon. Again, stuck after decompressing the image (this time the Status LED was green and the others dark, but module status still Other). Next I wondered if my startup-configuration could somehow be corrupt and prevent IOS from booting, so I tried to boot from ROMMon with "ignore system config info" enabled, but still no success using either image.
Now I'm stumped, and I would really appreciate it if anyone could offer some inputs about what to try next. Thanks.
Below is a console capture of a couple of show commands on the main supervisor, and of trying to boot the SHX4 image with startup-config ignored and diagnostic mode enabled. Nothing happens after "running startup...." is displayed in the console no matter how long I wait, module LEDs glow red, and status remains Other.
Show commands on main sup:
XXX#sh mod 5
Mod Ports Card Type Model Serial No.
--- ----- -------------------------------------- ------------------ -----------
5 5 Supervisor Engine 720 10GE (Active) VS-S720-10G <omitted>
Mod MAC addresses Hw Fw Sw Status
--- ---------------------------------- ------ ------------ ------------ -------
5 001e.4af8.6f00 to 001e.4af8.6f07 2.0 8.5(2) 12.2(33)SXH4 Ok
Mod Sub-Module Model Serial Hw Status
---- --------------------------- ------------------ ----------- ------- -------
5 Policy Feature Card 3 VS-F6K-PFC3C <omitted> 1.0 Ok
5 MSFC3 Daughterboard VS-F6K-MSFC3 <omitted> 1.0 Ok
Mod Online Diag Status
---- -------------------
5 Pass
XXX#sh mod 6
Mod Ports Card Type Model Serial No.
--- ----- -------------------------------------- ------------------ -----------
6 5 Supervisor Engine 720 10GE (Other) VS-S720-10G <omitted>
Mod MAC addresses Hw Fw Sw Status
--- ---------------------------------- ------ ------------ ------------ -------
6 001e.4af8.6f00 to 001e.4af8.6f07 2.0 8.5(2) 12.2(33)SXH5 Other
Mod Online Diag Status
---- -------------------
6 Not Applicable
XXX#sh red sta
my state = 13 -ACTIVE
peer state = 1 -DISABLED
Mode = Simplex
Unit = Primary
Unit ID = 5
Redundancy Mode (Operational) = sso
Redundancy Mode (Configured) = sso
Redundancy State = Non Redundant
Maintenance Mode = Disabled
Communications = Down Reason: Simplex mode
client count = 77
client_notification_TMR = 30000 milliseconds
keep_alive TMR = 9000 milliseconds
keep_alive count = 0
keep_alive threshold = 18
RF debug mask = 0x0
XXX#
Bootup attempt on new redundant sup:
System Bootstrap, Version 8.5(2)
Copyright (c) 1994-2007 by cisco Systems, Inc.
Testing lower main memory - data equals address
Testing lower main memory - checkerboard
Testing lower main memory - inverse checkerboard
Clearing lower memory for cache initialization
Clearing bss
Clearing autoboot state machine
melody_present_reg: 1st read w/ 0x5555
melody_present_reg: 2nd read w/ 0xaaaa, reversed: 0x5555
Bootdisk adapter is detected, enabling bootdisk access...
Reprogramming CS1 w/ Melody value...
Reading monitor variables from NVRAM
Reset reason for CPU board 0xffff , BaseBoard 0x240ffff, display 0x20000System Reset by Software.
Enabling interrupts
Initializing TLB
Initializing cache
Initializing required TLB entries
Initializing main memory
Sizing NVRAM
Initializing PCMCIA controller
Initializing USB2.0 controller
Exiting init
Cat6k-Sup720/SP processor with 1048576 Kbytes of main memory
Autoboot executing command: "boot bootdisk:/newsys/s72033/base/s72033-ipservicesk9_wan-vm"
Loading image, please wait ...
Initializing ATA monitor library...
reg_v0: 0x00000001
reg_k0: 0x30409001
reg_k1: 0x30410000
tar_start: 0x00000000
tar_size: 0x00000000
Tar image address is : 0x30409001
tar_size in czip : 0x00000000
Stack pointer : 0x8FFFFF80
monstack : 0x80100FC0
monra : 0xBFC59764
edata : 0x80106EA0
magic : 0xFEEDFACE
memsize : 0x10000000
uncomp_size : 0x0350FD3C
comp_size : 0x0119B94C
comp_checksum : 0x8B65EDE7
uncomp_checksum : 0x0AEF011C
Compressed IOS src copy is : 0x80106EB4
compressed IOS src is : 0x8EE5BEB4
IOS uncompressed dest copy is : 0x8012F4AC
Self decompressing the image : ############################################################################################################################################################################################################################################################################################################ [OK]
e_shoff :
0x00000054
e_flags : 0x00000000
e_phnum :
0x00000001
Source elf_hdr->e_shnum = 0x00000003
Setting up to copy ELF section 0x00000001
to image_info section 0x00000000
sh_name = 0x00000000
sh_type = 0x00000001
sh_flags = 0x00000007
sh_addr = 0x80100F00
sh_offset = 0x00000F00
sh_size = 0x0350EE3C
sh_link = 0x00000000
sh_info = 0x00000000
sh_addralign = 0x00000F00
sh_entsize = 0x00000000
cpu type : 0x00000019
uncomp_size : 0x0350FD3C
monstack : 0x80100FC0
image_info.entry_point = 0x80101000
image_info.section_count = 0x00000001
image_info.monstack = 0x80100FC0
image_info.monra = 0xBFC59764
image_info.param0 = 0x00000002
image_info.param1 = 0x00000000
image_info.param2 = 0x800066B0
image_info.param3 = 0x80101000
image_info.reg_k0 = 0x30409001
image_info.reg_k1 = 0x00000000
Section Index = 0x00000000
source = 0x801303AC
dest = 0x80100F00
bytes = 0x0350EE3C
running startup....
01-09-2012 06:52 AM
Hi Tukka,
the red LED is not a good sign.
Can you try to pull that sup out and install it on a spare chassis (if you have one)?
I would then try to break into ROMMON and check which images you have on the flash. I would manually boot SXH4 (provided is there and accessible).
Try use this document as help
If the red LED condition shows up again I think you just need to RMA that sup.
Riccardo
01-09-2012 07:54 AM
Thanks for the quick reply Riccardo!
Unfortunately I don't have a spare chassis. The images I have on the flash are SXH4 and SXH5, and neither of them will boot, with or without startup-config. With SXH4 the LEDs stay red, and with SXH5 the status LED is green and others dark.
It's hard to believe that the sup hardware would be broken, because when I first inserted it, it booted up into SXH5 and RPR just fine. The problems only began after trying to downgrade IOS on the new sup. Note that we are using native mode.
Can I use "tftpdnld" on the sup to transfer over a fresh image?
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide