cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
1610
Views
0
Helpful
2
Replies

Redundant supervisor trouble with WS-C6509-E

Greetings!

Last weekend I installed a new redundant supervisor (VS-S720-10G) to our WS-C6509-E chassis. Everything went smoothly, the new sup booted into IOS and redundancy was established in RPR mode (due to different IOS versions). We want to run SSO mode, so I copied the 12.2(33)SXH4 IOS image from sup-bootdisk to the same location on slavesup-bootdisk, set and verified bootvars, waited a while to make sure the configs are synced, and reset the new supervisor.

I don't know if it's relevant that the IOS image I copied over to the redundant module (SXH4) is older than the version it shipped with (SXH5), but anyways this is where the problems started.

Booting up, the IOS image decompressed successfully, I got to the "running startup...." message in the console, and then nothing. All LEDs on the supervisor were red, and show module reported status Other. Thinking maybe something went wrong copying the SXH4 image, I reset the module again, interrupted the boot sequence, and booted the SXH5 image from ROMMon. Again, stuck after decompressing the image (this time the Status LED was green and the others dark, but module status still Other). Next I wondered if my startup-configuration could somehow be corrupt and prevent IOS from booting, so I tried to boot from ROMMon with "ignore system config info" enabled, but still no success using either image.

Now I'm stumped, and I would really appreciate it if anyone could offer some inputs about what to try next. Thanks.

Below is a console capture of a couple of show commands on the main supervisor, and of trying to boot the SHX4 image with startup-config ignored and diagnostic mode enabled. Nothing happens after "running startup...." is displayed in the console no matter how long I wait, module LEDs glow red, and status remains Other.

Show commands on main sup:

XXX#sh mod 5

Mod Ports Card Type                              Model              Serial No.

--- ----- -------------------------------------- ------------------ -----------

  5    5  Supervisor Engine 720 10GE (Active)    VS-S720-10G        <omitted>

Mod MAC addresses                       Hw    Fw           Sw           Status

--- ---------------------------------- ------ ------------ ------------ -------

  5  001e.4af8.6f00 to 001e.4af8.6f07   2.0   8.5(2)       12.2(33)SXH4 Ok

Mod  Sub-Module                  Model              Serial       Hw     Status

---- --------------------------- ------------------ ----------- ------- -------

  5  Policy Feature Card 3       VS-F6K-PFC3C       <omitted>  1.0    Ok

  5  MSFC3 Daughterboard         VS-F6K-MSFC3       <omitted>  1.0    Ok

Mod  Online Diag Status

---- -------------------

  5  Pass

XXX#sh mod 6

Mod Ports Card Type                              Model              Serial No.

--- ----- -------------------------------------- ------------------ -----------

  6    5  Supervisor Engine 720 10GE (Other)     VS-S720-10G        <omitted>

Mod MAC addresses                       Hw    Fw           Sw           Status

--- ---------------------------------- ------ ------------ ------------ -------

  6  001e.4af8.6f00 to 001e.4af8.6f07   2.0   8.5(2)       12.2(33)SXH5 Other

Mod  Online Diag Status

---- -------------------

  6  Not Applicable

XXX#sh red sta

       my state = 13 -ACTIVE

     peer state = 1  -DISABLED

           Mode = Simplex

           Unit = Primary

        Unit ID = 5

Redundancy Mode (Operational) = sso

Redundancy Mode (Configured)  = sso

Redundancy State              = Non Redundant

     Maintenance Mode = Disabled

Communications = Down      Reason: Simplex mode

   client count = 77

client_notification_TMR = 30000 milliseconds

          keep_alive TMR = 9000 milliseconds

        keep_alive count = 0

    keep_alive threshold = 18

           RF debug mask = 0x0

XXX#

Bootup attempt on new redundant sup:

System Bootstrap, Version 8.5(2)

Copyright (c) 1994-2007 by cisco Systems, Inc.

Testing lower main memory - data equals address

Testing lower main memory - checkerboard

Testing lower main memory - inverse checkerboard

Clearing lower memory for cache initialization

Clearing bss

Clearing autoboot state machine

melody_present_reg: 1st read w/ 0x5555

melody_present_reg: 2nd read w/ 0xaaaa, reversed: 0x5555

Bootdisk adapter is detected, enabling bootdisk access...

Reprogramming CS1 w/ Melody value...

Reading monitor variables from NVRAM

Reset reason for CPU board 0xffff , BaseBoard 0x240ffff, display 0x20000System Reset by Software.

Enabling interrupts

Initializing TLB

Initializing cache

Initializing required TLB entries

Initializing main memory

Sizing NVRAM

Initializing PCMCIA controller

Initializing USB2.0 controller

Exiting init

Cat6k-Sup720/SP processor with 1048576 Kbytes of main memory

Autoboot executing command: "boot bootdisk:/newsys/s72033/base/s72033-ipservicesk9_wan-vm"

Loading image, please wait ...

Initializing ATA monitor library...

reg_v0: 0x00000001

reg_k0: 0x30409001

reg_k1: 0x30410000

tar_start: 0x00000000

tar_size: 0x00000000

Tar image address is             : 0x30409001

tar_size  in czip         : 0x00000000

Stack pointer       : 0x8FFFFF80

monstack            : 0x80100FC0

monra               : 0xBFC59764

edata : 0x80106EA0

magic : 0xFEEDFACE

memsize             : 0x10000000

uncomp_size         : 0x0350FD3C

comp_size           : 0x0119B94C

comp_checksum       : 0x8B65EDE7

uncomp_checksum     : 0x0AEF011C

Compressed IOS src copy is     : 0x80106EB4

compressed IOS  src is     : 0x8EE5BEB4

IOS uncompressed dest copy is     : 0x8012F4AC

Self decompressing the image : ############################################################################################################################################################################################################################################################################################################ [OK]

e_shoff :

0x00000054

e_flags : 0x00000000

e_phnum :

0x00000001

Source elf_hdr->e_shnum = 0x00000003

Setting up to copy ELF section 0x00000001

to image_info section 0x00000000

sh_name = 0x00000000

sh_type = 0x00000001

sh_flags = 0x00000007

sh_addr = 0x80100F00

sh_offset = 0x00000F00

sh_size = 0x0350EE3C

sh_link = 0x00000000

sh_info = 0x00000000

sh_addralign = 0x00000F00

sh_entsize = 0x00000000

cpu type                   : 0x00000019

uncomp_size                : 0x0350FD3C

monstack                   : 0x80100FC0

image_info.entry_point   = 0x80101000

image_info.section_count = 0x00000001

image_info.monstack      = 0x80100FC0

image_info.monra         = 0xBFC59764

image_info.param0        = 0x00000002

image_info.param1        = 0x00000000

image_info.param2        = 0x800066B0

image_info.param3        = 0x80101000

image_info.reg_k0        = 0x30409001

image_info.reg_k1        = 0x00000000

Section Index = 0x00000000

    source    = 0x801303AC

    dest      = 0x80100F00

    bytes     = 0x0350EE3C

running startup....

2 Replies 2

rsimoni
Cisco Employee
Cisco Employee

Hi Tukka,

the red LED is not a good sign.

Can you try to pull that sup out and install it on a spare chassis (if you have one)?

I would then try to break into ROMMON and check which images you have on the flash. I would manually boot SXH4 (provided is there and accessible).

Try use this document as help

http://www.cisco.com/en/US/products/hw/switches/ps700/products_tech_note09186a008015bfa1.shtml#Sup720

If the red LED condition shows up again I think you just need to RMA that sup.

Riccardo

Thanks for the quick reply Riccardo!

Unfortunately I don't have a spare chassis. The images I have on the flash are SXH4 and SXH5, and neither of them will boot, with or without startup-config. With SXH4 the LEDs stay red, and with SXH5 the status LED is green and others dark.

It's hard to believe that the sup hardware would be broken, because when I first inserted it, it booted up into SXH5 and RPR just fine. The problems only began after trying to downgrade IOS on the new sup. Note that we are using native mode.

Can I use "tftpdnld" on the sup to transfer over a fresh image?

Review Cisco Networking products for a $25 gift card