cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
747
Views
0
Helpful
8
Replies

Stackwise virtual c9500 interface VSL qsfp Lr4-s: activ member removed

ROGER LE VERGE
Level 1
Level 1

Hello

We have a curious bug on a two C9500-48Y4C connected on 4 vsl link with QSFP 100G LR4.No errors on link VSL, no errors on Dad link.All work good for 3 hours or 1 day and sudunly the activ member become and stay removed by redundancy switchover , and the standby member becom activ.

the Ios version is 17.12.4 fc3 .We test 17.9.5 ie .We test the c9500 on the same site :ie. We test the other member activ :ie. A tac Cisco is open (Cisco Poland) No return after trace logs upload. Any one have an idea ?

 

*Nov 25 2024 16:28:21.341 France: %PLATFORM-6-RF_PROG_SUCCESS: RF state STANDBY HOT
*Nov 26 2024 02:56:29.768 France: %PLATFORM-6-HASTATUS: RP switchover, received chassis event to become active
*Nov 26 2024 02:56:29.827 France: %REDUNDANCY-3-SWITCHOVER: RP switchover (PEER_NOT_PRESENT)
*Nov 26 2024 02:56:29.828 France: %REDUNDANCY-3-SWITCHOVER: RP switchover (PEER_DOWN)
*Nov 26 2024 02:56:29.828 France: %REDUNDANCY-3-SWITCHOVER: RP switchover (PEER_REDUNDANCY_STATE_CHANGE)

1 Accepted Solution

Accepted Solutions

Response of the Cisco Tac Today , the bug is cause by sfp+ QSFP-100G-LR4-S    not Cisco only compatible .

This can work fine 3 ou 4 days but the errors in the conversation appears without errors on the interfaces

View solution in original post

8 Replies 8

marce1000
Hall of Fame
Hall of Fame

 

  - You may find this useful : https://www.cisco.com/c/en/us/support/docs/switches/catalyst-9500-series-switches/216537-troubleshoot-svl-on-catalyst-9000-switch.html

 M.



-- Each morning when I wake up and look into the mirror I always say ' Why am I so brilliant ? '
    When the mirror will then always repond to me with ' The only thing that exceeds your brilliance is your beauty! '

Thanks for the link , i have already control all the link state , all link is up (DAD and VSL), no crc errors .

It seems like Dad decision is bad at a moment , it see vsl link down but they are up ... Nothing in the log events say vsl link are down. The show tech-support stackwise-virtual does'nt give any bad result .

 

 

ROGER LE VERGE
Level 1
Level 1

sw001-C9500#sh stackwise-virtual link
Stackwise Virtual Link(SVL) Information:
----------------------------------------
Flags:
------
Link Status
-----------
U-Up D-Down
Protocol Status
---------------
S-Suspended P-Pending E-Error T-Timeout R-Ready
-----------------------------------------------
Switch SVL Ports Link-Status Protocol-Status
------ --- ----- ----------- ---------------
1 1 HundredGigE1/0/51 U R
HundredGigE1/0/52 U R
2 1 HundredGigE2/0/51 U R
HundredGigE2/0/52 U R

 

sw001-C9500#sh stackwise-virtual dual-active-detection
In dual-active recovery mode: No
Recovery Reload: Enabled

Dual-Active-Detection Configuration:
-------------------------------------
Switch Dad port Status
------ ------------ ---------
1 TwentyFiveGigE1/0/48 up
2 TwentyFiveGigE2/0/48 up

 

sw001-C9500#sh redun
sw001-C9500#sh redundancy state
my state = 13 -ACTIVE
peer state = 8 -STANDBY HOT
Mode = Duplex
Unit = Primary
Unit ID = 2

Redundancy Mode (Operational) = sso
Redundancy Mode (Configured) = sso
Redundancy State = sso
Maintenance Mode = Disabled
Manual Swact = enabled
Communications = Up

client count = 118
client_notification_TMR = 30000 milliseconds
RF debug mask = 0x0

the bug as today is not resolved no issu from  Cisco Tac .There is no errors on the vsl link en DAD link. The troubleshootting says all is ok.

I try the command vl l2bum optimization today .

The bug is still present, the activ member becom removed., So i configure the switch c9500 with  dual-active recovery-reload-disable may-be we will see more events .

 

Response of the Cisco Tac Today , the bug is cause by sfp+ QSFP-100G-LR4-S    not Cisco only compatible .

This can work fine 3 ou 4 days but the errors in the conversation appears without errors on the interfaces

Just formation , the virtual stack work well about 6 weeks but after that randomly the VSL links shut without errors again. The active switch become removed .The Tac Cisco after analysing the trace logs ititiate RMA for sw001-1 .