10-24-2021 11:11 PM - edited 10-24-2021 11:28 PM
Hello Team,
I have HA pairs of C9800-40-K9 in SSO.. both units are connected to primary and secondary core each on 40G fiber uplink. when I test failover it take forever to switch active to standby. As a work around, we were able to fix the problem by shutting down the one10g interface of the four fiber uplinks. Then failover happened instantly. The controller is now
connected at 30G instead of 40G. The core switches are WS-C3850-24S models.
I have a few more sites with WS-C3850-24S core switches. Is there any known technical bugs with same issue that I need to put into consideration. Please advise if there is a workaround to fix the issue.
Thanks,
10-24-2021 11:26 PM - edited 10-24-2021 11:33 PM
Hi,
please explain the way you did the test failover. is it device power shut or other method?
use below guide to verify everything is designed in correct way.
rate this and mark as answer, if resolved your concern
10-24-2021 11:47 PM
SSO redundancy setup has no problem. however the failover from primary to secondary unit is taking forever. FYI secondary unit is connected to secondary L3 switch. the only way to do temp fix is, either by bouncing the secondary L3 switch uplink interface or shutting down one of the 10G port in 40G bundle in the L3 switch.
10-26-2021 12:39 PM
Default gateway check feature is not available in releases prior to 17.X.X in 9800 WLC's. So if you are running any 16.X.X flavor it is expected that you bounce one of the upstream ports to induce the failover immediately.
If you are running 17.X.X in 9800 then it may be possible that you did not configure the RMI interface. RMI interface does the default gateway check. Also check the peer timeout values and retries. Even with default gateway check enabled failover may take up to 10sec in case of gateway non-reachability.
Please refer the below document for more info.
Also post a rough diagram of how your devices connected to upstream switches, along with how you conducted the failover test. Learning this would allow us to provide you with accurate suggestions.
Not relevant, but WS-C3850-24S is 1G ports in the fixed chassis, are you using a 10G NIM module to connect your WLC?
10-24-2021 11:30 PM
What firmware is the switch running on?
10-24-2021 11:37 PM
03.06.06E
10-24-2021 11:43 PM - edited 10-24-2021 11:44 PM
I've heard of that bug. Don't remember it.
Read the 3.6.6 Release Notes. It is there.
(Plus it takes one to two hours to upgrade the IOS-XE of the 3850-12XS/24XS, CSCvq53573.)
Find answers to your questions by entering keywords or phrases in the Search bar above. New here? Use these resources to familiarize yourself with the community: