10-27-2022 12:23 AM - edited 10-28-2022 07:26 AM
I would like to share my experience with wireless issues of 9120 APs in Fabric mode with 9800 WLC.
We had a bad wifi experience and users disconnects & unable to join, etc.
After a long tshooting, I couldn't figure out something but i noticed that sometimes some APs lose their CAPWAP connection to the WLC for a couple of minutes. I saw this shouldn't cause the issue we had because we had kind of outage for 1 or more hours.... so, APs still can forward data traffic, and control tunnel loss of 2 minutes is not a big deal in this situation for some APs.
I decided to upgrade from 17.3.3 to 17.6.4 (which is the recommended version at this time). After the upgrade, things got better, but still we had issues in one building where clients couldn't join to the wifi at all.
On DNAC, i noticed this MAC flapping logs (too many of them for mutiple hosts, switches & ports):
Layer 2 loop symptoms: MAC_FLAPPING
SW_MATM:MACFLAP_NOTIF
28057: switch1: 032846: Oct 26 14:08:23.717: Host xxxx.xxxx.xxxx in vlan 1058 is flapping between port Ac23 and port Gi2/0/7
I checked that and i realized that this port is an AP port, and the AccessTunnel Ac23 belongs to the same AP:
Then checking the MAC addresses on this port, i found many MACs for clients are coming of the AP port, which i found illogical as they should be coming out of the Ac23 tunnel... and this this makes sense with MAC flapping message.
I also noticed that some APs don't build the vxlan tunnel and some lose it and build again.
I did some research about that, and then i found this ugly bug CSCwb68720 with high hit count (Cisco says that it is already resolved in the version that i run now... which is not true... no surprise!)
I then downgraded to 17.3.5b (which is also another MD recommended version at this time), and so far i don't see these MAC flapping logs, and i expect that these issues are gone.
If you find this useful, like and rate!
10-27-2022 12:50 AM
- - Review the 9800 WLC configuration with the CLI command : show tech wireless , have the output analyzed by https://cway.cisco.com/
M.
10-27-2022 12:55 AM
Thanks!... indeed that was one of the initail tshooting steps i did before the upgrade... but it didn't highlight major things related to the issue in my case. But it is definitely useful and important
10-27-2022 02:54 AM
How did you conclude that the AP is causing the disconnection? I would say check the AP logs from WLC side. You may run a Radioactive Trace for AP MAC or IP to get more visibility.
I would suggest that you open 2 cases, one with wireless team for AP's and another with DNAC team for the overlay connectivity. Let TAC collaborate and troubleshoot this issue,
10-27-2022 03:01 AM
If you read the CSCwb68720 you will know that this is a bug happening from the wireless side (AP/WLC) not from the switch side (DNAC). As i mentioned, upgrading the WLC to 17.3.5b fixed the issue, and i don't see any more MAC flapping logs on the DNAC related to AccessTunnel vs AP Port. So, this is indeed the bug. Issue is resolved, and fingers crossed!
Also offcourse i checked the logs... i didn't want to make the post too long mentioning every single detail... just to be inclusive and informative as possible.
If you find is useful... like and rate
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide