- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-07-2025 05:56 AM
Hi!
We are a small science institut having like 35-40 Cisco access points that are connected to the near-by University and their Cisco Wireless Controller (9800-40-A?). However, some of those APs do loose their connection to the WLC or cannot find the WLC reliably. Sometimes they are connected and have clients on them, but the next morning the AP is flashing green/red. Sometimes are power cycle helps, more often it does not. It's not always the same set of APs having these issues.
Previous network setup was a VLAN 300 that was connected directly via the switch and dark fiber to the University. Than the University forced us to remove the VLAN 300, because they wanted to get rid off of that VLAN. So we set up a new VLAN 30 which is now behind our pfSense firewall (yeah, we are low on budget).
We have these types of APs:
- 9120
- 9162
- 9176
We have no access to the controller and the guys at the University are sometimes unresponsive. There are times when 50% of the APs are offline, causing great frustration of the users about the unstable (or: not working) Wifi.
The Firewall allows the necessary ports like 5246-5248, NTP, DNS, mDNS.
I'm running out of ideas and my debugging options are limited as I have no access to the controller. Currently 9 APs are "offline":
Please find the attached log file from console output of a CW9176I AP failing to find the controller. Any help/suggestions are appreciated!
Thanks!
Ingo
Ingo Jürgensmann
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-08-2025 01:16 AM
Nope, old stuff:
Cisco WS-C2960S-48LPS-L and Extreme X440G2-24p-10G4
Ingo Jürgensmann
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-08-2025 12:28 AM
When you had vlan 300 did it work without any issues ? was it after creating the new vlan 30. Can you check the DHCP pool to see enough scope is configured ?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-08-2025 02:43 AM
Regarding prior use of VLAN 300: here the feedback differs whom you'll ask. Manager says that it worked fine before the migration, some users say that it didn't work back then as well. As the VLAN 300 to VLAN 30 migration was at my job start I can't comment much how well it was working before the migration.
Meanwhile I have an estimate from our Account Manager and let me state it this way: having me walking around the house twice a week for 30 mins each to reset the APs is more expensive than ordering a virtual controller.
But until then I still have to deal with the APs and unresponsive admin of the controller... *sigh*
Ingo Jürgensmann
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-09-2025 01:02 AM
There is enough space in the pool. We have like 40 APs and the pool is configured to use the whole /24...
VLAN 30 was created after VLAN 300, but then again: out of 38 APs in that VLAN 30, as of now 29 APs are actually working. 9 APs do not.
Ingo Jürgensmann
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-09-2025 05:19 AM
> There is enough space in the pool. We have like 40 APs and the pool is configured to use the whole /24...
Well it depends what DHCP server you are using ... There is a well known issue with Microsoft DHCP server that the APs will cause the server to mark addresses as "bad" and then the server runs out of usable IPs even though there should be plenty free. Cisco provided a fix for it on AireOS but have decided not to fix it on 9800. See the details at:
https://bst.cisco.com/bugsearch/bug/CSCvj14517
Although this is only known to happen with MS DHCP server it's possible that it could affect other products too.
So rather than assuming the addresses have not run out you should make sure <wink>
Please click Helpful if this post helped you and Select as Solution (drop down menu at top right of this reply) if this answered your query.
------------------------------
TAC recommended codes for AireOS WLC's and TAC recommended codes for 9800 WLC's
Best Practices for AireOS WLC's, Best Practices for 9800 WLC's and Cisco Wireless compatibility matrix
Check your 9800 WLC config with Wireless Config Analyzer using "show tech wireless" output or "config paging disable" then "show run-config" output on AireOS and use Wireless Debug Analyzer to analyze your WLC client debugs
Field Notice: FN63942 APs and WLCs Fail to Create CAPWAP Connections Due to Certificate Expiration
Field Notice: FN72424 Later Versions of WiFi 6 APs Fail to Join WLC - Software Upgrade Required
Field Notice: FN72524 IOS APs stuck in downloading state after 4 Dec 2022 due to Certificate Expired
- Fixed in 8.10.196.0, latest 9800 releases, 8.5.182.12 (8.5.182.13 for 3504) and 8.5.182.109 (IRCM, 8.5.182.111 for 3504)
Field Notice: FN70479 AP Fails to Join or Joins with 1 Radio due to Country Mismatch, RMA needed
How to avoid boot loop due to corrupted image on Wave 2 and Catalyst 11ax Access Points (CSCvx32806)
Field Notice: FN74035 - Wave2 APs DFS May Not Detect Radar After Channel Availability Check Time
Leo's list of bugs affecting 2800/3800/4800/1560 APs
Default AP console baud rate from 17.12.x is 115200 - introduced by CSCwe88390
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-08-2025 02:05 PM
Well your troubleshooting options are limited without access to the controller!
From the log it's clear that the AP is not getting any response to the discovery requests - presume those 4 WLC IPs it's trying are correct?
So either:
- the discovery requests do not reach the WLC
- the WLC does not respond to the requests
- the requests or responses are getting dropped by the network
Ideally you'd be starting with a packet capture on the WLC to make sure the requests are received and the WLC replies then you'd at least know whether the problem is the WLC or network.
What is the connectivity between you and the WLC? We did once have a problem with a customer using a 3rd party network service which had integrated IDS/IPS (which we only found out about after they had a similar complaint). It turned out the IDS/IPS was detecting the CAPWAP (UDP 5246/5247) as an "attack" and dropping the packets occasionally. They had to ask the network provider to exclude the CAPWAP traffic on the IDS/IPS - so check whether your "VLAN 300" has anything like that. If you see ping to the WLC working while CAPWAP isn't that would also point to something like this.
Otherwise you're really going to need the WLC owner to work with you ...
Please click Helpful if this post helped you and Select as Solution (drop down menu at top right of this reply) if this answered your query.
------------------------------
TAC recommended codes for AireOS WLC's and TAC recommended codes for 9800 WLC's
Best Practices for AireOS WLC's, Best Practices for 9800 WLC's and Cisco Wireless compatibility matrix
Check your 9800 WLC config with Wireless Config Analyzer using "show tech wireless" output or "config paging disable" then "show run-config" output on AireOS and use Wireless Debug Analyzer to analyze your WLC client debugs
Field Notice: FN63942 APs and WLCs Fail to Create CAPWAP Connections Due to Certificate Expiration
Field Notice: FN72424 Later Versions of WiFi 6 APs Fail to Join WLC - Software Upgrade Required
Field Notice: FN72524 IOS APs stuck in downloading state after 4 Dec 2022 due to Certificate Expired
- Fixed in 8.10.196.0, latest 9800 releases, 8.5.182.12 (8.5.182.13 for 3504) and 8.5.182.109 (IRCM, 8.5.182.111 for 3504)
Field Notice: FN70479 AP Fails to Join or Joins with 1 Radio due to Country Mismatch, RMA needed
How to avoid boot loop due to corrupted image on Wave 2 and Catalyst 11ax Access Points (CSCvx32806)
Field Notice: FN74035 - Wave2 APs DFS May Not Detect Radar After Channel Availability Check Time
Leo's list of bugs affecting 2800/3800/4800/1560 APs
Default AP console baud rate from 17.12.x is 115200 - introduced by CSCwe88390
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-09-2025 07:26 AM
Hi Rich!
Well, 2 of the 4 IPs are not valid and are actively blocked in our firewall. According to the admin of the controller, these are IPs for another site location of theirs and not reachable from out network, but he can't get some configured out for us. I can only accept this as a given fact.
According the IPS/IDS idea: yeah, we had this idea as well. Or more exactly the idea that something is blocking or discarding packets, because the APs are behind NAT and something like WLC or another external Firewall is blocking or rate limiting our connections. But regarding our IDS/IPS: neither the IPs nor the ports can be found in the IDS logs...
In the meanwhile the WLC admin wrote an email today that he yesterday updated the second controller to a current version. The controllers were first on an older version that rejected the 9176 APs and later one controller got updated and now the second one as well.
We'll see if this update brings improvements...
Ingo Jürgensmann
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-18-2025 04:34 PM
"he can't get some configured out for us." = doesn't know how to or is too lazy!
But doesn't matter - the AP will always try all the WLCs it knows about so as long as it has valid addresses that's ok.
Definitely best to have all the WLCs on the right code version!
Please click Helpful if this post helped you and Select as Solution (drop down menu at top right of this reply) if this answered your query.
------------------------------
TAC recommended codes for AireOS WLC's and TAC recommended codes for 9800 WLC's
Best Practices for AireOS WLC's, Best Practices for 9800 WLC's and Cisco Wireless compatibility matrix
Check your 9800 WLC config with Wireless Config Analyzer using "show tech wireless" output or "config paging disable" then "show run-config" output on AireOS and use Wireless Debug Analyzer to analyze your WLC client debugs
Field Notice: FN63942 APs and WLCs Fail to Create CAPWAP Connections Due to Certificate Expiration
Field Notice: FN72424 Later Versions of WiFi 6 APs Fail to Join WLC - Software Upgrade Required
Field Notice: FN72524 IOS APs stuck in downloading state after 4 Dec 2022 due to Certificate Expired
- Fixed in 8.10.196.0, latest 9800 releases, 8.5.182.12 (8.5.182.13 for 3504) and 8.5.182.109 (IRCM, 8.5.182.111 for 3504)
Field Notice: FN70479 AP Fails to Join or Joins with 1 Radio due to Country Mismatch, RMA needed
How to avoid boot loop due to corrupted image on Wave 2 and Catalyst 11ax Access Points (CSCvx32806)
Field Notice: FN74035 - Wave2 APs DFS May Not Detect Radar After Channel Availability Check Time
Leo's list of bugs affecting 2800/3800/4800/1560 APs
Default AP console baud rate from 17.12.x is 115200 - introduced by CSCwe88390
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-18-2025 06:30 PM
What firmware is the controller on?
Please check if the disconnected APs would still have valid IP addresses or not.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-21-2025 05:09 AM
Can't really tell, but one of the controller has a firmware on that supports the 9176 APs, the other has an older version that gets rejected by the APs then.
Ingo Jürgensmann
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-21-2025 05:14 AM
FYI, it seems that we solved our issue with the APs not being able to join the controller.
Not really sure what did the trick but we saw some "Single:No_Traffic" states in pfsense Firewall. This leads to the assumption that NAT is not correctly working and that one way is working (to the controller), but the other way is not working (from controller to AP).
I've then added another NAT rule in front of our general rule of doing NAT for all 10.0.0.0/8 network to non-RFC1918 networks (i.e. the Internet), but when adding that other rule for 10.10.30.0/24 to the specific destination network, the AP in question suddenly worked.
Why this did the trick for the missing 4 APs while it worked without that additional rule for all the other 34 APs: absolutely no idea! Maybe the Elders of the Internet will know, though...
Anyway, thanks to all who replied and tried to help! Very much appreciated!
Ingo Jürgensmann

- « Previous
-
- 1
- 2
- Next »