cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
2258
Views
5
Helpful
4
Replies

Replacing a failed WLC

mumbles202
Level 7
Level 7

Had a vWLC running that had about 25 APs connected to it.  Had a failure of a datastore and didn't have a backup of the unit or the configuration.  I was able to get a new device deployed and configured and able to get APs joined to the controller if I factory reset them, but otherwise I see them in the AP Join as trying to join but they never do.  The new WLC has the same ip as the old and APs and WLC are on the same L2 segment so I thought L2 Discovery would handle this, but I added an option 43 for good measure as well.  If i do a debug on the console I see a ton of DTLS errors; will the new APs try to join another controller if they're unable to join the old controller and but learn controller ip via option 43? 

 

I can see why it would be locked out for security reasons and would force you to go the route of migrating to a new controller ip and changing the Primary WLC for an AP on the old controller; just in a pinch when that controller is offline what is the approach?  telnet/ssh weren't enabled on the APs on the old WLC so it seems like ladder and a reset is the only solution?

4 Replies 4

johnd2310
Level 11
Level 11

Hi,

 

What errors are you seeing on the WLC? is the new controller same software version as the old?

 

Thanks

John

**Please rate posts you find helpful**

Rich R
VIP
VIP

This is clearly documented in the vWLC deployment guide: https://www.cisco.com/c/en/us/td/docs/wireless/technology/mesh/8-2/b_Virtual_Wireless_LAN_Controller_Deployment_Guide_8-2.html?referring_site=RE&pos=1&page=https://www.cisco.com/c/en/us/support/wireless/virtual-wireless-controller/series.html#concept...

Note: When an AP moves from one vWLC to another, it may refuse to join the second vWLC. It occurs when the server hardware fails, or a new instance of vWLCs are created. It is recommended to implement server mirroring scheme at the VMware level such as vMotion or some orchestrator. It is highly recommended to retain a snapshot of the VM instance, one from the mobility domain to which access points have joined previously. Then use the snapshot to start the vWLC instance. Access points then join the vWLC. This method can be also be used for priming access points instead of a physical controller.

More info: https://bst.cloudapps.cisco.com/bugsearch/bug/CSCva69352
https://www.cisco.com/c/en/us/td/docs/wireless/controller/8-4/config-guide/b_cg84/high_availability.html#vwlc-and-nplus1-ha

 

So you should probably get your ladder out ...

------------------------------
Please click Helpful if this post helped you and Accept as Solution (drop down menu at top right of this reply) if this answered your query.
------------------------------
TAC recommended codes for AireOS WLC's   and   TAC recommended codes for 9800 WLC's
Best Practices for AireOS WLC's,   Best Practices for 9800 WLC's   and   Cisco Wireless compatibility matrix
Check your 9800 WLC config with Wireless Config Analyzer using "show tech wireless" output or "config paging disable" then "show run-config" output on AireOS and use Wireless Debug Analyzer to analyze your WLC client debugs
Field Notice: FN63942 APs and WLCs Fail to Create CAPWAP Connections Due to Certificate Expiration
Field Notice: FN72424 Later Versions of WiFi 6 APs Fail to Join WLC - Software Upgrade Required
Field Notice: FN72524 IOS APs stuck in downloading state after 4 Dec 2022 due to Certificate Expired
- Fixed in 8.10.196.0, latest 9800 releases, 8.5.182.12 (8.5.182.13 for 3504) and 8.5.182.109 (IRCM, 8.5.182.111 for 3504)
Field Notice: FN70479 AP Fails to Join or Joins with 1 Radio due to Country Mismatch, RMA needed
Field Notice: FN74383 APs Running 17.12.4/5/6/6a May Run Out of Flash Space Preventing Upgrades
How to avoid boot loop due to corrupted image on Wave 2 and Catalyst 11ax Access Points (CSCvx32806)
Field Notice: FN74035 - Wave2 APs DFS May Not Detect Radar After Channel Availability Check Time
Leo's list of bugs affecting 2800/3800/4800/1560 APs
Default AP console baud rate from 17.12.x is 115200 - introduced by CSCwe88390

Scott Fella
Hall of Fame
Hall of Fame

Just like what rrudling mentioned... now at least you know what you need to do in the future.  Backups, snapshots and enable ssh.  With the newer 9800-CL, you don't have that issue, it is one of the reasons I never like using the vWLC and would rather prefer the appliance.  Heck, you probably can find a 3504 for cheap to replace the vWLC until you refresh your access points.  just keep that in mind.

-Scott
*** Please rate helpful posts ***

jagan.chowdam
VIP Alumni
VIP Alumni

Can you get console access to one of your APs and post the console output once you power cycle the AP.

 

 

 

CJ

 

Review Cisco Networking for a $25 gift card