cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
1501
Views
1
Helpful
5
Replies

CSCvz55484 AAA server Down

We had the problem from 16 of mars on wlc 9800-40 17.12.4 and after upgrade to 17.12.5

Could it be continue of the issue and not really fixed?

Platform State from WNCD (0) : current DEAD, duration 35s, previous duration 70s

     Platform State from WNCD (1) : current DEAD, duration 6s, previous duration 146s

     Platform State from WNCD (2) : current UP, duration 36s, previous duration 120s

     Platform State from WNCD (3) : current DEAD, duration 93s, previous duration 83s

     Platform State from WNCD (4) : current UP, duration 205s, previous duration 120s

     Platform State from WNCD (5) : current UP, duration 317128s, previous duration 0s

     Platform State from WNCD (6) : current UP, duration 317128s, previous duration 0s

     Platform State from WNCD (7) : current UP, duration 317128s, previous duration 0s

     WNCD Platform Dead: total time 544734137s, count 4681UP

5 Replies 5

Mark Elsen
Hall of Fame
Hall of Fame

 

 - Upgrade to one of the Known Fixed Releases mentioned in the bug report. If done already go a 'step higher'. If not feasible or possible contact TAC,

 M.



-- Let everything happen to you  
       Beauty and terror
      Just keep going    
       No feeling is final
Reiner Maria Rilke (1899)

Saikat Nandy
Cisco Employee
Cisco Employee

This is not a straightforward thing to troubleshoot but you can narrow it down to some extent. You need to check - 
1. During the problem state are you seeing all the WNCDs going 'DEAD' ? From your logs doesn't looks like can you can validate this behaviour few more times.
2. Lets consider you have 2 servers, ISE1 (For SSID 1)& ISE2(For SSID 2). During problem state, are you seeing WNCDs going down for both ISE1 and ISE2 or just a specific ISE? 'show aaa servers' will give you this data.
3. Do you have any dead-criteria and deadtime configured?

Quite unlikely you are hitting CSCvz55484, because pretty much the entire 17.12.x train has the fix of this defect. Also if you look into the 'How to Identify:' section in the defect, number 4 might be your scenario. If you have accounting enabled, you can try disabling that and give it a shot.

In reality this issue comes when your WLC is not getting any response back from the AAA server...it could be somewhere in between WLC & AAA server the traffic is getting dropped. Also seen this issue with ISE if there is significant amount of load on it. Pcap on both WLC and AAA side in sync will give you a picture about the communication between these two devices.

Do you mean disable accounting-interim? og accounting fra AAA?

Accounting as a whole.

Rich R
VIP
VIP

Agreed with @Saikat Nandy I don't think you're hitting CSCvz55484 - I have not seen it at all on 17.12.

But there is a common problem with the radius timeouts which is already covered in the Best Practice guidance - have you read and followed that @orlando-g-suarez ?
https://www.cisco.com/c/en/us/td/docs/wireless/controller/9800/technical-reference/c9800-best-practices.html#RADIUSServerTimeout

The BU are planning to change the config defaults in a future release to address this.

------------------------------
Please click Helpful if this post helped you and Accept as Solution (drop down menu at top right of this reply) if this answered your query.
------------------------------
TAC recommended codes for AireOS WLC's   and   TAC recommended codes for 9800 WLC's
Best Practices for AireOS WLC's,   Best Practices for 9800 WLC's   and   Cisco Wireless compatibility matrix
Check your 9800 WLC config with Wireless Config Analyzer using "show tech wireless" output or "config paging disable" then "show run-config" output on AireOS and use Wireless Debug Analyzer to analyze your WLC client debugs
Field Notice: FN63942 APs and WLCs Fail to Create CAPWAP Connections Due to Certificate Expiration
Field Notice: FN72424 Later Versions of WiFi 6 APs Fail to Join WLC - Software Upgrade Required
Field Notice: FN72524 IOS APs stuck in downloading state after 4 Dec 2022 due to Certificate Expired
- Fixed in 8.10.196.0, latest 9800 releases, 8.5.182.12 (8.5.182.13 for 3504) and 8.5.182.109 (IRCM, 8.5.182.111 for 3504)
Field Notice: FN70479 AP Fails to Join or Joins with 1 Radio due to Country Mismatch, RMA needed
Field Notice: FN74383 APs Running 17.12.4/5/6/6a May Run Out of Flash Space Preventing Upgrades
How to avoid boot loop due to corrupted image on Wave 2 and Catalyst 11ax Access Points (CSCvx32806)
Field Notice: FN74035 - Wave2 APs DFS May Not Detect Radar After Channel Availability Check Time
Leo's list of bugs affecting 2800/3800/4800/1560 APs
Default AP console baud rate from 17.12.x is 115200 - introduced by CSCwe88390
Review Cisco Networking for a $25 gift card