cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
1413
Views
3
Helpful
10
Replies

vWLC 9800 - AP 3800/3700 flaps between WLC in HA

Hi,

There are few topic similar to this one, but none of the solutions worked for me. I have 2 x WLC 9800 in Azure with HA in between (without SSO). APs on one of them flaps randomly and what is important - not all APs from site at the same time. Second WLC is configured in same way regarding policies, tags, etc.. and there all is fine. From logs on AP it looks that WLC stops responding, then AP drops CAPWAP, connects to secondary configured in HA, after that it see in configuration that primary is configured and connects back.

I have checked Wireless Config Analyzer and regarding CAPWAP I have issue that CAPWAP: Invalid configured controller IP. Action: The IP address configured in the AP controller list, does not match the address of the current controller. Possible invalid configuration that should be corrected - it is not related with HA nor SSO. I checked N+1 config and it is same on both WLCs (secondary works perfectly fine, no drops).

Log from AP:

Aug 2 11:32:29 kernel: [*08/02/2023 11:31:01.3164] Re-Tx Count=1, Max Re-Tx Value=3, SendSeqNum=71, NumofPendingMsgs=2
Aug 2 13:34:27 kernel: [*08/02/2023 13:34:27.6955]
Aug 2 13:34:31 kernel: [*08/02/2023 13:34:31.4967] Re-Tx Count=2, Max Re-Tx Value=3, SendSeqNum=72, NumofPendingMsgs=3
Aug 2 13:34:31 kernel: [*08/02/2023 13:34:31.4967]
Aug 2 13:34:35 kernel: [*08/02/2023 13:34:35.2978] Re-Tx Count=3, Max Re-Tx Value=3, SendSeqNum=72, NumofPendingMsgs=3
Aug 2 13:34:35 kernel: [*08/02/2023 13:34:35.2978]
Aug 2 13:34:39 kernel: [*08/02/2023 13:34:39.0990] Max retransmission count exceeded, going back to DISCOVER mode.
Aug 2 13:34:39 kernel: [*08/02/2023 13:34:39.0990] Dropping msg CAPWAP_WTP_EVENT_REQUEST, type = 34, len = 487, eleLen = 495, sendSeqNum = 73
Aug 2 13:34:39 kernel: [*08/02/2023 13:34:39.0990] ...Vendor SubType: AP_CDP_CACHE_PAYLOAD(24) len: 483 vendId 409600
Aug 2 13:34:39 kernel: [*08/02/2023 13:34:39.0990] Dropping msg CAPWAP_WTP_EVENT_REQUEST, type = 21, len = 16, eleLen = 24, sendSeqNum = 73
Aug 2 13:34:39 kernel: [*08/02/2023 13:34:39.0991] ...Vendor SubType: HA_FAST_HEARTBEAT_REQUEST_PAYLOAD(60) len: 12 vendId 409600
Aug 2 13:34:39 kernel: [*08/02/2023 13:34:39.0991] Dropping msg CAPWAP_WTP_EVENT_REQUEST, type = 21, len = 16, eleLen = 24, sendSeqNum = 73
Aug 2 13:34:39 kernel: [*08/02/2023 13:34:39.0991] ...Vendor SubType: HA_FAST_HEARTBEAT_REQUEST_PAYLOAD(60) len: 12 vendId 409600
Aug 2 13:34:39 kernel: [*08/02/2023 13:34:39.0991] Dropping msg CAPWAP_WTP_EVENT_REQUEST, type = 21, len = 16, eleLen = 24, sendSeqNum = 73
Aug 2 13:34:39 kernel: [*08/02/2023 13:34:39.0991] ...Vendor SubType: HA_FAST_HEARTBEAT_REQUEST_PAYLOAD(60) len: 12 vendId 409600
Aug 2 13:34:39 kernel: [*08/02/2023 13:34:39.0998] Flexconnect Switching to Standalone Mode!
Aug 2 13:34:39 kernel: [*08/02/2023 13:34:39.1730] GOING BACK TO DISCOVER MODE
Aug 2 13:34:39 kernel: [*08/02/2023 13:34:39.1959] OOBImageDnld: OOBImageDownloadTimer expired for image download..
Aug 2 13:34:39 kernel: [*08/02/2023 13:34:39.1960] OOBImageDnld: Do common error handler for OOB image download..
Aug 2 13:34:39 kernel: [*08/02/2023 13:34:39.2281]
Aug 2 13:34:39 kernel: [*08/02/2023 13:34:39.2281] CAPWAP State: DTLS Teardown
Aug 2 13:34:40 NCI: CLEANAIR: Slot 1 CAPWAP down
Aug 2 13:34:40 NCI: I1: shutdownNci
Aug 2 13:34:40 kernel: [*08/02/2023 13:34:40.2771] OOBImageDnld: Do common error handler for OOB image download..
Aug 2 13:34:40 upgrade: Script called with args:[CANCEL]
Aug 2 13:34:40 kernel: [*08/02/2023 13:34:40.3722] status 'upgrade.sh: Script called with args:[CANCEL]'
Aug 2 13:34:40 kernel: [*08/02/2023 13:34:40.4304] do CANCEL, part2 is active part
Aug 2 13:34:40 upgrade: Cleanup tmp files ...
Aug 2 13:34:40 kernel: [*08/02/2023 13:34:40.4463] status 'upgrade.sh: Cleanup tmp files ...'
Aug 2 13:34:40 kernel: [*08/02/2023 13:34:40.4818] Dropping dtls packet since session is not established. Peer 10.208.136.196-5246, Local 10.216.4.27-5256, conn (nil)
Aug 2 13:34:40 kernel: [*08/02/2023 13:34:40.4819] Invalid event 59 & state 4 combination.
Aug 2 13:34:40 kernel: [*08/02/2023 13:34:40.4819] Failed to handle timer message.
Aug 2 13:34:45 kernel: [*08/02/2023 13:34:45.0138] OOBImageDnld: OOBImageDownloadTimer expired for image download..
Aug 2 13:34:45 kernel: [*08/02/2023 13:34:45.0138] OOBImageDnld: Do common error handler for OOB image download..
Aug 2 13:34:45 kernel: [*08/02/2023 13:34:45.0351] dtls_queue_first: Nothing to extract!
Aug 2 13:34:45 kernel: [*08/02/2023 13:34:45.0351]
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.0532] systemd[1]: Starting dhcpv6 client watcher...
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.0933] systemd[1]: Stopping DHCPv6 client...
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.1103] systemd[1]: Starting DHCPv6 client...
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.1614] Discovery Response from 10.208.136.196
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.1637] systemd[1]: Started DHCPv6 client.
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.1989] systemd[1]: Started dhcpv6 client watcher.
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.2681] Discovery Response from 10.192.136.196
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.2853] Started wait dtls timer (60 sec)
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.2952]
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.2952] CAPWAP State: DTLS Setup
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.2968] Invalid event 2 & state 3 combination.
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.2968] CAPWAP SM handler: Failed to process message type 2 state 3.
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.2968] Failed to handle capwap control message from controller - status 1
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.2968] Failed to process unencrypted capwap packet 0x55944000 from 10.192.136.196
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.2969] Failed to send capwap message 0 to the state machine. Packet already freed.
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.6632] First connect to vWLC, accept vWLC by default
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.6632]
Aug 2 13:34:55 kernel: [*08/02/2023 13:34:55.6791] dtls_verify_server_cert: vWLC is using SSC, returning 1
Aug 2 13:34:56 kernel: [*08/02/2023 13:34:56.5188]
Aug 2 13:34:56 kernel: [*08/02/2023 13:34:56.5189] CAPWAP State: Join
Aug 2 13:34:56 kernel: [*08/02/2023 13:34:56.6208] OOBImageDnld: OOB Image Download in ap_cap_bitmask(2)
Aug 2 13:34:56 kernel: [*08/02/2023 13:34:56.6210] Sending Join request to 10.192.136.196 through port 5256, packet size 1376
Aug 2 13:35:01 kernel: [*08/02/2023 13:35:01.3538] OOBImageDnld: OOB Image Download in ap_cap_bitmask(2)
Aug 2 13:35:01 kernel: [*08/02/2023 13:35:01.3540] Sending Join request to 10.192.136.196 through port 5256, packet size 1376
Aug 2 13:35:01 kernel: [*08/02/2023 13:35:01.5396] Join Response from 10.192.136.196, packet size 1397
Aug 2 13:35:01 kernel: [*08/02/2023 13:35:01.5396] AC accepted previous sent request with result code: 0
Aug 2 13:35:01 kernel: [*08/02/2023 13:35:01.5397] Received wlcType 0, timer 30
Aug 2 13:35:01 kernel: [*08/02/2023 13:35:01.6857]
Aug 2 13:35:01 kernel: [*08/02/2023 13:35:01.6857] CAPWAP State: Image Data
Aug 2 13:35:01 kernel: [*08/02/2023 13:35:01.6863] AP image version 17.9.3.50 backup 8.10.183.0, Controller 17.9.3.50
Aug 2 13:35:01 kernel: [*08/02/2023 13:35:01.6864] Version is the same, do not need update.
Aug 2 13:35:01 upgrade: Script called with args:[NO_UPGRADE]
Aug 2 13:35:01 kernel: [*08/02/2023 13:35:01.7328] status 'upgrade.sh: Script called with args:[NO_UPGRADE]'
Aug 2 13:35:01 kernel: [*08/02/2023 13:35:01.7933] do NO_UPGRADE, part2 is active part
Aug 2 13:35:01 kernel: [*08/02/2023 13:35:01.8090]
Aug 2 13:35:01 kernel: [*08/02/2023 13:35:01.8090] CAPWAP State: Configure
Aug 2 13:35:04 kernel: [*08/02/2023 13:35:04.1389]
Aug 2 13:35:04 kernel: [*08/02/2023 13:35:04.1389] CAPWAP State: Run
Aug 2 13:35:04 kernel: [*08/02/2023 13:35:04.1847] AP has joined controller AZJPE1VWLC01
Aug 2 13:35:04 kernel: [*08/02/2023 13:35:04.1855] Flexconnect Switching to Connected Mode!
Aug 2 13:35:04 kernel: [*08/02/2023 13:35:04.2341] wifi2 no private ioctls.
Aug 2 13:35:04 kernel: [*08/02/2023 13:35:04.2342]
Aug 2 13:35:04 kernel: [*08/02/2023 13:35:04.2573] wifi2 no private ioctls.
Aug 2 13:35:04 kernel: [*08/02/2023 13:35:04.2573]
Aug 2 13:35:04 kernel: [*08/02/2023 13:35:04.2790] wifi2 no private ioctls.
Aug 2 13:35:04 kernel: [*08/02/2023 13:35:04.2791]
Aug 2 13:35:04 kernel: [*08/02/2023 13:35:04.3009] wifi2 no private ioctls.
Aug 2 13:35:04 kernel: [*08/02/2023 13:35:04.3010]
Aug 2 13:35:06 syslog: Check lagloadbalance setting flex_mode 1 cfg 0 linkstate 0 ap_type 52
Aug 2 13:35:06 kernel: [*08/02/2023 13:35:06.8168] Previous AP mode is 2, change to 2
Aug 2 13:35:06 kernel: [*08/02/2023 13:35:06.8578] Current session mode: ssh, Configured: Telnet-No, SSH-No, Console-No
Aug 2 13:35:06 kernel: [*08/02/2023 13:35:06.8578]
Aug 2 13:35:06 kernel: [*08/02/2023 13:35:06.8579] Current session mode: telnet, Configured: Telnet-No, SSH-No, Console-No
Aug 2 13:35:06 kernel: [*08/02/2023 13:35:06.8579]
Aug 2 13:35:06 kernel: [*08/02/2023 13:35:06.8580] Current session mode: console, Configured: Telnet-No, SSH-No, Console-No
Aug 2 13:35:06 kernel: [*08/02/2023 13:35:06.8580]
Aug 2 13:35:06 syslog: password for user changed
Aug 2 13:35:06 kernel: [*08/02/2023 13:35:06.8954] chpasswd: password for user changed
Aug 2 13:35:06 syslog: password for user changed
Aug 2 13:35:06 kernel: [*08/02/2023 13:35:06.9432] chpasswd: password for user changed
Aug 2 13:35:07 kernel: [*08/02/2023 13:35:07.0286] systemd[1]: Starting Cisco syslogd watcher...
Aug 2 13:35:07 syslogd exiting
Aug 2 13:35:07 syslogd started: BusyBox v1.32.1
Aug 2 13:35:07 kernel: [*08/02/2023 13:35:07.1137] systemd[1]: Started Cisco syslog service.
Aug 2 13:35:07 kernel: [*08/02/2023 13:35:07.1318] systemd[1]: Started Cisco syslogd watcher.
Aug 2 13:35:07 kernel: [*08/02/2023 13:35:07.2087]
Aug 2 13:35:07 kernel: [*08/02/2023 13:35:07.2087] Same LSC mode, no action needed
Aug 2 13:35:07 kernel: [*08/02/2023 13:35:07.5912] systemd[1]: Starting ntp file watcher...
Aug 2 13:35:07 kernel: [*08/02/2023 13:35:07.5917] CLSM[00:00:00:00:00:00]: U3 Client RSSI Stats feature is deprecated; can no longer be enabled
Aug 2 13:35:07 syslog: NTP: Wed Aug 2 13:35:07 2023 :Can not create ntp process log file.
Aug 2 13:35:07 kernel: [*08/02/2023 13:35:07.6217] Update NTP source to WLC
Aug 2 13:35:07 kernel: [*08/02/2023 13:35:07.6439] systemd[1]: Started ntp file watcher.
Aug 2 13:35:09 kernel: [*08/02/2023 13:35:09.0634] wifi2 no private ioctls.
Aug 2 13:35:09 kernel: [*08/02/2023 13:35:09.0634]
Aug 2 13:35:09 kernel: [*08/02/2023 13:35:09.0864] wifi2 no private ioctls.
Aug 2 13:35:09 kernel: [*08/02/2023 13:35:09.0864]
Aug 2 13:35:09 kernel: [*08/02/2023 13:35:09.1109] wifi2 no private ioctls.
Aug 2 13:35:09 kernel: [*08/02/2023 13:35:09.1109]
Aug 2 13:35:09 kernel: [*08/02/2023 13:35:09.1354] wifi2 no private ioctls.
Aug 2 13:35:09 kernel: [*08/02/2023 13:35:09.1354]
Aug 2 13:35:10 kernel: [*08/02/2023 13:35:10.2598] systemd[1]: Starting Lighttpd Watcher...
Aug 2 13:35:10 kernel: [*08/02/2023 13:35:10.3155] systemd[1]: Started Lighttpd Watcher.
Aug 2 13:35:10 kernel: [*08/02/2023 13:35:10.7824] Got WSA Server config TLVs
Aug 2 13:35:11 kernel: [*08/02/2023 13:35:11.5182] SD AVC only supports 802.11ax AP
Aug 2 13:35:12 kernel: [*08/02/2023 13:35:12.3390] systemd[1]: Starting nbar watcher...
Aug 2 13:35:12 kernel: [*08/02/2023 13:35:12.4047] systemd[1]: Started nbar watcher.
Aug 2 13:35:12 kernel: [*08/02/2023 13:35:12.5223] systemd[1]: Starting nbar watcher...
Aug 2 13:35:12 kernel: [*08/02/2023 13:35:12.5584] systemd[1]: Started nbar watcher.
Aug 2 13:35:13 kernel: [*08/02/2023 13:35:13.7052] systemd[1]: Starting nbar watcher...
Aug 2 13:35:13 kernel: [*08/02/2023 13:35:13.7484] systemd[1]: Started nbar watcher.
Aug 2 13:35:13 kernel: [*08/02/2023 13:35:13.7581] systemd[1]: Starting nbar watcher...
Aug 2 13:35:13 kernel: [*08/02/2023 13:35:13.8057] systemd[1]: Started nbar watcher.
Aug 2 13:35:14 kernel: [*08/02/2023 13:35:14.6330] systemd[1]: Starting nbar watcher...
Aug 2 13:35:14 kernel: [*08/02/2023 13:35:14.6646] systemd[1]: Started nbar watcher.
Aug 2 13:35:14 kernel: [*08/02/2023 13:35:14.8121] systemd[1]: Starting nbar watcher...
Aug 2 13:35:14 kernel: [*08/02/2023 13:35:14.8419] systemd[1]: Started nbar watcher.
Aug 2 13:35:16 kernel: [*08/02/2023 13:35:16.5429] AP tag change to Perth-DIA
Aug 2 13:35:19 kernel: [*08/02/2023 13:35:19.4265] In write handler 'add_cache_param_via_tlv' for 'pmk_tracker :: PMKTracker':
Aug 2 13:35:19 kernel: [*08/02/2023 13:35:19.4266] not a valid cache_type in add_cache_param_via_tlv
Aug 2 13:35:33 kernel: [*08/02/2023 13:35:33.3082] set cleanair [slot0][band0] enabled
Aug 2 13:35:33 kernel: [*08/02/2023 13:35:33.4861] set cleanair [slot1][band1] enabled
Aug 2 13:35:33 mrvlfwd: radio 1: cfg loopback only mode 0
Aug 2 13:35:35 NCI: I1: SensorApp=824d9257
Aug 2 13:35:35 NCI: I1: SensorHdw=1.2.300
Aug 2 13:35:35 NCI: I1: Hardware Radio Band = [4890, 5935] MHz, BW=150625, band=1
Aug 2 13:35:35 NCI: slot=1 mode=2 chanCnt=2 cw=3
Aug 2 13:35:35 NCI: Squashed Channel List:
Aug 2 13:35:35 NCI: chans: 108 112
Aug 2 13:35:35 NCI: cf(MHz): 5540 5560
Aug 2 13:35:35 NCI: I1: channel map channels: in=2 cloned=2
Aug 2 13:35:35 NCI: I1: Requesting MonBand [5550, 5550] bw=40MHz/0 ant=0xbc
Aug 2 13:35:35 NCI: I1: Monitoring (cf=5550, span=40), RadioUsage=3%
Aug 2 13:35:35 NCI: I1: dwell=20000us, update=1000ms, resBW=156249
Aug 2 13:35:35 NCI: CLEANAIR: Slot 1 enabled
Aug 2 13:35:35 kernel: [*08/02/2023 13:35:35.7712] systemd[1]: Starting Lighttpd Watcher...
Aug 2 13:35:35 kernel: [*08/02/2023 13:35:35.8171] systemd[1]: Started Lighttpd Watcher.
Aug 2 13:36:55 kernel: [*08/02/2023 13:36:55.6613] FOUND CONFIGURED WLC (Primary) REDISCOVER TO CONNECT WITH THAT.
Aug 2 13:36:55 kernel: [*08/02/2023 13:36:55.6620] Flexconnect Switching to Standalone Mode!
Aug 2 13:36:55 kernel: [*08/02/2023 13:36:55.7648] OOBImageDnld: OOBImageDownloadTimer expired for image download..
Aug 2 13:36:55 kernel: [*08/02/2023 13:36:55.7648] OOBImageDnld: Do common error handler for OOB image download..
Aug 2 13:36:55 kernel: [*08/02/2023 13:36:55.7988]
Aug 2 13:36:55 kernel: [*08/02/2023 13:36:55.7988] CAPWAP State: DTLS Teardown
Aug 2 13:36:56 NCI: CLEANAIR: Slot 1 CAPWAP down
Aug 2 13:36:56 NCI: I1: shutdownNci
Aug 2 13:36:56 kernel: [*08/02/2023 13:36:56.8499] OOBImageDnld: Do common error handler for OOB image download..
Aug 2 13:36:56 upgrade: Script called with args:[CANCEL]
Aug 2 13:36:56 kernel: [*08/02/2023 13:36:56.9448] status 'upgrade.sh: Script called with args:[CANCEL]'
Aug 2 13:36:57 kernel: [*08/02/2023 13:36:57.0069] do CANCEL, part2 is active part
Aug 2 13:36:57 upgrade: Cleanup tmp files ...
Aug 2 13:36:57 kernel: [*08/02/2023 13:36:57.0238] status 'upgrade.sh: Cleanup tmp files ...'
Aug 2 13:36:57 kernel: [*08/02/2023 13:36:57.0618] Dropping dtls packet since session is not established. Peer 10.192.136.196-5246, Local 10.216.4.27-5256, conn (nil)
Aug 2 13:36:57 kernel: [*08/02/2023 13:36:57.0619] Invalid event 59 & state 4 combination.
Aug 2 13:36:57 kernel: [*08/02/2023 13:36:57.0619] Failed to handle timer message.
Aug 2 13:37:01 kernel: [*08/02/2023 13:37:01.4872] OOBImageDnld: OOBImageDownloadTimer expired for image download..
Aug 2 13:37:01 kernel: [*08/02/2023 13:37:01.4872] OOBImageDnld: Do common error handler for OOB image download..
Aug 2 13:37:01 kernel: [*08/02/2023 13:37:01.5100] dtls_queue_first: Nothing to extract!
Aug 2 13:37:01 kernel: [*08/02/2023 13:37:01.5100]
Aug 2 13:37:11 kernel: [*08/02/2023 13:37:11.5267] systemd[1]: Starting dhcpv6 client watcher...
Aug 2 13:37:11 kernel: [*08/02/2023 13:37:11.5648] systemd[1]: Stopping DHCPv6 client...
Aug 2 13:37:11 kernel: [*08/02/2023 13:37:11.5861] systemd[1]: Starting DHCPv6 client...
Aug 2 13:37:11 kernel: [*08/02/2023 13:37:11.6109] Discovery Response from 10.208.136.196
Aug 2 13:37:11 kernel: [*08/02/2023 13:37:11.6141] Discovery Response from 10.208.136.196
Aug 2 13:37:11 kernel: [*08/02/2023 13:37:11.6420] systemd[1]: Started DHCPv6 client.
Aug 2 13:37:11 kernel: [*08/02/2023 13:37:11.6794] systemd[1]: Started dhcpv6 client watcher.
Aug 2 13:37:11 kernel: [*08/02/2023 13:37:11.6820] Discovery Response from 10.192.136.196
Aug 2 13:37:20 kernel: [*08/02/2023 13:37:20.8604] Started wait dtls timer (60 sec)
Aug 2 13:37:20 kernel: [*08/02/2023 13:37:20.8702]
Aug 2 13:37:20 kernel: [*08/02/2023 13:37:20.8703] CAPWAP State: DTLS Setup
Aug 2 13:37:21 kernel: [*08/02/2023 13:37:21.0120] First connect to vWLC, accept vWLC by default
Aug 2 13:37:21 kernel: [*08/02/2023 13:37:21.0120]
Aug 2 13:37:21 kernel: [*08/02/2023 13:37:21.0146] dtls_verify_server_cert: vWLC is using SSC, returning 1
Aug 2 13:37:21 kernel: [*08/02/2023 13:37:21.7251]
Aug 2 13:37:21 kernel: [*08/02/2023 13:37:21.7252] CAPWAP State: Join
Aug 2 13:37:21 kernel: [*08/02/2023 13:37:21.8266] OOBImageDnld: OOB Image Download in ap_cap_bitmask(2)
Aug 2 13:37:21 kernel: [*08/02/2023 13:37:21.8269] Sending Join request to 10.208.136.196 through port 5256, packet size 1376
Aug 2 13:37:26 kernel: [*08/02/2023 13:37:26.5722] OOBImageDnld: OOB Image Download in ap_cap_bitmask(2)
Aug 2 13:37:26 kernel: [*08/02/2023 13:37:26.5724] Sending Join request to 10.208.136.196 through port 5256, packet size 1376
Aug 2 13:37:26 kernel: [*08/02/2023 13:37:26.6385] Join Response from 10.208.136.196, packet size 1397
Aug 2 13:37:26 kernel: [*08/02/2023 13:37:26.6386] AC accepted previous sent request with result code: 0
Aug 2 13:37:26 kernel: [*08/02/2023 13:37:26.6386] Received wlcType 0, timer 30
Aug 2 13:37:26 kernel: [*08/02/2023 13:37:26.7889]
Aug 2 13:37:26 kernel: [*08/02/2023 13:37:26.7889] CAPWAP State: Image Data
Aug 2 13:37:26 kernel: [*08/02/2023 13:37:26.8115] AP image version 17.9.3.50 backup 8.10.183.0, Controller 17.9.3.50
Aug 2 13:37:26 kernel: [*08/02/2023 13:37:26.8116] Version is the same, do not need update.
Aug 2 13:37:26 upgrade: Script called with args:[NO_UPGRADE]
Aug 2 13:37:26 kernel: [*08/02/2023 13:37:26.8553] status 'upgrade.sh: Script called with args:[NO_UPGRADE]'
Aug 2 13:37:26 kernel: [*08/02/2023 13:37:26.9141] do NO_UPGRADE, part2 is active part
Aug 2 13:37:26 kernel: [*08/02/2023 13:37:26.9298]
Aug 2 13:37:26 kernel: [*08/02/2023 13:37:26.9298] CAPWAP State: Configure
Aug 2 13:37:27 kernel: [*08/02/2023 13:37:27.2401] systemd[1]: Starting Lighttpd Watcher...
Aug 2 13:37:27 kernel: [*08/02/2023 13:37:27.3024] systemd[1]: Started Lighttpd Watcher.
Aug 2 13:37:28 kernel: [*08/02/2023 13:37:28.9595]
Aug 2 13:37:28 kernel: [*08/02/2023 13:37:28.9595] CAPWAP State: Run
Aug 2 13:37:29 kernel: [*08/02/2023 13:37:29.0126] AP has joined controller AZAUE1VWLC01
Aug 2 13:37:29 kernel: [*08/02/2023 13:37:29.0133] Flexconnect Switching to Connected Mode!

 

1 Accepted Solution

Accepted Solutions

I've changed retransmission count to max (8) and interval to max (5) - all good now. Still trying to find out the cause.

View solution in original post

10 Replies 10

Hi @przemyslaw.wieczorek 

 I think a good question here is why the AP drops  from the Primary WLC. The flap is consequence of this drop. 

Considering this log "Max retransmission count exceeded, going back to DISCOVER mode." I would try to increase the CAPWAP timers.

https://www.cisco.com/c/dam/en/us/td/docs/wireless/controller/9800/17-4/deployment-guide/c9800-n-plus-1-high-availability-wp.pdf

 

I already done that. What I don't fully get is why AP tries only 3 times with 3 seconds gap since as I have configured retransmit timers under HA / CAPWAP - Join Profile to Count 5 and Interval 3? Now I used maxed values (count 8 / interval 5). I will know how this will work tomorrow probably, but like I wrote, APs should try 5 times and not 3?

Sure. that´s what I would expect also.

I've changed retransmission count to max (8) and interval to max (5) - all good now. Still trying to find out the cause.

It should be packet loss or delay, more likely to be delay.

I would check CPU and run iperf

I checked it - all fine.

Rich R
VIP
VIP

What version of software?
Have you checked CPU loading?
How many APs on the WLC?

WLC I added to DNAC and PRIME so I checked CPU there - all good. There are 150 APs and version is 17.9.3 (to support 3700/3800 APs). So far so good, none of AP flapped anymore.

I have again AP flapping. Not as many as before buts problem still exist

I had a TAC case for this. There is no best practice for retransmissions and according to TAC I should leave it as it is. We couldn't find the route cause. So for now, case is closed.

Review Cisco Networking for a $25 gift card