12-12-2011 06:57 AM - edited 03-07-2019 03:51 AM
Hi,
We have a failover pair of loadbalancers (non-Cisco) which are connected to each other via Catalyst 6509Es with Sup720 supervisor cards. Failover is achieved by the newly active loadbalancer GARPing all its service IP addresses with the relevant MAC address in order to update nearby ARP tables (failover GARPs are fired out by the loadbalancers at a rate of 200 per second). Failing over services between these loadbalancers has been found to be problematic, with numerous services not failing over in a timely manner.
Some of the loadbalanced networks involved are routed on the Sup720s, the rest are routed on FWSM modules in the same chassis. Problems occur only with VLANs routed on the Sup720s; all VLANs routed on the FWSMs failover without issue.
Investigation has shown that this is due to a proportion of the ARP table entries not being updated in the Sup720 and (with the default 4 hour ARP table timeout) subsequently requiring a manual flush of the "wrong" ARP entries.
Testing by throwing GARPs at both the FWSM and the Sup720 has revealed the following:
We're not rate-limiting ARP anywhere in the Sup720 - hardware or otherwise - and the FWSM handles the rate of GARP without issue. Is there any built-in restriction on the Sup720 we're not aware of that would cause the observed behaviour, is it configurable, or can anyone suggest what else could be causing the observed behaviour?
Thanks
12-14-2011 12:09 PM
Hi,
Will you please post outputs of these commands:
"show policy-map control-plane",
"show mls qos protocol",
"show mls rate-limit".
Thanks & Regards,
Antonin
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide