01-28-2018 01:52 AM - edited 03-05-2019 09:50 AM
01-28-2018 05:32 AM
Hello Richard,
without knowing your configuration, what happens if you disable ARP caching on the overlay interfaces ( no otv suppress arp-nd ) ?
01-28-2018 12:14 PM - edited 01-28-2018 12:43 PM
Hi Georg
I have now attached the 3 CSR configs to the original post. I have tried with and without ARP caching enabled but not to fix this issue, just to see the ARP caching in action. I thought ARP caching just cached the remote ARP responses locally to save ARP traffic across the overlay?
Can you see anything missing from the config which would prevent the documented TCN from being generated?
I have pasted an extract from Cisco on the TCN generation.
Source
If multihoming is used, it is highly recommended to enable spanning tree on the OTV routers. Doing so enables the OTV router to send out a topology change notification (TCN), which will cause the adjacent Layer 2 switch device (along with other switches in the spanning tree) to reduce their aging timer from the default setting to 15 seconds. This will greatly speed convergence when there is a failure or recovery between the multihomed pair.
Thanks
Rick
01-28-2018 01:18 PM
Hello,
configs look ok as far as I can tell (except for the mtu 1600 size, I guess you have a specific reason for those?).
Can you verify which is the root switch for Vlan 99 and Vlan 100 (show spanning-tree vlan 99/100) ?
01-28-2018 02:14 PM - edited 01-28-2018 02:15 PM
I increased the WAN MTU due to the overhead caused by using this feature, I could have used a lower value but 1600 covers it. The spanning tree root for VLAN's 99-100 is the 'customer' switch connected to CSR1 Site1.
01-29-2018 12:08 AM - edited 01-29-2018 05:33 AM
01-29-2018 12:33 AM
Hi Guys
I think I have the reason for the behavior in my lab. I have the 'silent host' issue which happens in labs but generally doesn't happen in live networks. For my host devices I used Cisco routers with an IP address on a single interface, all these devices were doing is a ping and an ARP. In a production network these hosts would be workstations and servers and would be a lot more chatty, generating broadcast traffic. When I drop the CSR1 site 1 WAN overlay the remote Cisco host does not generate any new broadcast traffic, new broadcast traffic would flood from the CSR1 site 2 across the overlay and eventually into the 'customer' layer 2 at site 1.
So in summary, in a production network the hosts would generate enough broadcast traffic to keep failover connectivity issues to a minimum. In a lab with silent hosts, you will have to wait 5 minutes for the 'customer' layer 2 mac address table to age out before connectivity is restored.
I still don't fully understand why the OTV host doesn't generate a TCN as documented so if anyone could get an answer on that it would be great.
For now I am happy to design OTV into my customer solution.
Thanks
Rick
01-13-2021 03:25 PM
Richard, just wondering which image version CSR you used in EVE-NG, I've tried the same configuration and cannot get the adjacency UP even though site is UP on each site. Both CSRs display the "No, overlay DIS not elected"
Thank you
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide