Solved: Help to Prefer MPLS route over backup wireless bridge

Daniel Hanson · ‎10-22-2015

We have 2 sites (A and B) that each have their own MPLS connection through BGP. These are redistributed to OSPF, which then connects to the core switch at each respective site. We have recently installed a wireless bridge connecting site A and B that we would like to use as a "backup" connection, should the MPLS at one site fail.

Since we have no additional ports available on our routers, we have to connect the wireless systems to the core switch. Also, the idea is to establish redundancy, so terminating to the same endpoint (the routers) would create a single point of failure. We used a unique SVI (because one of the bridges terminates in a remote switch, Site A - SA1) to route traffic up to the core for each bridge connection. This also jives with our core redundancy at site A, which we'll eventually be utilizing for S2S VPN redundancy through a firewall/internet link, but that's a future problem.

We run OSPF currently between the cores (on the diagram as "SC#") and the routers (RW#). Site A uses OSPF area 0, and site B uses OSPF area 36. We have BGP to the MPLS, with both sites in the same AS.

Here's the rub: We want the primary connection to be the MPLS, and the wireless bridge to act as a backup. Since the bridge's network will currently be shared through OSPF as an inter-area route, it is still preferred over the E2 route learned from MPLS. How do we manipulate the routing to prefer the E2 route?

Things I've looking into, and why I'm concerned about them not really resolving the issue:

1. OSPF virtual-links. The problem here is that there is no intermediary area or router that the sites need to communicate through. Also, this doesn't necessarily resolve our problem, as the route (as I understand it) will still be preferred over the E2 routes from MPLS.

2. OSPF sham-links. My understanding of this functionality is extremely limited, but my belief is that this doesn't apply to our situation since we do BGP to the MPLS, not OSPF. If my understanding of this is incorrect, this might be exactly what we're looking for, I just need clarification.

3. Creating an additional OSPF process to redistribute between the existing OSPF instances, and manipulate the routes as they redistribute. I'm simply not comfortable with this (wouldn't these routes be inter-area routes anyway?) so any clarification on how we could use this would be appreciated. I've yet to find a clear example with any similarity to our issue.

4. Bring BGP down to the core switches and modify AS-PATH or local preference to manipulate routing. This causes additional routing calculations to occur on the core switches, which is not optimal. This would solve the problem, though, with the risk of over-tasking the core CPU's.

Please let me know if I'm overlooking something, or if there is a recommended method for this.

Jon Marshall · ‎10-23-2015

Dan

Okay, did a quick test.

Just so we are using the same setup -

1) I assumed you were not redistributing OSPF into BGP only BGP into OSPF otherwise you would get some very funny results and you would need route filtering

2) the SVI connection - both ends needs to be in the same area for the adjacency to form so I put them both in area 0.

I used loopbacks on the switches to emulate local vlans/IP subnets and I only used one L3 switch at each site.

So I removed the network statement from the OSPF process for the SVI subnet and created a new OSPF process on each switch and then added the SVI subnet to that process.

Then I simply redistributed one way from the current OSPF process to the new one.

It worked fine and actually it preferred the MPLS E2's without any modification of metrics etc.

I then shut the interface of one of the CE routers connecting to the MPLS network and once BGP worked itself out it failed over to the backup link.

Then brought it back up again and it failed back, again reliant on BGP more than anything else.

With each test I pinged between loopbacks on the switches and they worked.

One thing to note was that because I only did one way redistribution the CE routers never get the routes via the backup links which I didn't think would be a problem because it is the switches the traffic hits first and it should be routed from there.

Finally all of this was done using an emulator not real kit but the routing aspects of the lab are pretty reliable.

I can't guarantee the solution and you may need to play around with metrics even though I didn't have to and your setup is probably more complicated than what I used but initial results look promising.

I can save the lab so if there is anything else you want me to test or try other things only too happy to try them out.

Edit - my tests were rather basic so if there are other failover scenarios you want me to test just let me know.

Jon

View solution in original post

Jon Marshall · ‎10-23-2015

Okay, that complicates it but doesn't necessarily mean it won't work.

So when you redistribute into BGP you only redistribute the local subnets making sure any subnets received from the backup link are not then advertised back to the MPLS cloud.

That shouldn't be a problem because I was only redistributing one way ie from the main OSPF process to the new one.

In fact that means that your CE router never sees any of the routes via the backup link (which it does currently) because the CE router is only in the current OSPF process and not the new one.

Not that I am recommending removing the filtering.

You definitely need to filter when you redistribute from the current OSPF process to the new one but it is relatively easy to do.

I'm not sure of the additional overhead of another process compared to extending BGP to the core and I'm not sure at the moment which fits better with the tertiary connection you mentioned.

What I may do over the weekend is get the lab up again and check exactly why it is preferring one path over the other and make sure you can influence it otherwise like I say it is not really a solution for you.

And obviously bear in mind I literally had a switch and router per site and that was it in addition to an MPLS network between the sites so you probably have more complexity in your network.

Basically it's just another option for you and it does seem to work but like I say I was just shutting down an MPLS connection and it was all quite basic so no guarantees.

The emulator is not GNS3 unfortunately which I used to have a while back.

I get access to a training site Cisco have due to my participation on here and it has an emulator you can use.

Jon

View solution in original post

Jon Marshall · ‎10-22-2015

This may not be a solution for you but sometimes the simplest answers are the best.

What about a floating static route on each core switch pointing to the wireless connection.

The static could be a default route or if that is in use you could use a summary route assuming each site can be summarised.

If the MPLS connection goes down the BGP routes are no longer received and redistributed into OSPF so the floating static takes over.

And if the link comes back up the OSPF routes take over again.

I am not a fan of statics everywhere but sometimes they can be quite useful.

Jon

Daniel Hanson · ‎10-23-2015

Hi Jon,

Thank you for your input. I hadn't considered static routes, mostly due to the fact that I have two cores at one of the sites, and we're planning a tertiary backup connection, which would further complicate the static configuration. I'm against this option, but I will add it to the possibilities.

Very Respectfully,

Dan Hanson

Jon Marshall · ‎10-23-2015

Dan

No problem, just a suggestion.

One other thought that I suspect you won't be a fan of :-) is with option 3) you seem to be suggesting you would be happier to run an additional OSPF process on each switch rather than extend BGP to the core.

I don't know whether option 3 would work, would need to lab it up and try out a few things but if the devices are all Cisco you could run EIGRP on the backup link.

If you did a redistribute connected on the core switches then the EIGRP routes would be external ie. AD 170 and so your OSPF routes would be preferred.

If you then added a tertiary backup connection you could run EIGRP on there as well and use delay to manipulate which of the backup links took preference.

Again I should stress I am not a fan of overcomplicating things and it would mean running another routing protocol (if supported) but just thought I'd mention it as another option.

Jon

Daniel Hanson · ‎10-23-2015

Another good option...but I can't deploy EIGRP purely because we want to stay open to additional hardware options. I know EIGRP is now an open standard, but only in a stub configuration. If we end up implementing something with the Cisco proprietary commands, it locks us in to their hardware. That's the reason we run OSPF as our IGP.

Believe me, if it was strictly my choice, I'd go EIGRP (way easier to manipulate than OSPF) but there's a pretty solid reason not to.

Thanks again for another really solid idea, but sadly I can't go that route either.

Jon Marshall · ‎10-23-2015

Again no problem.

With option 3) what were you thinking of ie. creating a new OSPF process on the core switches and redistributing the existing OSPF process into it ?

If you redistribute they should be external routes but I'm not totally convinced it would work.

I do have an MPLS lab scenario setup so I could test this if you provided more details.

As I understand it the wireless connection is simply terminated on each core switch with an SVI in the same IP subnet.

Is that how it is setup ?

So in effect it is just like having a direct L3 P2P connection direct between your core switches ?

Jon

Daniel Hanson · ‎10-23-2015

So here's the theory:

1. Yes, we have SVI's on each core in the same subnet, and the wireless bridge exists in that VLAN.

2. The network between them is not shared by any of the summary-routes that are being used to advertise each particular site, so this network can be isolated into it's own range. for example: if Site A is 10.10.0.0/16 and B is 10.20.0.0/16, the SVI is 192.168.30.0/29.

3. With an additional OSPF instance, my understanding is that it would consider all routes redistributed into this process as external routes, and we could manipulate them accordingly. With an appropriate cost manipulation, you might be able to make the route less preferred over the E2 route learned via the MPLS path. I'm not 100% sure this would work, but it's simply an idea.

I unfortunately don't have a lab to test this, so if you have the time/patience to play with this, your results would be greatly appreciated.

If you need any additional details, please let me know what kind of information you need, and I'll supply anything I can.

Jon Marshall · ‎10-23-2015