Solved: Traffic Shaping design and advise

LovejitSingh130013 · ‎01-19-2023

Hello Experts @Leo Laohoo @Rob Ingram @balaji.bandi

I have SDWAN implemented and working to design traffic shaping policy.

There are two parts, one is the WAN policy from the underlay for Internet access which covers policies like teams, zoom, SIP-Voip, and other UDP base access. Do we need any policy for Outlook or any other stuff?

2nd part is Overlay Internal applications like internal access to phone servers from phones, particular application access to internal servers etc.

Please provide suggestions to design and implement it. Key things to include.

How to leverage new/more applications in future without disturbing the existing setup.

Thanks

Joseph W. Doherty · ‎01-19-2023

As you didn't mention me as an expert (even if I am a QoS expert legend in my own mind - laugh), hope you don't mind if I inject my 2 bits worth.

Firstly, unclear what/how you see using traffic shaping. If you have something in mind such as shaping some traffic to preserve bandwidth for other traffic, that can be done, but as shaping limits bandwidth all the time, I prefer prioritizing bandwidth based on application needs.

The only time, I usually shape traffic, is when I'm dealing with some QoS control point, which has more immediate (e.g. port) bandwidth than I know is available further along the path (e.g. CIR less than port bandwidth). Such a shaper creates a bandwidth restriction where I can apply QoS effectively for the path bandwidth.

You mention Internet access. Of course, the Internet doesn't generally honor/support QoS bandwidth management.

Downstream QoS bandwidth management (like receiving traffic from other Internet sites you have no control over), can be somewhat effective, but generally requires a "special" device like (now defunct?) Packeteer devices.

Site-to-site QoS bandwidth management across the Internet can be (almost) as effective as having a "private" WAN. However, much like a private WAN, you need to be able to fully manage traffic between your Internet connected sites.

For example, between two of your sites, connected via Internet, you should be (mostly) able to deliver necessary performance for Zoom, VoIP, etc.

Between one of your Internet connected sites, and another Internet connected site, which you have no bandwidth management control, without a "special" appliance, service levels are often practically impossible to provide, and even with a "special" appliance, you cannot guarantee service levels at the same level as if you control both Internet ends. (Again, as the Internet doesn't guarantee any service level, you cannot reach the same guarantee level as with a private WAN.)

"How to leverage new/more applications in future without disturbing the existing setup."

That's best accomplished, I believe, by using a very generic QoS model, where you only need to correctly direct traffic to the "right" class and insure you have the bandwidth actually needed to support your classes.

My generic model is:

policy-map GenericModel
class real-time !e.g. VoIP bearer, video conferencing (apps with "known" bandwidth usage limits)
priority percent 35
class Hi-priority !e.g. VoIP signaling (apps with "known" bandwidth usage limits)
bandwidth remaining percent 81
fair-queue
class Low-priority !e.g. scavenger, bulk data replication (e.g. database copies, email [server-to-server] transfers)
bandwidth remaining percent 1
fair-queue
class class-default !e.g. most all traffic, i.e. none of the above
bandwidth remaining percent 9
fair-queue

I've posted the above models, many times over the years, what I may not have mentioned, is when I used such a model, I would adopt techniques to try to determine what level traffic should be in without or ignoring the "classical" method, i.e. put this kind of traffic here and that kind of traffic there.

For example, suppose you have both HTTP and FTP traffic. "Classically", as HTTP is likely interactive it's "better" than FTP traffic for bandwidth prioritization. But, consider, if a HTTP session is downloading some 100 MB app update file vs. a FTP session is downloading a readme.txt file containing 100 characters, which is really "better", especially if the FTP is being used interactively.

Or, consider two users using telnet. Both users interacting with a Cisco router connected to the Internet. One user is doing "typical" console interaction, the other has just set no screen pagination, and is listing the whole Internet router table (while logging their telnet session output). Should these two users get exactly the same service level guarantees? (Personally, I believe they should not.)

So, I believe a dynamically driven service policy, based on bandwidth usage, avoids many problems of how to treat different applications, both now, and into the future. (One of the reasons I really, really like FQ/WFQ.)

It's also somewhat nice when you start dealing with crypto traffic. I cannot see "what" your traffic is, but I can "see" how demanding you are for bandwidth.

To be clear, being dynamic doesn't totally eliminate using application type and/or using ToS tags, but it does, I've found, simplify policy management. E.g. if desired, knowing both one app is telnet and the other FTP and also knowing that telnet flow is moving lots and lots of data, and the FTP flow very little, I might prioritize this particular FTP flow over the telnet flow.

View solution in original post

Joseph W. Doherty · ‎01-19-2023