cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
577
Views
5
Helpful
4
Replies

Output drops over wireless bridge

danletkeman
Level 1
Level 1

Hello,

I'm seeing a high number of output drops when we are running backups from site to site over a wireless bridge. Our setup is a follows:

backup server - 3560G - 3560 - 70mbit wireless bridge - 3560G - san

When the backups are running and sending data to the san at the remote site, the output drops show up on the port connected to the wireless bridge and don't stop until the backups are complete or fail. From what I have read this is because the wireless bridge is slower than the 100mbit or 1gbit ports on the 3560's and becomes a bottleneck. That is fine, but is there a way to decrease or eliminate the output drops using qos?

Thanks,

Dan.

4 Replies 4

Giuseppe Larosa
Hall of Fame
Hall of Fame

Hello Dan,

generally speaking QoS has features like traffic shaping that can help in your scenario.

What can be done depends from QoS features of used devices.

C3560 as C3750 uses a queueing scheme called SRR that comes in two flavors: shaped and shared.

Shaped SRR should work for you because it allows you put a shaper.

shape to a rate lower then wireless bridge effective rate.

see

http://www.cisco.com/en/US/docs/switches/lan/catalyst3560/software/release/12.2_44_se/configuration/guide/swqos.html#wp1200681

Hope to help

Giuseppe

Ok, so from what I have read so far I would need to apply a shape command on the interface towards the wireless bridge. eg:

srr-queue bandwidth shape 10 2 20 20

The wireless bridge is capable of 70mbit/second actual throughput.

From what i understand the 4 numbers represent the available queue's and the fraction of the interface bandwidth

So with 10 2 20 20 on a 100mbit interface queue 1 would get 10mbit, queue 2 would get 50mbit, queue 3 would get 5 mbit and queue 4 would get 5 mbit.

From the tests I have done it seems to use queue 2 first, and so when i run a bandwidth test i only get 50mbit now. At what point will it use the other queues?

Dan.

Hello Dan,

traffic is placed on the different queues depending on markings (DSCP byte).

if in your tests you send traffic with no markings or with the same DSCP value you are using only one queue.

in shaped mode the following applies:

>> In shaped mode, the egress queues are guaranteed a percentage of the bandwidth, and they are rate-limited to that amount. Shaped traffic does not use more than the allocated bandwidth even if the link is idle. Shaping provides a more even flow of traffic over time and reduces the peaks and valleys of bursty traffic. With shaping, the absolute value of each weight is used to compute the bandwidth available for the queues.

so you can get 50 Mbps on that queue and no more.

I may be wrong but I think this is what is needed in your scenario, may be using different ratios to go nearer to 70 Mbps.

Hope to help

Giuseppe

Joseph W. Doherty
Hall of Fame
Hall of Fame

From what you describe, you likely have several possible issues.

As you note, the wireless bridge is 70 Mbps, so assuming the 3560 connection to it is 100 Mbps, what happens to excess bandwidth utilizition? There's a good chance it's dropped on the bridge, where you might not see the drops. This is the likely the smallest bottleneck, but you may have another. (Does the bridge have any stats where it can show drops?)

Since you show 3560Gs on both ends, and mention gig, where gig slows to 100 Mbps you likely have drops. (Are these the drops you're noticing?)

Besides the bottlenecks, i.e. 70 and 100 Mbps vs. gig, what protocol does the SAN device use for its backups? If it uses TCP, TCP, usually by design, detects available bandwidth by ramping up bandwidth utilization until there's drops. I.e., Some drops, for a high bandwidth demanding TCP flow, is quite normal and can't be easily eliminated unless you have a device that can spoof RWIN between devices.

If the SAN device is using some "special" protocol for its backups, difficult to predict how the protocol will work unless the vendor documents it.

With the above background, what might you do?

First, assuming the wireless bridge lacks any effective QoS, we want to keep from sending it more traffic than it can handle. This so we can manage congestion on the 3560s, again assuming it's not possible on the bridge. This could be accomplished to setting the 3560s interfaces connected to the bridge not to exceed 70 Mbps. (E.g. on a 100 Mbps configured interface, "srr-queue bandwidth limit 70". NB: These values are not exact because the hardware adjusts the line rate in increments of six. I.e., you might need to set slower than 70 to insure 70 not exceeded.)

Second, you'll want to configure QoS, such that the backup traffic is identified (hopefully this can be done vs. "normal" SAN traffic). Using two (of the four) egress queues, place all routine traffic in one queue and and backups in another. Use SRR in shared mode and provide the backup queue minimal or little bandwidth and other traffic maximum or most bandwidth. (This will allow backups to use as much as possible available bandwidth, but cause little adverse impact to other traffic.)

With most bandwidth reserved for non-backups, that traffic, assuming there's sufficient bandwidth, should see almost not drops, however backups likely will.

Buffers should be sized for BDP. If they are, there's not much more that can be done on a 3560 to reduce drops for bandwidth probing traffic.

Review Cisco Networking for a $25 gift card