03-20-2015 05:48 PM - edited 03-07-2019 11:11 PM
Okay, so yesterday we noticed a lot of churn on the network in regards to STP. There were mass TCN's for multiple VLAN's since Brocade also uses PVST. After reviewing the effect of TC's on the network, I realize it doesn't necessarily mean a total STP re-convergence event, but changes STP only for affected direct/indirect links participating in the spanning tree. However, I am focusing on insignificant topology changes which are clients either powering off or powering on their machines. Now, we should be running RSTP exclusively in our network, but I've come across multiple devices either configured for 802.1d or for no STP at all. This mix probably changes the behavior of TC and makes it less predictable to know what is going on.
Solved! Go to Solution.
03-22-2015 01:30 PM
Mike
And if I am understanding your response correctly, then because the non-edge ports are constantly being flushed, it is even flushing entries for the root bridge?
Potentially yes, as all non edge ports, except the one the TCN was received on are flushed.
RSTP cannot maintain any entries for non edge ports because it can't know the topology hasn't changed and the entries it has could now be pointing the wrong way ie. to a path that has failed.
The loss of BPDUs from other switches could be more to do with the flooding but you won't know that until you at least get portfast configured.
TCNs are not in themselves a bad thing but an excessive amount together with all entries, including those for end devices because portfast has not been configured, being flushed can result in a lot of flooded traffic.
You may well have other issues in your network and by all means use this forum for help but like I say if you can eliminate unnecessary TCNs and flooding if you do still have an issue it will be easier to narrow down.
Jon
03-21-2015 07:38 AM
Mike
With RSTP portfast is a double hit.
Not only does an end device generate a TCN when the port goes up or down but as you say a switch running RSTP then flushes it's mac address table.
But it only flushes the entries for non edge ports. But without portfast it doesn't know the end device ports are edge ports.
So the first thing you need to do is enable portfast on all end devices, or "portfast trunk" if any end device is trunking.
Without it, if enough clients are connecting and disconnecting, your network can be in an almost permanent state of topology change and this causes a lot of flooding.
I would do that as a priority and then see what other symptoms you are still seeing.
Jon
03-21-2015 04:38 PM
Jon,
I appreciate the response. I'll definitely start marking client ports as edge ports to cut down on the topology changes. And if I am understanding your response correctly, then because the non-edge ports are constantly being flushed, it is even flushing entries for the root bridge? I thought RSTP would maintain active entries in the CAM even during a TCN event. Would that include bridge to bridge communication? If it is a complete flush, then that would explain why a switch looses its root bridge and what log messages I can capture, the events coincide (TCN/Root Bridge lost). And it does find its RBID within a second or so. Yeah, that might be the case. I'll be back in the office tomorrow and able to see what is going on and getting a better grasp of what is/is not configured correctly. Thanks again for your help!
v/r
Mike
03-22-2015 01:30 PM
Mike
And if I am understanding your response correctly, then because the non-edge ports are constantly being flushed, it is even flushing entries for the root bridge?
Potentially yes, as all non edge ports, except the one the TCN was received on are flushed.
RSTP cannot maintain any entries for non edge ports because it can't know the topology hasn't changed and the entries it has could now be pointing the wrong way ie. to a path that has failed.
The loss of BPDUs from other switches could be more to do with the flooding but you won't know that until you at least get portfast configured.
TCNs are not in themselves a bad thing but an excessive amount together with all entries, including those for end devices because portfast has not been configured, being flushed can result in a lot of flooded traffic.
You may well have other issues in your network and by all means use this forum for help but like I say if you can eliminate unnecessary TCNs and flooding if you do still have an issue it will be easier to narrow down.
Jon
03-22-2015 02:05 PM
Jon,
I really appreciate you clarifying the underlying mechanisms for me. It's one thing to read about it and another thing to see it in the wild. Between STP best practices and the STP toolkit I am sure I can resolve the numerous issues we're having and if I have anymore questions about the technology at work I'll be sure to hit the forum. Thanks again.
v/r
Mike
03-22-2015 03:12 PM
Mike
No problem and please do come back if needed.
One thing I should have answered from your questions but didn't directly was the question of the mac address of the root switch.
The mac address that is important in the root switch election is the one contained in the BPDU not the source mac address of the BPDU. The source mac address is simply that of the port that transmitted the BDPU.
If a switch flushes it's mac address table it would remove that mac address but that would make no difference as to whether the switch believed it had lost it's path to root or not.
In terms of switch to switch communication BPDUs are sent with a multicast destination mac address so removing that mac address has no effect on BPDUs being exchanged.
So the fact that you are seeing the switch reporting it has lost it's path to root is not a direct consequence of the mac address being flushed because it doesn't need that to send and receive BPDUs.
However with all the flooding of end to end devices because of the flushing an indirect consequence may be that BPDUs are getting lost.
Apologies for not making that clearer.
Jon
03-22-2015 08:40 PM
Jon,
This really does help me out a lot, at least being able to understand the processes that are going on behind the scenes. I think, like you suggested, if I begin fixing small issues and moving STP to a stable and predictable topology a lot of these issues will disappear. Thanks again, and if something makes my head scratch I'll be sure to come back!
v/r
Mike
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide