01-20-2016 08:43 PM - edited 03-08-2019 03:29 AM
hello,
Our CISCO 7609 which is using WS-SUP32-GE-3B (ios : 12.2(33)SRD4 ) has detected a crash and automatically reloaded while I was donig 'no address-family' and 'no ip vrf' job. I opend a case and the CISCO Tac gave me an answer that I am hitting a bug 'CSCtg94250'. The bug says as below :
Symptoms: Removing address-family ipv4 vrf (in router BGP) followed by no ip vrf (where "vrf" is the same) could result in a crash.
Conditions: The symptom is observed in a large VPNv4 scale setup, when applyingthe following commands to the same VRF back-to-back:
1. no address-family ipv4 vrf
2. no ip vrf
3. ip vrf
The trigger of the BGP crash is a result of a racing condition between event 1 and event 2.
and the workaround is :
Workaround:
Since this is a racing condition, the workarounds are: 1. Not applying (1) before (2). 2. Give sufficient time for (1) to complete before applying (2).
(1) no address-family ipv4 vrf (2) no ip vrf (3) ip vrf
Known Fixed Releases:
12.2(33)SRE3 or later
However I am wondering how long is that "sufficient time" for (1) to complete? and how can I be sure if I can continue to next step(2)?
I collected some more info about the crash and I found that the last four Processes were :
and then it detected a Crash -> created a Crashinfo file -> reloaded.
I think I can use 'show process cpu' command to verify if the BGP Processes are stable and I can continue to next step...
I want to listen to your opinion, do you have any Ideas? or do you know the exect time? or any other command to verify the safty?
thank you and happy new year!!
Solved! Go to Solution.
01-20-2016 10:40 PM
This bug has been fixed for quite some time.
https://tools.cisco.com/bugsearch/bug/CSCtg94250
Rather than wondering how long to wait in case it crashes wouldn't it be better to apply software that has the bug fixed?
The "gold star" release for platform looks like it is 15.3(3)S6. I would change to running a gold star release.
01-20-2016 10:40 PM
This bug has been fixed for quite some time.
https://tools.cisco.com/bugsearch/bug/CSCtg94250
Rather than wondering how long to wait in case it crashes wouldn't it be better to apply software that has the bug fixed?
The "gold star" release for platform looks like it is 15.3(3)S6. I would change to running a gold star release.
01-21-2016 04:02 PM
hi, thank you for your kindness
I 100% agree with you that upgrading would resolve all the problem.
However, my boss wants to figure that out and we have some reason that we are not available to upgrade the IOS.
I was thinking about it, and I think it is not possible to know the exect time because of the differnce of the size of site and configuration. so, I have to know the command to verify the Processes or any other way to be sure that the Processes are stable.
01-21-2016 07:31 PM
Two easy solutions comes to mind:
01-21-2016 07:46 PM
Can you please provide the reason why I should delete VRFs next day?
I don't believe there is any reference or docs about this Issue, but I'd like to listen to your opnion. Then let me investigate deeply based on your opinion.
01-21-2016 08:12 PM
You were asking for the "exact time" you had to wait. A safer option is to just defer the deletion till the next day.
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide