Recently one D9036 encoder (v1.10.30) was hit by a Software Error alarm which was related to AlarmCollector process crash. In this article we will discuss about this alarm, its behavior and the way out to resolve this alarm.
While investigating the alarm file, we saw too many “Software Error / mvi01-vbi:AlarmCollector” alarms in the alarm history. It was looking like the AlarmCollector process was crashing multiple times and coming back again. Once came back the alarm was cleared and again crashed.
It was confirmed from customer that there was no such service degradation (in terms of input and output functions) due to this frequent alarm.
We were worried about too many crashes of the AlarmCollector process; however Cisco investigated it and came with below observations:
-> There is a MON process that tracks running processes and reports an alarm.
-> Usually if a process crashes, it should restart automatically. However AlarmCollector process did not restart on crash as it is not standalone type of process i.e. it has dependencies on other processes that would need to be handled if process is restarted.
-> MON process has a bug where one of its processing paths makes incorrect determination of existence of the process when one or more of processes is/are down, resulting in incorrect clear of the alarm and alarm asserted again in the next MON query.
-> As confirmed by Cisco Engineering, they are looking to correct the MON bug in release v2.3
-> The cause of the AlarmCollector crash was not determined since there was no information available on the logs related to the crash (the information’s were overwritten by another alarms) and also Engineering was unable to reproduce it. However after a scheduled reboot of the encoder the alarm cleared and didn’t come back again.
-> Below the link to have more clarification about memory leak which in turn cause of various software error and we should suggest customer to upgrade encoder version incase they don't want frequent reboot.
HiBased on information on cisco website, there is an "A9K-IVRF-LIC" also per LC we have this license, for example for 8x100GE we have "A9K-800G-IVRF". Question here is do we need both? or per LC is enough?Thank you
Hi As I checked Cisco Live! power points, Tomahawk NPU supports 64K pppoe session. Some Questions here:1- A99-48x10GE-1GE-SR has two NPUs, does this mean I will have 128K pppoe session on this card?2- Also it was mentioned the TCAM capacity for ipv4 ...
HiI want to test BNG features of ASR9K in The lab and I want to use XRv9 for this purpose. I want to ask does this image support full features of BNG(RADIUS protocol,PPPoE termination on dot1q interface ,Session/Service Accounting, Per session QoS,...)If ...
Hello Expert, I have to replace ASR1000-SIP10 with ASR1000-6TGE.The SPA on the ASR1000-SIP10 are alredy in use.I am thinking to replace ASR1000-SIP10 with ASR1000-6TGE and copy the configuration that were ...