10-25-2018 03:54 PM
Hello
My customer wants to monitor their ISE deployment more closely and we have recommended enabling more ISE Logging Categories, especially for issues that could cause business impact. Recently one of their ISE appliances stopped processing Radius because of a disk full issue. It was just one SYSLOG in a sea of millions and was not spotten in time - even if it had been spotted, the 1st line guys/gals may not understand the impact or who to inform next. The goal is to focus on the top 45 critical ones (in my opinion) and to create some logic for their Manager of Managers.
I was looking for the canonical list of ISE SYSLOG messages and came across an ancient Excel for ISE 2.0. Is there anything newer than this Excel (for ISE 2.4 ?) because I have logged quite a few SYSLOG event messages that are not listed, or have no Message Code in that Excel.
At the moment I am reverse engineering ISE to compile my own list of text strings that I can give to Operations Team for their SYSLOG application filtering. The final goal is to group and classify these alarms and create some automated rules about escallation paths etc. - it's very tedious though. I would appreciate if anyone else has done a similar exercise - or if not, does anyone have access to their SYSLOG server and can perform a quick grep of any "CISE_Alarm CRITICAL" and send those over to me? You might be surprised about what's in there ;)
I am spending time torturing ISE to provoke all these error conditions - but I wish I didn't have to.
Solved! Go to Solution.
10-24-2019 02:14 PM
Reviving a not so old thread (as I'm in the exact same situation as some people that have come across this), would this work for our purposes?
I know it's for ISE 2.6 but perhaps most of it (if not all) would apply for 2.4 as well.
I haven't checked my syslog files against this, but the page came in when I searched Google for a very specific message in the log files, so that could be it.
10-25-2018 05:07 PM - edited 10-25-2018 05:18 PM
10-25-2018 05:20 PM
Hi @hslai - thanks for this - would you be okay sharing the .XML file as an attachment on this forum? It might help others too.
10-25-2018 05:21 PM
I edited my previous response and attached the file there.
10-25-2018 06:07 PM
I can't find some of the rather important SYSLOG messages in that file. Just some random examples
CISE_Alarm CRITICAL: High Disk I/O Utilization: Server=ise01
CISE_Alarm CRITICAL: High Disk Utilization: Server=ise01
CISE_Alarm CRITICAL: High Memory usage: Server=ise01
Where is the list that contains this stuff?
10-25-2018 06:24 PM
Currently, ISE alarms are in somewhat free format and not using message catalog.
10-25-2018 06:35 PM
That means I am back to square one again. Reverse engineering based on torture testing.
I have around 1/3 of my list complete.
Sounds to me like there is some work to be done to get error codes assigned to all of these events and to index them into this XML file. And then update the CCO document (been neglected since ISE 2.0)
10-25-2018 06:50 PM
As they are in free format, hard to document them.
CSCvi40720 is an existing enhancement Nidhi raised on this area.
10-26-2018 05:20 AM
Man, I was excited when I saw this discussion come through... Much less excited after reading through the thread. Sounds like a significant miss on the vendor's part.
Thank you for your efforts, and happy to contribute if I can find the time. Hopefully others on the community have done some of this leg work!
10-27-2018 05:05 AM
I am just catching up on this thread and have a few questions. I haven't spend much time on the log parsing side of ISE, but there are two types of logs that can be sent in Syslogs from what I understand:
So are you saying that for #1 where all the work has been done to create the Categories, Classes and Message Codes that that information doesn't make it into the actual Syslog Message? Or are you perhaps seeing the Alarms turned into Syslogs and those are just the raw alarm messages sent via Syslog?
10-28-2018 02:41 PM
@paul - yip it's the Alarms that don't have numbers assigned to them - that part I don't mind so much - it would still be okay if we had a list from Cisco that contained all the possible alarm strings and which category/setting is ISE triggers it. There are alarms that I don't think I could ever simulate (unless I had root access and knew exactly what to do). But I have managed to capture around 1/3 of the ones I am interested in.
I'd be keen to have some more from folks out there who can grep their SYSLOG files for the "CRITICAL" alarms. :-)
10-29-2018 06:06 AM
Hmm, seems to me it would be easy for someone at Cisco *cough cough* *wink wink* to pull the actual ISE application syslog config. With root access, this shouldn't be very difficult. Just do a recursive 'find' for 'CISE_Alarm' under the directory where the app is running.
Does this make sense? Or am I just blowing smoke? Maybe it's not that simple.
10-28-2018 05:48 PM
I don't see any SYSLOGs being sent from an ISE node, when it loses connection to the Primary PAN. This would be a scenario where the PAN loses connectivity to another ISE node. I would have expected something ... anything. e.g. replication warning or heartbeat warning.
Maybe I did something wrong. But in my case the remaining node is still sending SYSLOGs (e.g. regular System_Statistics etc.) - but it doesn't complain about the loss of comms to the PAN. Weird.
10-29-2018 06:10 AM
10-29-2018 02:55 PM
Hi @paul
I have not looked into all of the SYSLOG messages so I won't comment on whether they are correct or not. @hslai has uploaded the latest .XML file containing all the SYSLOG messages that are sent via the proper way. The remaining SYSLOGS are just scattered around in the code if and when needed (and those don't have ID numbers).
I took the interesting Alarm categories from ISE and then compiled an Excel like below. The column SYSLOG example is based on my own lab output. I doubt I will be able to fill the entire sheet - and that is why I was hoping someone out there had done something similar.
A customer can't spend all day looking at their SYSLOG waiting for something to trigger on - we need to be able to supply them with the strings that they can intelligently match on.
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide