cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
674
Views
0
Helpful
2
Replies

Job Monitoring / Alerting

rachaelhall80
Level 1
Level 1

Aside from using job events/email actions to alert on job failures, what other tools are being used to monitor jobs and their critical path flows?

If you are using a tool outside of the Tidal console, what is the tool or tools you are using and what benefits are you getting from each of them?

2 Replies 2

We only use job events email, and Nagios triggered shells scripts that directly query the Tidal DB for specific things we want.  I think you can use SNMP traps with Tidal, we just don't do it.

We use an external event handler (CorreLog)  to handle complex actions. From TES we create a job event, that sends a message to CorreLog including all necessay parameters to impose actions.

Many of these actions are sacmd commands, SMS (text messages) to operators etc.

This way we can achieve a much higher level of automation than we can with TES events alone.