How do you monitor and react when some scheduled job fails? - general question

Posted by Dzida on Server Fault See other posts from Server Fault or by Dzida
Published on 2010-06-10T09:42:24Z Indexed on 2010/06/10 9:53 UTC
Read the original article Hit count: 168

Hi,

In many projects my team faced problems with 'silent fails' of some important components. There are lot of tasks executed behind the scenes and if somethings fails (either by errors in logic or hardware problems) in most cases responsible person is not notified (or not notified instantly).

I know about heavy-weight monitoring tools that could solve some of that problems but there over-complicated and too expensive for our team.

I am interested what are your solutions for such problems.

© Server Fault or respective owner

Related posts about monitoring

Related posts about cron