Quantcast
Channel: THWACK: Message List
Viewing all articles
Browse latest Browse all 20396

Is there a way to have the poller repoll within a given time frame if a component fails?

$
0
0

I have several templates that only poll every 60 minutes (some are even 720 minutes).   This poses a challenge in our monitoring as these are critical components but they can't be polled more often on a due to certain restrictions placed on us.

 

In our Nagios/Centreon setup we are migrating away from, I can specify a normal check interval, and a retry check interval.  So basically, if I'm polling every 60 minutes but I get a failure, it will retry in 3 minutes (or whatever I specify).  This will either clear the alarm if the problem resolves, or set the status to a hard down state if it fails the specified number of times.

 

I see in the component configuration (http check) that I can set it to fail after a certain number of consecutive failures, which is good, but there's not retry time, so it would be 60 minutes before it polls again.  I'd like for it to repoll within 5 minutes or so to get the status again.  I can't afford to wait 120 minutes for it to alert, so obviously I can't set it to fail two consecutive checks.

 

Clear as mud?  Anyone know if this is possible and I'm just missing it?


Viewing all articles
Browse latest Browse all 20396

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>