I have several templates that only poll every 60 minutes (some are even 720 minutes). This poses a challenge in our monitoring as these are critical components but they can't be polled more often on a due to certain restrictions placed on us.
In our Nagios/Centreon setup we are migrating away from, I can specify a normal check interval, and a retry check interval. So basically, if I'm polling every 60 minutes but I get a failure, it will retry in 3 minutes (or whatever I specify). This will either clear the alarm if the problem resolves, or set the status to a hard down state if it fails the specified number of times.
I see in the component configuration (http check) that I can set it to fail after a certain number of consecutive failures, which is good, but there's not retry time, so it would be 60 minutes before it polls again. I'd like for it to repoll within 5 minutes or so to get the status again. I can't afford to wait 120 minutes for it to alert, so obviously I can't set it to fail two consecutive checks.
Clear as mud? Anyone know if this is possible and I'm just missing it?