Increase number of cycles for felix bridges
Increase timeout check cycles for default-bridge-felix-1 and default-bridge-felix-2 as they have been generating too many alerts.
Merge request reports
Activity
After looking at the change, I'd be surprised if this solved the issue. The way I understand the syntax used here is that monit will check the connection once per cycle (2 minutes). With how it was written before, if it failed 3 times within the span of 5 cycles, it would send an alert. With this change, if it fails 3 times in the span of 8 cycles it will trigger an alert. So isn't this strictly worse than before?
@cohosh true, I mixed up the number of times and cycles. Here is how I tested it:
$ cat monit-test.conf
check host testing with address 127.0.0.1 if failed port 34567 for 6 times within 6 cycles then exec "/usr/bin/netcat -l 127.0.0.1 34567"
$ monit -d 1 -I -c monit-test.conf
Starting Monit 5.27.2 daemon 'host' Monit 5.27.2 started 'testing' failed protocol test [DEFAULT] at [127.0.0.1]:34567 [TCP/IP] -- Connection refused 'testing' failed protocol test [DEFAULT] at [127.0.0.1]:34567 [TCP/IP] -- Connection refused 'testing' failed protocol test [DEFAULT] at [127.0.0.1]:34567 [TCP/IP] -- Connection refused 'testing' failed protocol test [DEFAULT] at [127.0.0.1]:34567 [TCP/IP] -- Connection refused 'testing' failed protocol test [DEFAULT] at [127.0.0.1]:34567 [TCP/IP] -- Connection refused 'testing' failed protocol test [DEFAULT] at [127.0.0.1]:34567 [TCP/IP] -- Connection refused 'testing' exec: '/usr/bin/netcat -l 127.0.0.1 34567' 'testing' connection succeeded to [127.0.0.1]:34567 [TCP/IP]
Edited by anadahzOkay thanks for the update. This looks reasonable to me. I'll ping @phw about deploying it