Bug 11764

Summary: Monitor mail queue on mail01.ipfire.org
Product: Infrastructure Reporter: Peter Müller <peter.mueller>
Component: MonitoringAssignee: Michael Tremer <michael.tremer>
Status: CLOSED FIXED QA Contact: Peter Müller <peter.mueller>
Severity: - Unknown -    
Priority: - Unknown - CC: morlix
Version: unspecified   
Hardware: unspecified   
OS: Unspecified   
See Also: https://bugzilla.ipfire.org/show_bug.cgi?id=11765
Bug Depends on:    
Bug Blocks: 11768    

Description Peter Müller 2018-06-11 18:21:49 UTC
Currently, we see a massive amount of mails in the queue of IPFire's mailserver. Since the situation needs some further investigation, it would be helpful if the number of mails currently in the queue could be monitored and alerts to Michael and me are eventually triggered.
Comment 1 Michael Tremer 2018-06-11 19:26:46 UTC
What would the thresholds be and potential actions?
Comment 2 Peter Müller 2018-06-12 06:20:25 UTC
Treshold: 100 (warning), 250 (critical)

Actions: Depends on the situation. If we just have a lot of traffic to a bogus destination (DTAG, for example), we might just have to wait. Most important aspect here is to get notified.
Comment 3 Michael Tremer 2018-06-12 18:08:27 UTC
Apologies for being a bit too busy to google right now, but I will have to write
something that allows us to extract that data via SNMP. I don't want to
distribute SSH keys for running monitoring checks if I can avoid it.
Comment 4 Peter Müller 2018-06-12 18:50:31 UTC
(In reply to Michael Tremer from comment #3)
> Apologies for being a bit too busy to google right now, but I will have to
> write
> something that allows us to extract that data via SNMP. I don't want to
> distribute SSH keys for running monitoring checks if I can avoid it.
Let me google that for you... :-)

Maybe this is helpful: https://conshell.net/wiki/Using_Nagios_with_SNMP
(Basically there is a Nagios plugin for monitoring the mail queue, "/usr/local/nagios/libexec/check_mailq", which can be added to SNMP). Could you set this up?
Comment 5 Michael Tremer 2018-06-13 12:21:29 UTC
Done.