additional prometheus/grafana exporters/dashboards
our munin replacement is not entirely complete, as there are key parts of the infrastructure that are not monitored. here's a short inventory of what I found in legacy/trac#29681: **email servers monitoring (eugeni, etc? postfix)** * [in debian](https://github.com/kumina/postfix_exporter), [possible dashboard](https://github.com/kumina/postfix_exporter/issues/21) * another approach: [email delivery tests](https://github.com/cherti/mailexporter) **mailman monitoring** no known exporter or dashboard **databases** * [postgres exporter](https://github.com/wrouesnel/postgres_exporter/), in debian, [no offocial dashboard](https://github.com/wrouesnel/postgres_exporter/issues/218), but [many possible dashboards](https://grafana.com/dashboards?dataSource=prometheus&search=postgres) * [mysqld exporter](https://github.com/prometheus/mysqld_exporter), in debian - [possible dashboard](https://grafana.com/dashboards/625) [another from percona](https://github.com/percona/grafana-dashboards), [not officially documented](https://github.com/prometheus/mysqld_exporter/issues/286) * [generic sql exporter](https://github.com/free/sql_exporter), in debian - [credativ config](https://github.com/credativ/elephant-shed/tree/master/sql-exporter) and [dashboard](https://github.com/credativ/elephant-shed/tree/master/grafana) **DNS / bind** - [in debian](https://github.com/digitalocean/bind_exporter/), [official dashboard](https://grafana.com/dashboards/1666) **GitLab** there is [builtin support for prometheus](https://docs.gitlab.com/ee/administration/monitoring/prometheus/) that has to be [configured](https://docs.gitlab.com/ee/administration/monitoring/prometheus/gitlab_metrics.html) those are the other missing things I found during the audit performed while removing Munin: * **spamassassin**: ham/spam/total counts, looks for `spamd: ((processing|checking) message|identified spam|clean message)` in mail.log, could be replaced with [​mtail](https://github.com/google/mtail) * **postgres-wal-traffic_**: should be covered by the postgres_exporter mentioned above, otherwise hook `psql -p "$port" --no-align --command 'SELECT * FROM pg_current_xlog_insert_location()' --tuples-only --quiet | tr -d /,` into the node_exporter * **ksm stats**: extra memory statistics, might not be very important * **haproxy**: https://github.com/prometheus/haproxy_exporter * **per VM disk usage**: see legacy/trac#29816 * **vsftpd**: custom mtail plugin, no known exporter or dashboard See the full review in legacy/trac#29682 for details on those. There were also demands from other teams for monitoring, see legacy/trac#29863 and legacy/trac#30006 for now.
issue