diff --git a/howto/gitlab.md b/howto/gitlab.md index 81b97caae273ea2ea1be2b16346f95a0cde2f8a7..2643cf1f1e58acf10ea49639173189bcc40d4041 100644 --- a/howto/gitlab.md +++ b/howto/gitlab.md @@ -840,7 +840,17 @@ is *not* configured through our Puppet like other Prometheus servers. It has still been (manually) integrated in our Prometheus setup and Grafana dashboards (see [pager playbook](#pager-playbook)) have been deployed. -More work is underway to improve monitoring in [issue 33921](https://gitlab.torproject.org/tpo/tpa/services/-/issues/33921). +More work is underway to improve monitoring in [this issue](https://gitlab.torproject.org/tpo/tpa/team/-/issues/40077) (not +hardcoding exporters). We could also use the following tools: + + * [moosh3/gitlab-alerts](https://github.com/moosh3/gitlab-alerts): autogenerate issues based from Prometheus + Alert Manager (with the webhook) + * [FUSAKLA/prometheus-gitlab-notifier](https://github.com/FUSAKLA/prometheus-gitlab-notifier): similar + * 11.5 shipped [a bunch of alerts](https://gitlab.com/gitlab-org/gitlab-foss/-/issues/45740) which we might want to use + directly + * the "Incident management" support has various [integrations](https://docs.gitlab.com/ee/operations/incident_management/integrations.html) + including Prometheus (starting from 13.1) and Pagerduty (which is + supported by Prometheus) ## Logs and metrics