... | ... | @@ -840,7 +840,17 @@ is *not* configured through our Puppet like other Prometheus |
|
|
servers. It has still been (manually) integrated in our Prometheus
|
|
|
setup and Grafana dashboards (see [pager playbook](#pager-playbook)) have been deployed.
|
|
|
|
|
|
More work is underway to improve monitoring in [issue 33921](https://gitlab.torproject.org/tpo/tpa/services/-/issues/33921).
|
|
|
More work is underway to improve monitoring in [this issue](https://gitlab.torproject.org/tpo/tpa/team/-/issues/40077) (not
|
|
|
hardcoding exporters). We could also use the following tools:
|
|
|
|
|
|
* [moosh3/gitlab-alerts](https://github.com/moosh3/gitlab-alerts): autogenerate issues based from Prometheus
|
|
|
Alert Manager (with the webhook)
|
|
|
* [FUSAKLA/prometheus-gitlab-notifier](https://github.com/FUSAKLA/prometheus-gitlab-notifier): similar
|
|
|
* 11.5 shipped [a bunch of alerts](https://gitlab.com/gitlab-org/gitlab-foss/-/issues/45740) which we might want to use
|
|
|
directly
|
|
|
* the "Incident management" support has various [integrations](https://docs.gitlab.com/ee/operations/incident_management/integrations.html)
|
|
|
including Prometheus (starting from 13.1) and Pagerduty (which is
|
|
|
supported by Prometheus)
|
|
|
|
|
|
## Logs and metrics
|
|
|
|
... | ... | |