Upcoming
Milestone
Jan 1, 2025–Dec 31, 2025
TPA-RFC-33-C: Prometheus high availability, long term metrics, other exporters
Quote from TPA-RFC-33:
At this point, the vast majority of checks has been converted into Prometheus and we have reached feature parity. We are looking for "nice to have" improvements.
- prometheus3 server built for high availability (team#41643)
- autonomous delivery (team#41644)
- GitLab alert integration (team#41645)
- long term metrics: high retention, lower scrape interval on secondary server (team#40330)
- additional proxy setup as data source for Grafana (promxy or Thanos) c.f. above
- faster dashboard deployments (systemd timer instead of Puppet pulling) (team#41647)
- convert dashboards to Grafanalib (see team#41312)
- development Grafana server setup (team#41646)
- Matrix notifications (team#40216)
This work can wait for a while, probably starting and hopefully ending in 2025.
Follows %TPA-RFC-33-B: Prometheus server merge, more exporters
See also the kanban board for this milestone