Skip to content

Review and improve Prometheus rules and tests

Description

Improvements for the "Onionprove not responding" alert were proposed at tpo/tpa/prometheus-alerts#27 (closed). This ticket is about evaluating, integrating and improving the suggestions.

Tasks

  • Add a new, simpler alert for when Onionprobe is down.
  • Test solution for tpo/tpa/prometheus-alerts#27 (closed), especially when there are no samples/no series.
  • Decide whether this new test should be integrated, and improve it if needed.

Time estimation

  • Complexity: very small (0.5 day)
  • Uncertainty: low (x1.1)
  • Reference (adapted)
Edited by Silvio Rhatto