Prometheus is a monitoring system that is designed to process a large
number of metrics, centralize them on one (or multiple) servers and
serve them with a well-defined API. That API is queried through a
domain-specific language (DSL) called "PromQL" or "Prometheus Query
Language". Prometheus also supports basic graphing capabilities
although those are limited enough that we use a separate graphing
layer on top (Grafana).
The Prometheus web interface is available at:
<https://prometheus.torproject.org>
A simple query you can try is to pick any metric in the list and click
`Execute`. For example, [this link](https://prometheus1.torproject.org/graph?g0.range_input=2w&g0.expr=node_load5&g0.tab=0) will show the 5-minute load
over the last two weeks for the known servers.
All machines configured through Puppet are scraped by the central
server every 15 seconds.
Munin expatriates
-----------------
Here's a quick cheat sheet from people used to Munin and switching to
Prometheus:
| What | Munin | Prometheus |
| --- | ----- | ---------- |
| Scraper | munin-update | prometheus |
| Agent | munin-node | prometheus node-exporter and others |
| Graphing | munin-graph | prometheus or grafana |