Skip to content

Ensure we always have enough spare resources

There's a number of critical services (the website, whisperback, our schleuder lists, etc.) for which we should always have spare infra to roll out a recovery from backup in case of emergency.

We should:

  • make an inventory of how which services we deem critical
  • calculate how many resources these take on which nodes
  • make a monitoring check that verifies enough spare resources are available on other nodes