- Sep 08, 2022
-
- Sep 07, 2022
-
- Aug 30, 2022
-
-
Jérôme Charaoui authored
This enables a new capability for which support was added in Ganeti 3.0.0: postcopy-ram migration support. Enabling this flag allows Ganeti to automatically switch to postcopy-ram when precopy-ram migration is not completing fast, allowing us to migrate very busy machines much more rapidly and without human intervention (eg. no need to SIGSTOP gitlab processes when migrating gitlab-02). https://github.com/ganeti/ganeti/blob/master/NEWS#automatic-postcopy-migration-handling-for-kvm-guests https://wiki.qemu.org/Features/PostCopyLiveMigration
-
Jérôme Charaoui authored
-
- Aug 29, 2022
-
-
Jérôme Charaoui authored
-
- Aug 25, 2022
-
- Aug 24, 2022
-
- Aug 18, 2022
-
- Aug 16, 2022
-
-
kez authored
Closes team#40853
- Aug 09, 2022
-
-
Jérôme Charaoui authored
-
Jérôme Charaoui authored
-
Jérôme Charaoui authored
-
Jérôme Charaoui authored
-
- Aug 08, 2022
-
-
Jérôme Charaoui authored
-
- Aug 04, 2022
-
-
Jérôme Charaoui authored
-
- Aug 02, 2022
- Aug 01, 2022
-
-
Jérôme Charaoui authored
This mainly adds an upgrade howto to facilitate continuing upgrades of this service.
-
- Jul 30, 2022
-
-
micah authored
-
- Jul 27, 2022
-
-
anarcat authored
This is a test of issue templates to see if the default templates work and if issue templates work at all. Unless there are project changes (which I won't do now), this should only affect the wiki replica... See team#29398.
-
micah authored
-
anarcat authored
An "alert" is not necessarily delivered through bots and email and everything: it *could* be just a message on a dashboard, which is actually a scenario I am considering. In particular, I am considering the idea that we explicitly do *not* want email notifications for TPA services, because those create a lot of noise and are not necessarily useful for figuring out what's going on (as opposed to a dashboard). In any case, it seems important to decouple those two notions, which were kind of mixed up before. This should be clearer now.
- Jul 26, 2022
-
-
Jérôme Charaoui authored
I added the "trigger service check update" here because some checks are expensive (eg. anything to do with apt or needrestart) and we might not be monitoring those every minute. So it should be possible to handle an alert generated by such components and be able to quickly trigger a check for confirmation, instead of having to wait a certain amount of time. As for event history, I think it's also an important item to have in this list. When handling an incident, it's not ideal to have to rely on IRC or email notifications generated by the system for this because each operator may have different clients for those things with different retention policies, so establishing a procedure for this would be impractical if not impossible.
-