Skip to content
GitLab
  • Menu
Projects Groups Snippets
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in
  • TPA team TPA team
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Issues 175
    • Issues 175
    • List
    • Boards
    • Service Desk
    • Milestones
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
  • Wiki
    • Wiki
  • Activity
  • Create a new issue
  • Issue Boards
Collapse sidebar
  • The Tor Project
  • TPA
  • TPA teamTPA team
  • Issues
  • #33810
Closed
Open
Created Apr 03, 2020 by anarcat@anarcatOwner

ganeti monitoring

we're migrating everything into ganeti, but maybe there's some extra monitoring we could think about, as ganeti is way more knowledgeable about its own internals than libvirt was. or at least that's the feeling I get.

some ideas:

  • we could have a nagios plugin that checks for N+1. riseup has something like this
  • we could have a grafana dashboard that shows us the state of the cluster. we already have the main dashboard which we can set to show only the ganeti cluster

The current memory view goes about like this:

snap-2020.04.03-17.33.23.png,700

I'm not sure how we could improve this, but it seems to me having global (and/or per node?) memory, CPU, network and disk usage would be a great improvement as well.

To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information
Assignee
Assign to
Time tracking