@@ -550,11 +551,11 @@ procedures specific to that database.
If a job is behaving strangely, you can inspect its job log to see
what's going on. For example, today Nagios warned about the backups
being too old on colchicifolium:
being too old on `colchicifolium`:
10:02:58 <nsa> tor-nagios: [colchicifolium] backup - bacula - last full backup is WARNING: WARN: Last backup of colchicifolium.torproject.org/F is 45.16 days old.
Looking at the bacula director status, it says this:
Looking at the Bacula director status, it says this:
Console connected using TLS at 10-Jan-20 18:19
JobId Type Level Files Bytes Name Status
@@ -564,8 +565,8 @@ Looking at the bacula director status, it says this:
120468 Back Diff 30,694 3.353 G gitlab-01.torproject.org is running
====
Which is strange because those JobId numbers are very low compared to
(say) the gitlab backup job. To inspect the job log, you use the
Which is strange because those `JobId` numbers are very low compared to
(say) the GitLab backup job. To inspect the job log, you use the
* the `ssl_cert_file` and `ssl_key_file` point to valid SSL certs
11. Once you have the postgres database cluster restored, start the
11. Once you have the PostgreSQL database cluster restored, start the
director:
systemctl start bacula-director
@@ -974,13 +975,13 @@ director.
killall sleep
16. switch the nagios checks over the new director: grep for the old
director name in the nagios configuration and fix up some of the
16. switch the Nagios checks over the new director: grep for the old
director name in the Nagios configuration and fix up some of the
checks
git -C tor-nagios grep dictyotum
17. you will also need to restore the password file for the nagios
17. you will also need to restore the password file for the Nagios
check in `/etc/nagios/bacula-database`
18. switch the director in `/etc/dsa/bacula-reader-database` or
@@ -994,7 +995,7 @@ comfortable the new setup is working.
TODO: `15:19:55 <weasel> and once that's up and running, it'd probably be smart to upgrade it to 11. pg_upgradecluster -m upgrade --link`
TODO: some psql users still refer to host-specific usernames like
`bacula-dictyotum-reader`, maybe they should refer to role-specif
`bacula-dictyotum-reader`, maybe they should refer to role-specific
names instead?
### Troubleshooting
@@ -1013,7 +1014,7 @@ because the `/etc/dsa/bacula-reader-database` refers to a non-existent
port=5433
to that file, or specify the `dbname` and `port` manually in the
config file.
configuration file.
If the scheduler is sending you an email every three minutes with this
error:
@@ -1055,7 +1056,7 @@ Backups are *pulled* from machines to the backup server, which means a
compromise on a machine shouldn't allow an attacker to delete backups
from the backup server.
Bacula splits the different responsabilities of the backup system
Bacula splits the different responsibilities of the backup system
among multiple components, namely:
* storage daemon (`bacula::storage` in Puppet, currently `bungei`)
@@ -1080,15 +1081,15 @@ make it harder to find volumes. Instead, our catalog is backed up in
the compressed database dump, which is sufficient to re-bootstrap the
director.
See the [introductio to Bacula](https://www.bacula.org/9.4.x-manuals/en/main/What_is_Bacula.html#SECTION00220000000000000000) for more information on those
See the [introduction to Bacula](https://www.bacula.org/9.4.x-manuals/en/main/What_is_Bacula.html#SECTION00220000000000000000) for more information on those
distinctions.
### PostgreSQL backup system
Database backups are handled specially. We use PostgreSQL (postgres)
everywhere apart from a few rare exceptions (currently only CiviCRM)
and therefore use postgres-specific configurations to do backups of
all our servers.
Database backups are handled specially. We use PostgreSQL everywhere
apart from a few rare exceptions (currently only CiviCRM) and
therefore use postgres-specific configurations to do backups of all
our servers.
See [howto/postgresql](howto/postgresql) for that server's specific backup/restore