aboutsummaryrefslogtreecommitdiff
path: root/os/config/README.md
diff options
context:
space:
mode:
Diffstat (limited to 'os/config/README.md')
-rw-r--r--os/config/README.md11
1 files changed, 7 insertions, 4 deletions
diff --git a/os/config/README.md b/os/config/README.md
index a1078df..81fe9c9 100644
--- a/os/config/README.md
+++ b/os/config/README.md
@@ -4,9 +4,12 @@
For each machine, **one by one** do:
- Check that cluster is healthy
- - Check gluster
- - `sudo gluster peer status`
- - `sudo gluster volume status all` (check Online Col, only `Y` must appear)
+ - Check garage
+ - check that all nodes are online `docker exec -ti xxx /garage status`
+ - check that tables are in sync `docker exec -ti 63a4d7ecd795 /garage repair --yes tables`
+ - check garage logs
+ - no unknown errors or resync should be in progress
+ - the following line must appear `INFO garage_util::background > Worker exited: Repair worker`
- Check that Nomad is healthy
- `nomad server members`
- `nomad node status`
@@ -17,5 +20,5 @@ For each machine, **one by one** do:
- Run `nomad node drain -enable -force -self`
- Reboot
- Run `nomad node drain -self -disable`
- - Check that cluster is healthy
+ - Check that cluster is healthy (basically the whole first point)