From a1c6c33d735761f4897dd64d2145f21913ad84a6 Mon Sep 17 00:00:00 2001 From: Quentin Dufour Date: Wed, 9 Mar 2022 16:54:19 +0100 Subject: Maintenance du 2022-03-09 --- os/config/README.md | 11 +++++++---- 1 file changed, 7 insertions(+), 4 deletions(-) (limited to 'os/config/README.md') diff --git a/os/config/README.md b/os/config/README.md index a1078df..81fe9c9 100644 --- a/os/config/README.md +++ b/os/config/README.md @@ -4,9 +4,12 @@ For each machine, **one by one** do: - Check that cluster is healthy - - Check gluster - - `sudo gluster peer status` - - `sudo gluster volume status all` (check Online Col, only `Y` must appear) + - Check garage + - check that all nodes are online `docker exec -ti xxx /garage status` + - check that tables are in sync `docker exec -ti 63a4d7ecd795 /garage repair --yes tables` + - check garage logs + - no unknown errors or resync should be in progress + - the following line must appear `INFO garage_util::background > Worker exited: Repair worker` - Check that Nomad is healthy - `nomad server members` - `nomad node status` @@ -17,5 +20,5 @@ For each machine, **one by one** do: - Run `nomad node drain -enable -force -self` - Reboot - Run `nomad node drain -self -disable` - - Check that cluster is healthy + - Check that cluster is healthy (basically the whole first point) -- cgit v1.2.3