diff options
author | Alex Auvolat <alex@adnab.me> | 2022-11-16 13:40:49 +0100 |
---|---|---|
committer | Alex Auvolat <alex@adnab.me> | 2022-11-16 13:40:49 +0100 |
commit | 555a54ec4063f253e2afdc54a07c43d6d3cb582d (patch) | |
tree | 1449beb2982cdd10b336adb36eac97622bc48a54 /doc/book/cookbook/monitoring.md | |
parent | fc8f795bba1e23c8932af322959e0e3db2ef1233 (diff) | |
download | garage-555a54ec4063f253e2afdc54a07c43d6d3cb582d.tar.gz garage-555a54ec4063f253e2afdc54a07c43d6d3cb582d.zip |
doc precisions and fixesdoc-0.8
Diffstat (limited to 'doc/book/cookbook/monitoring.md')
-rw-r--r-- | doc/book/cookbook/monitoring.md | 5 |
1 files changed, 3 insertions, 2 deletions
diff --git a/doc/book/cookbook/monitoring.md b/doc/book/cookbook/monitoring.md index fd014c4e..8206f645 100644 --- a/doc/book/cookbook/monitoring.md +++ b/doc/book/cookbook/monitoring.md @@ -193,6 +193,7 @@ block_resync_queue_length 0 The number of block hashes that we were unable to resync last time we tried. **THIS SHOULD BE ZERO, OR FALL BACK TO ZERO RAPIDLY, IN A HEALTHY CLUSTER.** +Persistent nonzero values indicate that some data is likely to be lost. ``` block_resync_errored_blocks 0 @@ -211,7 +212,7 @@ rpc_request_counter{from="<this node>",rpc_endpoint="garage_block/manager.rs/Rpc #### `rpc_netapp_error_counter` (counter) -Number of communication errors (errors in the Netapp library) +Number of communication errors (errors in the Netapp library, generally due to disconnected nodes) ``` rpc_netapp_error_counter{from="<this node>",rpc_endpoint="garage_block/manager.rs/Rpc",to="<remote node>"} 354 @@ -219,7 +220,7 @@ rpc_netapp_error_counter{from="<this node>",rpc_endpoint="garage_block/manager.r #### `rpc_timeout_counter` (counter) -Number of RPC timeouts +Number of RPC timeouts, should be close to zero in a healthy cluster. ``` rpc_timeout_counter{from="<this node>",rpc_endpoint="garage_rpc/membership.rs/SystemRpc",to="<remote node>"} 1 |