aboutsummaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorRune Henriksen <contact@henriksen.dk>2022-03-20 22:58:49 +0100
committerGitea <gitea@fake.local>2022-03-21 12:03:57 +0100
commitaea8b41728422de4920b67c791b13beb948a3c88 (patch)
tree1fef2ed0e9b50fb01278a662e324bd3f153b37fe
parent71e6645e09e9506099ab9058498b26e1802020e7 (diff)
downloadgarage-aea8b41728422de4920b67c791b13beb948a3c88.tar.gz
garage-aea8b41728422de4920b67c791b13beb948a3c88.zip
document request routing logic
-rw-r--r--doc/book/reference-manual/routing.md45
1 files changed, 45 insertions, 0 deletions
diff --git a/doc/book/reference-manual/routing.md b/doc/book/reference-manual/routing.md
new file mode 100644
index 00000000..aec637cc
--- /dev/null
+++ b/doc/book/reference-manual/routing.md
@@ -0,0 +1,45 @@
++++
+title = "Request routing logic"
+weight = 10
++++
+
+Data retrieval requests to Garage endpoints (S3 API and websites) are resolved
+to an individual object in a bucket. Since objects are replicated to multiple nodes
+Garage must ensure consistency before answering the request.
+
+## Using quorum to ensure consistency
+
+Garage ensures consistency by attempting to establish a quorum with the
+data nodes responsible for the object. When a majority of the data nodes
+have provided metadata on a object Garage can then answer the request.
+
+When a request arrives Garage will, assuming the recommended 3 replicas, perform the following actions:
+
+- Make a request to the two preferred nodes for object metadata
+- Try the third node if one of the two initial requests fail
+- Check that the metadata from at least 2 nodes match
+- Check that the object hasn't been marked deleted
+- Answer the request with inline data from metadata if object is small enough
+- Or get data blocks from the preferred nodes and answer using the assembled object
+
+Garage dynamically determines which nodes to query based on health, preference, and
+which nodes actually host a given data. Garage has no concept of "primary" so any
+healthy node with the data can be used as long as a quorum is reached for the metadata.
+
+## Node health
+
+Garage keeps a TCP session open to each node in the cluster and periodically pings them. If a connection
+cannot be established, or a node fails to answer a number of pings, the target node is marked as failed.
+Failed nodes are not used for quorum or other internal requests.
+
+## Node preference
+
+Garage prioritizes which nodes to query according to a few criteria:
+
+- A node always prefers itself if it can answer the request
+- Then the node prioritizes nodes in the same zone
+- Finally the nodes with the lowest latency are prioritized
+
+
+For further reading on the cluster structure look at the [gateway](@/documentation/cookbook/gateways.md)
+and [cluster layout management](@/documentation/reference-manual/layout.md) pages. \ No newline at end of file