Relay client should limit max concurrency #1140

cody-littley · 2025-01-23T15:58:37Z

Why are these changes needed?

In preprod, we observe relays rate limiting concurrent GetChunks() requests. This is non-ideal in an environment where the relays are not under intentional attack or high load.

The following screenshot is a histogram of the logs reporting concurrency throttling.

This PR introduces two primary changes:

increases the maximum permissible concurrency per relay client from 1 to 2
adds logic to relay clients that causes them to avoid exceeding maximum permissible concurrency

Signed-off-by: Cody Littley <cody@eigenlabs.org>

api/clients/v2/relay_client.go

ian-shim · 2025-01-23T17:04:46Z

relay/cmd/flags/flags.go

@@ -199,7 +199,7 @@ var (
 		Usage:    "Max number of concurrent GetChunk operations per client",
 		Required: false,
 		EnvVar:   common.PrefixEnvVar(envVarPrefix, "MAX_CONCURRENT_GET_CHUNK_OPS_CLIENT"),
-		Value:    1,
+		Value:    2, // default value should stay in sync with the default value of node.Config.RelayConcurrency


can we increase this further like 8 or 16? (obviously this is just default value and real value will be configured in the cluster, but wondering what the most accommodating value should be for clients calling relay)

increased to 8

ian-shim · 2025-01-23T17:14:25Z

api/clients/v2/relay_client.go

@@ -95,7 +111,17 @@ func (c *relayClient) GetBlob(ctx context.Context, relayKey corev2.RelayKey, blo
 		return nil, err
 	}

-	res, err := client.GetBlob(ctx, &relaygrpc.GetBlobRequest{
+	select {


Why don't we return the error immediately (in case of rate limits at the relay, it will return ResourceExhausted error) instead of blocking and waiting? This would allow the user of this client to handle the error however it wants

Currently we don't have retry logic when fetching from a relay. The idea here was that it was better to slightly delay a request to a relay than it was to fail to fetch data (and potentially not sign a batch).

I think this choice should be made not inside the client but by the users of the client, and client being a thin wrapper for interacting with the relay.
Should we add this functionality at the node side if we really do want this kind of limiting logic from the client side?
I'm fine with not putting in this limiting mechanism at all from the client side (node) in favor of potentially retrying with backoff in case of ResourceExhausted error. Because the users of this relay client don't know the rate limiting configuration set in the relay, it's highly likely the client side accounting is not going to be correct.

Signed-off-by: Cody Littley <cody@eigenlabs.org>

api/clients/v2/relay_client.go

Signed-off-by: Cody Littley <cody@eigenlabs.org>

Relay client should limit max concurrency

e2f6da7

Signed-off-by: Cody Littley <cody@eigenlabs.org>

cody-littley requested review from ian-shim and litt3 January 23, 2025 15:58

cody-littley self-assigned this Jan 23, 2025

cody-littley added 2 commits January 23, 2025 10:02

Added TODO

dddc680

Signed-off-by: Cody Littley <cody@eigenlabs.org>

Merge branch 'master' into relay-concurrency-limiting

df25806

litt3 reviewed Jan 23, 2025

View reviewed changes

api/clients/v2/relay_client.go Outdated Show resolved Hide resolved

api/clients/v2/relay_client.go Outdated Show resolved Hide resolved

ian-shim reviewed Jan 23, 2025

View reviewed changes

cody-littley added 3 commits January 23, 2025 11:54

Merge branch 'master' into relay-concurrency-limiting

55aec4c

Merge branch 'master' into relay-concurrency-limiting

dea085f

Made suggested changes.

c8280ce

Signed-off-by: Cody Littley <cody@eigenlabs.org>

litt3 approved these changes Jan 24, 2025

View reviewed changes

api/clients/v2/relay_client.go Outdated Show resolved Hide resolved

cody-littley added 4 commits January 27, 2025 15:42

Merge branch 'master' into relay-concurrency-limiting

46fb07c

Made suggested change.

b0296fe

Signed-off-by: Cody Littley <cody@eigenlabs.org>

Check for invalid config.

a8019bb

Signed-off-by: Cody Littley <cody@eigenlabs.org>

Fix config

b29d103

Signed-off-by: Cody Littley <cody@eigenlabs.org>

litt3 approved these changes Jan 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relay client should limit max concurrency #1140

Relay client should limit max concurrency #1140

cody-littley commented Jan 23, 2025

ian-shim Jan 23, 2025

cody-littley Jan 24, 2025

ian-shim Jan 23, 2025

cody-littley Jan 24, 2025

ian-shim Jan 28, 2025 •

edited

Loading

Relay client should limit max concurrency #1140

Are you sure you want to change the base?

Relay client should limit max concurrency #1140

Conversation

cody-littley commented Jan 23, 2025

Why are these changes needed?

ian-shim Jan 23, 2025

Choose a reason for hiding this comment

cody-littley Jan 24, 2025

Choose a reason for hiding this comment

ian-shim Jan 23, 2025

Choose a reason for hiding this comment

cody-littley Jan 24, 2025

Choose a reason for hiding this comment

ian-shim Jan 28, 2025 • edited Loading

Choose a reason for hiding this comment

ian-shim Jan 28, 2025 •

edited

Loading