Consider token as good when node doesn't support trace calls #3002

fleupold · 2024-09-18T08:48:53Z

Description

Not all of our fallback nodes support the trace_call API. While we can configure the tracing URL separately, in case of outage the API may not be able to classify any tokens as good and thus effectively prevent trading.

Changes

If trace call many fails due to a TransportError (which is what happens if you call an unsupported method), consider the token as good.
Add tracing spans with token address context for any logs that happen during detection

How to test

cargo test --package shared --lib -- bad_token::trace_call::tests::mainnet_tokens --exact --show-output 
--ignored

both with a Node that does and does not support the trace call API. Observe that in the case of non-support all tokens are of quality good.

MartinquaXD

Change makes sense to me. I'm mostly worried about the seemingly duplicate .instrument() calls as I think the instrumentation should happen inside the token detector itself and not in its callers.

MartinquaXD · 2024-09-18T09:00:38Z

crates/shared/src/order_validation.rs

@@ -434,6 +435,10 @@ impl OrderValidating for OrderValidator {
            if let TokenQuality::Bad { reason } = self
                .bad_token_detector
                .detect(token)
+                .instrument(tracing::info_span!(


I'm a bit surprised by all these instrumented futures and think these could lead to duplicated tracing spans. Wouldn't it be sufficient to instrument only the inner.detect() calls inside the CachingDetector?

This would assume we always use a CachingDetector in our estimating chain (which is true now but might change when and we don't want the observability to change in that case). I'm fine with this, but I thought the callsite that creates the outmost

Actually, I found InstrumentedBadTokenDetector which lends itself perfectly for this.

MartinquaXD · 2024-09-18T09:04:42Z

crates/shared/src/trace_many.rs

@@ -20,17 +21,15 @@ pub async fn trace_many(requests: Vec<CallRequest>, web3: &Web3) -> Result<Vec<B
                serde_json::to_value(vec![TraceType::Trace])?,
            ])
        })
-        .collect::<Result<Vec<_>>>()?;
+        .collect::<Result<Vec<_>>>()
+        .map_err(|e| Error::Decoder(e.to_string()))?;


Looks like we can only hit this .map_err() if our JSON serialization fails so using the Error::Decoder variant seems odd. OTOH web3 doesn't offer a more suitable error variant anyway.

sunce86 · 2024-09-18T09:34:40Z

crates/shared/src/bad_token/trace_call.rs

-            .context("trace_many")?;
+        let traces = match trace_many::trace_many(request, &self.web3).await {
+            Ok(result) => result,
+            Err(web3::Error::Transport(e)) => {


How do we know we want to catch this type of Error? For example, I would rather expect RPCError with ErrorCode::MethodNotFound (-32601)

If you received Transport Error in practice instead, this might mean the specific node provider has unexpected handling of this error, so this code change would be effective only for that node provider and not for the rest.
In that case I would suggest also adding Rpc variant and handle it similarly as Err(web3::Error::Transport(e))

Very good point. Seems to be an Alchemy specific behavior (Nodereal returns with the response code you expected). Will add this case.

m-lord-renkse

LGTM, I am also concern about duplicate spans as @MartinquaXD mentioned.

Consider token as good when node doesn't support trace calls

60e5f7a

fleupold requested a review from a team as a code owner September 18, 2024 08:48

MartinquaXD approved these changes Sep 18, 2024

View reviewed changes

sunce86 reviewed Sep 18, 2024

View reviewed changes

m-lord-renkse approved these changes Sep 19, 2024

View reviewed changes

fleupold added 4 commits September 19, 2024 18:21

filter out method errors as well

f72c8fb

remove instrumentation redundancy

2a597a0

clippy

3b77579

Merge branch 'main' into bad_token_detection_non_tracing_node

a3eff2d

fleupold merged commit 185586d into main Sep 20, 2024
11 checks passed

fleupold deleted the bad_token_detection_non_tracing_node branch September 20, 2024 15:17

github-actions bot locked and limited conversation to collaborators Sep 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider token as good when node doesn't support trace calls #3002

Consider token as good when node doesn't support trace calls #3002

fleupold commented Sep 18, 2024

MartinquaXD left a comment

MartinquaXD Sep 18, 2024

fleupold Sep 19, 2024

fleupold Sep 19, 2024

MartinquaXD Sep 18, 2024

sunce86 Sep 18, 2024 •

edited

Loading

sunce86 Sep 18, 2024 •

edited

Loading

fleupold Sep 19, 2024

m-lord-renkse left a comment

Consider token as good when node doesn't support trace calls #3002

Consider token as good when node doesn't support trace calls #3002

Conversation

fleupold commented Sep 18, 2024

Description

Changes

How to test

MartinquaXD left a comment

Choose a reason for hiding this comment

MartinquaXD Sep 18, 2024

Choose a reason for hiding this comment

fleupold Sep 19, 2024

Choose a reason for hiding this comment

fleupold Sep 19, 2024

Choose a reason for hiding this comment

MartinquaXD Sep 18, 2024

Choose a reason for hiding this comment

sunce86 Sep 18, 2024 • edited Loading

Choose a reason for hiding this comment

sunce86 Sep 18, 2024 • edited Loading

Choose a reason for hiding this comment

fleupold Sep 19, 2024

Choose a reason for hiding this comment

m-lord-renkse left a comment

Choose a reason for hiding this comment

sunce86 Sep 18, 2024 •

edited

Loading

sunce86 Sep 18, 2024 •

edited

Loading