ShrEx observed issues #1983

renaynay · 2023-03-29T07:56:22Z

Documenting issues we've observed so far as a note:

DASer doesn’t max out the concurrency lim for full nodes — even when my catchup head is ~4k headers past my head of sampled chain, i only spawn two workers with about 50 headers each
Full node lags significantly behind network head (even when it was already synced up to network head) for sampling when chain is maxxed out (every block @8mb)
Re-DASing of headers
False alarm blacklisting of good peers

walldiss · 2023-03-29T09:08:24Z

DASer doesn’t max out the concurrency lim for full nodes — even when my catchup head is ~4k headers past my head of sampled chain, i only spawn two workers with about 50 headers each

It is not an issue. DASer does not make any decision based on head of sampled chain. If catchup head is same as network head, it means all headers were sent to workers. Two workers situation could happen, if those 2 are working on header ranges that is significantly slower to sample. Meaning all headers after those ranges were already processed by DASer, but not included in head of sampled chain, because it still has gaps.

Full node lags significantly behind network head

Do we see increase of "request failed" logs with DEBUG logs enabled for "share/getters"?

Re-DASing of headers

Would need logs showing the issue to identify the cause.

False alarm blacklisting of good peers

Merge peer-manager metrics PR and plug it into grafana would help to identify if the blacklist happens inside peer-manger and the reason for blacklisting.

renaynay · 2023-03-31T08:37:21Z

That's a fair point re 1.

Re 2.: we need to run metrics on a robusta full node for this

Re 3: Will report it next time it's observed

Re 4: Agree. @walldiss can you make an issue to track blacklisting stability?

renaynay · 2023-04-04T13:35:43Z

Closing this issue in favour of #2016

github-actions bot added the needs:triage label Mar 29, 2023

renaynay added area:shares Shares and samples and removed needs:triage labels Mar 29, 2023

renaynay closed this as not planned Won't fix, can't repro, duplicate, stale Apr 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ShrEx observed issues #1983

ShrEx observed issues #1983

renaynay commented Mar 29, 2023

walldiss commented Mar 29, 2023

renaynay commented Mar 31, 2023

renaynay commented Apr 4, 2023

ShrEx observed issues #1983

ShrEx observed issues #1983

Comments

renaynay commented Mar 29, 2023

walldiss commented Mar 29, 2023

renaynay commented Mar 31, 2023

renaynay commented Apr 4, 2023