-
Notifications
You must be signed in to change notification settings - Fork 487
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Epic: resolve the pageserver backpressure problems #2028
Comments
So my understanding of the current status is the following:
There is an alternative to backpressure which can reduce select queries latency - more precisely calculate last written LSN for the particular page (relation/chunk). I have PR for it. So based on the results of test_wal_backpressure test I have made the following conclusions:
As far as changing default |
Oh, can we close #1793 then? |
To summarize what we have done with the current backpressure approach,
In short, the current status is quite "messy": we did have a patch to mitigate some of the backpressure issues, but we cannot update [1]: I'm not too sure about the status of the failed tests. Are they because of flaky tests unrelated to the new changes or because of the new changes? Maybe Konstantin can provide more insights on this. |
Some more information from my side:
Concerning the idea to have time-based backpressure I do not think that it can some radically reduce latencies comparing with current implementation:
|
I did disable
|
Yes, you are right. In this particular case ( |
@kelvich mentioned that we may be ok with tweaking the backpressure settings 10MB or 15MB & without the immediate changes in the backpressure logic |
There are two bp settings max_replication_write_lag and max_replication_flush_lag, set to 500MB and 10GB now. Are we going to set both to 10MB? |
we've done everything that we wanted for now, most of the issues should be gone, the remaining we're leaving for later (backlog) |
Just set max_replication_write_lag to 15 MB on prod. My new and old computes started well |
@kelvich is this something that we need to put more effort into ? |
@shanyp to rescope this, there's still unfinished work, but we're in a different world now :) |
Stale. |
The current plan is:
Ortiginally we had this plan:
We have quite a lot of issues about backpressure and several attempts to fix it. ISTM that there are two most actionable things that we can do:
List of backpressure-related tasks, if we can resolve a half, we'll be happy!:
The text was updated successfully, but these errors were encountered: