[FEA] Update GpuWindowExec to use OOM retry framework #7254

revans2 · 2022-12-05T21:10:28Z

Is your feature request related to a problem? Please describe.
Once #7253 is complete we should update GpuWindowExec to use it.

Describe the solution you'd like
There are a few parts to this that we need to get right, and may need to be done in multiple separate steps or PRs.

We need to combine the preceding GpuCoalesceBatches with the GpuWindowExec. This is needed because the current design of the GpuMemoryLeaseManager does not allow for passing batches between nodes in a SparkPlan that are larger than the default lease size. This is because of limitations in Spark itself, but also because it should help with debugging and avoiding leaks of leases.

Once the two are combined after the task knows how much memory it will need to hold the input and the number of input rows it should do a high water mark estimation on how much GPU memory it will need to complete the operation. This is likely to be tricky to know and we should use some of the memory usage tools being worked on to help us know what the size actually use is. We should also look at adding some scale tests using the same technology. If the technology is not available when we first start to work on this, they should be filled as follow on issues so we don't lose track of them.

If the node needs a higher lease to complete the processing it should ask for the lease at this point before going on, as all of the data is already spillable.

Once it has the lease it should do the operations, and ideally split the output batch into smaller chunks that are about the target batch size.

For window operations this can get to be rather complicated because there are multiple different backends that could be used to perform a given window aggregation/function. We should strive for accuracy in the size of the data used as much as possible, but in cases where we just don't know we should try to err on the side of the worst case situation. Only if we see serious performance degradation should we rethink this plan.

This is intended to be the first operator that we are going to tackle. This is because the input to a window operation can be unbounded, and there are a number of cases where we do not currently support out of core algorithms to allow us to process arbitrary amounts of data.

abellina · 2023-02-15T22:07:55Z

A first crack at this adds the retry logic within BasicWindowCalc.computeBasicWindow. This should allow us to take care of RetryOOM.

A follow on issue needs to be defined for split and retry and running window seems like the first candidate. Running window already has logic to handle fixup around batch boundaries, to carry the window aggregation through, and the split/merge logic would plug into this. But again, likely should be split up from this issue when we start looking at this.

abellina · 2023-02-23T17:40:20Z

I have something working here, with some tests that touch row, range, and row-optimized windows (just going off what I see in doAggs). These tests are very basic and nowhere near exhaustive to test all of the window functionality. I believe what we actually want is CI to trigger retries at some frequency for the full fledged tests, to figure out correctness in many more scenarios I could possibly test with.

revans2 added feature request New feature or request ? - Needs Triage Need team to review and classify labels Dec 5, 2022

revans2 mentioned this issue Dec 5, 2022

[FEA] Avoid memory over usage on GPU nodes in the SparkPlan #7252

Closed

7 tasks

revans2 changed the title ~~[FEA] Update GpuWindowExec to use GPuMemoryLeaseManager~~ [FEA] Update GpuWindowExec to use GpuMemoryLeaseManager Dec 5, 2022

This was referenced Dec 5, 2022

[FEA] Update GpuHashJoin and GpuBroadcastHashJoin to use OOM retry framework #7255

Closed

[FEA] Update GpuHashAggregate to use OOM retry framework #7256

Closed

mattahrens added reliability Features to improve reliability or bugs that severly impact the reliability of the plugin and removed ? - Needs Triage Need team to review and classify labels Dec 6, 2022

mattahrens changed the title ~~[FEA] Update GpuWindowExec to use GpuMemoryLeaseManager~~ [FEA] Update GpuWindowExec to use OOO retry framework Jan 27, 2023

sameerz changed the title ~~[FEA] Update GpuWindowExec to use OOO retry framework~~ [FEA] Update GpuWindowExec to use OOM retry framework Feb 18, 2023

abellina self-assigned this Feb 22, 2023

abellina mentioned this issue Feb 23, 2023

[FEA] Update GpuRunningWindowExec to use OOM retry framework #7809

Closed

abellina mentioned this issue Feb 27, 2023

Use withRetryNoSplit in BasicWindowCalc #7824

Merged

abellina closed this as completed in #7824 Mar 6, 2023

sameerz removed the feature request New feature or request label Apr 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Update GpuWindowExec to use OOM retry framework #7254

[FEA] Update GpuWindowExec to use OOM retry framework #7254

revans2 commented Dec 5, 2022

abellina commented Feb 15, 2023 •

edited

Loading

abellina commented Feb 23, 2023

[FEA] Update GpuWindowExec to use OOM retry framework #7254

[FEA] Update GpuWindowExec to use OOM retry framework #7254

Comments

revans2 commented Dec 5, 2022

abellina commented Feb 15, 2023 • edited Loading

abellina commented Feb 23, 2023

abellina commented Feb 15, 2023 •

edited

Loading