Add cpu oom retry split handling to InternalRowToColumnarBatchIterator #10011

jbrennan333 · 2023-12-11T15:36:50Z

This is a followup to #9929 and is part of addressing issue #8887.

This adds handing for CpuSplitAndRetryOOM in InternalRowToColumnarBatchIterator.
For splits, we divide the target number of rows in half via splitTargetSizeInHalfCpu, and recalculate the buffer sizes from that. It will try that until we hit 1 row.

I also included a workaround for #10004, by combining the withHostBufferWriteLock with the withHostBufferReadOnly blocks. In this case we will hold the write lock during the full operation, but this should allow us to proceed without introducing data corruption until we have a fix for #10004.

One other fix is in HostAlloc, where it was passing the wrong argument to RapidsBufferCatalog.synchronousSpill - it was passing allocSize instead of targetSize, causing us to spill a lot more than needed in some cases.

…rator Signed-off-by: Jim Brennan <jimb@nvidia.com>

jbrennan333 · 2023-12-11T15:56:01Z

build

abellina · 2023-12-11T17:30:26Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/HostAlloc.scala

@@ -144,7 +144,7 @@ private class HostAlloc(nonPinnedLimit: Long) extends HostMemoryAllocator with L
      logDebug(s"Targeting host store size of $targetSize bytes")
      // We could not make it work so try and spill enough to make it work
      val maybeAmountSpilled =
-        RapidsBufferCatalog.synchronousSpill(RapidsBufferCatalog.getHostStorage, allocSize)
+        RapidsBufferCatalog.synchronousSpill(RapidsBufferCatalog.getHostStorage, targetSize)


abellina · 2023-12-11T17:38:39Z

sql-plugin/src/main/java/com/nvidia/spark/rapids/InternalRowToColumnarBatchIterator.java

+        SpillPriorities$.MODULE$.ACTIVE_ON_DECK_PRIORITY(),
+        RapidsBufferCatalog$.MODULE$.singleton());
+    hBufs[0] = null; // Was closed by spillable
+    hBufs[1] = HostAlloc$.MODULE$.alloc(offsetBytes, true);


do we need to wrap this in a try/catch so we can close sBufs[0] if alloc throws?

oh nm, I think allocBuffersWithRestore takes care of it.

I think there may be a problem here actually. If we throw with something other than a retry exception, the withRestoreOnRetry won't handle it.

yeah that would be an issue

abellina

LGTM

gerashegalov · 2023-12-11T18:16:40Z

sql-plugin/src/main/java/com/nvidia/spark/rapids/InternalRowToColumnarBatchIterator.java

+    return (int) Math.max(1,
+        Math.min(Integer.MAX_VALUE - 1, targetBytes / sizePerRowEstimate));


nit: casting is forced by targetBytes / sizePerRowEstimate? I'd prefer pushing it down to the origin:

Suggested change

return (int) Math.max(1,

Math.min(Integer.MAX_VALUE - 1, targetBytes / sizePerRowEstimate));

return Math.max(1,

Math.min(Integer.MAX_VALUE - 1, (int) (targetBytes / sizePerRowEstimate)));

sql-plugin/src/main/java/com/nvidia/spark/rapids/InternalRowToColumnarBatchIterator.java

jbrennan333 · 2023-12-11T21:03:10Z

Thanks for the review comments @abellina and @gerashegalov! I believe I have addressed them in this latest commit.

gerashegalov

LGTM

jbrennan333 · 2023-12-11T21:57:51Z

build

jbrennan333 added 3 commits December 8, 2023 15:50

Add handling for CpuSplitAndRetryOOM in InternalRowToColumnarBatchIte…

a8b9e82

…rator Signed-off-by: Jim Brennan <jimb@nvidia.com>

Workaround spill issue

f242884

Fix for targetSize in HostAlloc synchronous spill call and some cleanup

15cef09

jbrennan333 added feature request New feature or request reliability Features to improve reliability or bugs that severly impact the reliability of the plugin labels Dec 11, 2023

jbrennan333 self-assigned this Dec 11, 2023

abellina reviewed Dec 11, 2023

View reviewed changes

abellina previously approved these changes Dec 11, 2023

View reviewed changes

gerashegalov reviewed Dec 11, 2023

View reviewed changes

Address review comments

b73fbd8

jbrennan333 dismissed abellina’s stale review via b73fbd8 December 11, 2023 21:00

gerashegalov approved these changes Dec 11, 2023

View reviewed changes

abellina approved these changes Dec 12, 2023

View reviewed changes

jbrennan333 merged commit 61e8e6c into NVIDIA:branch-24.02 Dec 12, 2023

jbrennan333 linked an issue Dec 14, 2023 that may be closed by this pull request

[FEA] Add Host Memory Retry for Row to Columnar Conversion #8887

Open

sameerz removed the feature request New feature or request label Dec 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cpu oom retry split handling to InternalRowToColumnarBatchIterator #10011

Add cpu oom retry split handling to InternalRowToColumnarBatchIterator #10011

jbrennan333 commented Dec 11, 2023

jbrennan333 commented Dec 11, 2023

abellina Dec 11, 2023

abellina Dec 11, 2023

abellina Dec 11, 2023

jbrennan333 Dec 11, 2023

abellina Dec 11, 2023

abellina left a comment

gerashegalov Dec 11, 2023

jbrennan333 commented Dec 11, 2023

gerashegalov left a comment

jbrennan333 commented Dec 11, 2023

		return (int) Math.max(1,
		Math.min(Integer.MAX_VALUE - 1, targetBytes / sizePerRowEstimate));

Add cpu oom retry split handling to InternalRowToColumnarBatchIterator #10011

Add cpu oom retry split handling to InternalRowToColumnarBatchIterator #10011

Conversation

jbrennan333 commented Dec 11, 2023

jbrennan333 commented Dec 11, 2023

abellina Dec 11, 2023

Choose a reason for hiding this comment

abellina Dec 11, 2023

Choose a reason for hiding this comment

abellina Dec 11, 2023

Choose a reason for hiding this comment

jbrennan333 Dec 11, 2023

Choose a reason for hiding this comment

abellina Dec 11, 2023

Choose a reason for hiding this comment

abellina left a comment

Choose a reason for hiding this comment

gerashegalov Dec 11, 2023

Choose a reason for hiding this comment

jbrennan333 commented Dec 11, 2023

gerashegalov left a comment

Choose a reason for hiding this comment

jbrennan333 commented Dec 11, 2023