[DOC] Multi-Threaded shuffle documentation is not accurate on the read side #9512

abellina · 2023-10-23T14:00:37Z

The Multi-Threaded shuffle documentation says:

The multi-threaded shuffle targets the “BypassMergeSortShuffle” shuffle algorithm in Spark, which is the default when spark.shuffle.partitions is 200 or less.

Unfortunately, that isn't true for the read side. The write side of the shuffle is following this, as Spark has different shuffle writer algorithms (bypass merge, and merge sort). The reader side is a single implementation in Spark, so we don't have a "bypass merge" reader or a "merge sort" reader, it's just the reader. Therefore the documentation should state this, and it's currently incorrect.

Note that we have reduced spark.rapids.shuffle.multiThreaded.maxBytesInFlight lately from 2GB to 128MB because of memory constraints #9153, and this is an ideal knob to control the size in bytes that we allow to be in flight in the decompression/decode threads. Another option to disable the MT reader side only entirely would be to set spark.rapids.shuffle.multiThreaded.reader.threads=0. This is another tool if a user is having issues at shuffle read time only.

The text was updated successfully, but these errors were encountered:

kuhushukla · 2023-10-24T19:42:15Z

Fixed.

abellina added documentation Improvements or additions to documentation ? - Needs Triage Need team to review and classify shuffle things that impact the shuffle plugin labels Oct 23, 2023

abellina assigned kuhushukla Oct 23, 2023

mattahrens removed the ? - Needs Triage Need team to review and classify label Oct 24, 2023

kuhushukla closed this as completed Oct 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DOC] Multi-Threaded shuffle documentation is not accurate on the read side #9512

[DOC] Multi-Threaded shuffle documentation is not accurate on the read side #9512

abellina commented Oct 23, 2023

kuhushukla commented Oct 24, 2023

[DOC] Multi-Threaded shuffle documentation is not accurate on the read side #9512

[DOC] Multi-Threaded shuffle documentation is not accurate on the read side #9512

Comments

abellina commented Oct 23, 2023

kuhushukla commented Oct 24, 2023