Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GLUTEN-8492][CH] Offload RangeExec #8518

Open
wants to merge 12 commits into
base: main
Choose a base branch
from

Conversation

taiyang-li
Copy link
Contributor

@taiyang-li taiyang-li commented Jan 13, 2025

What changes were proposed in this pull request?

(Please fill in changes proposed in this fix)

(Fixes: #8492)

How was this patch tested?

All existing uts using range function. Below ut is excluded because assertion that plan contains RangeExec operator would fail after RangeExec is offloaded to gluten.

  • "SPARK-27439: Explain result should match collected result after view change"

@github-actions github-actions bot added CORE works for Gluten Core VELOX CLICKHOUSE labels Jan 13, 2025
Copy link

Thanks for opening a pull request!

Could you open an issue for this pull request on Github Issues?

https://github.com/apache/incubator-gluten/issues

Then could you also rename commit message and pull request title in the following format?

[GLUTEN-${ISSUES_ID}][COMPONENT]feat/fix: ${detailed message}

See also:

Copy link

Run Gluten Clickhouse CI on x86

1 similar comment
Copy link

Run Gluten Clickhouse CI on x86

@taiyang-li taiyang-li changed the title [WIP] Support range functions [CH] Support range functions Jan 13, 2025
Copy link

Run Gluten Clickhouse CI on x86

1 similar comment
Copy link

Run Gluten Clickhouse CI on x86

@taiyang-li taiyang-li changed the title [CH] Support range functions [GLUTEN-8492][CH] Support range functions Jan 14, 2025
@taiyang-li taiyang-li marked this pull request as ready for review January 14, 2025 04:14
Copy link

#8492

Copy link

Run Gluten Clickhouse CI on x86

Copy link

Run Gluten Clickhouse CI on x86

2 similar comments
Copy link

Run Gluten Clickhouse CI on x86

Copy link

Run Gluten Clickhouse CI on x86

@taiyang-li taiyang-li changed the title [GLUTEN-8492][CH] Support range functions [GLUTEN-8492][CH] Offload RangeExec Jan 14, 2025
Copy link

Run Gluten Clickhouse CI on x86

1 similar comment
Copy link

Run Gluten Clickhouse CI on x86

Copy link

Run Gluten Clickhouse CI on x86

2 similar comments
Copy link

Run Gluten Clickhouse CI on x86

Copy link

Run Gluten Clickhouse CI on x86

@taiyang-li taiyang-li requested review from zhztheplayer and PHILO-HE and removed request for PHILO-HE February 11, 2025 04:18
@taiyang-li
Copy link
Contributor Author

For performance comparison refer to #8492 (comment)

Copy link

Run Gluten Clickhouse CI on x86

@CodiumAI-Agent
Copy link

Persistent review updated to latest commit febd663

@taiyang-li
Copy link
Contributor Author

Run Gluten Clickhouse CI on x86

Copy link
Contributor

@PHILO-HE PHILO-HE left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Mainly reviewed the common code. Basically looks good!

@PHILO-HE
Copy link
Contributor

Maybe, we can move the existing GlutenSQLRangeExecSuite.scala from spark33 ut folder to gluten-ut/test folder, which can get it tested for all Spark versions and both backends.

Copy link

Run Gluten Clickhouse CI on x86

@taiyang-li
Copy link
Contributor Author

Maybe, we can move the existing GlutenSQLRangeExecSuite.scala from spark33 ut folder to gluten-ut/test folder, which can get it tested for all Spark versions and both backends.

done.

Copy link

Run Gluten Clickhouse CI on x86

2 similar comments
Copy link

Run Gluten Clickhouse CI on x86

Copy link

Run Gluten Clickhouse CI on x86

Copy link

Run Gluten Clickhouse CI on x86

Copy link

Run Gluten Clickhouse CI on x86

Copy link

Run Gluten Clickhouse CI on x86

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLICKHOUSE CORE works for Gluten Core VELOX
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[CH] sum with filter bad performance compared to vanilla spark
3 participants