Refactor(optimizer): Optimize USING expansion #4115

VaggelisD · 2024-09-13T13:29:04Z

This PR is an optimization follow-up on #4113

When processing multiple joins such as A JOIN B JOIN C USING(col), we gather the columns from the previous sources (A, B etc) to expand USING (col) to ON c.col = <source>.col. Previously, this operation would happen from scratch on each iteration, which can be avoided. A few key observations:

The ordered list is constructed as a subset of scope.selected_sources; It can have [0, N] elements before the join loop
At each iteration, we'll add the current join_table to ordered; It is implied that join_table is a member of scope.selected_sources as it participates in the scope's JOIN clause(s) so (1) continues to hold

The combination of (1) and (2) means that we can avoid checking if source_name is both in scope.selected_sources and ordered; Besides that, (2) drives incremental construction of columns at each iteration.

This PR also adds one more test case of a longer JOIN chain; This is to ensure that columns from all intermediate join tables are gathered, even if there is no USING clause.

sqlglot/optimizer/qualify_columns.py

VaggelisD force-pushed the vaggelisd/qualify_cols branch from ce1e48a to 1f01944 Compare September 13, 2024 13:36

Refactor expand_using

2b5c8d0

VaggelisD force-pushed the vaggelisd/qualify_cols branch from 1f01944 to 2b5c8d0 Compare September 13, 2024 13:45

georgesittas reviewed Sep 13, 2024

View reviewed changes

sqlglot/optimizer/qualify_columns.py Outdated Show resolved Hide resolved

georgesittas approved these changes Sep 13, 2024

View reviewed changes

Set return type

7bc3bc4

georgesittas merged commit 7cf1d70 into main Sep 13, 2024
6 checks passed

georgesittas deleted the vaggelisd/qualify_cols branch September 13, 2024 15:17

georgesittas mentioned this pull request Sep 13, 2024

Chore(optimizer): rename helper function in expand_using #4117

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor(optimizer): Optimize USING expansion #4115

Refactor(optimizer): Optimize USING expansion #4115

VaggelisD commented Sep 13, 2024 •

edited

Loading

Refactor(optimizer): Optimize USING expansion #4115

Refactor(optimizer): Optimize USING expansion #4115

Conversation

VaggelisD commented Sep 13, 2024 • edited Loading

VaggelisD commented Sep 13, 2024 •

edited

Loading