[SPIKE] Testing PR #6322 (build performance update) #6562

iknox-fa · 2023-01-10T16:50:00Z

This ticket encompasses the work necessary to review PR #6322 (related to #6073) and determine the following:

Does this actually improve performance? The contributor claims some level of performance improvement, but we need proper benchmarks and the core team should be responsible for building said benchmarks-- especially since initial testing on smaller projects and synthetic benchmarking projects doesn't seem to show much of an improvement.
Is this algorithm the right choice to use across the board? Does it have performance characteristics that fit everyone's needs?

Once this spike determines this work is suitable for what we need here we can work with the contributor to correct some code issues and add some testing. If it doesn't we should probably prioritize putting dbt engineering hours into this since it's obviously been an issue for some time and judging by slack conversations, there's a real paint point to be solved here.

This ticket ~~may~~ should also drive us to get better performance monitoring setup around the build command.

Slack refs:

The text was updated successfully, but these errors were encountered:

jtcohen6 · 2023-01-11T11:42:04Z

Prior art for performance benchmarking + regression testing: https://github.com/dbt-labs/dbt-core/tree/main/performance

With partial parsing enabled, I believe we could use this top-level command to reliably isolate the time associated with compiling a DAG with "extra test edges": dbt build --exclude fqn:*

nathaniel-may · 2023-01-12T18:17:52Z

Scope of this ticket is just to decide if want to merge this PR or not. Additional foundational performance improvements are out of scope for now.

boxysean · 2023-01-20T22:08:24Z

Using dbt's built-in timing info (thanks @dbeatty10!), I ran dbt --no-partial-parse compile on a gnarly DAG sized 8468 models, 17103 tests. It took 14m4s to run on dbt-core#tag/v1.3.2 code, and 16m3s to run on dbt-core#tag/v1.3.2 with #6322 patch applied to dbt/graph/graph.py.

Looking into the charts below, we see that indeed the changed code in #6322 seems to have added time.

github-actions · 2024-02-16T01:45:24Z

This issue has been marked as Stale because it has been open for 180 days with no activity. If you would like the issue to remain open, please comment on the issue or else it will be closed in 7 days.

iknox-fa added the Team:Execution label Jan 10, 2023

github-actions bot changed the title ~~[SPIKE] Testing~~ [CT-1781] [SPIKE] Testing Jan 10, 2023

iknox-fa changed the title ~~[CT-1781] [SPIKE] Testing~~ [SPIKE] Testing PR #6322 (build performance update) Jan 10, 2023

jtcohen6 mentioned this issue Jan 11, 2023

dbt build performance fix #6322

Closed

6 tasks

jtcohen6 added the spike label Jan 11, 2023

jtcohen6 removed the Team:Execution label Jul 19, 2023

github-actions bot added the stale Issues that have gone stale label Feb 16, 2024

dbeatty10 added the performance label Feb 16, 2024

github-actions bot removed the stale Issues that have gone stale label Feb 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPIKE] Testing PR #6322 (build performance update) #6562

[SPIKE] Testing PR #6322 (build performance update) #6562

iknox-fa commented Jan 10, 2023 •

edited

Loading

jtcohen6 commented Jan 11, 2023

nathaniel-may commented Jan 12, 2023

boxysean commented Jan 20, 2023

github-actions bot commented Feb 16, 2024

[SPIKE] Testing PR #6322 (build performance update) #6562

[SPIKE] Testing PR #6322 (build performance update) #6562

Comments

iknox-fa commented Jan 10, 2023 • edited Loading

jtcohen6 commented Jan 11, 2023

nathaniel-may commented Jan 12, 2023

boxysean commented Jan 20, 2023

github-actions bot commented Feb 16, 2024

iknox-fa commented Jan 10, 2023 •

edited

Loading