Add `PrefectDbtRunner` #16971

kevingrismore · 2025-02-04T23:52:39Z

This is a new primary interface for dbt Core.

Example usage:

from prefect import flow
from prefect_dbt import PrefectDbtRunner, PrefectDbtSettings


@flow
def run_dbt():
    runner = PrefectDbtRunner(
        settings=PrefectDbtSettings(project_dir="test", profiles_dir="examples/run_dbt")
    )
    runner.invoke(["run"])


if __name__ == "__main__":
    run_dbt()

As this will be replacing the functionality of everything in prefect_dbt.cli, any imports from that path will raise the following warning:

"prefect_dbt.cli is deprecated and will be removed in a future release. Please use prefect_dbt.core instead."

Improved

Logging

The previous implementation assumed the presence of a Prefect run logger, ignored debug logs, and treated all other log levels as INFO.

The PrefectDbtRunner translates all log levels from dbt to standard Python logging levels, and logs exclusively to a Prefect run logger or directly to the terminal depending on the context in which it was invoked.

Failure handling

In the previous implementation, a failed dbt run was handled by returning a Failed state with the dbt exception in the state message. However, exceptions from RunResults in dbt are primarily an indication of failure, whereas the RunExecutionResult contains more detailed messages about status and failure reason.

The PrefectDbtRunner raises an exception when a dbt run fails, including a standardized message with detailed failure information. It also offers raise_on_failure, which controls whether that exception is raised when a dbt run fails, since a failing test does not always constitute a failing project as a whole.

New

`PrefectDbtSettings`

Added in #16834

Using Pydantic's BaseSettings, this class automatically detects a common set of DBT_-prefixed env vars upon instantiation. Creating a PrefectDbtRunner without passing in a PrefectDbtSettings instance will construct one by default, detecting dbt config from the environment.

`profiles.yaml` Templating

Added in #16889

When invoking dbt commands through the PrefectDbtRunner, templated references to Prefect blocks and Prefect variables in your workspace will be resolved at runtime. This enables users to include their profiles.yaml in their dbt project repositories without having to worry about exposing secrets, and allows mapping of dbt target configs to workspaces.

For example, a dbt project executed with the PrefectDbtRunner could target a staging database when run in a staging workspace, and a production database when run in a production workspace, without having to deal with managing env vars or other execution environment-specific management by setting a dbt_target Prefect variable to "stg" in their staging workspace and "prod" in their production workspace.

test:
  outputs:
    stg:
      type: duckdb
      path: dev.duckdb
      threads: 1

    prod:
      type: duckdb
      path: prod.duckdb
      threads: 4
      password: "{{ prefect.blocks.secret.my-password }}"

  target: "{{ prefect.variables.dbt_target }}"

Events

A Prefect event is emitted each time a Node Finished event occurs in dbt. The emitted event contains a dbt.node.status field on its primary resource, recording the final state of the node's execution. This enables prefect-dbt users to create automations that fire conditionally on the state of individual nodes in their dbt DAGs, kicking off downstream jobs after particular table updates or alerting on specific test failures.

Node Finished event emission can be disabled for a resource in your dbt project by setting

prefect:
  emit_node_events: False

in your dbt resource's config.

Lineage (Prefect Cloud only)

The PrefectDbtRunner will automatically emit events describing the lineage graph of your dbt DAG by default.

The dbt resource types that constitute Prefect lineage resources are limited to NODE_TYPES_TO_EMIT_LINEAGE, which currently includes models, seeds, and snapshots.

If invoked in a flow run, each node executed during the run will appear as a resource on the flow run page in Prefect Cloud. Clicking "View graph" on a resource will display a lineage graph with several degrees of upstream and downstream resources pre-populated.

A resource in your dbt project can be omitted from the lineage graph by setting

prefect:
  emit_lineage_events: False

in your dbt resource's config.

Checklist

This pull request references any related issue by including "closes <link to issue>"
- If no issue exists and your change is not a small fix, please create an issue first.
If this pull request adds new functionality, it includes unit tests that cover the changes
If this pull request removes docs files, it includes redirect settings in mint.json.
If this pull request adds functions or classes, it includes helpful docstrings.

src/integrations/prefect-dbt/prefect_dbt/core/runner.py

aaazzam · 2025-02-05T04:37:51Z

src/integrations/prefect-dbt/prefect_dbt/core/runner.py

+            payload=event_data,
+        )
+
+    def _get_dbt_event_msg(self, event: EventMsg) -> str:


kinda question the need for a full method for this however private

i think i had suggested this pattern earlier to isolate upstream type warts

yep that's exactly what I'm doing

aaazzam · 2025-02-05T04:38:58Z

src/integrations/prefect-dbt/prefect_dbt/core/runner.py

+
+            self.manifest = res.result
+
+    async def ainvoke(self, args: list[str], **kwargs: Any):


double checking this accepts a single positional argument called args that's a list of str and not several positional args that are each a string?

it's a direct args passthrough to the underlying invoke from dbt. It may not feel the best but is at least decently documented, so we can refer people there.

zzstoatzz

new primary interface

looking good @kevingrismore! can we add some color here on how this sits in relation to today's prefect-dbt? i.e. what's entirely new vs. similar but different

kevingrismore · 2025-02-05T16:41:23Z

new primary interface

looking good @kevingrismore! can we add some color here on how this sits in relation to today's prefect-dbt? i.e. what's entirely new vs. similar but different

Updated the description @zzstoatzz

kevingrismore · 2025-02-07T20:04:48Z

I did something stupid with rebase 😭 hang on

kevingrismore added the integrations Related to integrations with other services label Feb 4, 2025

kevingrismore marked this pull request as ready for review February 5, 2025 00:07

kevingrismore requested review from cicdw, desertaxle and zzstoatzz as code owners February 5, 2025 00:07

kevingrismore changed the title ~~add PrefectDbtRunner~~ Add PrefectDbtRunner Feb 5, 2025

aaazzam approved these changes Feb 5, 2025

View reviewed changes

zzstoatzz reviewed Feb 5, 2025

View reviewed changes

kevingrismore requested review from devinvillarosa, devangrose, pleek91, chrisguidry, daniel-prefect, seanpwlms and znicholasbrown as code owners February 7, 2025 20:03

github-actions bot added the migration label Feb 7, 2025

kevingrismore force-pushed the add-dbt-core-runner branch from cb28ba7 to 5733156 Compare February 7, 2025 20:11

kevingrismore removed request for chrisguidry, pleek91, znicholasbrown, devangrose, seanpwlms, devinvillarosa and daniel-prefect February 7, 2025 20:12

kevingrismore added 4 commits February 7, 2025 14:17

add PrefectDbtRunner

dbfecd8

remove debug prints

2ad960a

omit skipped lineage nodes from upstream

59835be

add more tests

4ef276d

kevingrismore added 4 commits February 7, 2025 14:17

dynamic node finished event names

8ebd990

fix node finished tests

be5e1e8

add deprecation warning

063e889

handle dynamic imports with deprecation warning

e62767b

kevingrismore force-pushed the add-dbt-core-runner branch from 5733156 to e62767b Compare February 7, 2025 20:50

kevingrismore added 4 commits February 7, 2025 14:58

test dynamic imports

a6fe23f

fix import test

75be56b

fix import test

7643d28

really fix import test

ce014e0

kevingrismore merged commit b5012c4 into main Feb 7, 2025
15 checks passed

kevingrismore deleted the add-dbt-core-runner branch February 7, 2025 21:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `PrefectDbtRunner` #16971

Add `PrefectDbtRunner` #16971

kevingrismore commented Feb 4, 2025 •

edited

Loading

aaazzam Feb 5, 2025

zzstoatzz Feb 5, 2025

kevingrismore Feb 5, 2025

aaazzam Feb 5, 2025

kevingrismore Feb 5, 2025

zzstoatzz left a comment •

edited

Loading

kevingrismore commented Feb 5, 2025 •

edited

Loading

kevingrismore commented Feb 7, 2025


		self.manifest = res.result

		async def ainvoke(self, args: list[str], **kwargs: Any):

Add PrefectDbtRunner #16971

Add PrefectDbtRunner #16971

Conversation

kevingrismore commented Feb 4, 2025 • edited Loading

Improved

Logging

Failure handling

New

PrefectDbtSettings

profiles.yaml Templating

Events

Lineage (Prefect Cloud only)

Checklist

aaazzam Feb 5, 2025

Choose a reason for hiding this comment

zzstoatzz Feb 5, 2025

Choose a reason for hiding this comment

kevingrismore Feb 5, 2025

Choose a reason for hiding this comment

aaazzam Feb 5, 2025

Choose a reason for hiding this comment

kevingrismore Feb 5, 2025

Choose a reason for hiding this comment

zzstoatzz left a comment • edited Loading

Choose a reason for hiding this comment

kevingrismore commented Feb 5, 2025 • edited Loading

kevingrismore commented Feb 7, 2025

Add `PrefectDbtRunner` #16971

Add `PrefectDbtRunner` #16971

kevingrismore commented Feb 4, 2025 •

edited

Loading

`PrefectDbtSettings`

`profiles.yaml` Templating

zzstoatzz left a comment •

edited

Loading

kevingrismore commented Feb 5, 2025 •

edited

Loading