Enable local test runner for Kubeflow Pipelines #1138

neuromage · 2019-04-11T16:37:30Z

Today, there is no way to run a KFP pipeline on a local machine (outside of running a mini Kubernetes cluster locally). This makes testing pipelines for user difficult as they always have to go through the steps of uploading to a cluster to run even small tests on the correctness of the pipeline.

To solve this problem, it would be nice if KFP offers a way to run pipelines locally. Such a mode does not have to have full fidelity with what's possible in a cluster run with Argo to begin with. A simple approach of using docker run to run each step sequentially, with a mounted local volume for passing parameters, will go a long way towards satisfying most local-run use-cases.

As a suggestion, we can modify the pipeline decorator to enable this behaviour when the user adds a keyword like test = True, i.e.

@dsl.Pipeline(
   name = "..",
   test = True,
   ...

The text was updated successfully, but these errors were encountered:

issue-label-bot · 2019-04-11T16:37:32Z

Issue-Label Bot is automatically applying the label feature_request to this issue, with a confidence of 0.98. Please mark this comment with 👍 or 👎 to give our bot feedback!

Links: dashboard, app homepage and code for this bot.

vicaire · 2019-04-11T19:40:32Z

Some ideal requirements to consider:

The local executor should support all the same features as execution within a GKE cluster.
The local executor should not duplicate the code of the controllers used to execute the pipeline.
The local executor should execute the yaml, not Python, so that both yaml and Python can be tested.

Ark-kun · 2019-04-12T02:37:06Z

The local executor should support all the same features as execution within a GKE cluster.

I guess only the proper local kubernetes cluster can satisfy this requirement. The test environment needs Minikube installed.

Ark-kun · 2019-04-12T02:41:05Z

As a suggestion, we can modify the pipeline decorator to enable this behaviour when the user adds a keyword like test = True, i.e.

I think it's better have a function to perform this instead of overloading the decorator:

kfp.run_pipeline_locally(my_pipeline, arguments={...})

If we want to differentiate, we can have

run_pipeline_locally
run_pipeline_using_docker
run_pipeline_on_kubernetes
run_pipeline_on_kfp

vicaire · 2019-04-12T02:41:21Z

BTW, what about providing users with a VM image that has all the tools installed (minikube and etc.) to make local execution easy?

ucdmkt · 2019-04-12T17:21:21Z

Thank you so much for tracking this issue.

Another thing to consider is that we may want to have a way to inject a stub to a component inside the pipeline, if a component is making call to foreign services such as Dataflow.

kevinbache · 2019-04-16T20:49:56Z

related: #1104

neuromage · 2020-02-14T01:37:50Z

Closing as infeasible/obsolete for now.

neuromage · 2020-05-26T17:33:48Z

@Ark-kun not sure if you intentionally meant to re-open this issue? Are you working on local runner for KFP?

Ark-kun · 2020-05-28T22:42:00Z

@Ark-kun not sure if you intentionally meant to re-open this issue? Are you working on local runner for KFP?

I think that your idea is still pretty valid and useful. It would be nice to have that feature for component testing and experimentation. I've recently started creating components in my free time and testing them was not that easy without having a Kubernetes cluster.
Our SDK already runs some tests in a local environment, but it would be less hacky to just use docker.
So I just wanted to keep this feature request in mind.
The priority of this is not P0 or P1 of course.

What do you think?

rmgogogo · 2020-06-16T16:12:54Z

FYI, I'm thinking on bring a lightweight local runner in a new design. It may rely on TFX SDK's docker launcher. It's still early investigation phase, together with IR work.

numerology · 2020-06-16T16:21:57Z

FYI, for testing TFX has this tensorflow/tfx#1986 WIP.

That said it's pretty different than local dev.

rmgogogo · 2020-06-22T07:25:56Z

(in IR based impl, we may handle it with Docker runner)

stale · 2020-09-20T07:46:29Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

Bobgy · 2020-09-23T13:06:52Z

/lifecycle frozen
we'd still want this open

NikeNano · 2020-11-10T21:12:55Z

Are we interested in adding instructions on how to deploy locally to Minikube based upon the local code for development @Bobgy? Will not be a big change but would require the user to build all the images and add an overlay that uses these images instead of the official released images.

Bobgy · 2020-11-15T10:00:20Z

@NikeNano I think that's a different topic, this issue is about creating a local runner that helps experiment with KFP. Rather than developing KFP.

lynnmatrix · 2021-01-13T04:00:16Z

@Bobgy @Ark-kun @numerology I have create a PR(#4983) trying to provide a local runner which will run Kubeflow pipeline on docker or locally.
As @Ark-kun suggested, kfp.run_pipeline_func_locally(my_pipeline, arguments={...}) is picked.

…ixes #1138 (#4983) * add local runner which will run ops in docker or locally * use str.format rather than f-string * add some brief doc string in local client * comment the unittest about running op in docker, which is not supported in CI env for now * Add some brief docstring about DAG used in local client * make graph/reverse_graph of DAG as property to keep them in sync * make some methods of LocalClient static * remove circular reference in local client * Incapsulate artifact storage root in the constuctor of LocalClient * Add Alpha notice for kfp.run_pipeline_func_locally * Support list of local images in kfp.run_pipeline_func_locally * make staticmethod to module level private method * Trivial modification according to code review, some renaming or docstring * local runner support components without '--' as argument prefix * make output file of op in loop unique * Local runner decides whether run component in docker or in local process base on ExecutionMode

neuromage added the feature_request label Apr 11, 2019

vicaire added area/sdk/dsl kind/feature priority/p0 and removed feature_request labels Apr 11, 2019

vicaire assigned neuromage Apr 11, 2019

Ark-kun self-assigned this Oct 19, 2019

neuromage closed this as completed Feb 14, 2020

Ark-kun unassigned neuromage May 26, 2020

Ark-kun reopened this May 26, 2020

Ark-kun added priority/p2 and removed priority/p0 labels May 28, 2020

rmgogogo self-assigned this Jun 22, 2020

rmgogogo added the needs investigation label Jun 22, 2020

stale bot added the lifecycle/stale The issue / pull request is stale, any activities remove this label. label Sep 20, 2020

k8s-ci-robot added lifecycle/frozen and removed lifecycle/stale The issue / pull request is stale, any activities remove this label. labels Sep 23, 2020

lynnmatrix mentioned this issue Jan 13, 2021

feat(sdk): Add local runner which will run ops in docker or locally. Fixes #1138 #4983

Merged

google-oss-robot closed this as completed in #4983 Feb 24, 2021

This was referenced Oct 14, 2024

[Snyk] Fix for 22 vulnerabilities VaniHaripriya/data-science-pipelines#331

Closed

[Snyk] Fix for 22 vulnerabilities VaniHaripriya/data-science-pipelines#332

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable local test runner for Kubeflow Pipelines #1138

Enable local test runner for Kubeflow Pipelines #1138

neuromage commented Apr 11, 2019

issue-label-bot bot commented Apr 11, 2019

vicaire commented Apr 11, 2019

Ark-kun commented Apr 12, 2019

Ark-kun commented Apr 12, 2019

vicaire commented Apr 12, 2019

ucdmkt commented Apr 12, 2019

kevinbache commented Apr 16, 2019

neuromage commented Feb 14, 2020

neuromage commented May 26, 2020

Ark-kun commented May 28, 2020

rmgogogo commented Jun 16, 2020

numerology commented Jun 16, 2020

rmgogogo commented Jun 22, 2020

stale bot commented Sep 20, 2020

Bobgy commented Sep 23, 2020

NikeNano commented Nov 10, 2020

Bobgy commented Nov 15, 2020 •

edited

Loading

lynnmatrix commented Jan 13, 2021 •

edited

Loading

Enable local test runner for Kubeflow Pipelines #1138

Enable local test runner for Kubeflow Pipelines #1138

Comments

neuromage commented Apr 11, 2019

issue-label-bot bot commented Apr 11, 2019

vicaire commented Apr 11, 2019

Ark-kun commented Apr 12, 2019

Ark-kun commented Apr 12, 2019

vicaire commented Apr 12, 2019

ucdmkt commented Apr 12, 2019

kevinbache commented Apr 16, 2019

neuromage commented Feb 14, 2020

neuromage commented May 26, 2020

Ark-kun commented May 28, 2020

rmgogogo commented Jun 16, 2020

numerology commented Jun 16, 2020

rmgogogo commented Jun 22, 2020

stale bot commented Sep 20, 2020

Bobgy commented Sep 23, 2020

NikeNano commented Nov 10, 2020

Bobgy commented Nov 15, 2020 • edited Loading

lynnmatrix commented Jan 13, 2021 • edited Loading

Bobgy commented Nov 15, 2020 •

edited

Loading

lynnmatrix commented Jan 13, 2021 •

edited

Loading