Updated tensorboard example

zenml-io · Mar 7, 2022 · 2806dc5 · 2806dc5
1 parent 2bbe2bf
commit 2806dc5
Show file tree

Hide file tree

Showing 4 changed files with 31 additions and 165 deletions.
diff --git a/examples/tensorboard/README.md b/examples/tensorboard/README.md
@@ -1,158 +1,39 @@
-# Deploy pipelines to production using Kubeflow Pipelines
+# Visualize model statistics with Tensorboard
 
-When developing ML models, you probably develop your pipelines on your local machine initially as this allows for 
-quicker iteration and debugging. However, at a certain point when you are finished with its design, you might want to 
-transition to a more production-ready setting and deploy the pipeline to a more robust environment.
-
-You can also watch a video of this example [here](https://www.youtube.com/watch?v=b5TXRYkdL3w).
+This example features a pipeline that trains a Keras model that logs Tensorboard
+information that can be visualized after the pipeline run is over.
 
 ## Pre-requisites
-
-In order to run this example, we have to install a few tools that allow ZenML to spin up a local Kubeflow Pipelines 
-setup:
-
-* [K3D](https://k3d.io/v5.2.1/#installation) to spin up a local Kubernetes cluster
-* The Kubernetes command-line tool [Kubectl](https://kubernetes.io/docs/tasks/tools/#kubectl) to deploy 
-Kubeflow Pipelines
-* [Docker](https://docs.docker.com/get-docker/) to build docker images that run your pipeline in Kubernetes 
-pods (**Note**: the local Kubeflow Pipelines deployment requires more than 2 GB of RAM, so if you're using 
-Docker Desktop make sure to update the resource limits in the preferences)
-
-
-## Installation
-
-Next, we will install ZenML, get the code for this example and initialize a ZenML repository:
+In order to run this example, you need to install and initialize ZenML:
 
 ```bash
 # Install python dependencies
 pip install zenml
 
 # Install ZenML integrations
-zenml integration install kubeflow  
-zenml integration install sklearn  
+zenml integration install tensorflow
 
-# Pull the kubeflow example
-zenml example pull kubeflow
-cd zenml_examples/kubeflow
+# Pull the tensorboard example
+zenml example pull tensorboard
+cd zenml_examples/tensorboard
 
 # Initialize a ZenML repository
 zenml init
 ```
-
-## Run on a local Kubeflow Pipelines deployment
-
-### Create a local Kubeflow Pipelines Stack
-
-Now with all the installation and initialization out of the way, all that's left to do is configuring our 
-ZenML [stack](https://docs.zenml.io/core-concepts). For this example, the stack we create consists of the 
-following four parts:
-* The **local artifact store** stores step outputs on your hard disk. 
-* The **local metadata store** stores metadata like the pipeline name and step parameters inside a local SQLite database.
-* The docker images that are created to run your pipeline are stored in a local docker **container registry**.
-* The **Kubeflow orchestrator** is responsible for running your ZenML pipeline in Kubeflow Pipelines.
-
-```bash
-# Make sure to create the local registry on port 5000 for it to work 
-zenml container-registry register local_registry --type=default --uri=localhost:5000 
-zenml orchestrator register kubeflow_orchestrator --type=kubeflow
-zenml stack register local_kubeflow_stack \
-    -m local_metadata_store \
-    -a local_artifact_store \
-    -o kubeflow_orchestrator \
-    -c local_registry
-
-# Activate the newly created stack
-zenml stack set local_kubeflow_stack
-```
-
-### Start up Kubeflow Pipelines locally
-ZenML takes care of setting up and configuring the local Kubeflow Pipelines deployment. All we need to do is run:
-```bash
-zenml stack up
-```
-When the setup is finished, you should see a local URL which you can access in your browser and take a look at the 
-Kubeflow Pipelines UI.
-
-
-### Run the pipeline
-We can now run the pipeline by simply executing the python script:
+### Run the project
+Now we're ready. Execute:
 
 ```bash
 python run.py
 ```
 
-This will build a docker image containing all the necessary python packages and files, push it to the local container 
-registry and schedule a pipeline run in Kubeflow Pipelines.
-Once the script is finished, you should be able to see the pipeline run [here](http://localhost:8080/#/runs).
-
 ### Clean up
-Once you're done experimenting, you can delete the local Kubernetes cluster and all associated resources by calling:
+In order to clean up, delete the remaining ZenML references.
 
-```bash
-zenml stack down
-```
-
-## Run the same pipeline on Kubeflow Pipelines deployed to GCP
-
-We will now run the same pipeline in Kubeflow Pipelines deployed to a Google Kubernetes Engine cluster. 
-As you can see from the long list of additional pre-requisites, this requires lots of external setup steps at the 
-moment. In future releases ZenML will be able to automate most of these steps for you, so make sure to revisit this 
-guide if this is something you're interested in!
-
-### Additional pre-requisites
-
-* An existing [GCP container registry](https://cloud.google.com/container-registry/docs).
-* An existing [GCP bucket](https://cloud.google.com/storage/docs/creating-buckets).
-* [Kubeflow Pipelines](https://www.kubeflow.org/docs/distributions/gke/deploy/overview/) deployed to a Google 
-Kubernetes Engine cluster.
-* The local docker client has to be [authorized](https://cloud.google.com/container-registry/docs/advanced-authentication) 
-to access the GCP container registry.
-* Kubectl can [access](https://cloud.google.com/kubernetes-engine/docs/how-to/cluster-access-for-kubectl) your GCP 
-Kubernetes cluster.
-* The [current context](https://kubernetes.io/docs/reference/kubectl/cheatsheet/#kubectl-context-and-configuration) 
-configured in Kubectl points to your GCP cluster. 
-
-### Create a GCP Kubeflow Pipelines stack
-
-To run our pipeline on Kubeflow Pipelines deployed to GCP, we will create a new stack with these components:
-* The **artifact store** stores step outputs in a GCP Bucket. 
-* The **metadata store** stores metadata inside the Kubeflow Pipelines internal MySQL database.
-* The docker images that are created to run your pipeline are stored in GCP **container registry**.
-* The **Kubeflow orchestrator** is the same as in the local Kubeflow Pipelines example.
-
-When running the upcoming commands, make sure to replace `$PATH_TO_YOUR_CONTAINER_REGISTRY` and 
-`$PATH_TO_YOUR_GCP_BUCKET` with the actual URI's of your container registry and bucket.
-
-```bash
-# In order to create the GCP artifact store, we need to install one additional ZenML integration:
-zenml integration install gcp
-
-# Create the stack and its components
-zenml container-registry register gcp_registry --uri=$PATH_TO_YOUR_CONTAINER_REGISTRY
-zenml metadata-store register kubeflow_metadata_store --type=kubeflow
-zenml artifact-store register gcp_artifact_store --type=gcp --path=$PATH_TO_YOUR_GCP_BUCKET
-zenml stack register gcp_kubeflow_stack \
-    -m kubeflow_metadata_store \
-    -a gcp_artifact_store \
-    -o kubeflow_orchestrator \
-    -c gcp_registry
-
-# Activate the newly created stack
-zenml stack set gcp_kubeflow_stack
-```
-
-### Run the pipeline
-
-Configuring and activating the new stack is all that's necessary to switch from running your pipelines locally 
-to running them on GCP:
-
-```bash
-python run.py
+```shell
+rm -rf zenml_examples
 ```
 
-That's it! If everything went as planned this pipeline should now be running in the cloud, and we are one step 
-closer to a production pipeline!
-
 ## SuperQuick `tensorboard` run
 
 If you're really in a hurry and you want just to see this example pipeline run,

diff --git a/examples/tensorboard/__init__.py b/examples/tensorboard/__init__.py
diff --git a/examples/tensorboard/run.py b/examples/tensorboard/run.py
@@ -17,8 +17,11 @@
 import numpy as np
 import tensorflow as tf
 
+from rich import print
+
 from zenml.integrations.constants import AWS, TENSORFLOW
 from zenml.pipelines import pipeline
+from zenml.repository import Repository
 from zenml.steps import BaseStepConfig, Output, StepContext, step
 
 
@@ -104,7 +107,7 @@ def tf_evaluator(
 
 
 # Define the pipeline
-@pipeline(required_integrations=[TENSORFLOW, AWS])
+@pipeline(required_integrations=[TENSORFLOW, AWS], enable_cache=False)
 def mnist_pipeline(
     importer,
     normalizer,
@@ -129,3 +132,15 @@ def mnist_pipeline(
 
     # Run the pipeline
     tf_p.run()
+
+    # Post-execution flow
+    repo = Repository()
+    pipeline = repo.get_pipeline(tf_p.name)
+    run = pipeline.runs[-1]
+    trainer_step = run.get_step("trainer")
+    if trainer_step.outputs:
+        print(
+            "You can run:\n"
+            f"[italic green]    tensorboard --logdir={trainer_step.output.uri}[/italic green]\n"
+            "...to visualize the Tensorboard logs for your trained model."
+        )
diff --git a/examples/tensorboard/setup.sh b/examples/tensorboard/setup.sh
@@ -2,27 +2,10 @@
 
 set -Eeo pipefail
 
-setup_stack () {
-  zenml container-registry register local_registry  --type=default --uri=localhost:5000|| \
-    msg "${WARNING}Reusing preexisting container registry ${NOFORMAT}local_registry, this might fail if port 5000 is blocked by another process."
-  zenml orchestrator register kubeflow_orchestrator --type=kubeflow || \
-    msg "${WARNING}Reusing preexisting orchestrator ${NOFORMAT}kubeflow_orchestrator"
-  zenml stack register local_kubeflow_stack \
-      -m local_metadata_store \
-      -a local_artifact_store \
-      -o kubeflow_orchestrator \
-      -c local_registry || \
-    msg "${WARNING}Reusing preexisting stack ${NOFORMAT}local_kubeflow_stack"
-
-  zenml stack set local_kubeflow_stack
-
-  zenml stack up
-}
-
 pre_run () {
-  zenml integration install kubeflow tensorflow aws
+  zenml integration install tensorflow
 }
 
 pre_run_forced () {
-  zenml integration install kubeflow tensorflow aws -f
+  zenml integration install tensorflow -f
 }