-
Notifications
You must be signed in to change notification settings - Fork 467
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Kubeflow tensorboard viz and fix tensorflow file IO for cloud back-ends #447
Conversation
* use fixed tensorflow version * reorganize code structure
* when tensorflow is installed, TFX activates its file IO plugin with a priority higher than the ZenML IO plugin * the 2.6 tensorflow release moves the IO support into a separate python package (tensorflow_io) that must be installed and imported explicitly for the S3/GCP/Azure FS schemes to be supported
5cbcf49
to
a2ff7cf
Compare
a2ff7cf
to
dfebffa
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for taking this over Stefan! Really like it!
I would have some suggestions:
- Potentially adding this example as a file in the
kubeflow
example to showcase the tensorboard integration. I dont believe this deserves an extra example unless we have a native tensorboard integration that works in a magical way for different configurations - Adding a picture of the 'Launch Tensorboard' button on the Kubeflow UI for users to understand the true value is when this is done on Kubeflow. This can be put on the
examples/kubeflow/README.md
in a subheading where we explain the kubeflow-tensorboard integration we just enabled - Adding in the limitations of Kubeflow (e.g. with S3) as you decribed in the JIRA ticket
- Potentially adding a 'TensorboardVisualizer' and showcasing launching tensorboard in a post-execution workflow. This should be trivial but exemplrary
I can also implement some of the above if you give me the go-ahead. Otherwise, its its clear I would be glad to review these changes! Thanks again for doing the deep dive!
* use Kubeflow volume metadata support [1] to mount the local artifact store as a hostpath volume in the Kubeflow UI and Tensorboard pods [1] https://github.com/kubeflow/pipelines/blob/master/docs/config/volume-support.md
Thanks @htahir1 , I hope I've addressed all your suggestions with the last commit (see inline).
I merged the Tensorboard and Kubeflow examples. This basically meant switching the existing Kubeflow example from sklearn to tensorflow and adding some post-execution visualizations.
Done. There are new pics and new information regarding Tensorboard in that
No need. I fixed all the limitations :)
You're going to love this: I implemented a TensorboardService to wrap around the Tensorboard local server, and a visualizer that makes it easy to start/stop Tensorboard services. This goes along with the idea that Services are for more than just model serving. The visualizer also works with Jupyter notebooks, which reminds me: I also updated the notebook in the kubeflow example to reflect this. |
402cd86
to
86653c0
Compare
…rd examples * Tensorboard service to track Tensorboard daemons running locally * Tensorboard visualizer to start Tensorboard service for a pipeline step * switched Kubeflow example to use Tensorflow instead of sklearn * add more pics and info about the Tensorboard service and the Kubeflow Tensorboard UI
86653c0
to
7010889
Compare
61fbeec
to
03f04f1
Compare
src/zenml/integrations/kubeflow/orchestrators/local_deployment_utils.py
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice! Just found a few typos and a tiny amount of duplicated code, other than that good to go
src/zenml/integrations/kubeflow/orchestrators/local_deployment_utils.py
Outdated
Show resolved
Hide resolved
src/zenml/integrations/tensorflow/visualizers/tensorboard_visualizer.py
Outdated
Show resolved
Hide resolved
* removed duplicated code in Tensorboard visualizer
03f04f1
to
2e772cb
Compare
…_utils.py Co-authored-by: Michael Schuster <schustmi@users.noreply.github.com>
…_utils.py Co-authored-by: Michael Schuster <schustmi@users.noreply.github.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Are you kidding here!!!! Whatttt the heck just happened. AMAZING WORK! I absolutely love the new Tensorboard service and can only be astonished at the effort put in here. Kudos to you and LGTM!!!!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah wait I just noticed one TINY problem. No updates to the docs. Would really love to have some of this stuff on the docs, including the service you implemented
I have made the following changes to this branch.
@stefannica and @schustmi please do take a look |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice job with the docs, I like it.
@htahir1 The only thing i'd change is to not use an |
Describe changes
Enable the tensorboard visualization for Kubeflow and fixes the tensorflow file IO support when shared cloud storage like AWS S3 is used.
tensorflow_io
that must be installed and imported explicitly for the S3/GCP/Azure FS filesystem schemes to be supportedPre-requisites
Please ensure you have done the following:
Types of changes