Update xgboost_synthetic to 0.7 #655

jlewi · 2019-10-09T00:04:10Z

In 0.7 we will use workload identity.

As such notebooks should no longer need to use/set GOOGLE_APPLICATION_CREDENTIALS

The notebook
https://github.com/kubeflow/examples/blob/master/xgboost_synthetic/build-train-deploy.ipynb

Is currently checking GOOGLE_APPLICATION_CREDENTIALS we will need to update that code to work with workload identity.

P0 because this is part of our demo script for 0.7.

issue-label-bot · 2019-10-09T00:04:12Z

Issue-Label Bot is automatically applying the label kind/feature to this issue, with a confidence of 0.92. Please mark this comment with 👍 or 👎 to give our bot feedback!

Links: app homepage, dashboard and code for this bot.

kunmingg · 2019-10-17T19:07:52Z

Saw following error with image: gcr.io/kubeflow-images-public/tensorflow-1.14.0-notebook-cpu:v0.7.0
Seems some dependency are missing?

NameError Traceback (most recent call last)
in
----> 1 model = ModelServe(model_file="mockup-model.dat")
2 model.train()

in init(self, model_file)
17 self.model = None
18 self._workspace = None
---> 19 self.exec = self.create_execution()
20
21 def train(self):

in create_execution(self)
88
89 def create_execution(self):
---> 90 r = metadata.Run(
91 workspace=self.workspace,
92 name="xgboost-synthetic-faring-run" + datetime.utcnow().isoformat("T"),

NameError: name 'metadata' is not defined

* Related to kubeflow#655 update xgboost_synthetic to use workload identity * Related to to kubeflow#665 no signal about xgboost_synthetic * We need to update the xgboost_synthetic example to work with 0.7.0; e.g. workload identity * This PR focuses on updating the test infra and some preliminary updates the notebook * More fixes to the test and the notebook are probably needed in order to get it to actually pass * Update job spec for 0.7; remove the secret and set the default service account. * This is to make it work with workload identity * Instead of using kustomize to define the job to run the notebook we can just modify the YAML spec using python. * Use the python API for K8s to create the job rather than shelling out. * Notebook should do a 0.7 compatible check for credentials * We don't want to assume GOOGLE_APPLICATION_CREDENTIALS is set because we will be using workload identity. * Take in repos as an argument akin to what checkout_repos.sh requires * Convert xgboost_test.py to a pytest. * This allows us to mark it as expected to fail so we can start to get signal without blocking * We also need to emit junit files to show up in test grid. * Convert the jsonnet workflow for the E2E test to a python function to define the workflow. * Remove the old jsonnet workflow.

krishnadurai · 2019-10-24T17:53:32Z

Just another point to note:
I was testing with Istio 1.3.1 and ran into an issue installing the python package retrying (Step 2) saying the user does not have permissions to install it.

… 0.7.0 (#666) * Update xgboost_synthetic test infra to use pytest and pyfunc. * Related to #655 update xgboost_synthetic to use workload identity * Related to to #665 no signal about xgboost_synthetic * We need to update the xgboost_synthetic example to work with 0.7.0; e.g. workload identity * This PR focuses on updating the test infra and some preliminary updates the notebook * More fixes to the test and the notebook are probably needed in order to get it to actually pass * Update job spec for 0.7; remove the secret and set the default service account. * This is to make it work with workload identity * Instead of using kustomize to define the job to run the notebook we can just modify the YAML spec using python. * Use the python API for K8s to create the job rather than shelling out. * Notebook should do a 0.7 compatible check for credentials * We don't want to assume GOOGLE_APPLICATION_CREDENTIALS is set because we will be using workload identity. * Take in repos as an argument akin to what checkout_repos.sh requires * Convert xgboost_test.py to a pytest. * This allows us to mark it as expected to fail so we can start to get signal without blocking * We also need to emit junit files to show up in test grid. * Convert the jsonnet workflow for the E2E test to a python function to define the workflow. * Remove the old jsonnet workflow. * Address comments. * Fix issues with the notebook * Install pip packages in user space * 0.7.0 images are based on TF images and they have different permissions * Install a newer version of fairing sdk that works with workload identity * Split pip installing dependencies out of util.py and into notebook_setup.py * That's because util.py could depend on the packages being installed by notebook_setup.py * After pip installing the modules into user space; we need to add the local path for pip packages to the python otherwise we get import not found errors.

jlewi · 2019-11-08T04:36:54Z

#676 should hopefully fix the test

There is however one more issue with model deployment #673

* install newer version of fairing * modify preprocessor to use custom dockerfile * use newer 0.7 base image. * Fix endpoint. Related to: kubeflow#673 model doesn't deploy its crash looping Related to kubeflow#655 update example to work with 0.7

#682) * Fix issues with the xgboost_synthetic example and deploying the model. * install newer version of fairing * modify preprocessor to use custom dockerfile * use newer 0.7 base image. * Fix endpoint. Related to: #673 model doesn't deploy its crash looping Related to #655 update example to work with 0.7 * Add some comments to the notebook.

stale · 2020-02-06T04:41:27Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

issue-label-bot bot added the kind/feature label Oct 9, 2019

jlewi added priority/p0 area/examples platform/gcp labels Oct 9, 2019

kunmingg mentioned this issue Oct 17, 2019

quick fix xgboost_synthetic #662

Closed

zhenghuiwang mentioned this issue Oct 18, 2019

Update xgboost notebook example for workload identity #663

Closed

jlewi mentioned this issue Oct 24, 2019

Update xgboost_synthetic test infra; preliminary updates to work with 0.7.0 #666

Merged

jlewi changed the title ~~Update xgboost_synthetic to use workload identiy~~ Update xgboost_synthetic to 0.7 Oct 25, 2019

jlewi mentioned this issue Oct 25, 2019

ClusterBuilder doesn't allow setting custom Dockerfile - Needed for xgboost_synthetic kubeflow/fairing#404

Closed

This was referenced Nov 25, 2019

Fix issues with the xgboost_synthetic example and deploying the model. #682

Merged

Best way to collect output from papermill in tests? kubeflow/testing#530

Closed

stale bot added the lifecycle/stale label Feb 6, 2020

stale bot closed this as completed Feb 13, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update xgboost_synthetic to 0.7 #655

Update xgboost_synthetic to 0.7 #655

jlewi commented Oct 9, 2019

issue-label-bot bot commented Oct 9, 2019

kunmingg commented Oct 17, 2019 •

edited

Loading

krishnadurai commented Oct 24, 2019 •

edited

Loading

jlewi commented Nov 8, 2019

stale bot commented Feb 6, 2020

Update xgboost_synthetic to 0.7 #655

Update xgboost_synthetic to 0.7 #655

Comments

jlewi commented Oct 9, 2019

issue-label-bot bot commented Oct 9, 2019

kunmingg commented Oct 17, 2019 • edited Loading

krishnadurai commented Oct 24, 2019 • edited Loading

jlewi commented Nov 8, 2019

stale bot commented Feb 6, 2020

kunmingg commented Oct 17, 2019 •

edited

Loading

krishnadurai commented Oct 24, 2019 •

edited

Loading