-
Notifications
You must be signed in to change notification settings - Fork 118
Block spark-submit call until job is complete #46
Comments
It's controlled by a flag We can use a similar one to match it. |
+1 |
Yes looking at the description for |
We should probably try to unify these settings in Spark, mesos and standalone cluster mode can potentially both use this flag. |
+1 Printing the status after every N seconds seems like a good idea. |
I don't think we should tail the logs unless the user asks for it, it's quite verbose |
When running
spark-submit
in YARN cluster mode, thespark-submit
script stays running until the Spark job completes, printing out the application status every second until it eventually finishes:We should have spark-submit with k8s cluster mode (only supported mode now) do the same -- block the call and poll for pod status until the pod terminates.
The blocking call seems required to match the YARN feature set, though as a possible extension we could provide driver logs instead of secondly-status polling using the below example. I for one would find that a great usability improvement over YARN-cluster mode's behavior.
https://github.com/fabric8io/kubernetes-client/blob/master/kubernetes-examples/src/main/java/io/fabric8/kubernetes/examples/PodLogExample.java
P.S. As a side note, I'm interested in making this call blocking so I can more accurately perform perf benchmarks of the same job running in YARN vs kubernetes by running
time spark-submit ...
aimed at both clusters.The text was updated successfully, but these errors were encountered: