Support custom YAML for the driver pod spec #38

mccheah · 2017-01-23T21:01:54Z

We started with our base implementation, which hard-codes the pod spec to have specific fields. We then thought of the idea of supporting custom labels. However, there's plenty of things on an arbitrary user's application that could be useful to customize, such as ports and mounted volumes.

This issue therefore proposes that we support users providing arbitrary YAML files that describe the pod, or at least any modifications and augmentations they would like to make. We need to be careful when considering the API and expectations here. One mode of operation could be, the user specifies a custom file for the pod via —driver-pod-spec-file or an equivalent SparkConf. We can then take the user's pod spec and augment it with whatever is missing - for example, adding the Spark UI and REST submission server ports which are required for Spark.

The tricky part is the fact that the user has to specify that the container is running the Spark driver submission server, and that this container is the one that needs the custom ports open. Thus we should probably support adding —driver-container that must be set if the custom pod spec is set, so that we know which container we need to adjust to add the missing ports, etc. I don't know if there is any way we can make this easy to use, but in a sense usability seems to be a secondary issue here - I anticipate this will primarily be used for specific off-roading "power-user" scenarios.

The text was updated successfully, but these errors were encountered:

mccheah · 2017-01-23T22:44:19Z

@foxish @erikerlandson curious as to your thoughts on this.

Could also be worthwhile for executors to support side-car containers - think custom metrics and reporting, etc.

foxish · 2017-01-24T00:12:14Z

This is something Eric Tune and I had discussed earlier. It is a use case we want to support, but I think we should defer implementing this till we have the "default" specifications of driver and executor pods nailed down.

mccheah · 2017-01-24T00:15:10Z

+1 - probably not for phase 1 then but for down the line.

erikerlandson · 2017-01-24T01:47:28Z

We've done prototype work with side-car containers in a master pod for supporting carbon & graphite sinks for spark's metrics. I'm not sure if something similar is needed for executors but I can see how it might be.

I agree it's not a high priority.

On that topic, does the ability to customize yaml on the driver imply the potential to add containers in the driver pod?

mccheah · 2017-01-24T01:48:17Z

Yeah side-cars in the driver would be supported as well. We'd need to de-duplicate which is the actual driver container, hence the suggestion for a second config option to denote that.

liyinan926 · 2017-07-27T23:07:26Z

I want bring this issue up again as I feel that given now we have more requirements for customizing the driver/executor pods, with issues like #393, #397, #299, etc., it's the right time to re-think about this. I have some thoughts below on the use of YAML pod templates specifically. Whether to use YAML templates, PodPresets, or even something else remains a question.

We still support the individual configuration properties that, when set, override the same aspect set in the template. So individual configuration properties are considered more specific and should always take precedence.
We can limit what can be set in the template to those aspects that have corresponding configuration properties, i.e., aspects that are currently already configurable by the users, e.g., name, labels, annotations, memory, cpu cores, etc.
We can add validation logic in the submission client to make sure that templates don't contain aspects that are not allowed to be set in the template. We may relax this in the future if we a good notion of a default specification.

With those, we really just use YAML templates for things that can be overridden by individual configuration properties. However, having the option to use templates save users a lot of efforts of setting individual properties repeatedly, while still offering the flexibility through overriding by the individual properties. Thoughts on this?

mccheah · 2017-07-27T23:15:52Z

I think @foxish suggested using Pod Presets for this instead, in which case there would actually be no work to do on our part.

liyinan926 · 2017-07-27T23:26:36Z

Yes, we are also considering PodPresets and having discussions on that. But it's not quite ready yet (currently alpha so is not guaranteed to be available on a cluster) and needs certain things to be enabled to be used. That's why I brought this up again just to start discussions on the feasibility of using YAML templates as a potential solution.

mccheah · 2017-07-27T23:28:03Z

What's the timeline for Pod Presets to move from alpha to beta status?

erikerlandson · 2017-07-27T23:35:15Z

Doing it with YAML (as opposed to PodPresets, which of course are also yaml) feels like it will be an unguarded chainsaw kind of tool. Trying to make it both safe and easy to explain would be a hard needle to thread. From the POV that it's for power users anyway, that isn't necessarily a deal breaker, but at the very least it would have to be documented as a dangerous-power-tool category of feature.

My inclination is to wait for pod presets, although others might be feeling varying levels of urgency.

…olve Add revision to maven resolver

mccheah changed the title ~~Support custom YAML for the pod spec~~ Support custom YAML for the driver pod spec Jan 23, 2017

ash211 mentioned this issue Jan 25, 2017

Support setting ImagePullSecrets on the Spark pods #42

Closed

This was referenced Jan 27, 2017

Move executor and driver commands from dockerfile to scheduler #60

Closed

Allow opening additional ports on the driver #78

Closed

foxish added the in-beta label Feb 14, 2017

This was referenced Jun 22, 2017

Add node selectors for driver and executor pods #355

Merged

Config for hard cpu limit on pods; default unlimited #356

Merged

mccheah mentioned this issue Jun 27, 2017

User-specified node selector on pods #358

Closed

mccheah mentioned this issue Aug 14, 2017

Submission client should provide the driver with a pod template to use for executors #434

Closed

luck02 mentioned this issue Aug 20, 2017

Allow injection of arbitrary application secrets into driver/executor Pods #397

Closed

mccheah mentioned this issue Aug 22, 2017

documentation on resource staging server #386

Open

sumansomasundar mentioned this issue Feb 13, 2018

Specify hostpath volume and mount the volume in Spark driver and executor pods #614

Open

ifilonenko pushed a commit to ifilonenko/spark that referenced this issue Feb 25, 2019

Merge pull request apache-spark-on-k8s#38 from palantir/pw/bintrayRes…

95679c9

…olve Add revision to maven resolver

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support custom YAML for the driver pod spec #38

Support custom YAML for the driver pod spec #38

mccheah commented Jan 23, 2017 •

edited

Loading

mccheah commented Jan 23, 2017

foxish commented Jan 24, 2017

mccheah commented Jan 24, 2017

erikerlandson commented Jan 24, 2017

mccheah commented Jan 24, 2017

liyinan926 commented Jul 27, 2017

mccheah commented Jul 27, 2017

liyinan926 commented Jul 27, 2017 •

edited

Loading

mccheah commented Jul 27, 2017

erikerlandson commented Jul 27, 2017

Support custom YAML for the driver pod spec #38

Support custom YAML for the driver pod spec #38

Comments

mccheah commented Jan 23, 2017 • edited Loading

mccheah commented Jan 23, 2017

foxish commented Jan 24, 2017

mccheah commented Jan 24, 2017

erikerlandson commented Jan 24, 2017

mccheah commented Jan 24, 2017

liyinan926 commented Jul 27, 2017

mccheah commented Jul 27, 2017

liyinan926 commented Jul 27, 2017 • edited Loading

mccheah commented Jul 27, 2017

erikerlandson commented Jul 27, 2017

mccheah commented Jan 23, 2017 •

edited

Loading

liyinan926 commented Jul 27, 2017 •

edited

Loading