Add ToolchainContext to ConfiguredTarget. #7969

katre · 2019-04-08T13:24:47Z

Part of work on execution transitions, #7935.

Part of work on execution transitions, bazelbuild#7935.

lberki · 2019-04-08T13:43:25Z

src/main/java/com/google/devtools/build/lib/analysis/ConfiguredTarget.java

@@ -75,4 +75,7 @@ default String getConfigurationChecksum() {
  default SourceArtifact getSourceArtifact() {
    return null;
  }
+
+  @Nullable


What is this needed for? Looking at ToolchainContext, it's not a particularly lightweight object and this change would make it so that it's kept in RAM permanently as opposed to being discarded after the rule in question is analyzed. It's also called a "Context" which makes it doubly surprising that it's kept in RAM after analysis.

Can you run a memory benchmark on a larg-ish target with C++ and Java toolchain resolution enabled?

Reason: the execution transition and toolchain transition both need to know about the toolchain context, and this is part of plumbing that through. The full change is here: https://github.com/katre/bazel/tree/et-05-tc-in-atd

I did the memory benchmarking last week, actually. The short version is that adding this increases memory use for analyzing //gws:gws_and_dev_fileset by slightly less than 1%:

Before: 6036MB (blaze memory reported by used-hea-size-after-gc)
After: 6086MB

One option is to store the UnloadedToolchainContext, not the ResolvedToolchainContext. UTC also implements the ToolchainContext interface, but doesn't have the map from toolchain type to ToolchainInfo (or the list of TemplateVariableInfo).

(/me acting as the Guardian Of Memory this time)

I looked at the above fork and it looks like that the the new field in ConfiguredTarget is used for the sole purpose of passing to dependentNodeMap() in production code? If that's the case, that method in itself does a lot of dependency resolution already, so I was wondering if it would be possible to make callers also do toolchain resolution: technically, both toolchains and dependencies are, well, prerequisites and it feels weird that we cache the resolved values for one of them, but not for the other.

Shouldn't the main application be in ConfiguredTargetFunction, which already has access to the context?

I see calls in PostConfiguredTargetFunction and TransitionsOutputFormatterCallback (for query). Are these the motivating dependencies?

Part of my motivation was also to remove a lot of hacky /* toolchainsLabels= */ ImmutableSet.of() style code from around the codebase.

I can change PostConfiguredTargetFunction to re-compute the toolchain resolution process, but TransitionsOutputFormatterCallback can't do that (it doesn't have a skyframe Environment handy). That means that cquery will end up potentially reporting the wrong transitions (as it won't be able to properly set up the execution and toolchain transitions).

The change to use UnloadedToolchainContext instead of the ToolchainContext in ConfiguredTarget gets the correct functionality and reduces the unbounded part of the memory usage (UnloadedToolchainContext has a String, two PlatformInfo objects, a set of ToolchainTypeInfo objects, and a map from ToolchainTypeInfo to Label). Is that acceptable, memory-wise?

Looking further, I don't think $resolved_toolchains_internal is kept in the ConfiguredTarget instance, so that is definitely a memory regression to be avoided.

Is it not kept in the ConfiguredTarget instance to supply

bazel/src/main/java/com/google/devtools/build/lib/query2/TransitionsOutputFormatterCallback.java

Lines 124 to 128 in b3b3e8b

// Note: Being able to pull the $resolved_toolchain_internal attr unconditionally from the

// mapper relies on the fact that {@link PlatformSemantics.RESOLVED_TOOLCHAINS_ATTR} exists

// in every rule. Also, we don't actually use fromOptions in our implementation of

// DependencyResolver but passing to avoid passing a null and since we have the information

// anyway.

? Or you mean it would no longer be necessary if this is refactored to avoid that dependency?

I like the idea of exploring a refactoring of how toolchains relate to DependencyResolver / dependency resolution before following through on this change. But it's not clear to me how easy it is to un-special-case them since there are some fundamental ways (maybe?) that toolchain dependencies really are special.

Today in DependencyResolver I think the only integration is to set a special TOOLCHAIN_DEPENDENCY on the toolchain labels with a host configuration. We could easily set that directly outside DependencyResolver, or as a special case at the beginning of dependentNodeMap without getting further littered through the class.. But then, @lberki would that get in the way of your intentions of creating distinct phases for partially resolved vs. fully resolve dependencies?

And with John's current changes there's a new role for the toolchain context: providing the execution platform. It's possible to just pass the execution platform directly similar to how we're currently passing in the host config. So I think it's still possible to completely disentangle the toolchain context and DependencyResolver. But I'm not sure that puts us in a better state, nor makes the broader issue of special-casing toolchain dependencies less deep?

Anyway, this is all getting complicated. To recap, at core:

PostConfiguredTarget can just re-call toolchain resolution if it has the unloaded context, which it would have to get from the ConfiguredTarget. This should be quick since that should already be cached in Skyframe. Maybe there's an opportunity to consolidate that same lookup code with other places that resolve toolchains so we're not duplicating a bunch of logic?

TransitionsOutputFormatterCallback just fits poorly here. It has Skyframe access but through a different interface, which is already an awkward compromise since the current SkyframeExecutor.getConfiguration() is replicating similar logic called from ConfiguredTargetFunction but in a "different" way. Note this is also used for getting the configurations for top-level targets. This is a necessary evil and expanding this with yet another variation for toolchain resolution would be an unfortunate design direction.

But... does TransitionsOutputFormatterCallback actually need to resolve toolchains? If we just pass it the exec platform label via a ConfiguredTarget, I think that's all it needs to function the rest of the way. It doesn't actually apply transitions - it just spits out the transition objects. So it should be possible to avoid any extra new Skyframe evaluation needs here.

I am going to explore the change where I revert most of this, and just pass the execution platform label directly down the call chain. I'll send it when I have some time.

Add ToolchainContext to ConfiguredTarget.

fabe0cb

Part of work on execution transitions, bazelbuild#7935.

katre requested a review from gregestren April 8, 2019 13:24

katre requested a review from lberki as a code owner April 8, 2019 13:24

googlebot added the cla: yes label Apr 8, 2019

lberki reviewed Apr 8, 2019

View reviewed changes

katre closed this Apr 18, 2019

katre deleted the et-03-tc-in-ct branch April 19, 2019 15:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ToolchainContext to ConfiguredTarget. #7969

Add ToolchainContext to ConfiguredTarget. #7969

katre commented Apr 8, 2019

lberki Apr 8, 2019

katre Apr 8, 2019

lberki Apr 8, 2019

gregestren Apr 8, 2019

katre Apr 9, 2019

katre Apr 9, 2019

gregestren Apr 10, 2019

gregestren Apr 10, 2019

gregestren Apr 10, 2019

katre Apr 10, 2019

	// Note: Being able to pull the $resolved_toolchain_internal attr unconditionally from the
	// mapper relies on the fact that {@link PlatformSemantics.RESOLVED_TOOLCHAINS_ATTR} exists
	// in every rule. Also, we don't actually use fromOptions in our implementation of
	// DependencyResolver but passing to avoid passing a null and since we have the information
	// anyway.

Add ToolchainContext to ConfiguredTarget. #7969

Add ToolchainContext to ConfiguredTarget. #7969

Conversation

katre commented Apr 8, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment