Enzyme hessian with closure: "You may be using a constant variable as temporary storage for active memory" #307

jbrea · 2024-06-06T07:47:34Z

julia> g(x, y) = sum((x .* y).^2)
g (generic function with 1 method)

julia> f = let y = rand(10)
           x -> g(x, y)
       end
#57 (generic function with 1 method)

julia> hessian(f, AutoEnzyme(), rand(10))
ERROR: Enzyme execution failed.
Mismatched activity for:   store {} addrspace(10)* %.fca.0.0.extract, {} addrspace(10)* addrspace(10)* %memcpy_refined_dst.i, align 8, !dbg !17, !tbaa !21, !alias.scope !25, !noalias !28 const val:   %.fca.0.0.extract = extractvalue { [1 x {} addrspace(10)*] } %0, 0, 0, !dbg !8
Type tree: {[-1]:Pointer, [-1,0]:Pointer, [-1,0,-1]:Float@double, [-1,8]:Integer, [-1,9]:Integer, [-1,10]:Integer, [-1,11]:Integer, [-1,12]:Integer, [-1,13]:Integer, [-1,14]:Integer, [-1,15]:Integer, [-1,16]:Integer, [-1,17]:Integer, [-1,18]:Integer, [-1,19]:Integer, [-1,20]:Integer, [-1,21]:Integer, [-1,22]:Integer, [-1,23]:Integer, [-1,24]:Integer, [-1,25]:Integer, [-1,26]:Integer, [-1,27]:Integer, [-1,28]:Integer, [-1,29]:Integer, [-1,30]:Integer, [-1,31]:Integer, [-1,32]:Integer, [-1,33]:Integer, [-1,34]:Integer, [-1,35]:Integer, [-1,36]:Integer, [-1,37]:Integer, [-1,38]:Integer, [-1,39]:Integer}
 llvalue=  %.fca.0.0.extract = extractvalue { [1 x {} addrspace(10)*] } %0, 0, 0, !dbg !8
You may be using a constant variable as temporary storage for active memory (https://enzyme.mit.edu/julia/stable/faq/#Activity-of-temporary-storage). If not, please open an issue, and either rewrite this variable to not be conditionally active or use Enzyme.API.runtimeActivity!(true) as a workaround for now

Stacktrace:
 [1] gradient
   @ ~/.julia/packages/DifferentiationInterface/IH8L9/src/first_order/gradient.jl:84
 [2] inner_gradient_closure
   @ ~/.julia/packages/DifferentiationInterface/IH8L9/src/second_order/hvp.jl:99
 [3] inner_gradient_closure
   @ ~/.julia/packages/DifferentiationInterface/IH8L9/src/second_order/hvp.jl:0

Stacktrace:
  [1] throwerr(cstr::Cstring)
    @ Enzyme.Compiler ~/.julia/packages/Enzyme/F71IJ/src/compiler.jl:1338
  [2] gradient
    @ ~/.julia/packages/DifferentiationInterface/IH8L9/src/first_order/gradient.jl:84 [inlined]
  [3] inner_gradient_closure
    @ ~/.julia/packages/DifferentiationInterface/IH8L9/src/second_order/hvp.jl:99 [inlined]
  [4] inner_gradient_closure
    @ ~/.julia/packages/DifferentiationInterface/IH8L9/src/second_order/hvp.jl:0 [inlined]
  [5] fwddiffejulia_inner_gradient_closure_9648_inner_1wrap
    @ ~/.julia/packages/DifferentiationInterface/IH8L9/src/second_order/hvp.jl:0
  [6] macro expansion
    @ ~/.julia/packages/Enzyme/F71IJ/src/compiler.jl:5916 [inlined]
  [7] enzyme_call
    @ ~/.julia/packages/Enzyme/F71IJ/src/compiler.jl:5566 [inlined]
  [8] ForwardModeThunk
    @ ~/.julia/packages/Enzyme/F71IJ/src/compiler.jl:5446 [inlined]
  [9] autodiff
    @ ~/.julia/packages/Enzyme/F71IJ/src/Enzyme.jl:399 [inlined]
 [10] pushforward
    @ ~/.julia/packages/DifferentiationInterface/IH8L9/ext/DifferentiationInterfaceEnzymeExt/forward_onearg.jl:28 [inlined]
 [11] hvp_aux
    @ ~/.julia/packages/DifferentiationInterface/IH8L9/src/second_order/hvp.jl:166 [inlined]
 [12] hvp
    @ ~/.julia/packages/DifferentiationInterface/IH8L9/src/second_order/hvp.jl:154 [inlined]
 [13] #31
    @ ~/.julia/packages/DifferentiationInterface/IH8L9/src/second_order/hessian.jl:67 [inlined]
 [14] #186
    @ ./none:0 [inlined]
 [15] iterate
    @ ./generator.jl:47 [inlined]
 [16] _typed_stack(::Colon, ::Type{…}, ::Type{…}, A::Base.Generator{…}, Aax::Tuple{…})
    @ Base ./abstractarray.jl:2817
 [17] _typed_stack
    @ ./abstractarray.jl:2817 [inlined]
 [18] _stack
    @ ./abstractarray.jl:2807 [inlined]
 [19] _stack
    @ ./abstractarray.jl:2799 [inlined]
 [20] stack
    @ ./abstractarray.jl:2796 [inlined]
 [21] hessian
    @ ~/.julia/packages/DifferentiationInterface/IH8L9/src/second_order/hessian.jl:66 [inlined]
 [22] hessian
    @ ~/.julia/packages/DifferentiationInterface/IH8L9/src/second_order/hessian.jl:57 [inlined]
 [23] hessian(f::var"#57#58"{Vector{Float64}}, backend::AutoEnzyme{Nothing}, x::Vector{Float64})
    @ DifferentiationInterface ~/.julia/packages/DifferentiationInterface/IH8L9/src/second_order/hessian.jl:57
 [24] top-level scope
    @ REPL[123]:1
Some type information was truncated. Use `show(err)` to see complete types.

gdalle · 2024-06-06T07:58:41Z

Hi!
I think it's a problem with the type-instability of the function definition. A closure creates type-instability, and Enzyme does not like that.
For instance, the following works:

julia> h(x) = sum((x .* x) .^ 2)
h (generic function with 1 method)

julia> hessian(h, AutoEnzyme(), rand(10))
10×10 Matrix{Float64}:
 7.93735  0.0      0.0      0.0      …  0.0      0.0     0.0      0.0
 0.0      2.85219  0.0      0.0         0.0      0.0     0.0      0.0
 0.0      0.0      6.47397  0.0         0.0      0.0     0.0      0.0
 0.0      0.0      0.0      4.03136     0.0      0.0     0.0      0.0
 0.0      0.0      0.0      0.0         0.0      0.0     0.0      0.0
 0.0      0.0      0.0      0.0      …  0.0      0.0     0.0      0.0
 0.0      0.0      0.0      0.0         4.34344  0.0     0.0      0.0
 0.0      0.0      0.0      0.0         0.0      9.0601  0.0      0.0
 0.0      0.0      0.0      0.0         0.0      0.0     8.02508  0.0
 0.0      0.0      0.0      0.0         0.0      0.0     0.0      6.5375

gdalle · 2024-06-06T08:04:34Z

I suggest you open the same issue on the Enzyme side

jbrea · 2024-06-06T09:41:48Z

I think it's a problem with the type-instability of the function definition. A closure creates type-instability

Are you sure? I don't see why this should be type unstable, and JET doesn't report any type-instability.

Also, the error message "You may be using a constant variable as temporary storage for active memory" rather suggests that a Const is used instead of a Duplicated (or BatchDuplicated) at some point, no?

gdalle · 2024-06-06T10:03:56Z

Oh you're right, the let probably makes it type-stable.

I think #309 might fix it, can you try out the PR branch with your toy problem or your actual problem and see?

gdalle · 2024-06-06T10:19:30Z

Oops it breaks more than it fixes 😱

gdalle · 2024-06-06T10:27:18Z

@wsmoses any clues here?

wsmoses · 2024-06-06T11:04:37Z

I believe the problem here is the closure.

What if you use enzyme autodiff rather than DI (and actually pass multiple arguments, y marked const)

wsmoses · 2024-06-06T11:09:40Z

Given that the backtrace comes from DI.inner_gradient_closure, my hypothesis is indeed that the closure contains both active memory and constant memory -- and that the store of constant memory into the active closure here is the problem described in the linked FAQ.

-> recommending you just call autodiff rather than DI, as DI doesn't support multiple arguments [and thus requires the closure]

jbrea · 2024-06-06T11:15:24Z

What if you use enzyme autodiff rather than DI

Yes, sure, I know how to do it with Enzyme directly, but I hoped I could simplify my code with DifferentiationInterface.

wsmoses · 2024-06-06T11:24:31Z

Yeah this is a known issue for users of DI, where it presently doesn't support effective usage of Enzyme -- specifically multi arg / multi activity.

I think it would be prudent for DI to add that, but until then DI will add unnecessary limitations on what Enzyme can presently correctly and efficiently autodiff.

I wrote some comments about this tpapp/LogDensityProblemsAD.jl#29 (review) / tpapp/LogDensityProblemsAD.jl#29 (comment)

Unfortunately when I commented "Closures were one of the reasons imo that AD.jl didn't take off, so long term I think DI.jl would be well served by both not introducing them itself (my understanding is that it does this successfully), but also not forcing users to create the same closures user-side (which would create similar issues)." but the response was "DI is unlikely to support either multiple arguments or activity specification anytime soon"

@gdalle given the number of users who need this, do you think it is possible to higher prioritize?

wsmoses · 2024-06-06T11:28:59Z

@jbrea if you'd like I can also add some nice syntax sugar like Enzyme.hessian (with multiple arguments). Would that be useful to you?

jbrea · 2024-06-06T11:57:20Z

I can also add some nice syntax sugar like Enzyme.hessian

Oh, that's kind, thanks. If it is a super quick thing to do, why not. But it doesn't bother me much. ForwardDiff is usually fast enough for the Hessians I need. Failed type analysis with Enzyme bothers me more (e.g. EnzymeAD/Enzyme.jl#1410, but I also ran into a failed type analysis error for a gradient calculation, which I couldn't condense to a MWE yet.)

gdalle · 2024-06-06T12:00:20Z

Would that be useful to you?

It would certainly be useful to me! Right now DI is the only simple way to compute many operators with Enzyme (most notably the hessian), it would be awesome to see some of those operators taken care of. Feel free to ping me on the PR for review if you end up doing it!

wsmoses · 2024-06-06T12:02:39Z

I'm not sure only way is accurate. We describe how to make hessians in our basics guide: https://enzyme.mit.edu/index.fcgi/julia/stable/generated/autodiff/

gdalle · 2024-06-06T12:04:03Z

Which is why I said only simple way ;)

wsmoses · 2024-06-06T12:04:18Z

Ah missed that, my b

gdalle · 2024-06-06T12:07:46Z

No worries! Enzyme is amazing, its only flaw is a rather steep entrance cost, but that's why I'm working on lowering it :)

I opened an issue on the Enzyme side to discuss:

API for Hessian EnzymeAD/Enzyme.jl#1495

wsmoses · 2024-06-06T20:49:26Z

Also I stand corrected, I forgot https://github.com/SciML/OptimizationBase.jl/blob/b2251ec6831c3631fe6db74e8f1b7bdefeb9c0b8/ext/OptimizationEnzymeExt.jl#L38 served as an easy way to use Enzyme for hessians as well.

gdalle · 2024-06-06T21:34:15Z

The issue with multiple arguments or activities is not adding it, it's testing it properly in all possible cases. The reason DI is so reliable is because we have thousands of LOCs to test every operator in every situation. Adding activities multiplies the number of possible cases by at least 2, so I shudder to think of the effort necessary

gdalle · 2024-06-06T21:37:14Z

I have opened #311 to discuss this specifically

gdalle · 2024-09-24T09:11:10Z

Thanks to #489, this now works on the main branch (soon to be released):

julia> using DifferentiationInterface

julia> import Enzyme

julia> g(x, y) = sum((x .* y).^2)
g (generic function with 1 method)

julia> x, y = rand(10), rand(10);

julia> hessian(g, AutoEnzyme(), x, Constant(y))
freeing without malloc   %arrayref37.3_cache.i.0.i = phi double* [ undef, %L131.lr.ph.i.i ], [ %43, %L128.L177.loopexit_crit_edge.unr-lcssa.i.i.loopexit ]
freeing without malloc   %arrayref37.2_cache.i.0.i = phi double* [ undef, %L131.lr.ph.i.i ], [ %47, %L128.L177.loopexit_crit_edge.unr-lcssa.i.i.loopexit ]
freeing without malloc   %arrayref37.1_cache.i.0.i = phi double* [ undef, %L131.lr.ph.i.i ], [ %51, %L128.L177.loopexit_crit_edge.unr-lcssa.i.i.loopexit ]
freeing without malloc   %arrayref37_cache.i.0.i = phi double* [ undef, %L131.lr.ph.i.i ], [ %55, %L128.L177.loopexit_crit_edge.unr-lcssa.i.i.loopexit ]
freeing without malloc   %arrayref37.epil_cache.i.0.i = phi double* [ undef, %L128.L177.loopexit_crit_edge.unr-lcssa.i.i ], [ %105, %L177.i.i.loopexit ]
freeing without malloc   %arrayref37.3_cache.i.0.i = phi double* [ undef, %L131.lr.ph.i.i ], [ %43, %L128.L177.loopexit_crit_edge.unr-lcssa.i.i.loopexit ]
freeing without malloc   %arrayref37.2_cache.i.0.i = phi double* [ undef, %L131.lr.ph.i.i ], [ %47, %L128.L177.loopexit_crit_edge.unr-lcssa.i.i.loopexit ]
freeing without malloc   %arrayref37.1_cache.i.0.i = phi double* [ undef, %L131.lr.ph.i.i ], [ %51, %L128.L177.loopexit_crit_edge.unr-lcssa.i.i.loopexit ]
freeing without malloc   %arrayref37_cache.i.0.i = phi double* [ undef, %L131.lr.ph.i.i ], [ %55, %L128.L177.loopexit_crit_edge.unr-lcssa.i.i.loopexit ]
freeing without malloc   %arrayref37.epil_cache.i.0.i = phi double* [ undef, %L128.L177.loopexit_crit_edge.unr-lcssa.i.i ], [ %105, %L177.i.i.loopexit ]
10×10 Matrix{Float64}:
 0.350807  0.0       0.0       0.0      0.0      0.0        0.0       0.0       0.0       0.0
 0.0       0.353255  0.0       0.0      0.0      0.0        0.0       0.0       0.0       0.0
 0.0       0.0       0.519147  0.0      0.0      0.0        0.0       0.0       0.0       0.0
 0.0       0.0       0.0       1.23933  0.0      0.0        0.0       0.0       0.0       0.0
 0.0       0.0       0.0       0.0      1.50778  0.0        0.0       0.0       0.0       0.0
 0.0       0.0       0.0       0.0      0.0      0.0331721  0.0       0.0       0.0       0.0
 0.0       0.0       0.0       0.0      0.0      0.0        0.179644  0.0       0.0       0.0
 0.0       0.0       0.0       0.0      0.0      0.0        0.0       0.409008  0.0       0.0
 0.0       0.0       0.0       0.0      0.0      0.0        0.0       0.0       0.130913  0.0
 0.0       0.0       0.0       0.0      0.0      0.0        0.0       0.0       0.0       0.0760922

gdalle added bug backend Related to one or more autodiff backends labels Jun 6, 2024

gdalle linked a pull request Jun 6, 2024 that will close this issue

Non-constant f in Enzyme #309

Closed

2 tasks

wsmoses mentioned this issue Jun 6, 2024

Re-test enzyme SciML/OptimizationBase.jl#53

Closed

5 tasks

wsmoses mentioned this issue Jun 6, 2024

[WIP] Fresh attempt at DI integration SciML/OptimizationBase.jl#54

Merged

5 tasks

gdalle mentioned this issue Jun 6, 2024

Multiple arguments / activities? #311

Closed

gdalle closed this as completed Sep 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enzyme hessian with closure: "You may be using a constant variable as temporary storage for active memory" #307

Enzyme hessian with closure: "You may be using a constant variable as temporary storage for active memory" #307

jbrea commented Jun 6, 2024

gdalle commented Jun 6, 2024

gdalle commented Jun 6, 2024

jbrea commented Jun 6, 2024

gdalle commented Jun 6, 2024

gdalle commented Jun 6, 2024

gdalle commented Jun 6, 2024

wsmoses commented Jun 6, 2024

wsmoses commented Jun 6, 2024 •

edited

Loading

jbrea commented Jun 6, 2024

wsmoses commented Jun 6, 2024 •

edited

Loading

wsmoses commented Jun 6, 2024

jbrea commented Jun 6, 2024

gdalle commented Jun 6, 2024

wsmoses commented Jun 6, 2024

gdalle commented Jun 6, 2024

wsmoses commented Jun 6, 2024

gdalle commented Jun 6, 2024 •

edited

Loading

wsmoses commented Jun 6, 2024

gdalle commented Jun 6, 2024 •

edited

Loading

gdalle commented Jun 6, 2024

gdalle commented Sep 24, 2024 •

edited

Loading

Enzyme hessian with closure: "You may be using a constant variable as temporary storage for active memory" #307

Enzyme hessian with closure: "You may be using a constant variable as temporary storage for active memory" #307

Comments

jbrea commented Jun 6, 2024

gdalle commented Jun 6, 2024

gdalle commented Jun 6, 2024

jbrea commented Jun 6, 2024

gdalle commented Jun 6, 2024

gdalle commented Jun 6, 2024

gdalle commented Jun 6, 2024

wsmoses commented Jun 6, 2024

wsmoses commented Jun 6, 2024 • edited Loading

jbrea commented Jun 6, 2024

wsmoses commented Jun 6, 2024 • edited Loading

wsmoses commented Jun 6, 2024

jbrea commented Jun 6, 2024

gdalle commented Jun 6, 2024

wsmoses commented Jun 6, 2024

gdalle commented Jun 6, 2024

wsmoses commented Jun 6, 2024

gdalle commented Jun 6, 2024 • edited Loading

wsmoses commented Jun 6, 2024

gdalle commented Jun 6, 2024 • edited Loading

gdalle commented Jun 6, 2024

gdalle commented Sep 24, 2024 • edited Loading

wsmoses commented Jun 6, 2024 •

edited

Loading

wsmoses commented Jun 6, 2024 •

edited

Loading

gdalle commented Jun 6, 2024 •

edited

Loading

gdalle commented Jun 6, 2024 •

edited

Loading

gdalle commented Sep 24, 2024 •

edited

Loading