Support Distributions 0.22 #23

mohamed82008 · 2020-01-15T10:09:28Z

No description provided.

devmotion · 2020-01-15T10:29:21Z

Unfortunately, due to lines such as

Lines 89 to 91 in ab14b91

    
           MvNormal(m::TrackedVector{<:Real}, A::UniformScaling{<:TrackedReal}) = MvNormal(m, A.λ) 
        
           MvNormal(m::AbstractVector{<:Real}, A::UniformScaling{<:TrackedReal}) = MvNormal(m, A.λ) 
        
           MvNormal(m::TrackedVector{<:Real}, A::UniformScaling{<:Real}) = MvNormal(m, A.λ)

the breaking change introduced in Distributions 0.22 will lead to different behaviour of MvNormal constructors in Distributions and DistributionsAD. I don't really see how one could support both Distributions 0.21 and 0.22, so I guess the way forward would be to fix the implementation of MvNormal in DistributionsAD and drop support for Distributions 0.21.

mohamed82008 · 2020-01-15T23:43:42Z

Sounds good.

mohamed82008 · 2020-01-15T23:52:12Z

Btw IIUC, the different behavior is not really an issue in this case. For one, we are not committing any additional type piracy now than we were earlier. It's just going to use a diagonal covariance for UniformScaling here when Distributions is using a ScalMat, is this about right? I can fix it to use a ScalMat-like implementation here instead but just checking that this is the only "problem".

mohamed82008 · 2020-01-15T23:56:07Z

Oh I see the problem, it's the squaring, sorry.

src/multivariate.jl

mohamed82008 · 2020-01-16T03:19:37Z

So while testing the matrix case, I found some other bugs and fixed them including a performance bug that was probably the reason why Multi of MvNormal was slow, (CC: @xukai92). I also found 2 failed tests cases for MvNormal which I don't know why yet. You can find them in broken_mult_cont_dists in runtests.jl.

mohamed82008 · 2020-01-16T03:27:21Z

Here is a MWE of a failed test case if you want to investigate:

using Distributions, DistributionsAD, ForwardDiff, Tracker
function f(x)
	@assert length(x) == 8
	C = reshape(x[1:4], 2, 2)
	val = reshape(x[5:8], 2, 2)
	return sum(logpdf(MvNormal(C), val))
end
x = [1.0, 0.0, 0.0, 1.0]; x = [x; ones(4)];
Tracker.data(Tracker.gradient(f, x)[1])
ForwardDiff.gradient(f, x)

CC: @willtebbutt

mohamed82008 · 2020-01-16T05:13:24Z

I will add more tests to raise the coverage, then if there are no objections after that, I will merge. We can look into the test failures later if we can't figure it out soon.

devmotion · 2020-01-16T09:07:35Z

I don't have time right now to look into these test failures, but since the main motivation for this PR initially was to support Distributions 0.22, I think it can be merged.

As a general comment, I'm not sure if one should actually define three different Tracker-compatible Gaussian distributions, it somehow doesn't feel "right". It feels like they differ just in the type/structure of the covariance matrix, and hence IMO the implementation should also handle this on the level of the covariance matrices, similar to how MvNormal supports ScalMat, PDiagMat, and PDMat as covariance matrices and FillArray as mean vector. But also that could/should be part of a separate PR IMO.

mohamed82008 · 2020-01-16T09:23:01Z

As a general comment, I'm not sure if one should actually define three different Tracker-compatible Gaussian distributions, it somehow doesn't feel "right".

I agree, let's do that in another PR.

devmotion · 2020-01-17T15:26:36Z

Regarding the incorrect gradients @mohamed82008 mentioned in his comment above, the problem with ForwardDiff seems to be that basically all linear algebra implementations that exploit the sparsity of matrices (by, e.g., calling convert(UpperTriangular, ...)) are AD unsafe (see, e.g., JuliaDiff/ForwardDiff.jl#197 (comment) and the other comments in this issue). For instance you can observe this in the following small example, which could be related to the problem mentioned above:

using ForwardDiff
using LinearAlgebra

@show ForwardDiff.gradient(x -> sum(x \ ones(2)), Matrix(I, 2, 2))
@show ForwardDiff.gradient(x -> sum(cholesky(x) \ ones(2)), Matrix(I, 2, 2))

support Distributions 0.22

bb45b8f

define TuringScalMvNormal and fix UniformScaling

686148d

mohamed82008 requested a review from devmotion January 16, 2020 01:01

mohamed82008 added 2 commits January 16, 2020 12:06

remove Manifest file

b123503

gitignore Manifest

69ba53e