Fix incorrect sparse add behavior when the sparse tensor has non-contiguous values #18179

yf225 · 2019-03-19T16:02:30Z

Currently, this code gives incorrect result:

import torch
indices=torch.tensor([[7, 1, 3]])
values=torch.tensor([[1., 1., 1.],
               [1., 1., 1.],
               [1., 1., 1.]])
x = torch.sparse_coo_tensor(indices, values, size=(10, 3))
values=torch.tensor(1.).expand(3, 3)
y = torch.sparse_coo_tensor(indices, values, size=(10, 3))
z = x + y

# Should have been all 2's in `values`
tensor(indices=tensor([[7, 1, 3]]),
       values=tensor([[2., 1., 1.],
                      [1., 1., 1.],
                      [1., 1., 1.]]),
       size=(10, 3), nnz=3, layout=torch.sparse_coo)

This PR fixes the bug by adding special handling for sparse tensors with non-contiguous values in the addition function (specifically, by cat'ing the indices and values together).

This PR closes #17950 and #17919.

zou3519 · 2019-03-19T16:58:23Z

Mar 19 16:36:10 ======================================================================
Mar 19 16:36:10 ERROR: test_sparse_ctor_getter_backward (__main__.TestAutograd)
Mar 19 16:36:10 ----------------------------------------------------------------------
Mar 19 16:36:10 Traceback (most recent call last):
Mar 19 16:36:10   File "/var/lib/jenkins/workspace/test/common_utils.py", line 120, in wrapper
Mar 19 16:36:10     fn(*args, **kwargs)
Mar 19 16:36:10   File "test_autograd.py", line 671, in test_sparse_ctor_getter_backward
Mar 19 16:36:10     test(sparse_size + dense_size, len(sparse_size), nnz, device)
Mar 19 16:36:10   File "test_autograd.py", line 653, in test
Mar 19 16:36:10     gradcheck(fn, (inp,))
Mar 19 16:36:10   File "/opt/conda/lib/python3.6/site-packages/torch/autograd/gradcheck.py", line 247, in gradcheck
Mar 19 16:36:10     'numerical:%s\nanalytical:%s\n' % (i, j, n, a))
Mar 19 16:36:10   File "/opt/conda/lib/python3.6/site-packages/torch/autograd/gradcheck.py", line 202, in fail_test
Mar 19 16:36:10     raise RuntimeError(msg)
Mar 19 16:36:10 RuntimeError: Jacobian mismatch for output 0 with respect to input 0,
Mar 19 16:36:10 numerical:tensor([[0.0145, 0.0000],
Mar 19 16:36:10         [0.0000, 0.0000],
Mar 19 16:36:10         [0.0145, 0.0108],
Mar 19 16:36:10         [0.0000, 0.0000],
Mar 19 16:36:10         [0.0145, 0.0108],
Mar 19 16:36:10         [0.0000, 0.0000],
Mar 19 16:36:10         [0.0145, 0.0108],
Mar 19 16:36:10         [0.0000, 0.0000],
Mar 19 16:36:10         [0.0145, 0.0108],
Mar 19 16:36:10         [0.0000, 0.0000]])
Mar 19 16:36:10 analytical:tensor([[0.0145, 0.0000],
Mar 19 16:36:10         [0.0000, 0.0108],
Mar 19 16:36:10         [0.0145, 0.0000],
Mar 19 16:36:10         [0.0000, 0.0108],
Mar 19 16:36:10         [0.0145, 0.0000],
Mar 19 16:36:10         [0.0000, 0.0108],
Mar 19 16:36:10         [0.0145, 0.0000],
Mar 19 16:36:10         [0.0000, 0.0108],
Mar 19 16:36:10         [0.0145, 0.0000],
Mar 19 16:36:10         [0.0000, 0.0108]])

idk if this test failure is legit

ezyang · 2019-03-21T19:01:26Z

It would be really helpful review if the PR message explained how exactly the problem was solved.

ezyang · 2019-03-21T19:04:08Z

aten/src/ATen/native/sparse/SparseTensorMath.cpp

-  return r._coalesced_(t_coalesced && s_coalesced);
+  LongTensor r_indices = at::cat({t_indices, s_indices}, 1);
+  Tensor r_values = at::cat({t_values, s_values}, 0);
+  alias_into_sparse(r, r_indices, r_values);


If you cat'ed, don't you have to specify the output is not coalesced

IMO we should make this a parameter on alias_into_sparse so people have to consider it.

alias_into_sparse(...) calls set_indices_and_values_unsafe(...) internally which always sets coalesced_ = false, and we expect users to call sparse_tensor._coalesced_(...) afterwards if they want to change the coalesce-ness of the sparse tensor. For example:

pytorch/aten/src/ATen/native/sparse/SparseTensor.cpp

Lines 457 to 458 in 1c671c5

alias_into_sparse(r, mask_indices.clone(), r_values);

r._coalesced_(mask.is_coalesced());

To simplify this API, we can add an is_coalesced parameter on alias_into_sparse, possibly in a separate PR.

ezyang · 2019-03-21T19:09:31Z

Can we get some benchmark numbers? I'm not sure if some of our embedding examples exercise sparse-sparse, but if it does that would be most representative.

I don't think it's necessarily wrong to switch to cat'ing the indices and values together, but I feel you could have also fixed the problem by simply switching values to use an accessor (which respects strides) rather than pointer arithmetic (which doesn't). So the algorithm change should be justified.

gchanan · 2019-03-21T20:13:26Z

How about only catting if the tensors aren't contiguous? That way we only (potentially) slow down paths that were broken anyway.

yf225 · 2019-03-21T21:18:18Z

@ezyang @gchanan I haven't figured out a way to make THBlas_axpy work with non-contiguous values, and I opt for cat'ing when the tensors aren't contiguous. This shouldn't hurt performance because the path with non-contiguous values is broken anyway.

yf225 · 2019-03-21T21:19:09Z

aten/src/ATen/native/sparse/SparseTensorMath.cpp

-  int64_t blockSize = r_values.stride(0);
-  int64_t cmp, d;
-  int64_t r_i = 0, t_i = 0, s_i = 0;
+  if (s_values.is_contiguous() && t_values.is_contiguous()) {


There is no change in this if-branch compared to the original code - I only indented it.

yf225 · 2019-03-21T21:34:10Z

aten/src/ATen/native/sparse/SparseTensorMath.cpp

+    // index goes backwards) which may be more precise than using the
+    // coalesced flag here.  But this is easy.
+    return r._coalesced_(t_coalesced && s_coalesced);
+  } else {


This if-branch is the actual addition.

facebook-github-bot

@yf225 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@yf225 is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

…iguous values (#18179) Summary: Currently, this code gives incorrect result: ```python import torch indices=torch.tensor([[7, 1, 3]]) values=torch.tensor([[1., 1., 1.], [1., 1., 1.], [1., 1., 1.]]) x = torch.sparse_coo_tensor(indices, values, size=(10, 3)) values=torch.tensor(1.).expand(3, 3) y = torch.sparse_coo_tensor(indices, values, size=(10, 3)) z = x + y tensor(indices=tensor([[7, 1, 3]]), values=tensor([[2., 1., 1.], [1., 1., 1.], [1., 1., 1.]]), size=(10, 3), nnz=3, layout=torch.sparse_coo) ``` This PR fixes the bug by adding special handling for sparse tensors with non-contiguous values in the addition function (specifically, by cat'ing the indices and values together). This PR closes pytorch/pytorch#17950 and pytorch/pytorch#17919. Pull Request resolved: pytorch/pytorch#18179 Reviewed By: ezyang Differential Revision: D14569591 Pulled By: yf225 fbshipit-source-id: f5a14c4a31337fc95eab64596212066b4fb18b1a

Will Feng added 2 commits March 19, 2019 11:56

Add test cases

02d3837

Fix sparse add for noncontiguous values

4728d1d

yf225 requested review from gchanan and zou3519 March 19, 2019 16:02

yf225 force-pushed the fix_sparse_add_noncontiguous branch from 01dd5ba to 6bff499 Compare March 19, 2019 20:43

Use same implementation as CUDA

7d45e69

yf225 force-pushed the fix_sparse_add_noncontiguous branch 2 times, most recently from 37577e3 to a445698 Compare March 20, 2019 01:11

fix test

17ec4cb

yf225 force-pushed the fix_sparse_add_noncontiguous branch from a445698 to 17ec4cb Compare March 20, 2019 01:12

ezyang reviewed Mar 21, 2019

View reviewed changes

yf225 changed the title ~~Fix incorrect sparse add behavior when the sparse tensor has non-contiguous values~~ [WIP] Fix incorrect sparse add behavior when the sparse tensor has non-contiguous values Mar 21, 2019

Improvement

031f1c4

yf225 changed the title ~~[WIP] Fix incorrect sparse add behavior when the sparse tensor has non-contiguous values~~ Fix incorrect sparse add behavior when the sparse tensor has non-contiguous values Mar 21, 2019

yf225 commented Mar 21, 2019

View reviewed changes

clean up expect

466e81b

yf225 commented Mar 21, 2019

View reviewed changes

gchanan approved these changes Mar 21, 2019

View reviewed changes

facebook-github-bot reviewed Mar 21, 2019

View reviewed changes

facebook-github-bot reviewed Mar 22, 2019

View reviewed changes

facebook-github-bot closed this in 7be05b8 Mar 23, 2019

pytorchbot added the merged label Mar 23, 2019

cpuhrsch mentioned this pull request Jul 27, 2021

odd behavior w/ add_out_dense_sparse_cuda #12633

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix incorrect sparse add behavior when the sparse tensor has non-contiguous values #18179

Fix incorrect sparse add behavior when the sparse tensor has non-contiguous values #18179

yf225 commented Mar 19, 2019 •

edited

Loading

zou3519 commented Mar 19, 2019

ezyang commented Mar 21, 2019

ezyang Mar 21, 2019

gchanan Mar 21, 2019

yf225 Mar 21, 2019

ezyang commented Mar 21, 2019

gchanan commented Mar 21, 2019

yf225 commented Mar 21, 2019 •

edited

Loading

yf225 Mar 21, 2019

yf225 Mar 21, 2019

facebook-github-bot left a comment

facebook-github-bot left a comment

	alias_into_sparse(r, mask_indices.clone(), r_values);
	r._coalesced_(mask.is_coalesced());

Fix incorrect sparse add behavior when the sparse tensor has non-contiguous values #18179

Fix incorrect sparse add behavior when the sparse tensor has non-contiguous values #18179

Conversation

yf225 commented Mar 19, 2019 • edited Loading

zou3519 commented Mar 19, 2019

ezyang commented Mar 21, 2019

ezyang Mar 21, 2019

Choose a reason for hiding this comment

gchanan Mar 21, 2019

Choose a reason for hiding this comment

yf225 Mar 21, 2019

Choose a reason for hiding this comment

ezyang commented Mar 21, 2019

gchanan commented Mar 21, 2019

yf225 commented Mar 21, 2019 • edited Loading

yf225 Mar 21, 2019

Choose a reason for hiding this comment

yf225 Mar 21, 2019

Choose a reason for hiding this comment

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot left a comment

Choose a reason for hiding this comment

yf225 commented Mar 19, 2019 •

edited

Loading

yf225 commented Mar 21, 2019 •

edited

Loading