Computing var/stddev and mean at the same time #18731

ifedan · 2019-04-02T15:30:12Z

The current variance kernels compute mean at the same time. Many times we want both statistics together, so it seems reasonable to have a kwarg/function that allows us to get both values without launching an extra kernel.

facebook-github-bot

@ifedan has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@ifedan has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ifedan · 2019-04-02T19:59:52Z

@pytorchbot retest this please

vadimkantorov · 2019-04-02T21:55:50Z

(about naming: similar function is called tf.nn.moments in tensorflow and computes mean+variance, but mean+std seems useful as well, so name matching wouldn't be perfect anyways)

facebook-github-bot

@ifedan has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ifedan · 2019-04-04T20:21:25Z

@pytorchbot retest this please

t-vi · 2019-04-05T07:04:59Z

This looks nice. Do you have a rough indication of the relative performance between mean_std/mean_var and batch_norm_gather_stats?

ifedan · 2019-04-05T12:40:35Z

@pytorchbot retest this please

This reverts commit bc1cc26.

facebook-github-bot

@ifedan has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@ifedan has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

umanwizard · 2019-05-13T15:20:50Z

@ifedan I think there is an issue from previous reviews that was never fixed: Can we get rid of the c10::optional<IntArrayRef> stuff in make_dim_mask and make_reduction ?

Then we can have the no-dim functions work the same way as sum and prod, by calling into the other ones with an empty list {} of dims, e.g.:

Tensor sum(const Tensor &self, ScalarType dtype) {
  return at::native::sum(self, {}, false, optional<ScalarType>(dtype));
}

ifedan · 2019-05-14T21:24:46Z

@umanwizard I did some regression performance test between previous and current implementation:

Results from master:

import timeit
SETUP_CODE = '''import torch; a = torch.randn(10000, 10);'''
TEST_CODE = '''torch.std(a, dim=1);'''
timeit.repeat(setup = SETUP_CODE,stmt = TEST_CODE,repeat = 3,number = 10)
[0.04082131403265521, 0.02319741202518344, 0.024329718959052116]

SETUP_CODE = '''import torch; a = torch.randn(10000, 10).cuda();'''
TEST_CODE = '''d1=torch.std(a, dim=1);print(d1[0])'''
timeit.repeat(setup = SETUP_CODE,stmt = TEST_CODE,repeat = 3,number = 10)
[0.013087920029647648, 0.015553846955299377, 0.015307638968806714]

Results from new implementation:

import timeit
SETUP_CODE = '''import torch; a = torch.randn(10000, 10);'''
TEST_CODE = '''torch.std(a, dim=1);'''
timeit.repeat(setup = SETUP_CODE,stmt = TEST_CODE,repeat = 3,number = 10)
[0.029049404023680836, 0.023604059999343008, 0.02448320999974385]

SETUP_CODE = '''import torch; a = torch.randn(10000, 10).cuda();'''
TEST_CODE = '''d1=torch.std(a, dim=1);print(d1[0])'''
timeit.repeat(setup = SETUP_CODE,stmt = TEST_CODE,repeat = 3,number = 10)
[0.01722334197256714, 0.015372609021142125, 0.01548504998208955]

facebook-github-bot

@ifedan has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

umanwizard

LGTM, but can you check to see if Greg wants to look at it again, before landing.

facebook-github-bot

@ifedan has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@ifedan has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: The current variance kernels compute mean at the same time. Many times we want both statistics together, so it seems reasonable to have a kwarg/function that allows us to get both values without launching an extra kernel. Pull Request resolved: pytorch/pytorch#18731 Differential Revision: D14726082 Pulled By: ifedan fbshipit-source-id: 473cba0227b69eb2240dca5e61a8f4366df0e029

facebook-github-bot · 2019-05-16T01:10:35Z

@ifedan merged this pull request in 4c23c34.

gchanan · 2019-05-16T20:16:51Z

test/test_torch.py

+            for dim in range(x.dim()):
+                for unbiased in [False, True]:
+                    for keepdim in [False, True]:
+                        std1, mean1 = torch.std_mean(x, dim=dim, unbiased=unbiased, keepdim=keepdim)


do you ever check multiple dims that aren't all dims? the constituent functions support that, right?

gchanan · 2019-05-16T20:20:31Z

tools/autograd/derivatives.yaml

+
+- name: std_mean(Tensor self, IntArrayRef dim, bool unbiased, bool keepdim)
+  self: var_std_mean_backward(grads, self, result0, result1, dim, unbiased, keepdim, true)
+


I thought you were going to remove these?

Repeating by previous comment:

but that's very different -- __rpow__ is a real python function, see https://docs.python.org/2.0/ref/numeric-types.html, not a made up function for testing. I think the issue here is actually that common_methods_invocations.py doesn't work on methods. We should fix that! Can you take a look? For the purposes of this PR, I'm okay with making a fake method, i.e. _std_mean that just calls your function std_mean and you can write a comment about the situation.

…20650) Summary: Added some extra tests for std_mean and var_mean for multiple dims. Some refactoring of previously created tests based on PR comments: #18731 Pull Request resolved: #20650 Differential Revision: D15396101 Pulled By: ifedan fbshipit-source-id: d15c3c2c7084a24d6cfea4018173552fcc9c03a9

ifedan added 11 commits March 20, 2019 17:06

std mean var combination

573b0ab

std mean var combination

6df1664

std mean var combination

b6e2e96

std mean var combination

0c4033d

std mean var combination

d998c4d

std mean var combination

aaae08d

std mean var combination

ae63691

Merged with master

f6ffd15

Merge branch 'master' into mean.std.var

804364e

Fixes after merge

faf9899

made dim optional; assert on Reduce.project; added documentation

a10a78a

facebook-github-bot reviewed Apr 2, 2019

View reviewed changes

Fix std_mean var_mean documentation

69dc37b

facebook-github-bot reviewed Apr 2, 2019

View reviewed changes

ifedan requested a review from gchanan April 2, 2019 22:36

ifedan added 6 commits April 3, 2019 11:39

Fix autograd tests

4302eae

Merge branch 'master' into mean.std.var

5e4ffab

Fix flake8 issues

305bf67

Fix flake8 issues

c8054f5

Fix Windows issue

475d532

Merge branch 'master' into mean.std.var

d148d33

Fix for TensorMethods

bc1cc26

ifedan added 2 commits April 5, 2019 11:27

Fix for Windows

2b8ea7c

Revert "Fix for TensorMethods"

01deaf4

This reverts commit bc1cc26.

facebook-github-bot reviewed May 9, 2019

View reviewed changes

ifedan requested a review from umanwizard May 9, 2019 14:28

Used tuple for results for CPU implementation for std/var_mean

bc115d5

facebook-github-bot reviewed May 10, 2019

View reviewed changes

Fix based on CR

3fd4554

Merge branch 'master' into mean.std.var

c16d4b2

facebook-github-bot reviewed May 14, 2019

View reviewed changes

umanwizard approved these changes May 15, 2019

View reviewed changes

facebook-github-bot reviewed May 15, 2019

View reviewed changes

Merge branch 'master' into mean.std.var

69657fb

facebook-github-bot reviewed May 15, 2019

View reviewed changes

facebook-github-bot closed this in 4c23c34 May 15, 2019

facebook-github-bot added the merged label May 16, 2019

ifedan mentioned this pull request May 16, 2019

[feature request] Computing var/stddev and mean at the same time #4993

Closed

gchanan reviewed May 16, 2019

View reviewed changes

ifedan mentioned this pull request May 17, 2019

Added some extra tests for std_mean and var_mean for multiple dims. #20650

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Computing var/stddev and mean at the same time #18731

Computing var/stddev and mean at the same time #18731

ifedan commented Apr 2, 2019

facebook-github-bot left a comment

facebook-github-bot left a comment

ifedan commented Apr 2, 2019

vadimkantorov commented Apr 2, 2019

facebook-github-bot left a comment

ifedan commented Apr 4, 2019

t-vi commented Apr 5, 2019

ifedan commented Apr 5, 2019

facebook-github-bot left a comment

facebook-github-bot left a comment

umanwizard commented May 13, 2019

ifedan commented May 14, 2019 •

edited

Loading

facebook-github-bot left a comment

umanwizard left a comment

facebook-github-bot left a comment

facebook-github-bot left a comment

facebook-github-bot commented May 16, 2019

gchanan May 16, 2019

gchanan May 16, 2019


		- name: std_mean(Tensor self, IntArrayRef dim, bool unbiased, bool keepdim)
		self: var_std_mean_backward(grads, self, result0, result1, dim, unbiased, keepdim, true)

Computing var/stddev and mean at the same time #18731

Computing var/stddev and mean at the same time #18731

Conversation

ifedan commented Apr 2, 2019

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot left a comment

Choose a reason for hiding this comment

ifedan commented Apr 2, 2019

vadimkantorov commented Apr 2, 2019

facebook-github-bot left a comment

Choose a reason for hiding this comment

ifedan commented Apr 4, 2019

t-vi commented Apr 5, 2019

ifedan commented Apr 5, 2019

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot left a comment

Choose a reason for hiding this comment

umanwizard commented May 13, 2019

ifedan commented May 14, 2019 • edited Loading

facebook-github-bot left a comment

Choose a reason for hiding this comment

umanwizard left a comment

Choose a reason for hiding this comment

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot commented May 16, 2019

gchanan May 16, 2019

Choose a reason for hiding this comment

gchanan May 16, 2019

Choose a reason for hiding this comment

ifedan commented May 14, 2019 •

edited

Loading