[WIP] Use tensor random generator #1094

msperber · 2017-11-24T12:40:46Z

Use tensor random generator instead of the standard one. This should fix speed issues on GPUs due to creating random numbers on CPU and then transferring to GPU, as was done previously.

I implemented this carefully, but it would be good if someone could review the code, as this involves core functionality and is hard to unit-test.

Some numbers on a CPU machine and a GPU machine:

e = dy.random_normal((1000,1000), batch_size=40)
e2 = dy.dropout(e, 0.5)
v = e2.value()

old, CPU: 8.45s
new, CPU: 8.08s
old, GPU: 6.33s
new, GPU: 3.48s

I suspect that most of the 3.48s were dynet overhead because in my particular use case dropout went from consuming 90% of computation time to <1%.

fixes #542, #438
related to #1059

Initial implementation using Tensor random generator
fix strange behavior on some GPUs

neubig

Thanks a lot! This is long-needed. I have a few comments though.

neubig · 2017-11-24T16:12:45Z

dynet/nodes-dropout.cc

+  std::uniform_int_distribution<> seed_dist(1, 2147483647);
+  Eigen::internal::UniformRandomGenerator<float> uni_rg(seed_dist(*rndeng));
+  m.tvec().device(*dev.edevice) = m.tvec().random(uni_rg);
+  m.tvec().device(*dev.edevice) = (m.tvec() < m.tvec().constant((1.f-p))).cast<float>() * 1.f / (1.f-p);


Do we need *1.f here?

This is fixed now.

neubig · 2017-11-24T16:16:07Z

dynet/nodes-dropout.cc

@@ -30,7 +30,10 @@ size_t Dropout::aux_storage_size() const {
 template<class MyDevice>
 void Dropout::forward_dev_impl(const MyDevice & dev, const vector<const Tensor*>& xs, Tensor& fx) const {
  Tensor m(dim, (float*)aux_mem, fx.device, DeviceMempool::FXS);
-  TensorTools::randomize_bernoulli(m, (1.f-p), 1.f / (1.f-p));
+  std::uniform_int_distribution<> seed_dist(1, 2147483647);


I'm not a huge fan of this code being copied all the time. If this is something that we'll want to use frequently, perhaps it can be made a global variable in the DyNet namespace or something?

Also, tangentially, there are some random number generation functions in tensor.h/tensor.cc that it might be reasonable to move to their own rand.h/rand.cc files, along with this one. What do you think?

That sounds good, I moved the random helpers to their own files and added a draw_random_seed() helper to avoid code duplication.

neubig · 2017-11-24T16:17:41Z

dynet/nodes-dropout.cc

@@ -30,7 +30,10 @@ size_t Dropout::aux_storage_size() const {
 template<class MyDevice>
 void Dropout::forward_dev_impl(const MyDevice & dev, const vector<const Tensor*>& xs, Tensor& fx) const {
  Tensor m(dim, (float*)aux_mem, fx.device, DeviceMempool::FXS);
-  TensorTools::randomize_bernoulli(m, (1.f-p), 1.f / (1.f-p));
+  std::uniform_int_distribution<> seed_dist(1, 2147483647);


Also, is there a reason why you didn't just implement all of this in TensorTools? It seems like that would work as well. If there's a reason that's no problem though.

I didn't do that partly because the TensorTools are also used for parameter initialization which I didn't want to touch, and partly because I didn't know how to properly move over the device-related code.

neubig · 2017-11-28T16:07:28Z

OK, I tested this on CPU and GPU with the current master branch and this PR using the attached script. It looks like master CPU/GPU and PR CPU are consistent, but the PR on GPU gives weird results. Could you try running the script and seeing what's up?

dynet-random.py.zip

msperber · 2017-11-28T16:50:20Z

Thanks for taking a look, that is strange indeed. random_normal() and random_uniform() look fine to me but bernoulli and dropout do not, which would hint at this line being broken on GPU:

m.tvec().device(*dev.edevice) = (m.tvec() < m.tvec().constant((1.f-p))).cast<float>() / (1.f-p);

The result of operator< is a Tensor<bool> which might potentially be a problem?

neubig · 2017-12-19T15:51:29Z

OK, I'm getting very strange behavior even on RandomNormal. It looks like almost exactly half the time it's sampling the exact same value, while half the time it's sampling from the appropriate distribution. This looks like a relatively major bug in the upstream Eigen implementation. I'll try to reproduce it only in Eigen and see if they're interested in fixing it.

neubig · 2017-12-19T16:22:56Z

FYI, we're having some discussions offline about how to implement this directly in cuRAND, which will presumably be more stable than relying on Eigen and potentially easier.

neubig · 2018-01-24T14:52:54Z

Closed in favor of #1154.

msperber added 2 commits November 23, 2017 17:56

first stab at gpu-bound RNG

56cb167

consistently using Eigen random generators

f77be29

neubig reviewed Nov 24, 2017

View reviewed changes

msperber added 2 commits November 27, 2017 09:35

removed unnecessary * 1

741ce55

moved random helpers to rand.cc

d27bc27

msperber changed the title ~~Use tensor random generator~~ [WIP] Use tensor random generator Dec 3, 2017

Merge branch 'master' into random-gpu

01f4548

neubig mentioned this pull request Dec 22, 2017

cuRAND support #1154

Merged

neubig closed this Jan 24, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Use tensor random generator #1094

[WIP] Use tensor random generator #1094

msperber commented Nov 24, 2017 •

edited

Loading

neubig left a comment

neubig Nov 24, 2017

msperber Nov 27, 2017

neubig Nov 24, 2017

msperber Nov 27, 2017

neubig Nov 24, 2017

msperber Nov 27, 2017

neubig commented Nov 28, 2017

msperber commented Nov 28, 2017 •

edited

Loading

neubig commented Dec 19, 2017

neubig commented Dec 19, 2017

neubig commented Jan 24, 2018

[WIP] Use tensor random generator #1094

[WIP] Use tensor random generator #1094

Conversation

msperber commented Nov 24, 2017 • edited Loading

neubig left a comment

Choose a reason for hiding this comment

neubig Nov 24, 2017

Choose a reason for hiding this comment

msperber Nov 27, 2017

Choose a reason for hiding this comment

neubig Nov 24, 2017

Choose a reason for hiding this comment

msperber Nov 27, 2017

Choose a reason for hiding this comment

neubig Nov 24, 2017

Choose a reason for hiding this comment

msperber Nov 27, 2017

Choose a reason for hiding this comment

neubig commented Nov 28, 2017

msperber commented Nov 28, 2017 • edited Loading

neubig commented Dec 19, 2017

neubig commented Dec 19, 2017

neubig commented Jan 24, 2018

msperber commented Nov 24, 2017 •

edited

Loading

msperber commented Nov 28, 2017 •

edited

Loading