Platform-independent random numbers [using stdlib copy] #469

halflearned · 2019-08-04T09:56:26Z

Third -- and hopefully final -- attempt at #379.

In our last meeting, we decided that we should try once again to copy from boost a way that 1) would not bloat the third party dependencies with hundreds of boost files, but also 2) be easily reproducible. Unfortunately, I wasn't able to achieve both goals simultaneously. Boost has a very complex type system, and I wasn't able to extricate the necessary functions without bringing a few hundred boost files along.

Instead, I decided to take a final look at standard library's <random> and noticed that most of the dependencies came from operator<<. Having removed those methods, it was not hard to pull the necessary classes from the standard library.

Major changes

Two new files in third_party: grfstd/algorithm.hpp and grfstd/random.hpp, which are nearly exact copies of the relevant parts of algorithm and random headers, with minor differences such as adding prefix std:: to some of the function calls.
Calls to std::shuffle and std::*_distribution are now grfstd::shuffle and grfstd::*_distribution.

jtibshirani

The PR is looking great, the approach is very clean. A couple small comments, then I think it is ready to merge:

I think the common naming convention is nonstd for something from the standard library that's been copied or modified. This is the namespace for the implementation of 'optional' we copied in, for example. We could rename the grfstd directory to random to distinguish it from optional.
It would be good to add a README.md to the directory stating how the code was pulled together and why it exists. Important information would include what compiler's standard library was used, and also what modifications were made to the files (I think it is just removing methods related to operator<< as well as uses of _LIBCPP macros?)

halflearned · 2019-08-06T07:10:15Z

Thanks for the review, @jtibshirani! I'm also pretty happy with how this turned out.

On nonstd: that's a good point, I should have checked what was done for optional.

On the README: will do.

I'm a little tied up today and tomorrow, but by Friday I should have this done.

halflearned · 2019-08-10T19:53:39Z

Thank for the review, @jtibshirani! It should be ready for a second round whenever you have the time.

jtibshirani

Looks good to me!

After this merges, I plan to enable the characterization tests in CI. Having them enabled in CI is nice on its own, and will also serve as a sort of test for this PR.

core/third_party/random/README.md

* master: Fix a typo in the `@keywords` roxygen directive. (grf-labs#481) Ensuring DiceKriging does not break forest training (grf-labs#455) CI: Disable lintr (grf-labs#480) Platform-independent random numbers [using stdlib copy] (grf-labs#469) test_regression_forest.R: fix argument name (grf-labs#477) CI: Only warn on lint error (grf-labs#474)

Previously, the C++ characterization tests only passed when using clang because of differences in the way random numbers are generated across platforms. Because we build on a couple different platforms, we had to disable these tests in CI. Now that we've added platform-independent random number generation in #469, we can enable the characterization tests.

Previously, the C++ characterization tests only passed when using clang because of differences in the way random numbers are generated across platforms. Because we build on a couple different platforms, we had to disable these tests in CI. Now that we've added platform-independent random number generation in #469 and #492, we can enable the characterization tests.

Previously, the C++ characterization tests only passed when using clang because of differences in the way random numbers are generated across platforms. Because we build on a couple different platforms, we had to disable these tests in CI. Now that we've added platform-independent random number generation in grf-labs#469 and grf-labs#492, we can enable the characterization tests.

erikcs · 2021-07-09T05:36:46Z

Making a note of that:

After #1006 core GRF is tested to produce the same result with three different compilers (latest version), using the same seed and number of threads on

GCC
Clang
MSVC

Using Intel's C++ compiler ICC does not give the exact same results, but close. Evidently its compiler optimizations are more "aggressive", if they are set to -O0 or -O1 the ForestCharacterizationTest matches exactly on the linux machine I tried out ICC 19.0.0.117 on.

halflearned added 3 commits August 4, 2019 02:45

Replacing std -> grfstd

a36e1a6

adding symlink

b1bc63e

Removing compiler-dependent flags

eaa05e4

halflearned changed the title ~~Replacing std -> grfstd~~ Platform-independent random numbers [using stdlib copy] Aug 4, 2019

halflearned requested review from jtibshirani and erikcs August 4, 2019 23:29

jtibshirani reviewed Aug 6, 2019

View reviewed changes

This was referenced Aug 6, 2019

Predictable random number seeding across platforms #448

Closed

Predictable random numbers seeding across platforms [using boost] #454

Closed

halflearned added 3 commits August 9, 2019 22:08

grfstd->nonstd

827dfbc

Merge branch 'master' into random-grfstd

85bdc2c

README revision

8970ee2

jtibshirani approved these changes Aug 13, 2019

View reviewed changes

core/third_party/random/README.md Outdated Show resolved Hide resolved

core/third_party/random/README.md Show resolved Hide resolved

dotting the i's

d3bbcc9

halflearned merged commit 93ce8d3 into grf-labs:master Aug 14, 2019

halflearned deleted the random-grfstd branch August 14, 2019 01:27

jtibshirani mentioned this pull request Aug 18, 2019

Run the characterization tests in CI. #485

Merged

davidahirshberg pushed a commit to davidahirshberg/grf that referenced this pull request Dec 6, 2019

Platform-independent random numbers [using stdlib copy] (grf-labs#469)

8ead643

erikcs mentioned this pull request Jul 8, 2021

Add MSVC to Azure pipelines/update random.hpp #1006

Merged

jtibshirani mentioned this pull request Sep 27, 2024

Invariance test fails on Apple arm+clang #1452

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Platform-independent random numbers [using stdlib copy] #469

Platform-independent random numbers [using stdlib copy] #469

halflearned commented Aug 4, 2019

jtibshirani left a comment

halflearned commented Aug 6, 2019

halflearned commented Aug 10, 2019

jtibshirani left a comment

erikcs commented Jul 9, 2021 •

edited

Loading

Platform-independent random numbers [using stdlib copy] #469

Platform-independent random numbers [using stdlib copy] #469

Conversation

halflearned commented Aug 4, 2019

jtibshirani left a comment

Choose a reason for hiding this comment

halflearned commented Aug 6, 2019

halflearned commented Aug 10, 2019

jtibshirani left a comment

Choose a reason for hiding this comment

erikcs commented Jul 9, 2021 • edited Loading

erikcs commented Jul 9, 2021 •

edited

Loading