[Crypto] improve shuffle test #4844

tarakby · 2023-10-19T22:13:24Z

Improve a statistical test that evaluates the uniformity of a random shuffling.
The previous and new versions are both sanity checks of uniformity that can detect bugs in the implementation, but are not extended statistical tests.

The previous version chooses a random element of the original array, and tracks the distribution of indices where this element falls in the shuffled array. If the shuffling is uniform, the distribution of indices is uniform too. Uniformity is simply evaluated by comparing the standard deviation to a low threshold.
The new version uses a known bijection between the set of all permutations (shuffles) and a set of integers, using the Lehmer code the factorial number radix. If the shuffling is uniform, the distribution of the permutation encodings is uniform too. Uniformity is still evaluated using the same basic standard deviation.

The new version evaluates the permutation as a whole instead of isolating one element only.
The drawback is that the evaluated distribution in the new version is of size n! (compared to n in the previous version), so the tested shuffled array has to remain small.

Side change:
Export the EncodePermutation function so that other permutation tests can use the tool.

…g uniformity

codecov-commenter · 2023-10-19T22:24:09Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (6591e54) 55.76% compared to head (9ce23e0) 55.76%.
Report is 2 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #4844      +/-   ##
==========================================
- Coverage   55.76%   55.76%   -0.01%     
==========================================
  Files         955      955              
  Lines       88867    88867              
==========================================
- Hits        49555    49554       -1     
- Misses      35574    35578       +4     
+ Partials     3738     3735       -3

Flag	Coverage Δ
unittests	`55.76% <ø> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

see 10 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

jordanschalm · 2023-10-20T19:19:43Z

crypto/random/rand_utils.go

+// input `perm` is assumed to be a correct permutation of the set [0,n-1]
+// (not checked in this function).
+func EncodePermutation(perm []int) int {
+	r := make([]int, len(perm))


Suggested change

r := make([]int, len(perm))

// We first construct r, which contains elements with the following constraints:

// r = { r(0) ∈ [0,n-1], r(1) ∈ [0,n-2], ... , r(n-1) ∈ [0,1], r(n) ∈ [0] }

r := make([]int, len(perm))

The suggested comment is the start of explaining what a Lehmer code is. However it doesn't fully explain how r is obtained, or why there is a bijection between all permutations and the set [0,n-1] x [0,n-2] x ... x [0,1] x [0], so IMO it doesn't add much information.

In the original comment I only mentioned "compute Lehmer code", maybe I can add a link to what Lehmer code is (that contains all the details).

jordanschalm · 2023-10-20T19:23:06Z

crypto/random/rand_utils.go

+		}
+	}
+	// Convert to an integer following the factorial number system
+	m := 0


Suggested change

m := 0

// We construct `m` by taking the sum of all elements of `r`, each multiplied by the factorial of their inverted index:

// m = ∑ r(i) * (n-i)!, i ∈ [0,n-1]

m := 0

This comment is more explanatory but it doesn't explain why it makes sense to compute m (why there is a bijection between [0,n-1] x [0,n-2] x ... x [0,1] x [0] and `[0, n! -1] ).
I suggest to add a link to the full explanation instead.

jordanschalm · 2023-10-20T19:25:28Z

crypto/random/rand_test.go

+				BasicDistributionTest(t, factN, 1, permEncoding)
+			})
+
+			t.Run("shuffle a same permutation", func(t *testing.T) {


I don't understand why we have this test, which looks identical to the previous one.

the first test shuffles an array multiple times. The resulting array from the previous shuffle is shuffled again. The distribution is computed based on all intermediate arrays from each loop.

the second test shuffles the same array [0,1, .. , n-1] multiple times. The resulting array after each loop is used for the distribution, but it is then ignored. The next loop shuffle is computed using the same starting array.

The shuffling should be uniform in both scenarios. Some buggy implementations may be uniform in one scenario and not the other.

Tarak Ben Youssef and others added 5 commits October 16, 2023 17:29

fix typos and add a constant for better clarity

bf0ab86

add function to encode a permutation into an integer

8e5e693

update shuffle uniformity test to be based on the permutation encodin…

3f48bd0

…g uniformity

improve docs

6c80934

Merge branch 'master' into tarak/improve-shuffle-test

c45d625

tarakby added the Improvement label Oct 19, 2023

tarakby requested review from durkmurder, jordanschalm, gomisha and AlexHentschel October 19, 2023 22:16

jordanschalm reviewed Oct 20, 2023

View reviewed changes

add links to computation details

03b9532

tarakby requested a review from jordanschalm October 31, 2023 19:27

tarakby mentioned this pull request Oct 31, 2023

Pseudo-random generator statistical tests onflow/random-coin-toss#4

Merged

jordanschalm approved these changes Oct 31, 2023

View reviewed changes

sisyphusSmiling approved these changes Nov 7, 2023

View reviewed changes

Merge branch 'master' into tarak/improve-shuffle-test

7c272fe

tarakby enabled auto-merge November 7, 2023 01:18

Merge branch 'master' into tarak/improve-shuffle-test

9ce23e0

tarakby added this pull request to the merge queue Nov 7, 2023

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Nov 7, 2023

tarakby added this pull request to the merge queue Nov 7, 2023

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Nov 7, 2023

tarakby added this pull request to the merge queue Nov 7, 2023

tarakby removed this pull request from the merge queue due to a manual request Nov 7, 2023

tarakby added this pull request to the merge queue Nov 7, 2023

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Nov 7, 2023

gomisha added this pull request to the merge queue Nov 7, 2023

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Nov 7, 2023

tarakby added this pull request to the merge queue Nov 8, 2023

Merged via the queue into master with commit fe6714e Nov 8, 2023
36 checks passed

tarakby deleted the tarak/improve-shuffle-test branch November 8, 2023 01:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Crypto] improve shuffle test #4844

[Crypto] improve shuffle test #4844

tarakby commented Oct 19, 2023 •

edited

Loading

codecov-commenter commented Oct 19, 2023 •

edited

Loading

jordanschalm Oct 20, 2023

tarakby Oct 20, 2023

tarakby Oct 20, 2023

jordanschalm Oct 20, 2023 •

edited

Loading

tarakby Oct 20, 2023

tarakby Oct 20, 2023

jordanschalm Oct 20, 2023

tarakby Oct 20, 2023

-	m := 0
+	// We construct `m` by taking the sum of all elements of `r`, each multiplied by the factorial of their inverted index:
+	// m = ∑ r(i) * (n-i)!, i ∈ [0,n-1]
+	m := 0

[Crypto] improve shuffle test #4844

[Crypto] improve shuffle test #4844

Conversation

tarakby commented Oct 19, 2023 • edited Loading

codecov-commenter commented Oct 19, 2023 • edited Loading

Codecov Report

jordanschalm Oct 20, 2023

Choose a reason for hiding this comment

tarakby Oct 20, 2023

Choose a reason for hiding this comment

tarakby Oct 20, 2023

Choose a reason for hiding this comment

jordanschalm Oct 20, 2023 • edited Loading

Choose a reason for hiding this comment

tarakby Oct 20, 2023

Choose a reason for hiding this comment

tarakby Oct 20, 2023

Choose a reason for hiding this comment

jordanschalm Oct 20, 2023

Choose a reason for hiding this comment

tarakby Oct 20, 2023

Choose a reason for hiding this comment

tarakby commented Oct 19, 2023 •

edited

Loading

codecov-commenter commented Oct 19, 2023 •

edited

Loading

jordanschalm Oct 20, 2023 •

edited

Loading