Iterative generic math changes to match API review #69391

tannergooding · 2022-05-16T12:48:30Z

This updates GetShortestBitLength to return an int, exposes the WriteBigEndian, WriteExponentBigEndian, WriteSignificandBigEndian APIs, and updates System.Char to explicitly implement the numeric interfaces.

dotnet-issue-labeler · 2022-05-16T12:48:35Z

Note regarding the new-api-needs-documentation label:

This serves as a reminder for when your PR is modifying a ref *.cs file and adding/modifying public APIs, to please make sure the API implementation in the src *.cs file is documented with triple slash comments, so the PR reviewers can sign off that change.

dotnet-issue-labeler · 2022-05-16T12:48:36Z

I couldn't figure out the best area label to add to this PR. If you have write-permissions please help me learn by adding exactly one area label.

stephentoub · 2022-05-16T16:26:59Z

src/libraries/System.Private.CoreLib/src/System/Byte.cs

+            if (destination.Length >= sizeof(byte))
+            {
+                byte value = m_value;
+                Unsafe.WriteUnaligned(ref MemoryMarshal.GetReference(destination), value);


Does this result in better codegen than just destination[0] = m_value;? I'd have expected the guard above (which could also be if (!destination.IsEmpty)) to remove the bounds check.

Probably not, but its simpler to implement and compare the code when they are all roughly the same size/shape so I've mostly just been doing write once, copy/paste, small tweak.

I can update if you'd prefer it do the simpler thing here.

src/libraries/System.Private.CoreLib/src/System/Decimal.cs

src/libraries/System.Private.CoreLib/src/System/Double.cs

src/libraries/System.Private.CoreLib/src/System/Int64.cs

bartonjs · 2022-05-16T19:34:18Z

src/libraries/System.Private.CoreLib/src/System/Numerics/IBinaryInteger.cs

+        /// <summary>Tries to write the current value, in big-endian format, to a given span.</summary>
+        /// <param name="destination">The span to which the current value should be written.</param>
+        /// <param name="bytesWritten">The number of bytes written to <paramref name="destination" />.</param>
+        /// <returns><c>true</c> if the value was succesfully written to <paramref name="destination" />; otherwise, <c>false</c>.</returns>


By the by; my understanding is that <c>true</c> and <c>false</c> should actually be <see langword="true" /> and <see langword="false"/> so that they capitalize when changing to the VB version. (Same with null => Nothing)

Carlos' tool probably handles those as special-cased; but thought I'd share for future reference.

@carlossanlop is this handled or should I be explicitly using see langword ?

bartonjs · 2022-05-16T22:49:32Z

src/libraries/System.Runtime.Numerics/src/System/Numerics/BigInteger.cs

+
+                        i++;
+                    }
+                    while ((part == 0) && (i < bits.Length));


What spins here, and do we have boundary tests for it? (e.g. something does the do and exits to the other loop, something else runs the body twice, something runs thrice, and maybe 4 isn't possible?)

(A comment in the code to help future maintainers would be helpful).

No sure what you're asking for here? This handles the conversion from one's complement to two's complement.

The first loop handles 0x0000_0000 as the one's complement of that is 0x0000_0000. The second loop handles the remaining bits in the value.

0x0000_0000_0000_0000_0000_0000 becomes 0x0000_0000_0000_0000_0000_0000

0x0000_0000_0000_0000_0000_0001 becomes 0xFFFF_FFFF_FFFF_FFFF_FFFF_FFFF

0x0000_0000_0000_0001_0000_0000 becomes 0x0000_0000_FFFF_FFFF_0000_0000

This handles what is logically ~x + 1

I'm trying to understand what values of a BigInteger might cause the do/while to run more than once. Given that there's all of 1s complement, 2s complement, little endian, and big endian present, I can't follow the data flow in my head.

Negative values -1 .. int.MinValue + 1 all fit in _sign and _bits is null, so none of those are relevant.

int.MinValue produces a one element _bits value, so doesn't loop.

int.MinValue - 1 (0xFFFFFFFF_7FFFFFFF) is, I believe, stored as { 0x00000000, 0x80000000 }

The first time through the loop we load the 0. So now do ~0 + 1, which I guess goes back to 0.

Yay, this value looped.

The second time through we load the 0x8000_0000 and produce, uh, 0x8000_0000?

It's not zero, so we'd exit, but we're also out of data, so we'd exit anyways.

So perhaps the answer is "anything in the tests ending in multiples of 4 zero-byte values (BE) went through the top loop (trailing_zero_bytes_count / 4) + 1 times".

If that's the case, I'd expect to see tests that

Don't end in 4 zeros, but are at least 8 bytes (do+break+second_loop)

End in 4 zeros and are 8 bytes (do+do+return)

End in 4 zeros and are more than 8 bytes (do+do+break+second_loop)

End in 8 zeros and are 12 bytes (do+do+do+return)

End in 8 zeros and are more than 12 bytes (do+do+do+break+second_loop)

Ideally with each of the second_loop cases having a one/more_than_one case (and super ideally as one/two/more_than_two, since sometimes "two" is special)

If I understand the states now, I don't see any tests that produce 12 or more bytes, so there's nothing that runs that top loop 3 or more times.

The thing I'd hope for here is a comment talking about some of the boundary cases. i.e. a particular value that will run the top loop twice, then the bottom loop twice, but one more (or less?) would have a different execution characteristic (which I'm not feeling knowledgeable enough to predict if that means 1/3 or 3/1, or a black hole, or what). And those same values being codified in tests.

For this method in particular, we have tests covering everything but the 12 + byte case because it requires significant more setup to do and represent.

I can certainly add it, but its not interesting IMO. This logic is the same as we have in WriteLittleEndian and that is used in DangerousMakeTwosComplement, etc. The only difference is that it writes from the end of the buffer to the beginning, rather than from the beginning to the end and the current tests ensure that we are computing the correct results with both loops executing.

This logic is the same as we have in WriteLittleEndian

Which looks to be equally under-tested.

and that is used in DangerousMakeTwosComplement, etc.

To me, the fact that there are three copies of the code means there need to be more tests, not fewer, so that someone who changes something about only two of the three is likely to see failures from the remaining one.

Having read the comments in DangerousMakeTwosComplement I now better understand the reason for two distinct pieces of logic (the first loop is the +1 of the overall (~x + 1), and then also any carry values from the lower significance segments), but I genuinely didn't get that impression when reading the TryWriteLittleEndian and TryWriteBigEndian code blocks... it just came across as magic.

I still think that tests in the 12-16 byte range are useful for both TryWriteBigEndian and TryWriteLittleEndian, and even a comment as sparse as what DangerousMakeTwosComplement has about the carry bit would go a long way toward maintainability.

Fixed. There was indeed an issue with a specific edge case value due to it not accounting for needing an extra "part" to hold the sign.

In particular, signed.MinValue - 1 is represented in one's complement as 0x8000_0000, 0x0000_0001, swapped to two's complement this is 0x7FFF_FFFF, 0xFFFF_FFFF which is of course signed.MaxValue instead. We need it to be 0xFFFF_FFFF, 0x7FFF_FFFF, 0xFFFF_FFFF so that the sign is still tracked instead.

bartonjs · 2022-05-16T22:54:24Z

src/libraries/System.Runtime/tests/System/DoubleTests.GenericMath.cs

+
+            Assert.True(FloatingPointHelper<double>.TryWriteSignificandBigEndian(double.NegativeInfinity, destination, out bytesWritten));
+            Assert.Equal(8, bytesWritten);
+            Assert.Equal(new byte[] { 0x00, 0x10, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00 }, destination.ToArray());


FWIW: We have an AssertExtensions.SequenceEquals<T>(ROSpan<T>, ROSPan<T>) for spans, if you don't like all the ToArray calls in your tests.

https://github.com/dotnet/runtime/blob/main/src/libraries/Common/tests/TestUtilities/System/AssertExtensions.cs#L435-L476

Good to know for future changes. I might do that as a later cleanup. As for now, the current logic works and allows easily copying and tweaking logic between the 15-20 types that need it.

…r `GetShortestBitLength`

…e the one's complement format

…e correct

tannergooding added 3 commits May 15, 2022 06:21

Change GetShortestBitLength to return int and add WriteBigEndian APIs

eab08e6

Add WriteExponentBigEndian and WriteSignificandBigEndian APIs

0118da2

Update System.Char to explicitly implement the numeric interfaces

154c181

ghost assigned tannergooding May 16, 2022

dotnet-issue-labeler bot added the new-api-needs-documentation label May 16, 2022

tannergooding mentioned this pull request May 16, 2022

Updating BigInteger to implement IBinaryInteger and ISignedNumber #68964

Merged

stephentoub reviewed May 16, 2022

View reviewed changes

Ensure BigInteger.TryWriteBigEndian correctly offsets the address

89f2784

stephentoub reviewed May 16, 2022

View reviewed changes

src/libraries/System.Private.CoreLib/src/System/Decimal.cs Outdated Show resolved Hide resolved

tannergooding added 2 commits May 16, 2022 09:50

Fixing two usages of var in decimal

ccb84f1

Ensure lo64 is ulong and hi32 is uint

d4f17c1

bartonjs reviewed May 16, 2022

View reviewed changes

src/libraries/System.Private.CoreLib/src/System/Decimal.cs Show resolved Hide resolved

bartonjs reviewed May 16, 2022

View reviewed changes

src/libraries/System.Private.CoreLib/src/System/Double.cs Show resolved Hide resolved

bartonjs reviewed May 16, 2022

View reviewed changes

src/libraries/System.Private.CoreLib/src/System/Int64.cs Show resolved Hide resolved

bartonjs reviewed May 16, 2022

View reviewed changes

tannergooding added 5 commits May 18, 2022 21:47

Merge remote-tracking branch 'dotnet/main' into generic-math-2

5d8473c

Ensure Int128/UInt128 implement TryWriteBigEndian and return int fo…

5baf74d

…r `GetShortestBitLength`

Ensure that GetByteCount and GetShortestBitLength correctly handl…

8eb47d4

…e the one's complement format

Ensure the ReverseEndianness calls in TryWrite* for Int128/UInt128 ar…

2d54b7d

…e correct

Ensure BigInteger.TryWriteLittleEndian has a correct assert

c55e8e6

bartonjs approved these changes May 19, 2022

View reviewed changes

tannergooding closed this May 19, 2022

tannergooding reopened this May 19, 2022

This was referenced May 19, 2022

jit.1 work item failing on mono #67888

Closed

Test failure JIT/Performance/CodeQuality/BenchmarksGame/regex-redux/regex-redux-5/regex-redux-5.sh #66625

Closed

tannergooding merged commit 0c6d412 into dotnet:main May 19, 2022

ghost locked as resolved and limited conversation to collaborators Jun 19, 2022

tannergooding deleted the generic-math-2 branch November 11, 2022 15:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Iterative generic math changes to match API review #69391

Iterative generic math changes to match API review #69391

tannergooding commented May 16, 2022

dotnet-issue-labeler bot commented May 16, 2022

dotnet-issue-labeler bot commented May 16, 2022

stephentoub May 16, 2022 •

edited

Loading

tannergooding May 16, 2022

bartonjs May 16, 2022

tannergooding May 16, 2022

bartonjs May 16, 2022

tannergooding May 17, 2022

bartonjs May 17, 2022

tannergooding May 17, 2022 •

edited

Loading

bartonjs May 17, 2022

tannergooding May 19, 2022

bartonjs May 16, 2022

tannergooding May 17, 2022

Iterative generic math changes to match API review #69391

Iterative generic math changes to match API review #69391

Conversation

tannergooding commented May 16, 2022

dotnet-issue-labeler bot commented May 16, 2022

dotnet-issue-labeler bot commented May 16, 2022

stephentoub May 16, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tannergooding May 17, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stephentoub May 16, 2022 •

edited

Loading

tannergooding May 17, 2022 •

edited

Loading