Serializing a double with JsonSerializer results in JSON with lower precision #45341

dgiddins · 2020-11-26T14:21:32Z

Issue Title

Serializing a double with JsonSerializer results in JSON with lower precision

General

While upgrading from dotnet core 2.2 to dotnet 5.0 we experienced failing tests that depend on consistent serilaization of objects to generate a hash. I have tracked this down to the way the double type is serialized when passed to the JsonSerializer.Serialize method. During serialization we seem to be losing precision where a double is rounded to the next decimal point, but not every number is rounded.

For example

 var d = 50.494329039350461;

var dAsText = System.Text.Json.JsonSerializer.Serialize(d);

//Value of dAsText is 50.49432903935046

When we deserialize we get the origial number back but we need to act on serialized data to generate our hash. Why has this behaviour changed between frameworks, is it a bug or is it intended? Are there any settings we can change to restore the previous implementation (while remaining on .net 5 of course). The same behaviour can be seen with Newtonsoft Json (I have raised an issue with them as well)

The behavior seem consistent on Windows 10x64 and in a Lunix Docker image. This is running in Visual Studio 2019 (latest update)

The problem can also be seen with these number:

50.494328391915907
30.316339899700989
50.494128852095287

danmoseley · 2020-11-30T06:05:23Z

@dgiddins this is a consequence of the precision of double math. Both the before and after numbers in your example have exactly the same bit representation when stored in a double:

        Console.WriteLine("{0:X16}", BitConverter.DoubleToInt64Bits(50.494329039350461));
        Console.WriteLine("{0:X16}", BitConverter.DoubleToInt64Bits(50.49432903935046));

40493F462C88BC96
40493F462C88BC96

https://sharplab.io/#v2:EYLgtghgzgLgpgJwDQxAgrgOwD4AEBMAjALABQuAzAAQFUDCVA3mVazfvky2z9z67kIBOABQAiRgAYQADUIA2AL5ikVAEIBLGHQD2mAG6J4CAHQARHemAAbOABUdASUwx5AFk0woIgKySTbkJuFPhCkhRCFH5u8oQAlHEA3Hz8gqIS0nJKKupaugZGiOaWNvZOLu6e3n4BQSFhEVGSMQnJpCmKZIpAA=

So in your program d actually contains value 0x40493F462C88BC96 = 100000001001001001111110100011000101100100010001011110010010110. A floating point representation like 50.49432903935046 is often not the only floating point number that is stored as that bit pattern, so the routine that converts the bit pattern to a string must somehow choose which to use.

The previous implementation happened to translate 40493F462C88BC96 into 50.494329039350461 (which happens to be what it "came from") instead of 50.49432903935046; the current implementation chooses the latter, because it is the shortest possible representation that is stored as that bit pattern (ie., that is round trippable to itself when parsed from a string). Usually this is what you want, as it's more compact. This work was done to conform to standard IEEE-754.

Do you need numbers of this sort to deserialize to exactly the same representation? If so, you need to either serialize them as a larger type (like a 128 bit float, which .NET does not natively support) or as a string. Otherwise, perhaps you are satisfied with the explanation above.

dgiddins · 2020-11-30T09:14:45Z

Thank you @danmosemsft

My understanding is the behaviour I saw in .net core 2.2 was considered a bug so I am happy with the behaviour as it is in .net 5. The data represents a lat or long in our system so as long as we don't lose precision then the way the serialization works isn't citically important, we where simply serializing and object that contained a lat and long and generating a has from the Json representation to detect changes between different systems with the same data.

danmoseley transferred this issue from dotnet/core Nov 30, 2020

Dotnet-GitSync-Bot added area-System.Text.Json untriaged New issue has not been triaged by the area owner labels Nov 30, 2020

dgiddins closed this as completed Nov 30, 2020

layomia removed the untriaged New issue has not been triaged by the area owner label Dec 1, 2020

briansull mentioned this issue Dec 9, 2020

CI failure in JIT.Regression Runtime_40444.cmd #44831

Closed

ghost locked as resolved and limited conversation to collaborators Dec 31, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Serializing a double with JsonSerializer results in JSON with lower precision #45341

Serializing a double with JsonSerializer results in JSON with lower precision #45341

dgiddins commented Nov 26, 2020

danmoseley commented Nov 30, 2020 •

edited

Loading

dgiddins commented Nov 30, 2020

Serializing a double with JsonSerializer results in JSON with lower precision #45341

Serializing a double with JsonSerializer results in JSON with lower precision #45341

Comments

dgiddins commented Nov 26, 2020

Issue Title

General

danmoseley commented Nov 30, 2020 • edited Loading

dgiddins commented Nov 30, 2020

danmoseley commented Nov 30, 2020 •

edited

Loading