You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
One character (byte?) is always missed after U+10000. It leads to more problems with subsequent multi-byte codepoints, but I'm sure the first one is the actual cause. Does not happen with U+FFFF or U+10001. Seems like an off-by-one error, since U+10000 is the first 4-byte codepoint in UTF-8.
Just trace on the string seems to produce the correct output (F0 90 80 80):
Expected output:
On HL:
One character (byte?) is always missed after
U+10000
. It leads to more problems with subsequent multi-byte codepoints, but I'm sure the first one is the actual cause. Does not happen withU+FFFF
orU+10001
. Seems like an off-by-one error, sinceU+10000
is the first 4-byte codepoint in UTF-8.Just
trace
on the string seems to produce the correct output (F0 90 80 80
):$ hl out.hl | xxd 00000000: 4d61 696e 2e68 783a 343a 20f0 9080 8061 Main.hx:4: ....a ...
The text was updated successfully, but these errors were encountered: