URL/Encoding: change query state parsing #10915

annevk · 2018-05-09T10:08:04Z

See whatwg/encoding#139 for rationale and whatwg/url#386 for the change to the URL Standard.

(I found all these resources in part due to @rakuco's work on trying to align Chrome with the earlier iteration of the specification.)

annevk · 2018-05-09T10:09:29Z

Note: I looked into removing normalizeStr completely, but that broke too many tests. So I only removed it where it was a no-op or caused only one browser (Firefox) to start failing tests.

Perhaps further cleanup here can be done as part of #10636.

annevk · 2018-05-09T10:24:44Z

Bugs:

annevk · 2018-05-09T13:09:46Z

This needs to change #10891 as well. I haven't corrected it there to keep the state of the test suite consistent.

annevk · 2018-05-09T14:40:43Z

That's now done. This as well the URL Standard change are ready for review.

annevk · 2018-05-09T14:54:21Z

Note to self: once this is reviewed and landed, do some work on #4934 again.

inexorabletash

lgtm

annevk · 2018-05-10T05:28:24Z

(I merged the commits together so that "rebase and merge" can be used by an owner. This needs an override due to build timeouts. However, please do not merge this before I remove that label.)

annevk · 2018-05-10T05:28:49Z

Also, thanks @inexorabletash for the speedy review!

If the input to the URL parser contains code points outside the non-UTF-8 encoding's value space and the URL parser was invoked using a non-UTF-8 encoding, then those code points end up as &#...;. The problem is that &, #, and ; are also URL separators, but the previous algorithm would only encode #. This ensures that & and ; are also encoded, as some browsers already do, but only if they came about as the result of the encode operation. Tests: web-platform-tests/wpt#10915.

annevk · 2018-05-22T09:32:58Z

I rebased this to address a merge conflict in url/README.md.

If the input to the URL parser contains code points outside the non-UTF-8 encoding's value space and the URL parser was invoked using a non-UTF-8 encoding, then those code points end up as &#...;. The problem is that &, #, and ; are also URL separators, but the previous algorithm would only encode #. This ensures that & and ; are also encoded, as some browsers already do, but only if they came about as the result of the encode operation. Tests: web-platform-tests/wpt#10915. Fixes whatwg/encoding#139.

@rakuco

See whatwg/encoding#139 for rationale and whatwg/url#386 for the change to the URL Standard. (I found all these resources in part due to @rakuco's work on trying to align Chrome with the earlier iteration of the specification.)

wpt-pr-bot added encoding html url xhr labels May 9, 2018

wpt-pr-bot requested review from ayg, caitp, domenic, emilio, GPHemsley, hallvors, ibelem, inexorabletash, jdm, jgraham, jungkees, kangxu, Manishearth, mathiasbynens and mikewest May 9, 2018 10:08

annevk mentioned this pull request May 9, 2018

Extract Location object tests from query-encoding/ #10891

Merged

annevk force-pushed the annevk/query-state-encoding branch from 39db988 to 361b63f Compare May 9, 2018 14:39

inexorabletash approved these changes May 9, 2018

View reviewed changes

annevk force-pushed the annevk/query-state-encoding branch from 0551797 to bd7ea53 Compare May 10, 2018 05:27

wpt-pr-bot requested a review from ronkorving May 10, 2018 05:27

annevk added the status:needs-spec-decision label May 10, 2018

annevk added the do not merge yet label May 10, 2018

annevk force-pushed the annevk/query-state-encoding branch from bd7ea53 to d5b8565 Compare May 22, 2018 09:32

URL/Encoding: change query state parsing

6dc57cd

See whatwg/encoding#139 for rationale and whatwg/url#386 for the change to the URL Standard. (I found all these resources in part due to @rakuco's work on trying to align Chrome with the earlier iteration of the specification.)

sideshowbarker force-pushed the annevk/query-state-encoding branch from d5b8565 to 6dc57cd Compare May 23, 2018 07:27

jgraham merged commit e399a2c into master May 23, 2018

annevk deleted the annevk/query-state-encoding branch May 23, 2018 08:47

GPHemsley removed do not merge yet status:needs-spec-decision labels Jun 24, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

URL/Encoding: change query state parsing #10915

URL/Encoding: change query state parsing #10915

annevk commented May 9, 2018 •

edited by wpt-pr-bot

Loading

annevk commented May 9, 2018

annevk commented May 9, 2018

annevk commented May 9, 2018

annevk commented May 9, 2018

annevk commented May 9, 2018

inexorabletash left a comment

annevk commented May 10, 2018

annevk commented May 10, 2018

annevk commented May 22, 2018

URL/Encoding: change query state parsing #10915

URL/Encoding: change query state parsing #10915

Conversation

annevk commented May 9, 2018 • edited by wpt-pr-bot Loading

annevk commented May 9, 2018

annevk commented May 9, 2018

annevk commented May 9, 2018

annevk commented May 9, 2018

annevk commented May 9, 2018

inexorabletash left a comment

Choose a reason for hiding this comment

annevk commented May 10, 2018

annevk commented May 10, 2018

annevk commented May 22, 2018

annevk commented May 9, 2018 •

edited by wpt-pr-bot

Loading