You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In commit deed711 CharsetDetector was refactored and two steps in WHAT-NG recommendation missing were added (overriding META tag and use of BOM). Bug fixes in deed711, additional unit tests added in 9524744. I suggest RotatingCharsetDetector should now be deprecated.
CharsetDetector fails to detect correct character encoding when
META
tag sayscharset=UTF-16
but it is in fact inUTF-8
. It is because CharsetDetector puts higher priority onMETA
tag over charset detected from content. Reimplement CharsetDetector in reference to WHAT-NG recommendation http://www.whatwg.org/specs/web-apps/current-work/multipage/parsing.html#encoding-sniffing-algorithmKnown internally as ARI-3933.
The text was updated successfully, but these errors were encountered: