Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AO3-6087 Stop removing non-printable Unicode characters #4798

Merged
merged 2 commits into from
May 17, 2024

Conversation

potpotkettle
Copy link
Contributor

@potpotkettle potpotkettle commented Apr 25, 2024

Issue

https://otwarchive.atlassian.net/browse/AO3-6087

Purpose

This will retain non-printable characters entered by the user in various places including the chapter text, comment, etc. They are necessary for representing characters in some languages.

Apparently the special characters caused problems to the parser in the past. While the details are unclear, we tentatively believe that is not the case any more.

Testing Instructions

Try entering text with zero-width joiner, zero-width non-joiner, and word joiner into the main text of a work, or into a comment. They should remain.

References

I believe this will also affect emojis with gender variants (and maybe other variants), as reported in https://old.reddit.com/r/AO3/comments/1cbpoh8/commenting_issue/ .

Credit

Potpotkettle (they/them)

especially, zero-width joiner, zero-width non-joiner and word joiner
spec/lib/html_cleaner_spec.rb Outdated Show resolved Hide resolved
@brianjaustin brianjaustin merged commit 3e683fc into otwcode:master May 17, 2024
26 checks passed
sarken added a commit that referenced this pull request May 31, 2024
sarken added a commit that referenced this pull request May 31, 2024
Revert "AO3-6087 Stop removing non-printable Unicode characters (#4798)"

This reverts commit 3e683fc.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants