Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update reserved.ts to include hexpeak and permutations of slurs #1326

Closed

Conversation

simonblack
Copy link

  • Added dynamic slur generation to ensure no slurs are missed/bypassed using hexspeak
  • removed trailing space after 'nazi'
  • reintroduced pajeet, added jeet, its a bastardization of the traditional name Paaji; jeet/pajeet is used almost exclusively to exhibit racist undertones
  • reintroduced 'retard' for what should be clear reasons

apologies in advance if this breaks your code style, implementation speed was more important!

cheers

- Added dynamic slur generation to ensure no slurs are missed/bypassed using hexspeak
- removed trailing space after 'nazi'
- reintroduced pajeet, added jeet, its a bastardization of the traditional name Paaji; jeet/pajeet is used almost exclusively to exhibit racist undertones
- reintroduced 'retard' for what should be clear reasons
@simonblack simonblack marked this pull request as draft July 13, 2023 17:22
- added more cases to hexspeak generator
@simonblack simonblack marked this pull request as ready for review July 13, 2023 17:26
@ixtli
Copy link

ixtli commented Jul 13, 2023

this is a good pr because it is a template for others to use to add more checks for obfuscated slurs.

one thing to note, though, its been shown elsewhere that given the size and complexity of unicode rendering its not really possible to detect all permutations of characters. ultimately there may be a need to render into a buffer and do some computer vision on it to see if the form resembles the form of a slur :(

Comment on lines +1129 to +1130
hex_words.push(hex_word + 's');
hex_words.push(hex_word + '5');
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

to whomever reviews this: this is likely the most important part that really should have been there before.

@simonblack
Copy link
Author

this is a good pr because it is a template for others to use to add more checks for obfuscated slurs.

one thing to note, though, its been shown elsewhere that given the size and complexity of unicode rendering its not really possible to detect all permutations of characters. ultimately there may be a need to render into a buffer and do some computer vision on it to see if the form resembles the form of a slur :(

totally! my thoughts behind this implementation was to quickly patch it the way they did but to make it far more robust that what they did initially. I am certain that if this is adopted there will have to be a better design decision made when integrating though, imo, anything involving machine learning would be something to consider for the longterm, right now we just need a patch for people to feel as safe as they did a week ago (hopefully a bit better) while a far more robust solution is implemented

@dholms
Copy link
Collaborator

dholms commented Jul 14, 2023

Hey thanks for the PR! We added support for hexpeak & permutations of slurs here: #1336

@dholms dholms closed this Jul 14, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants