Update reserved.ts to include hexpeak and permutations of slurs #1326

simonblack · 2023-07-13T16:00:38Z

Added dynamic slur generation to ensure no slurs are missed/bypassed using hexspeak
removed trailing space after 'nazi'
reintroduced pajeet, added jeet, its a bastardization of the traditional name Paaji; jeet/pajeet is used almost exclusively to exhibit racist undertones
reintroduced 'retard' for what should be clear reasons

apologies in advance if this breaks your code style, implementation speed was more important!

cheers

- Added dynamic slur generation to ensure no slurs are missed/bypassed using hexspeak - removed trailing space after 'nazi' - reintroduced pajeet, added jeet, its a bastardization of the traditional name Paaji; jeet/pajeet is used almost exclusively to exhibit racist undertones - reintroduced 'retard' for what should be clear reasons

- added more cases to hexspeak generator

ixtli · 2023-07-13T19:07:43Z

this is a good pr because it is a template for others to use to add more checks for obfuscated slurs.

one thing to note, though, its been shown elsewhere that given the size and complexity of unicode rendering its not really possible to detect all permutations of characters. ultimately there may be a need to render into a buffer and do some computer vision on it to see if the form resembles the form of a slur :(

ixtli · 2023-07-13T19:11:13Z

packages/identifier/src/reserved.ts

+            hex_words.push(hex_word + 's');
+            hex_words.push(hex_word + '5');


to whomever reviews this: this is likely the most important part that really should have been there before.

simonblack · 2023-07-13T19:20:01Z

this is a good pr because it is a template for others to use to add more checks for obfuscated slurs.

one thing to note, though, its been shown elsewhere that given the size and complexity of unicode rendering its not really possible to detect all permutations of characters. ultimately there may be a need to render into a buffer and do some computer vision on it to see if the form resembles the form of a slur :(

totally! my thoughts behind this implementation was to quickly patch it the way they did but to make it far more robust that what they did initially. I am certain that if this is adopted there will have to be a better design decision made when integrating though, imo, anything involving machine learning would be something to consider for the longterm, right now we just need a patch for people to feel as safe as they did a week ago (hopefully a bit better) while a far more robust solution is implemented

dholms · 2023-07-14T03:37:09Z

Hey thanks for the PR! We added support for hexpeak & permutations of slurs here: #1336

HarryGogonis mentioned this pull request Jul 13, 2023

Block slurs as handles #1317

Closed

simonblack marked this pull request as draft July 13, 2023 17:22

Improved permutation generation

7efee0a

- added more cases to hexspeak generator

simonblack marked this pull request as ready for review July 13, 2023 17:26

ixtli reviewed Jul 13, 2023

View reviewed changes

WilliamRoyNelson mentioned this pull request Jul 13, 2023

Testing framework needed to assist slur filtering efforts #1337

Closed

dholms closed this Jul 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update reserved.ts to include hexpeak and permutations of slurs #1326

Update reserved.ts to include hexpeak and permutations of slurs #1326

simonblack commented Jul 13, 2023

ixtli commented Jul 13, 2023

ixtli Jul 13, 2023

simonblack commented Jul 13, 2023

dholms commented Jul 14, 2023

		hex_words.push(hex_word + 's');
		hex_words.push(hex_word + '5');

Update reserved.ts to include hexpeak and permutations of slurs #1326

Update reserved.ts to include hexpeak and permutations of slurs #1326

Conversation

simonblack commented Jul 13, 2023

ixtli commented Jul 13, 2023

ixtli Jul 13, 2023

Choose a reason for hiding this comment

simonblack commented Jul 13, 2023

dholms commented Jul 14, 2023