Best approach to censor bad words in form input

Geremy_de_Melo · June 4, 2023, 10:27pm

I don’t want my users to abuse my virtual postcards creation app, filling the form with words that are not allowed (mostly bad words).

I already developed a system that can identify separate words from a phrase and compare them to an array of censored words and give me a TRUE or FALSE state that I can use to program the app response.

But what if someone tries to write 1 single word that is a combination of 2 bad words (or 1 normal word and a bad word)?

I can’t figure out how to “extract” the bad word from that invented word that combines the text.

Here is an example:

Let’s say that I don’t want anyone to use the word “BLUE”. If someone writes “You BLUE man” in the text input, I can detect it with my system.

But if someone writes “combiningBLUEhere” I can’t.

Any suggestions? Thanks in advance!

Joe_Gabriele · June 4, 2023, 11:13pm

Take a look at this thread here. It worked for me.

Geremy_de_Melo · June 4, 2023, 11:38pm

Thanks Joe for that!

If I understood that correctly, it is still a way to detect separate words. But I think it wont work if the “bad word” is part of a larger single word. Like, if “Blue” was a bad word in the list, this system would detect BLUE but not BLUEMAN. Maybe I can try something with the OpenAI integration. A bad words detector prompt that returns TRUE or FALSE…

Other suggestions will be very welcome!

ThinhDinh · June 4, 2023, 11:40pm

This has two sides to it though. I think there will be cases where you want to ban BLUEabc but not DefBLUE.

Darren_Murphy · June 5, 2023, 2:42am

If I was going to do this, then I think I would definitely turn to OpenAI.
But don’t just look for words, because blindly filtering keywords will never work and is easy to get around. Words on their own are not inherently good or bad. It’s the way that they are used and how they are combined that can cause offence. So ask the AI to do a bit of sentiment analysis and pick out phrases that could be considered offensive or harmful.

I think what I would do is flag for moderation, rather than just rejecting outright. But that’s up to you of course.

Geremy_de_Melo · June 6, 2023, 2:38am

After trying different approaches, I am convinced that the best one is using OpenAI to do the job.

It’s very easy to setup and with the right prompt it can do everything.

My App is in 3 languages and I’m being able to filter any bad word from all the 3 languages with just this one setting.

The previous approach was almost impossible. Very time consuming and it was only able to find isolated words from a giant list, that was not enough to prevent all the possible combinations in 3 languages.

For now, I will leave it to OpenAI.

Robert_Petitto · June 6, 2023, 2:51am

Agreed. OpenAI is a very straightforward option.

system · June 7, 2023, 2:52am

This topic was automatically closed 24 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Filter innapropriate username / comment Ask for Help	1	122	April 21, 2024
Flag inappropriate words Ask for Help	9	297	April 12, 2023
Prevent certain words from being used Ask for Help	7	378	March 26, 2022
Prohibit profanity in usernames Ask for Help	6	189	June 9, 2024
"Banned User" feature Feature Requests	3	452	May 22, 2020

Best approach to censor bad words in form input

Related topics