The Times Australia
The Times World News

.

A new tool helps catch nasty comments – even when they’re disguised

  • Written by Johnny Chan, Lecturer, Business School, University of Auckland, Waipapa Taumata Rau



People determined to spread toxic messages online have taken to masking their words to bypass automated moderation filters.

A user might replace letters with numbers or symbols, for example, writing “Y0u’re st00pid” instead of “You’re stupid”.

Another tactic involves combining words, such as “IdiotFace”. Doing this masks the harmful intent from systems that look for individual toxic words.

Similarly, harmful terms can be altered with spaces or additional characters, such as “h a t e ” or “h@te”, effectively slipping through keyword-based filters.

While the intent remains harmful, traditional moderation tools often overlook such messages. This leaves users — particularly vulnerable groups — exposed to their negative impact.

To address this, we have developed a novel pre-processing technique[1] designed to help moderation tools more effectively handle the subtle complexities of hidden toxicity.

An intelligent assistant

Our tool works in conjunction with existing moderation. It acts as an intelligent assistant, preparing content for deeper and more accurate evaluation by restructuring and refining input text.

By addressing common tricks users employ to disguise harmful intent, it ensures moderation systems are more effective. The tool performs three key functions.

  1. It first simplifies the text. Irrelevant elements, such as excessive punctuation or extraneous characters, are removed to make text straightforward and ready for evaluation.

  2. It then standardises what is written. Variations in spelling, phrasing and grammar are resolved. This includes interpreting deliberate misspellings (“h8te” for “hate”).

  3. Finally, it looks for patterns. Recurring strategies such as breaking up toxic words (“I d i o t”), or embedding them within benign phrases, are identified and normalised to reveal the underlying intent.

These steps can break apart compound words like “IdiotFace” or normalise modified phrases like “Y0u’re st00pid”. This makes harmful content visible to traditional filters.

Importantly, our work is not about reinventing the wheel but ensuring the existing wheel functions as effectively as it should, even when faced with disguised toxic messages.

Girl in the dark looking sadly at a phone.
Our new tool cleans up toxic comments that have been hidden behind misspellings and extra characters. ClarkandCompany/Getty Images[2]

Catching subtle forms of toxicity

The applications of this tool extend across a wide range of online environments. For social media platforms, it enhances the ability to detect harmful messages, creating a safer space for users. This is particularly important for protecting younger audiences, who may be more vulnerable to online abuse.

By catching subtle forms of toxicity, the tool helps to prevent harmful behaviours like bullying from persisting unchecked.

Businesses can also use this technology to safeguard their online presence. Negative campaigns or covert attacks on brands often employ subtle and disguised messaging to avoid detection. By processing such content before it is moderated, the tool ensures that businesses can respond swiftly to any reputational threats.

Additionally, policymakers and organisations that monitor public discourse can benefit from this system. Hidden toxicity, particularly in polarised discussions, can undermine efforts to maintain constructive dialogue.

The tool provides a more robust way for identifying problematic content and ensuring that debates remain respectful and productive.

Better moderation

Our tool marks an important advance in content moderation. By addressing the limitations of traditional keyword-based filters, it offers a practical solution to the persistent issue of hidden toxicity.

Importantly, it demonstrates how small but focused improvements can make a big difference in creating safer and more inclusive online environments. As digital communication continues to evolve, tools like ours will play an increasingly vital role in protecting users and fostering positive interactions.

While this research addresses the challenges of detecting hidden toxicity within text, the journey is far from over.

Future advances will likely delve deeper into the complexities of context—analysing how meaning shifts depending on conversational dynamics, cultural nuances and intent.

By building on this foundation, the next generation of content moderation systems could uncover not just what is being said but also the circumstances in which it is said, paving the way for safer and more inclusive online spaces.

References

  1. ^ novel pre-processing technique (methods-x.com)
  2. ^ ClarkandCompany/Getty Images (www.gettyimages.com.au)

Read more https://theconversation.com/unmasking-hidden-online-hate-a-new-tool-helps-catch-nasty-comments-even-when-theyre-disguised-244636

Times Magazine

DIY Is In: How Aussie Parents Are Redefining Birthday Parties

When planning his daughter’s birthday, Rich opted for a DIY approach, inspired by her love for drawing maps and giving clues. Their weekend tradition of hiding treats at home sparked the idea, and with a pirate ship playground already chosen as t...

When Touchscreens Turn Temperamental: What to Do Before You Panic

When your touchscreen starts acting up, ignoring taps, registering phantom touches, or freezing entirely, it can feel like your entire setup is falling apart. Before you rush to replace the device, it’s worth taking a deep breath and exploring what c...

Why Social Media Marketing Matters for Businesses in Australia

Today social media is a big part of daily life. All over Australia people use Facebook, Instagram, TikTok , LinkedIn and Twitter to stay connected, share updates and find new ideas. For businesses this means a great chance to reach new customers and...

Building an AI-First Culture in Your Company

AI isn't just something to think about anymore - it's becoming part of how we live and work, whether we like it or not. At the office, it definitely helps us move faster. But here's the thing: just using tools like ChatGPT or plugging AI into your wo...

Data Management Isn't Just About Tech—Here’s Why It’s a Human Problem Too

Photo by Kevin Kuby Manuel O. Diaz Jr.We live in a world drowning in data. Every click, swipe, medical scan, and financial transaction generates information, so much that managing it all has become one of the biggest challenges of our digital age. Bu...

Headless CMS in Digital Twins and 3D Product Experiences

Image by freepik As the metaverse becomes more advanced and accessible, it's clear that multiple sectors will use digital twins and 3D product experiences to visualize, connect, and streamline efforts better. A digital twin is a virtual replica of ...

The Times Features

What Makes Certain Rings or Earrings Timeless Versus Trendy?

Timeless rings and earrings are defined by designs that withstand the test of time, quality craftsmanship, and versatility. Trendy pieces, on the other hand, often stand testimony ...

Italian Street Kitchen: A Nation’s Favourite with Expansion News on Horizon

Successful chef brothers, Enrico and Giulio Marchese, weigh in on their day-to-day at Australian foodie favourite, Italian Street Kitchen - with plans for ‘ambitious expansion’ to ...

What to Expect During a Professional Termite Inspection

Keeping a home safe from termites isn't just about peace of mind—it’s a vital investment in the structure of your property. A professional termite inspection is your first line o...

Booty and the Beasts - The Podcast

Cult TV Show Back with Bite as a Riotous New Podcast  The show that scandalised, shocked and entertained audiences across the country, ‘Beauty and the Beast’, has returned in ...

A Guide to Determining the Right Time for a Switchboard Replacement

At the centre of every property’s electrical system is the switchboard – a component that doesn’t get much attention until problems arise. This essential unit directs electrici...

Après Skrew: Peanut Butter Whiskey Turns Australia’s Winter Parties Upside Down

This August, winter in Australia is about to get a lot nuttier. Skrewball Whiskey, the cult U.S. peanut butter whiskey that’s taken the world by storm, is bringing its bold brand o...