Google AI
The Times Australia
The Times World News

.

How do ‘AI detection’ tools actually work? And are they effective?

  • Written by: T.J. Thomson, Senior Lecturer in Visual Communication & Digital Media, RMIT University

As nearly half of all Australians say[1] they have recently used artificial intelligence (AI) tools, knowing when and how they’re being used is becoming more important.

Consultancy firm Deloitte recently partially refunded the Australian government after a report they published had AI-generated errors[2] in it.

A lawyer also recently faced disciplinary action after false AI-generated citations[3] were discovered in a formal court document. And many universities are concerned[4] about how their students use AI.

Amid these examples, a range of “AI detection” tools have emerged to try to address people’s need for identifying accurate, trustworthy and verified content.

But how do these tools actually work? And are they effective at spotting AI-generated material?

How do AI detectors work?

Several approaches exist, and their effectiveness can depend on which types of content are involved.

Detectors for text often try to infer AI involvement by looking for “signature” patterns in sentence structure, writing style, and the predictability of certain words or phrases being used. For example, the use of “delves” and “showcasing” has skyrocketed[5] since AI writing tools became more available.

However the difference between AI and human patterns is getting smaller and smaller. This means signature-based tools can be highly unreliable.

Detectors for images sometimes work by analysing embedded metadata which some AI tools add to the image file.

For example, the Content Credentials inspect tool[6] allows people to view how a user has edited a piece of content, provided it was created and edited with compatible software. Like text, images can also be compared against verified datasets of AI-generated content (such as deepfakes).

Finally, some AI developers have started adding watermarks[7] to the outputs of their AI systems. These are hidden patterns in any kind of content which are imperceptible to humans but can be detected by the AI developer. None of the large developers have shared their detection tools with the public yet, though.

Each of these methods has its drawbacks and limitations.

How effective are AI detectors?

The effectiveness of AI detectors can depend on several factors. These include which tools were used to make the content and whether the content was edited or modified after generation.

The tools’ training data can also affect results.

For example, key datasets used to detect AI-generated pictures do not have enough full-body pictures of people or images from people of certain cultures. This means successful detection is already limited in many ways.

Watermark-based detection can be quite good at detecting content made by AI tools from the same company. For example, if you use one of Google’s AI models such as Imagen, Google’s SynthID watermark tool[8] claims to be able to spot the resulting outputs.

But SynthID is not publicly available yet. It also doesn’t work if, for example, you generate content using ChatGPT, which isn’t made by Google. Interoperability across AI developers is a major issue.

AI detectors can also be fooled when the output is edited. For example, if you use a voice cloning app and then add noise or reduce the quality (by making it smaller), this can trip up voice AI detectors. The same is true with AI image detectors.

Explainability is another major issue. Many AI detectors will give the user a “confidence estimate” of how certain it is that something is AI-generated. But they usually don’t explain their reasoning or why they think something is AI-generated.

It is important to realise that it is still early days for AI detection, especially when it comes to automatic detection.

A good example of this can be seen in recent attempts to detect deepfakes. The winner of Meta’s Deepfake Detection Challenge[9] identified four out of five deepfakes. However, the model was trained on the same data it was tested on – a bit like having seen the answers before it took the quiz.

When tested against new content, the model’s success rate dropped. It only correctly identified three out of five deepfakes in the new dataset.

All this means AI detectors can and do get things wrong. They can result in false positives (claiming something is AI generated when it’s not) and false negatives (claiming something is human-generated when it’s not).

For the users involved, these mistakes can be devastating – such as a student whose essay is dismissed as AI-generated when they wrote it themselves, or someone who mistakenly believes an AI-written email came from a real human.

It’s an arms race as new technologies are developed or refined, and detectors are struggling to keep up.

Where to from here?

Relying on a single tool is problematic and risky. It’s generally safer and better to use a variety of methods to assess the authenticity of a piece of content.

You can do so by cross-referencing sources and double-checking facts in written content. Or for visual content, you might compare suspect images to other images purported to be taken during the same time or place. You might also ask for additional evidence or explanation if something looks or sounds dodgy.

But ultimately, trusted relationships with individuals and institutions will remain one of the most important factors when detection tools fall short or other options aren’t available.

References

  1. ^ say (digitalinclusionindex.org.au)
  2. ^ AI-generated errors (www.theguardian.com)
  3. ^ AI-generated citations (www.theguardian.com)
  4. ^ are concerned (www.abc.net.au)
  5. ^ skyrocketed (www.forbes.com)
  6. ^ tool (verify.contentauthenticity.org)
  7. ^ watermarks (theconversation.com)
  8. ^ SynthID watermark tool (deepmind.google)
  9. ^ Deepfake Detection Challenge (ai.meta.com)

Read more https://theconversation.com/how-do-ai-detection-tools-actually-work-and-are-they-effective-269390

Times Magazine

Australian Wine Guide

A Quick but Informed Guide to the Varieties and Popular Brands of Australian WinesDon’t let a wine...

What next from Apple

The question of what comes next for Apple Inc. is no longer theoretical. With leadership transitio...

Leapmotor Hybrid EV Review

The Leapmotor hybrid EV—most notably the Leapmotor C10 REEV (range-extended electric vehicle)—has ...

Navman Gets Even Smarter with 2026 MiVue™ Dash Cams

Introducing NEW Integrated Smart Parking and Australia-First Extended Recording Mode Navman to...

Why Interactive Panels Are Replacing Traditional Whiteboards in Perth

Whiteboards have been part of classrooms and meeting rooms for decades. They’re familiar, flexible...

The Engineering Innovations Transforming the Australian Heavy Transport Fleet

Australia is a massive continent, and its national supply chain relies almost entirely on the road...

The Times Features

Australian Wine Guide

A Quick but Informed Guide to the Varieties and Popular Brands of Australian WinesDon’t let a wine...

Chef knives: Setting up a home or upgrading, does price…

For anyone serious about cooking—whether setting up a first kitchen or upgrading an existing one—t...

Solo Travel: why? Do as you like, when you like, anywhe…

There was a time when travel was almost always a shared experience—family holidays, group tours, c...

Moving to Cairns? These are the suburbs offering a seas…

For Australians looking to trade congestion, cold winters and rising property costs for sunshine a...

GINA WILLIAMS & GUY GHOUSE LIVE AT THE ELLINGTON’ D…

After 15 years of performing around the world, recording studio albums and unveiling two opera works...

The Quiet Luxury of Ink: Rediscovering the Joy of Writi…

In an age dominated by screens, taps and instant communication, the simple act of writing by hand ...

Owning a Restaurant: Buying One or Braving the Challeng…

Owning a restaurant has long been one of the most alluring—and misunderstood—paths in small busine...

Supermarket Prices Are Up — and So Is Dinner at a Modes…

For many Australians, the weekly grocery shop and a simple night out for dinner have quietly becom...

In 2006, The Devil Wears Prada Became One of the First …

When The Devil Wears Prada premiered in 2006, it was marketed as a sharp, entertaining adaptation ...