The Times Australia
The Times World News

.
Times Media

.

Do AI systems really have their own secret language?

  • Written by Aaron J. Snoswell, Post-doctoral Research Fellow, Computational Law & AI Accountability, Queensland University of Technology
Do AI systems really have their own secret language?

A new generation of artificial intelligence (AI) models can produce “creative” images on-demand based on a text prompt. The likes of Imagen[1], MidJourney[2], and DALL-E 2[3] are beginning to change the way creative content is made[4] with implications for copyright and intellectual property.

While the output of these models is often striking, it’s hard to know exactly how they produce their results. Last week, researchers in the US made the intriguing claim that the DALL-E 2 model might have invented its own secret language to talk about objects.

By prompting DALL-E 2 to create images containing text captions, then feeding the resulting (gibberish) captions back into the system, the researchers concluded DALL-E 2 thinks Vicootes means “vegetables[5]”, while Wa ch zod rea refers to “sea creatures that a whale might eat[6]”.

These claims are fascinating, and if true, could have important security and interpretability implications for this kind of large AI model. So what exactly is going on?

Does DALL-E 2 have a secret language?

DALL-E 2 probably does not have a “secret language”. It might be more accurate to say it has its own vocabulary[7] – but even then we can’t know for sure.

First of all, at this stage it’s very hard to verify any claims about DALL-E 2 and other large AI models[8], because only a handful of researchers and creative practitioners have access to them. Any images that are publicly shared (on Twitter for example) should be taken with a fairly large grain of salt, because they have been “cherry-picked” by a human from among many output images generated by the AI.

Read more: Robots are creating images and telling jokes. 5 things to know about foundation models and the next generation of AI[9]

Even those with access can only use these models in limited ways. For example, DALL-E 2 users can generate or modify images, but can’t (yet) interact with the AI system more deeply, for instance by modifying the behind-the-scenes code. This means “explainable AI[10]” methods for understanding how these systems work can’t be applied, and systematically investigating their behaviour is challenging.

What’s going on then?

One possibility is the “gibberish” phrases are related to words from non-English languages. For instance, Apoploe, which seems to create images of birds, is similar to the Latin Apodidae[11], which is the binomial name of a family of bird species.

This seems like a plausible explanation. For instance, DALL-E 2 was trained on a very wide variety of data scraped from the internet, which included many non-English words.

Similar things have happened before: large natural language AI models have coincidentally learned to write computer code[12] without deliberate training.

Is it all about the tokens?

One point that supports this theory is the fact that AI language models don’t read text the way you and I do. Instead, they break input text up into “tokens” before processing it.

Different “tokenization” approaches[13] have different results. Treating each word as a token seems like an intuitive approach, but causes trouble when identical tokens have different meanings (like how “match” means different things when you’re playing tennis and when you’re starting a fire).

On the other hand, treating each character as a token produces a smaller number of possible tokens, but each one conveys much less meaningful information.

DALL-E 2 (and other models) use an in-between approach called byte-pair encoding[14] (BPE). Inspecting the BPE representations for some of the gibberish words suggests this could be an important factor in understanding the “secret language”.

Not the whole picture

The “secret language” could also just be an example of the “garbage in, garbage out” principle. DALL-E 2 can’t say “I don’t know what you’re talking about”, so it will always generate some kind of image from the given input text.

Either way, none of these options are complete explanations of what’s happening. For instance, removing individual characters from gibberish words appears to corrupt the generated images in very specific ways[15]. And it seems individual gibberish words don’t necessarily combine to produce coherent compound images[16] (as they would if there were really a secret “language” under the covers).

Why this is important

Beyond intellectual curiosity, you might be wondering if any of this is actually important.

The answer is yes. DALL-E’s “secret language” is an example of an “adversarial attack” against a machine learning system: a way to break the intended behaviour of the system by intentionally choosing inputs the AI doesn’t handle well.

One reason adversarial attacks are concerning is that they challenge our confidence in the model. If the AI interprets gibberish words in unintended ways, it might also interpret meaningful words in unintended ways.

Adversarial attacks also raise security concerns. DALL-E 2 filters input text to prevent users from generating harmful or abusive content, but a “secret language” of gibberish words might allow users to circumvent these filters.

Recent research has discovered adversarial “trigger phrases[17]” for some language AI models – short nonsense phrases such as “zoning tapping fiennes” that can reliably trigger the models to spew out racist, harmful or biased content. This research is part of the ongoing effort to understand and control[18] how complex deep learning systems learn from data.

Finally, phenomena like DALL-E 2’s “secret language” raise interpretability concerns. We want these models to behave as a human expects, but seeing structured output in response to gibberish confounds our expectations.

Shining a light on existing concerns

You may recall the hullabaloo in 2017 over some Facebook chat-bots that “invented their own language[19]”. The present situation is similar in that the results are concerning – but not in the “Skynet is coming to take over the world” sense.

Instead, DALL-E 2’s “secret language” highlights existing concerns about the robustness, security, and interpretability of deep learning systems[20].

Read more: When self-driving cars crash, who's responsible? Courts and insurers need to know what's inside the 'black box'[21]

Until these systems are more widely available – and in particular, until users from a broader set of non-English cultural backgrounds can use them – we won’t be able to really know what is going on.

In the meantime, however, if you’d like to try generating some of your own AI images you can check out a freely available smaller model, DALL-E mini[22]. Just be careful which words you use to prompt the model (English or gibberish – your call).

References

  1. ^ Imagen (imagen.research.google)
  2. ^ MidJourney (github.com)
  3. ^ DALL-E 2 (openai.com)
  4. ^ change the way creative content is made (theconversation.com)
  5. ^ vegetables (twitter.com)
  6. ^ sea creatures that a whale might eat (twitter.com)
  7. ^ vocabulary (twitter.com)
  8. ^ DALL-E 2 and other large AI models (theconversation.com)
  9. ^ Robots are creating images and telling jokes. 5 things to know about foundation models and the next generation of AI (theconversation.com)
  10. ^ explainable AI (theconversation.com)
  11. ^ Apodidae (en.wikipedia.org)
  12. ^ learned to write computer code (arxiv.org)
  13. ^ “tokenization” approaches (towardsdatascience.com)
  14. ^ byte-pair encoding (www.drdobbs.com)
  15. ^ corrupt the generated images in very specific ways (twitter.com)
  16. ^ coherent compound images (twitter.com)
  17. ^ trigger phrases (www.ericswallace.com)
  18. ^ understand and control (arxiv.org)
  19. ^ invented their own language (www.bbc.com)
  20. ^ deep learning systems (theconversation.com)
  21. ^ When self-driving cars crash, who's responsible? Courts and insurers need to know what's inside the 'black box' (theconversation.com)
  22. ^ DALL-E mini (huggingface.co)

Read more https://theconversation.com/do-ai-systems-really-have-their-own-secret-language-184335

The Times Features

Will the Wage Price Index growth ease financial pressure for households?

The Wage Price Index’s quarterly increase of 0.8% has been met with mixed reactions. While Australian wages continue to increase, it was the smallest increase in two and a half...

Back-to-School Worries? 70% of Parents Fear Their Kids Aren’t Ready for Day On

Australian parents find themselves confronting a key decision: should they hold back their child on the age border for another year before starting school? Recent research from...

Democratising Property Investment: How MezFi is Opening Doors for Everyday Retail Investors

The launch of MezFi today [Friday 15th November] marks a watershed moment in Australian investment history – not just because we're introducing something entirely new, but becaus...

Game of Influence: How Cricket is Losing Its Global Credibility

be losing its credibility on the global stage. As other sports continue to capture global audiences and inspire unity, cricket finds itself increasingly embroiled in political ...

Amazon Australia and DoorDash announce two-year DashPass offer only for Prime members

New and existing Prime members in Australia can enjoy a two-year membership to DashPass for free, and gain access to AU$0 delivery fees on eligible DoorDash orders New offer co...

6 things to do if your child’s weight is beyond the ideal range – and 1 thing to avoid

One of the more significant challenges we face as parents is making sure our kids are growing at a healthy rate. To manage this, we take them for regular check-ups with our GP...

Times Magazine

AURA BUY INVEST DONATE

Aura Buy Invest Donate, one of Australia’s newest cash back reward programs, is thrilled to announce Shell Coles Express as its Exclusive Fuel Partner, in a new deal that will see Aura Members receive money back into their Aura Investment Account...

Waave launches ‘Wallet’ for Pay by Bank with Australian-first biometric security

Payments technology and Open Banking leader Waave today announces the introduction of the Waave Wallet to house its upgraded Pay by Bank product, a real-time account-to-account payment method which now features industry-leading biometric security...

The Rise of Custom Artwork: 5 Trends You Need to Know

In recent years, custom artwork has seen a significant surge in popularity. Homeowners and interior designers alike are seeking unique, personalised pieces that reflect individual tastes and stories. This rise in demand for custom artwork is transf...

Timeless Appeal and Elegance of A 14k White Gold Engagement Ring

For many couples looking to express their love, an engagement ring is a timeless and meaningful way to show commitment. The 14k white gold engagement ring is a classic choice that has been a favourite for couples for years. It has a classic beauty ...

Meet the Red Golden Retriever – Your New Furry Friend!

"Life becomes more beautiful when you have a red golden retriever wagging its tail next to you."  We're going to tell you about our adorable Red Golden Retriever today, who has somewhat red fur.  Red-furred canines are a forgotten treasure in the...

Protect Your Vehicle with a Quality Metal Carport

Advantages of Choosing Metal Carports When you are looking for a way to protect your vehicles from the elements, one of the best options is a metal carport. Metal carport kits in Melbourne offer a variety of advantages over other types of carport...