The Times Australia
The Times World News

.

‘Godfather of AI’ now fears it’s unsafe. He has a plan to rein it in

  • Written by Armin Chitizadeh, Lecturer, School of Computer Science, University of Sydney

This week the US Federal Bureau of Investigation revealed two men suspected[1] of bombing a fertility clinic in California last month allegedly used artificial intelligence (AI) to obtain bomb-making instructions. The FBI did not disclose the name of the AI program in question.

This brings into sharp focus the urgent need to make AI safer. Currently we are living in the “wild west” era of AI, where companies are fiercely competing to develop the fastest and most entertaining AI systems. Each company wants to outdo competitors and claim the top spot. This intense competition often leads to intentional or unintentional shortcuts[2] – especially when it comes to safety.

Coincidentally, at around the same time of the FBI’s revelation, one of the godfathers of modern AI, Canadian computer science professor Yoshua Bengio, launched a new nonprofit organisation[3] dedicated to developing a new AI model specifically designed to be safer than other AI models – and target those that cause social harm.

So what is Bengio’s new AI model? And will it actually protect the world from AI-faciliated harm?

An ‘honest’ AI

In 2018, Bengio, alongside his colleagues Yann LeCun and Geoffrey Hinton, won the Turing Award for groundbreaking research they had published three years earlier on deep learning[4]. A branch of machine learning, deep learning attempts to mimic the processes of the human brain by using artificial neural networks to learn from computational data and make predictions.

Bengio’s new nonprofit organisation, LawZero[5], is developing “Scientist AI”. Bengio has said[6] this model will be “honest and not deceptive”, and incorporate safety-by-design principles.

According to a preprint paper[7] released online earlier this year, Scientist AI will differ from current AI systems in two key ways.

First, it can assess and communicate its confidence level in its answers, helping to reduce the problem of AI giving overly confident and incorrect responses.

Second, it can explain its reasoning to humans, allowing its conclusions to be evaluated and tested for accuracy.

Interestingly, older AI systems had this feature[8]. But in the rush for speed and new approaches, many modern AI models[9] can’t explain their decisions. Their developers have sacrificed explainability for speed.

Bengio also intends “Scientist AI” to act as a guardrail against unsafe AI. It could monitor other, less reliable and harmful AI systems — essentially fighting fire with fire.

This may be the only viable solution to improve AI safety. Humans cannot properly monitor systems such as ChatGPT, which handle over a billion queries daily. Only another AI can manage this scale.

Using an AI system against other AI systems is not just a sci-fi concept – it’s a common practice in research to compare and test different level of intelligence in AI systems[10].

Adding a ‘world model’

Large language models and machine learning are just small parts of today’s AI landscape.

Another key addition Bengio’s team are adding to Scientist AI is the “world model[11]” which brings certainty and explainability. Just as humans make decisions based on their understanding of the world, AI needs a similar model to function effectively.

The absence of a world model in current AI models is clear.

One well-known example is the “hand problem[12]”: most of today’s AI models can imitate the appearance of hands but cannot replicate natural hand movements, because they lack an understanding of the physics — a world model — behind them.

Another example is how models such as ChatGPT struggle with chess, failing to win and even making illegal moves[13].

This is despite simpler AI systems, which do contain a model of the “world” of chess, beating even the best human players[14].

These issues stem from the lack of a foundational world model in these systems, which are not inherently designed to model the dynamics of the real world[15].

A man with grey and white hair wearing a suit speaking into a microphone.
Yoshua Bengio is recognised as one of the godfathers of AI. Alex Wong/Getty Images

On the right track – but it will be bumpy

Bengio is on the right track, aiming to build safer, more trustworthy AI by combining large language models with other AI technologies.

However, his journey isn’t going to be easy. LawZero’s US$30 million in funding[16] is small compared to efforts such as the US$500 billion project[17] announced by US President Donald Trump earlier this year to accelerate the development of AI.

Making LawZero’s task harder is the fact that Scientist AI – like any other AI project – needs huge amounts of data to be powerful, and most data are controlled by major tech companies[18].

There’s also an outstanding question. Even if Bengio can build an AI system that does everything he says it can, how is it going to be able to control other systems that might be causing harm?

Still, this project, with talented researchers behind it, could spark a movement toward a future where AI truly helps humans thrive. If successful, it could set new expectations for safe AI, motivating researchers, developers, and policymakers to prioritise safety.

Perhaps if we had taken similar action when social media first emerged, we would have a safer online environment for young people’s mental health. And maybe, if Scientist AI had already been in place, it could have prevented people with harmful intentions from accessing dangerous information with the help of AI systems.

References

  1. ^ revealed two men suspected (www.cnbc.com)
  2. ^ often leads to intentional or unintentional shortcuts (theconversation.com)
  3. ^ launched a new nonprofit organisation (www.theguardian.com)
  4. ^ had published three years earlier on deep learning (scholar.google.com)
  5. ^ LawZero (lawzero.org)
  6. ^ Bengio has said (www.theguardian.com)
  7. ^ preprint paper (arxiv.org)
  8. ^ older AI systems had this feature (journals.sagepub.com)
  9. ^ modern AI models (seon.io)
  10. ^ compare and test different level of intelligence in AI systems (link.springer.com)
  11. ^ world model (medium.com)
  12. ^ hand problem (www.britannica.com)
  13. ^ ChatGPT struggle with chess, failing to win and even making illegal moves (www.chess.com)
  14. ^ beating even the best human players (www.sciencefocus.com)
  15. ^ are not inherently designed to model the dynamics of the real world (arxiv.org)
  16. ^ US$30 million in funding (time.com)
  17. ^ US$500 billion project (theconversation.com)
  18. ^ most data are controlled by major tech companies (www.theguardian.com)

Read more https://theconversation.com/godfather-of-ai-now-fears-its-unsafe-he-has-a-plan-to-rein-it-in-258288

Times Magazine

DIY Is In: How Aussie Parents Are Redefining Birthday Parties

When planning his daughter’s birthday, Rich opted for a DIY approach, inspired by her love for drawing maps and giving clues. Their weekend tradition of hiding treats at home sparked the idea, and with a pirate ship playground already chosen as t...

When Touchscreens Turn Temperamental: What to Do Before You Panic

When your touchscreen starts acting up, ignoring taps, registering phantom touches, or freezing entirely, it can feel like your entire setup is falling apart. Before you rush to replace the device, it’s worth taking a deep breath and exploring what c...

Why Social Media Marketing Matters for Businesses in Australia

Today social media is a big part of daily life. All over Australia people use Facebook, Instagram, TikTok , LinkedIn and Twitter to stay connected, share updates and find new ideas. For businesses this means a great chance to reach new customers and...

Building an AI-First Culture in Your Company

AI isn't just something to think about anymore - it's becoming part of how we live and work, whether we like it or not. At the office, it definitely helps us move faster. But here's the thing: just using tools like ChatGPT or plugging AI into your wo...

Data Management Isn't Just About Tech—Here’s Why It’s a Human Problem Too

Photo by Kevin Kuby Manuel O. Diaz Jr.We live in a world drowning in data. Every click, swipe, medical scan, and financial transaction generates information, so much that managing it all has become one of the biggest challenges of our digital age. Bu...

Headless CMS in Digital Twins and 3D Product Experiences

Image by freepik As the metaverse becomes more advanced and accessible, it's clear that multiple sectors will use digital twins and 3D product experiences to visualize, connect, and streamline efforts better. A digital twin is a virtual replica of ...

The Times Features

Italian Street Kitchen: A Nation’s Favourite with Expansion News on Horizon

Successful chef brothers, Enrico and Giulio Marchese, weigh in on their day-to-day at Australian foodie favourite, Italian Street Kitchen - with plans for ‘ambitious expansion’ to ...

What to Expect During a Professional Termite Inspection

Keeping a home safe from termites isn't just about peace of mind—it’s a vital investment in the structure of your property. A professional termite inspection is your first line o...

Booty and the Beasts - The Podcast

Cult TV Show Back with Bite as a Riotous New Podcast  The show that scandalised, shocked and entertained audiences across the country, ‘Beauty and the Beast’, has returned in ...

A Guide to Determining the Right Time for a Switchboard Replacement

At the centre of every property’s electrical system is the switchboard – a component that doesn’t get much attention until problems arise. This essential unit directs electrici...

Après Skrew: Peanut Butter Whiskey Turns Australia’s Winter Parties Upside Down

This August, winter in Australia is about to get a lot nuttier. Skrewball Whiskey, the cult U.S. peanut butter whiskey that’s taken the world by storm, is bringing its bold brand o...

450 people queue for first taste of Pappa Flock’s crispy chicken as first restaurant opens in Queensland

Queenslanders turned out in flocks for the opening of Pappa Flock's first Queensland restaurant, with 450 people lining up to get their hands on the TikTok famous crispy crunchy ch...