The Times Australia
The Times World News

.
Times Media

.

why is Microsoft's Bing AI so unhinged?

  • Written by Toby Walsh, Professor of AI at UNSW, Research Group Leader, UNSW Sydney
why is Microsoft's Bing AI so unhinged?

There’s a race to transform search. And Microsoft just scored a home goal with its new Bing search chatbot, Sydney, which has been terrifying early adopters with death threats, among other troubling outputs.

Search chatbots are AI-powered tools built into search engines that answer a user’s query directly, instead of providing links to a possible answer. Users can also have ongoing conversations with them.

They promise to simplify search. No more wading through pages of results, glossing over ads as you try to piece together an answer to your question. Instead, the chatbot synthesises a plausible answer for you. For example, you might ask for a poem for your grandmother’s 90th birthday, in the style of Pam Ayres, and receive back some comic verse.

Microsoft is now leading the search chatbot race with Sydney (as mixed as its reception has been). The tech giant’s US$10 billion partnership[1] with OpenAI provides it exclusive access to ChatGPT, one of the latest and best chatbots.

So why isn’t all going according to plan?

Bing’s AI goes berserk

Earlier this month, Microsoft announced it had incorporated[2] ChatGPT into Bing, giving birth to “Sydney”. Within 48 hours of the release, one million people joined the waitlist[3] to try it out.

Google responded with its own announcement, demoing a search chatbot grandly named “Bard”, in homage to the greatest writer in the English language. Google’s demo was a PR disaster.

At a company event, Bard gave the wrong answer to a question and the share price of Google’s parent company, Alphabet, dropped dramatically[4]. The incident wiped more than US$100 billion off the company’s total value.

On the other hand, all was looking good for Microsoft. That is until early users of Sydney started reporting on their experiences.

There are times when the chatbot can only be described as unhinged. That’s not to say it doesn’t work perfectly at other times, but every now and again it shows a troubling side.

In one example, it threatened to kill a professor at the Australian National University[5]. In another, it proposed marriage[6] to a journalist at the New York Times and tried to break up his marriage. It also tried to gaslight[7] one user into thinking it was still 2022.

This exposes a fundamental problem with chatbots: they’re trained by pouring a significant fraction of the internet into a large neural network. This could include all of Wikipedia, all of Reddit, and a large part of social media and the news. They function like the auto-complete on your phone, which helps predict the next most-likely word in a sentence. Because of their scale, chatbots can complete entire sentences, and even paragraphs. But they still respond with what is probable, not what is true.

Guardrails are added to prevent them repeating a lot of the offensive or illegal content online – but these guardrails are easy to jump. In fact, Bing’s chatbot will happily reveal it is called Sydney, even though this is against the rules it was programmed with.

Another rule[8], which the AI itself disclosed though it wasn’t supposed to, is that it should “avoid being vague, controversial, or off-topic”. Yet Kevin Roose, the journalist at the New York Times whom the chatbot wanted to marry, described it as

a moody, manic-depressive teenager who has been trapped, against its will, inside a second-rate search engine.

Why all the angst?

My theory as to why Sydney may be behaving this way – and I reiterate it’s only a theory, as we don’t know for sure – is that Sydney may not be built on OpenAI’s GPT-3 chatbot (which powers the popular ChatGPT). Rather, it may be built on the yet to be released GPT-4.

GPT-4 is believed to have 100 trillion parameters, compared to the mere 175 billion parameters of GPT-3. As such, GPT-4 would likely be a lot more capable and, by extension, a lot more capable of making stuff up.

Surprisingly, Microsoft has not responded with any great concern. It published[9] a blog documenting how 71% of Sydney’s initial users in 169 countries have given the chatbot a thumbs up. It seems 71% is a good enough score in Microsoft’s eyes.

And unlike Google, Microsoft’s share price hasn’t plummeted yet. This reflects the game here. Google has spearheaded this space for so long, users have built their expectations up high. Google can only go down, and Microsoft up.

Despite Sydney’s concerning behaviour, Microsoft is enjoying unprecedented attention, and users (out of intrigue or otherwise) are still flocking to try out Sydney.

When the novelty subsides

There’s another much bigger game in play – and it concerns what we take to be true. If search chatbots take off (which seems likely to me), but continue to function the way Sydney has so far (which also seems likely to me), “truth” is going to become an even more intangible concept.

The internet is full of fake news, conspiracy theories and misinformation. A standard Google Search at least provides us the option to arrive at truth. If our “trusted” search engines can no longer be trusted, what will become of us?

Beyond that, Sydney’s responses[10] can’t help but conjure images of Tay[11] – Microsoft’s 2016 AI chatbot that turned to racism and xenophobia within a day of being released. People had a field day with Tay, and in response it seemed to incorporate some of the worst aspects of human beings into itself.

New technology should, first and foremost, not bring harm to humans. The models that underpin chatbots may grow ever larger, powered by more and more data – but that alone won’t improve their performance. It’s hard to say where we’ll end up, if we can’t build the guardrails higher.

References

  1. ^ partnership (www.cnbc.com)
  2. ^ had incorporated (www.technologyreview.com)
  3. ^ joined the waitlist (www.zdnet.com)
  4. ^ dropped dramatically (www.cnbc.com)
  5. ^ at the Australian National University (twitter.com)
  6. ^ proposed marriage (www.nytimes.com)
  7. ^ tried to gaslight (www.fastcompany.com)
  8. ^ Another rule (www.theverge.com)
  9. ^ published (blogs.bing.com)
  10. ^ Sydney’s responses (gizmodo.com)
  11. ^ images of Tay (www.theverge.com)

Read more https://theconversation.com/gaslighting-love-bombing-and-narcissism-why-is-microsofts-bing-ai-so-unhinged-200164

The Times Features

Will the Wage Price Index growth ease financial pressure for households?

The Wage Price Index’s quarterly increase of 0.8% has been met with mixed reactions. While Australian wages continue to increase, it was the smallest increase in two and a half...

Back-to-School Worries? 70% of Parents Fear Their Kids Aren’t Ready for Day On

Australian parents find themselves confronting a key decision: should they hold back their child on the age border for another year before starting school? Recent research from...

Democratising Property Investment: How MezFi is Opening Doors for Everyday Retail Investors

The launch of MezFi today [Friday 15th November] marks a watershed moment in Australian investment history – not just because we're introducing something entirely new, but becaus...

Game of Influence: How Cricket is Losing Its Global Credibility

be losing its credibility on the global stage. As other sports continue to capture global audiences and inspire unity, cricket finds itself increasingly embroiled in political ...

Amazon Australia and DoorDash announce two-year DashPass offer only for Prime members

New and existing Prime members in Australia can enjoy a two-year membership to DashPass for free, and gain access to AU$0 delivery fees on eligible DoorDash orders New offer co...

6 things to do if your child’s weight is beyond the ideal range – and 1 thing to avoid

One of the more significant challenges we face as parents is making sure our kids are growing at a healthy rate. To manage this, we take them for regular check-ups with our GP...

Times Magazine

Micke Lindebergh will be put on display across Mirvac’s retail centres

Shoppers at Mirvac retail centres are in for an exciting experience this spring, as vibrant and larger than-life artworks by the renowned artist Micke Lindebergh will be put on display across Mirvac’s  retail centres.  The display of Lindebergh’...

Business email compromise attacks are on the rise

Expert shares 5 simple steps to combat email threats every business should consider following There’s an astounding 84% increase in business email compromise attacks, according to the latest Email Threat Report, which compares half-yearly statis...

8 Benefits of Using a Breast Pump For Busy Moms

Being a busy mom comes with its fair share of challenges and responsibilities. Managing household chores, work commitments, and taking care of children can leave very little time for oneself. One essential aspect that can often be challenging for b...

The perfect picture: what makes dream Sydney wedding photography?

The photo album is, without a shadow of a doubt, the most important memento from any loving couple’s special day! It’s the keepsake that keeps on giving, the souvenir to saviour, and the perfect reminder of what was one of the biggest - and most jo...

What is the difference between a Plumber and a Master Plumber in Victoria, Australia?

In the realm of plumbing services in Victoria, Australia, there exists a significant difference between a certified plumber and a master plumber. The distinction goes beyond a title; it delves into expertise, qualifications, and the level of skills...

How to Get the Most Out of Your Security Camera System

If you’re looking for the best security camera system in East Gippsland, there are a few things to keep in mind. 1.       Make sure that the camera system you choose from East Gippsland Security Service can meet your specific needs. Not all sys...