The Times Australia
The Times World News

.
The Times Real Estate

.

ChatGPT is a data privacy nightmare. If you’ve ever posted online, you ought to be concerned

  • Written by Uri Gal, Professor in Business Information Systems, University of Sydney
ChatGPT is a data privacy nightmare. If you’ve ever posted online, you ought to be concerned

ChatGPT has taken the world by storm. Within two months of its release it reached 100 million active users[1], making it the fastest-growing consumer application ever launched[2]. Users are attracted to the tool’s advanced capabilities[3] – and concerned by its potential to cause disruption in various sectors[4].

A much less discussed implication is the privacy risks ChatGPT poses to each and every one of us. Just yesterday, Google unveiled[5] its own conversational AI called Bard, and others will surely follow. Technology companies working on AI have well and truly entered an arms race.

The problem is it’s fuelled by our personal data.

Read more: Everyone's having a field day with ChatGPT – but nobody knows how it actually works[6]

300 billion words. How many are yours?

ChatGPT is underpinned by a large language model that requires massive amounts of data to function and improve. The more data the model is trained on, the better it gets at detecting patterns, anticipating what will come next and generating plausible text.

OpenAI, the company behind ChatGPT, fed the tool some 300 billion words[7] systematically scraped from the internet: books, articles, websites and posts – including personal information obtained without consent.

If you’ve ever written a blog post or product review, or commented on an article online, there’s a good chance this information was consumed by ChatGPT.

So why is that an issue?

The data collection used to train ChatGPT is problematic for several reasons.

First, none of us were asked whether OpenAI could use our data. This is a clear violation of privacy, especially when data are sensitive and can be used to identify us, our family members, or our location.

Even when data are publicly available their use can breach what we call textual integrity[8]. This is a fundamental principle in legal discussions of privacy. It requires that individuals’ information is not revealed outside of the context in which it was originally produced.

Also, OpenAI offers no procedures for individuals to check whether the company stores their personal information, or to request it be deleted. This is a guaranteed right in accordance with the European General Data Protection Regulation (GDPR[9]) – although it’s still under debate whether ChatGPT is compliant with GDPR requirements[10].

This “right to be forgotten” is particularly important in cases where the information is inaccurate or misleading, which seems to be a regular occurrence[11] with ChatGPT.

Moreover, the scraped data ChatGPT was trained on can be proprietary or copyrighted. For instance, when I prompted it, the tool produced the first few paragraphs of Peter Carey’s novel “True History of the Kelly Gang” – a copyrighted text.

ChatGPT doesn’t consider copyright protection when generating outputs. Anyone using the outputs elsewhere could be inadvertently plagiarising. ChatGPT, Author provided

Finally, OpenAI did not pay for the data it scraped from the internet. The individuals, website owners and companies that produced it were not compensated. This is particularly noteworthy considering OpenAI was recently valued at US$29 billion[12], more than double its value in 2021[13].

OpenAI has also just announced ChatGPT Plus[14], a paid subscription plan that will offer customers ongoing access to the tool, faster response times and priority access to new features. This plan will contribute to expected revenue of $1 billion by 2024[15].

None of this would have been possible without data – our data – collected and used without our permission.

A flimsy privacy policy

Another privacy risk involves the data provided to ChatGPT in the form of user prompts. When we ask the tool to answer questions or perform tasks, we may inadvertently hand over sensitive information[16] and put it in the public domain.

For instance, an attorney may prompt the tool to review a draft divorce agreement, or a programmer may ask it to check a piece of code. The agreement and code, in addition to the outputted essays, are now part of ChatGPT’s database. This means they can be used to further train the tool, and be included in responses to other people’s prompts.

Beyond this, OpenAI gathers a broad scope of other user information. According to the company’s privacy policy[17], it collects users’ IP address, browser type and settings, and data on users’ interactions with the site – including the type of content users engage with, features they use and actions they take.

It also collects information about users’ browsing activities over time and across websites. Alarmingly, OpenAI states it may share users’ personal information[18] with unspecified third parties, without informing them, to meet their business objectives.

Read more: Everyone's having a field day with ChatGPT – but nobody knows how it actually works[19]

Time to rein it in?

Some experts believe ChatGPT is a tipping point for AI[20] – a realisation of technological development that can revolutionise the way we work, learn, write and even think. Its potential benefits notwithstanding, we must remember OpenAI is a private, for-profit company whose interests and commercial imperatives do not necessarily align with greater societal needs.

The privacy risks that come attached to ChatGPT should sound a warning. And as consumers of a growing number of AI technologies, we should be extremely careful about what information we share with such tools.

The Conversation reached out to OpenAI for comment, but they didn’t respond by deadline.

References

  1. ^ active users (news.yahoo.com)
  2. ^ application ever launched (www.reuters.com)
  3. ^ advanced capabilities (oneusefulthing.substack.com)
  4. ^ various sectors (theconversation.com)
  5. ^ Google unveiled (blog.google)
  6. ^ Everyone's having a field day with ChatGPT – but nobody knows how it actually works (theconversation.com)
  7. ^ 300 billion words (www.sciencefocus.com)
  8. ^ textual integrity (digitalcommons.law.uw.edu)
  9. ^ GDPR (gdpr-info.eu)
  10. ^ with GDPR requirements (blog.avast.com)
  11. ^ regular occurrence (www.fastcompany.com)
  12. ^ valued at US$29 billion (www.nasdaq.com)
  13. ^ value in 2021 (www.forbes.com)
  14. ^ announced ChatGPT Plus (openai.com)
  15. ^ revenue of $1 billion by 2024 (www.reuters.com)
  16. ^ sensitive information (www.forbes.com)
  17. ^ privacy policy (openai.com)
  18. ^ share users’ personal information (openai.com)
  19. ^ Everyone's having a field day with ChatGPT – but nobody knows how it actually works (theconversation.com)
  20. ^ a tipping point for AI (hbr.org)

Read more https://theconversation.com/chatgpt-is-a-data-privacy-nightmare-if-youve-ever-posted-online-you-ought-to-be-concerned-199283

The Times Features

What’s the difference between wholemeal and wholegrain bread? Not a whole lot

If you head to the shops to buy bread, you’ll face a variety of different options. But it can be hard to work out the difference between all the types on sale. For instance...

Expert Tips for Planning Home Electrical Upgrades in Australia

Home electrical systems in Australia are quite intricate and require careful handling. Safety and efficiency determine the functionality of these systems, and it's critical to ...

Floor Tiling: Choosing the Right Tiles for Every Room

Choosing floor tiles is more than just grabbing the first design that catches your eye at the showroom. You need to think about how the floor tiling option will fit into your spa...

Exploring Family Caravans: Your Ultimate Guide to Mobile Living and Travel

Australia is the land of vast horizons, spectacular coastlines, and a never-ending adventure. As landscapes and adventures vary across the country, Voyager will route you, carava...

Energy-Efficient Homes in Geelong: How a Local Electrician Can Help You Save Money

Rising energy bills don’t have to be the new normal. With Victoria’s energy prices up 25% last year, Geelong homeowners are fighting back and winning, by partnering with licenced...

Eating disorders don’t just affect teen girls. The risk may go up around pregnancy and menopause too

Eating disorders impact more than 1.1 million people in Australia[1], representing 4.5% of the population. These disorders include binge eating disorder, bulimia nervosa, and...

Times Magazine

The Power of Digital Signage in Modern Marketing

In a fast-paced digital world, businesses must find innovative ways to capture consumer attention. Digital signage has emerged as a powerful solution, offering dynamic and engaging content that attracts and retains customers. From retail stores to ...

Why Cloud Computing Is the Future of IT Infrastructure for Enterprises

Globally, cloud computing is changing the way business organizations manage their IT infrastructure. It offers cheap, flexible and scalable solutions. Cloud technologies are applied in organizations to facilitate procedures and optimize operation...

First Nations Writers Festival

The First Nations Writers Festival (FNWF) is back for its highly anticipated 2025 edition, continuing its mission to celebrate the voices, cultures and traditions of First Nations communities through literature, art and storytelling. Set to take ...

Improving Website Performance with a Cloud VPS

Websites represent the new mantra of success. One slow website may make escape for visitors along with income too. Therefore it's an extra offer to businesses seeking better performance with more scalability and, thus represents an added attracti...

Why You Should Choose Digital Printing for Your Next Project

In the rapidly evolving world of print media, digital printing has emerged as a cornerstone technology that revolutionises how businesses and creative professionals produce printed materials. Offering unparalleled flexibility, speed, and quality, d...

What to Look for When Booking an Event Space in Melbourne

Define your event needs early to streamline venue selection and ensure a good fit. Choose a well-located, accessible venue with good transport links and parking. Check for key amenities such as catering, AV equipment, and flexible seating. Pla...

LayBy Shopping