Fri Sep 29

Books 3 has revealed thousands of pirated Australian books. In the age of AI, is copyright law still fit for purpose?

Written by Dilan Thampapillai, Dean of Law, University of Wollongong, University of Wollongong

Thousands of Australian books have been found ^[1] on a pirated dataset of ebooks, known as Books3, used to train generative AI. Richard Flanagan, Helen Garner, Tim Winton and Tim Flannery are among the leading local authors affected – along, of course, with writers from around the world.

A search tool ^[2] published by the Atlantic ^[3] makes it possible for authors to find out whether their books are among the nearly 200,000 in the Books3 dataset.

Many of these writers have reacted angrily about their works being included in these datasets without their knowledge or consent. Flanagan told the Guardian ^[4], “I felt as if my soul had been strip mined and I was powerless to stop it”.

“Turning a blind eye to the legitimate rights of copyright owners threatens to diminish already-precarious creative careers,” said Olivia Lanchester, chief executive of the Australian Society of Authors, in an official response ^[5] this week.

AI moving at speed

Authors have turned to copyright law because it is the body of law that has traditionally protected authors and other creators from the appropriation of their works.

However, laws designed for the pre-AI era have little meaning in the post-OpenAI world.

Just last year, the issue of AI was only faintly on the cultural radar. But while AI technology is moving at high speed, the law moves slowly.

It took a very significant amount of time for copyright law to first appear. The first copyright law, the Statute of Anne ^[6], emerged in 1710 after protracted lobbying by stationers (publishers).

In a more modern context, it took 20 years from the time Australian courts first recognised a system of Aboriginal law existed, with the Milirrpum decision ^[7] in 1971 – meaning terra nullius was implausible – to the High Court handing down the landmark Mabo decision ^[8] that erased terra nullius, in June 1992. In the interim, injustice reigned.

The question that now confronts us is whether we can wait for the law to catch up with the rapid advances of technology – or whether we must jumpstart the process.

A spate of copyright disputes

There has been a spate of copyright disputes around AI datasets and copyright-protected works.

Earlier this month, the US Authors Guild filed a class action ^[10], with 17 authors including Jonathan Franzen and Jodi Picoult, against OpenAI for copyright infringement.

This followed the first copyright lawsuit ^[11] against OpenAI in July. It was filed by authors Mona Awad and Paul Tremblay, for using their books to train its AI, ChatGPT, without their consent.

And in August, Benji Smith was forced to take down ^[12] his website Prosecraft, which used an algorithm to trawl through more than 25,000 books (again, without authors’ consent) to produce analysis designed to give writing advice.

Copyright is not the answer

While it’s true that the uploading of works into a dataset is an act of copyright infringement, that only pertains to a one-off act of infringement.

No doubt, the liability would be large if thousands of works were involved and thousands of authors were to sue (as with the US Authors Guild class action), but the damages obtained by an individual author would be relatively small, making it not worth suing. The large commercial interests driving the development of the datasets and related AI tools are likely to withstand these lawsuits even if they are found liable.

Likewise, copyright law’s rules on fair dealing ^[14] in Australia and fair use in the United States would likely protect some uses.

Further, the outputs from AI that have been trained on these datasets are not likely to result in works that satisfy the substantial similarity threshold (which means that when the two works are compared side by side, they must be similar) for copyright infringement in most jurisdictions, including Australia.

‘A type of market failure’

This happened when the photocopier was invented, when video cassette recorders were developed, when blank tapes became widely available and when peer-to-peer copyright infringement took off during the digital era.

The difference then was that these technologies did not fundamentally threaten artistic and creative labour in the way AI does.

To appropriate a part of someone’s market is a radically different thing to producing a product that could entirely displace them in that market.

Yet this is the direction we’re heading in. And it requires a very significant rethink about the regulation of technology.

A type of market failure is occurring here, because authors are not being compensated even though their works, collectively, are the basis for new and commercially viable AI products.

When the sale of blank tapes began, the government responded ^[16] with a levy on every blank tape sale, which sent money back to copyright owners.

Something like the blank tape levy might need to be considered for AI. This would mean every time somebody uses an OpenAI-type tool for which they pay a fee, some small portion of the fee would revert to copyright owners.

References

^{^} have been found (www.abc.net.au)
^{^} search tool (full-stack-search-prod.vercel.app)
^{^} the Atlantic (www.theatlantic.com)
^{^} told the Guardian (www.theguardian.com)
^{^} an official response (www.asauthors.org.au)
^{^} Statute of Anne (www.historyofinformation.com)
^{^} Milirrpum decision (en.wikipedia.org)
^{^} landmark Mabo decision (theconversation.com)
^{^} Authors are resisting AI with petitions and lawsuits. But they have an advantage: we read to form relationships with writers (theconversation.com)
^{^} filed a class action (authorsguild.org)
^{^} the first copyright lawsuit (theconversation.com)
^{^} forced to take down (theconversation.com)
^{^} Two authors are suing OpenAI for training ChatGPT with their books. Could they win? (theconversation.com)
^{^} fair dealing (theconversation.com)
^{^} Prosecraft has infuriated authors by using their books without consent – but what does copyright law say? (theconversation.com)
^{^} the government responded (classic.austlii.edu.au)

Books 3 has revealed thousands of pirated Australian books. In the age of AI, is copyright law still fit for purpose?

AI moving at speed

A spate of copyright disputes

Copyright is not the answer

‘A type of market failure’

References

Marketers: Forget the Black Box. If You Aren't Moving the Needle, What Are You Doing?

Extreme weather growing threat to Australian businesses in storm and fire season

Australia’s food labelling system isn’t working – here’s how we can fix it

Why Australia’s trade deal with Europe hinges on a forgotten promise

How delays in Australia’s switch to clean energy are hurting workers

Rejuvenate Your Look with Dermal Fillers

Times Magazine

Governance Models for Headless CMS in Large Organizations

Narwal Freo Z Ultra Robotic Vacuum and Mop Cleaner

Shark launches SteamSpot - the shortcut for everyday floor mess

Game Together, Stay Together: Logitech G Reveals Gaming Couples Enjoy Higher Relationship Satisfaction

AI threatens to eat business software – and it could change the way we work

Worried AI means you won’t get a job when you graduate? Here’s what the research says

The Times Features

Oztent RV tent range. Buy with caution

Essential Upgrades for a Smarter, Safer Australian Home

How To Modernise Your Home Without Overcapitalising

The Art of the Big Trip: Planning a Seamless Multi-Generational Getaway in Tropical North Queensland

Love Without Borders: ‘Second Marriage At First Sight’ Opens Casting Call for Melbourne Singles Willing to Relocate for Romance

Macca’s is bringing pub-style vibes to the menu with the new Bistro Béarnaise Angus range

What are your options if you can’t afford to repay your mortgage?

Small, realistic increases in physical activity shown to significantly reduce risk of early death

Inside One Global resorts: The Sydney Stay Hosting This Season of MAFS Australia