The Times Australia
The Times World News

.
Times Media

.

An academic publisher has struck an AI data deal with Microsoft – without their authors’ knowledge

  • Written by Wellett Potter, Lecturer in Law, University of New England
An academic publisher has struck an AI data deal with Microsoft – without their authors’ knowledge

In May, a multibillion-dollar UK-based multinational called Informa announced in a trading update[1] that it had signed a deal with Microsoft involving “access to advanced learning content and data, and a partnership to explore AI expert applications”. Informa is the parent company of Taylor & Francis[2], which publishes a wide range of academic and technical books and journals, so the data in question may include the content of these books and journals.

According to reports published last week[3], the authors of the content do not appear to have been asked or even informed about the deal. What’s more, they say they had no opportunity to opt out of the deal, and will not see any money from it.

Academics are only the latest of several groups of what we might call content creators to take umbrage at having their work ingested by the generative AI models currently racing to hoover up the products of human culture. Newspapers[4], visual artists[5] and record labels[6] are already taking AI companies to court.

While it’s unclear how Informa will react to the rumblings of discontent, the deal is a reminder to authors to be aware of the contractual terms of the publishing agreements they sign.

What’s in the Informa deal?

Informa’s update stated four focus areas of the Microsoft deal:

  • increasing Informa’s own productivity
  • developing an automated citation tool
  • developing AI-powered research assistant software (perhaps like a system being tested by online academic library JSTOR[7])
  • giving Microsoft data access to “help improve relevance and performance of AI systems”.

Informa will be paid more than £8 million (A$15.5 million) for initial access to the data, followed by recurring payments of an unspecified amount for the next three years.

We don’t know exactly what Microsoft plans to do with its data access, but a likely scenario is that the content of academic books and articles would be added to the training data of ChatGPT-like generative AI models. In principle this should make the output of the AI systems more accurate, though existing AI models have faced heavy criticism, not only for regurgitating training data[8] without citation (which can be viewed as a kind of plagiarism[9]), but also for inventing false information[10] and attributing[11] it to real sources.

However, the update also says “the agreement protects intellectual property rights, including limits on verbatim text extracts and alignment on the importance of detailed citation references”.

The “limits on verbatim text extracts” mentioned likely pertains to the US doctrine of fair use[12], which permits certain uses of copyright-protected material.

Many generative AI companies are currently facing copyright infringement lawsuits[13] over their use of training data, and their defences are likely to rely on claiming fair use.

The “importance of detailed citation references” may pertain to the concept of attribution in copyright. This is a moral right[14] possessed by authors. It provides that the creator of the work should be known and attributed as the author when their work is reproduced.

How does scholarly publishing usually work?

Most academics do not receive payment or make any profit from most of their scholarly publishing. Rather, writing journal and conference papers is usually considered part of the scope of work within a full-time, tenured position. Publication builds an academic’s credibility and promotes their research.

The basic process often goes like this: an author researches and writes an original article and submits it to a journal publisher for peer review. Most peer reviewers and editorial board members also receive no payment for their work.

In fact, some journals may require authors to pay an “article processing charge[15]” to cover editing and other costs. This can be thousands of dollars for an open access[16] publication. Generally speaking, the more prestigious the publication, the higher the charge.

If an article passes peer review, the author will be asked to sign a publishing agreement[17]. The terms may cover logistical arrangements such as when the article will be published, the format (print, online or both), and the division of royalties (if applicable). There will also be arrangements regarding copyright and ownership of the article.

An author usually must also grant exclusive rights[18] to the publisher to distribute and publish the article. This may mean the author cannot publish the article elsewhere, and the publisher may also be able to sub-licence the article to a third party, such as an AI company.

Sometimes publishers require an author to assign copyright in the article to them via a permanent copyright transfer agreement[19].

Essentially, this means the author grants all of their authorial rights as copyright holder in the work to the publisher. The publisher can then reproduce, communicate, distribute or license the work to others as they wish.

It is possible to only assign limited rights, rather than all rights, and this is something authors should consider.

Content mining

It is vital that authors understand the implications of licensing and assignment and to contemplate precisely what they are agreeing to when they sign a contract. In light of the recent trend of publishers entering into agreements with generative AI companies[20], publishers’ AI policies should also be closely scrutinised.

In the US, a standard collective licensing solution for content use in internal AI systems[21] has recently been released, which sets out rights and remuneration for copyright holders. Similar licences for the use of content for AI systems will likely enter the Australian market very soon.

The types of agreements being reached between academic publishers and AI companies have sparked bigger-picture concerns for many academics. Do we want scholarly research to be reduced to content for AI knowledge mining[22]? There are no clear answers about the ethics and morals of such practices.

References

  1. ^ trading update (www.informa.com)
  2. ^ Taylor & Francis (taylorandfrancis.com)
  3. ^ reports published last week (www.thebookseller.com)
  4. ^ Newspapers (www.nytimes.com)
  5. ^ visual artists (www.cbsnews.com)
  6. ^ record labels (time.com)
  7. ^ online academic library JSTOR (www.about.jstor.org)
  8. ^ regurgitating training data (www.nytimes.com)
  9. ^ plagiarism (spectrum.ieee.org)
  10. ^ inventing false information (www.cnet.com)
  11. ^ attributing (www.nature.com)
  12. ^ US doctrine of fair use (www.alrc.gov.au)
  13. ^ facing copyright infringement lawsuits (theconversation.com)
  14. ^ moral right (www.artslaw.com.au)
  15. ^ article processing charge (akjournals.com)
  16. ^ open access (www.openaccess.nl)
  17. ^ publishing agreement (copyright.unimelb.edu.au)
  18. ^ exclusive rights (copyrightalliance.org)
  19. ^ copyright transfer agreement (authorservices.wiley.com)
  20. ^ publishers entering into agreements with generative AI companies (techcrunch.com)
  21. ^ collective licensing solution for content use in internal AI systems (www.copyright.com)
  22. ^ AI knowledge mining (azure.microsoft.com)

Read more https://theconversation.com/an-academic-publisher-has-struck-an-ai-data-deal-with-microsoft-without-their-authors-knowledge-235203

The Times Features

Air is an overlooked source of nutrients – evidence shows we can inhale some vitamins

You know that feeling you get when you take a breath of fresh air in nature? There may be more to it than a simple lack of pollution. When we think of nutrients, we think of t...

FedEx Australia Announces Christmas Shipping Cut-Off Dates To Help Beat the Holiday Rush

With Christmas just around the corner, FedEx is advising Australian shoppers to get their presents sorted early to ensure they arrive on time for the big day. FedEx has reveale...

Will the Wage Price Index growth ease financial pressure for households?

The Wage Price Index’s quarterly increase of 0.8% has been met with mixed reactions. While Australian wages continue to increase, it was the smallest increase in two and a half...

Back-to-School Worries? 70% of Parents Fear Their Kids Aren’t Ready for Day On

Australian parents find themselves confronting a key decision: should they hold back their child on the age border for another year before starting school? Recent research from...

Democratising Property Investment: How MezFi is Opening Doors for Everyday Retail Investors

The launch of MezFi today [Friday 15th November] marks a watershed moment in Australian investment history – not just because we're introducing something entirely new, but becaus...

Game of Influence: How Cricket is Losing Its Global Credibility

be losing its credibility on the global stage. As other sports continue to capture global audiences and inspire unity, cricket finds itself increasingly embroiled in political ...

Times Magazine

Interview with author Christian White. His latest book The Ledge is out now

What inspired you to write the book? I’d always wanted to write a coming-of-age thriller. The book started as a love letter to all the coming-of-age books and movies that shaped me as a teenager: Lord of The Flies, It, The Body / Stand By Me, The ...

Consider This Before Selling Your Motorhome on Consignment

It goes without saying that selling your motorhome is one of the greatest decisions to make when it is not being used or you want to buy a new vehicle and do not want to keep your old one. Although renting the motorhome for passive income or tradin...

How to Optimize Your Dust Collector’s Performance with the Right Filter Cartridge

The filter cartridge is one of the critical components of your dust collector system, and the efficiency of your system depends largely on it. The type of cartridge used in the dust collection system can significantly influence its performance, cos...

Rental Car Accidents: Key Steps to Take Right After a Crash

Rental Car Accidents: Key Steps to Take Right After a Crash The Immediate Aftermath of a Rental Car Accident When the unexpected happens, and you're involved in a rental car accident, the moments immediately following the crash are crucial. S...

Make the Most of Your Printing with the Right Printer Price in Singapore

Printers Available in Singapore Singapore is home to a wide variety of printers available at various price points. Whether you need a printer for home or office use, there's something for everyone. With the latest technology and features on offer...

Unveiling The Future: Dive Into The Latest Canon Models Online

Canon has remained at the forefront in the rapidly changing world of photography, continually setting new standards and transforming the sector. Canon constantly introduces innovative camera models that enthrall both professionals and enthusiasts...