The Times Australia
The Times World News

.

An academic publisher has struck an AI data deal with Microsoft – without their authors’ knowledge

  • Written by Wellett Potter, Lecturer in Law, University of New England
An academic publisher has struck an AI data deal with Microsoft – without their authors’ knowledge

In May, a multibillion-dollar UK-based multinational called Informa announced in a trading update[1] that it had signed a deal with Microsoft involving “access to advanced learning content and data, and a partnership to explore AI expert applications”. Informa is the parent company of Taylor & Francis[2], which publishes a wide range of academic and technical books and journals, so the data in question may include the content of these books and journals.

According to reports published last week[3], the authors of the content do not appear to have been asked or even informed about the deal. What’s more, they say they had no opportunity to opt out of the deal, and will not see any money from it.

Academics are only the latest of several groups of what we might call content creators to take umbrage at having their work ingested by the generative AI models currently racing to hoover up the products of human culture. Newspapers[4], visual artists[5] and record labels[6] are already taking AI companies to court.

While it’s unclear how Informa will react to the rumblings of discontent, the deal is a reminder to authors to be aware of the contractual terms of the publishing agreements they sign.

What’s in the Informa deal?

Informa’s update stated four focus areas of the Microsoft deal:

  • increasing Informa’s own productivity
  • developing an automated citation tool
  • developing AI-powered research assistant software (perhaps like a system being tested by online academic library JSTOR[7])
  • giving Microsoft data access to “help improve relevance and performance of AI systems”.

Informa will be paid more than £8 million (A$15.5 million) for initial access to the data, followed by recurring payments of an unspecified amount for the next three years.

We don’t know exactly what Microsoft plans to do with its data access, but a likely scenario is that the content of academic books and articles would be added to the training data of ChatGPT-like generative AI models. In principle this should make the output of the AI systems more accurate, though existing AI models have faced heavy criticism, not only for regurgitating training data[8] without citation (which can be viewed as a kind of plagiarism[9]), but also for inventing false information[10] and attributing[11] it to real sources.

However, the update also says “the agreement protects intellectual property rights, including limits on verbatim text extracts and alignment on the importance of detailed citation references”.

The “limits on verbatim text extracts” mentioned likely pertains to the US doctrine of fair use[12], which permits certain uses of copyright-protected material.

Many generative AI companies are currently facing copyright infringement lawsuits[13] over their use of training data, and their defences are likely to rely on claiming fair use.

The “importance of detailed citation references” may pertain to the concept of attribution in copyright. This is a moral right[14] possessed by authors. It provides that the creator of the work should be known and attributed as the author when their work is reproduced.

How does scholarly publishing usually work?

Most academics do not receive payment or make any profit from most of their scholarly publishing. Rather, writing journal and conference papers is usually considered part of the scope of work within a full-time, tenured position. Publication builds an academic’s credibility and promotes their research.

The basic process often goes like this: an author researches and writes an original article and submits it to a journal publisher for peer review. Most peer reviewers and editorial board members also receive no payment for their work.

In fact, some journals may require authors to pay an “article processing charge[15]” to cover editing and other costs. This can be thousands of dollars for an open access[16] publication. Generally speaking, the more prestigious the publication, the higher the charge.

If an article passes peer review, the author will be asked to sign a publishing agreement[17]. The terms may cover logistical arrangements such as when the article will be published, the format (print, online or both), and the division of royalties (if applicable). There will also be arrangements regarding copyright and ownership of the article.

An author usually must also grant exclusive rights[18] to the publisher to distribute and publish the article. This may mean the author cannot publish the article elsewhere, and the publisher may also be able to sub-licence the article to a third party, such as an AI company.

Sometimes publishers require an author to assign copyright in the article to them via a permanent copyright transfer agreement[19].

Essentially, this means the author grants all of their authorial rights as copyright holder in the work to the publisher. The publisher can then reproduce, communicate, distribute or license the work to others as they wish.

It is possible to only assign limited rights, rather than all rights, and this is something authors should consider.

Content mining

It is vital that authors understand the implications of licensing and assignment and to contemplate precisely what they are agreeing to when they sign a contract. In light of the recent trend of publishers entering into agreements with generative AI companies[20], publishers’ AI policies should also be closely scrutinised.

In the US, a standard collective licensing solution for content use in internal AI systems[21] has recently been released, which sets out rights and remuneration for copyright holders. Similar licences for the use of content for AI systems will likely enter the Australian market very soon.

The types of agreements being reached between academic publishers and AI companies have sparked bigger-picture concerns for many academics. Do we want scholarly research to be reduced to content for AI knowledge mining[22]? There are no clear answers about the ethics and morals of such practices.

References

  1. ^ trading update (www.informa.com)
  2. ^ Taylor & Francis (taylorandfrancis.com)
  3. ^ reports published last week (www.thebookseller.com)
  4. ^ Newspapers (www.nytimes.com)
  5. ^ visual artists (www.cbsnews.com)
  6. ^ record labels (time.com)
  7. ^ online academic library JSTOR (www.about.jstor.org)
  8. ^ regurgitating training data (www.nytimes.com)
  9. ^ plagiarism (spectrum.ieee.org)
  10. ^ inventing false information (www.cnet.com)
  11. ^ attributing (www.nature.com)
  12. ^ US doctrine of fair use (www.alrc.gov.au)
  13. ^ facing copyright infringement lawsuits (theconversation.com)
  14. ^ moral right (www.artslaw.com.au)
  15. ^ article processing charge (akjournals.com)
  16. ^ open access (www.openaccess.nl)
  17. ^ publishing agreement (copyright.unimelb.edu.au)
  18. ^ exclusive rights (copyrightalliance.org)
  19. ^ copyright transfer agreement (authorservices.wiley.com)
  20. ^ publishers entering into agreements with generative AI companies (techcrunch.com)
  21. ^ collective licensing solution for content use in internal AI systems (www.copyright.com)
  22. ^ AI knowledge mining (azure.microsoft.com)

Read more https://theconversation.com/an-academic-publisher-has-struck-an-ai-data-deal-with-microsoft-without-their-authors-knowledge-235203

Times Magazine

Building a Strong Online Presence with Katoomba Web Design

Katoomba web design is more than just creating a website that looks good—it’s about building an online presence that reflects your brand, engages your audience, and drives results. For local businesses in the Blue Mountains, a well-designed website a...

September Sunset Polo

International Polo Tour To Bridge Historic Sport, Life-Changing Philanthropy, and Breath-Taking Beauty On Saturday, September 6th, history will be made as the International Polo Tour (IPT), a sports leader headquartered here in South Florida...

5 Ways Microsoft Fabric Simplifies Your Data Analytics Workflow

In today's data-driven world, businesses are constantly seeking ways to streamline their data analytics processes. The sheer volume and complexity of data can be overwhelming, often leading to bottlenecks and inefficiencies. Enter the innovative da...

7 Questions to Ask Before You Sign IT Support Companies in Sydney

Choosing an IT partner can feel like buying an insurance policy you hope you never need. The right choice keeps your team productive, your data safe, and your budget predictable. The wrong choice shows up as slow tickets, surprise bills, and risky sh...

Choosing the Right Legal Aid Lawyer in Sutherland Shire: Key Considerations

Legal aid services play an essential role in ensuring access to justice for all. For people in the Sutherland Shire who may not have the financial means to pay for private legal assistance, legal aid ensures that everyone has access to representa...

Watercolor vs. Oil vs. Digital: Which Medium Fits Your Pet's Personality?

When it comes to immortalizing your pet’s unique personality in art, choosing the right medium is essential. Each artistic medium, whether watercolor, oil, or digital, has distinct qualities that can bring out the spirit of your furry friend in dif...

The Times Features

How much money do you need to be happy? Here’s what the research says

Over the next decade, Elon Musk could become the world’s first trillionaire[1]. The Tesla board recently proposed a US$1 trillion (A$1.5 trillion) compensation plan, if Musk ca...

NSW has a new fashion sector strategy – but a sustainable industry needs a federally legislated response

The New South Wales government recently announced the launch of the NSW Fashion Sector Strategy, 2025–28[1]. The strategy, developed in partnership with the Australian Fashion ...

From Garden to Gift: Why Roses Make the Perfect Present

Think back to the last time you gave or received flowers. Chances are, roses were part of the bunch, or maybe they were the whole bunch.   Roses tend to leave an impression. Even ...

Do I have insomnia? 5 reasons why you might not

Even a single night of sleep trouble can feel distressing and lonely. You toss and turn, stare at the ceiling, and wonder how you’ll cope tomorrow. No wonder many people star...

Wedding Photography Trends You Need to Know (Before You Regret Your Album)

Your wedding album should be a timeless keepsake, not something you cringe at years later. Trends may come and go, but choosing the right wedding photography approach ensures your ...

Can you say no to your doctor using an AI scribe?

Doctors’ offices were once private. But increasingly, artificial intelligence (AI) scribes (also known as digital scribes) are listening in. These tools can record and trans...