Mon May 13

AI-assisted writing is quietly booming in academic journals. Here’s why that’s OK

Written by Julian Koplin, Lecturer in Bioethics, Monash University & Honorary fellow, Melbourne Law School, Monash University

If you search Google Scholar for the phrase “as an AI language model ^[1]”, you’ll find plenty of AI research literature and also some rather suspicious results. For example, one paper ^[2] on agricultural technology says:

As an AI language model, I don’t have direct access to current research articles or studies. However, I can provide you with an overview of some recent trends and advancements …

Obvious gaffes like this aren’t the only signs that researchers are increasingly turning to generative AI tools when writing up their research. A recent study ^[3] examined the frequency of certain words in academic writing (such as “commendable”, “meticulously” and “intricate”), and found they became far more common after the launch of ChatGPT – so much so that 1% of all journal articles published in 2023 may have contained AI-generated text.

(Why do AI models overuse these words? There is speculation ^[4] it’s because they are more common in English as spoken in Nigeria, where key elements of model training often occur.)

The aforementioned study also looks at preliminary data from 2024, which indicates that AI writing assistance is only becoming more common. Is this a crisis for modern scholarship, or a boon for academic productivity?

Who should take credit for AI writing?

Many people are worried by the use of AI in academic papers. Indeed, the practice has been described as “contaminating ^[5]” scholarly literature.

Some argue that using AI output amounts to plagiarism. If your ideas are copy-pasted from ChatGPT, it is questionable whether you really deserve credit for them.

But there are important differences between “plagiarising” text authored by humans and text authored by AI. Those who plagiarise humans’ work receive credit for ideas that ought to have gone to the original author.

By contrast, it is debatable whether AI systems like ChatGPT can have ideas, let alone deserve credit for them. An AI tool is more like ^[6] your phone’s autocomplete function than a human researcher.

The question of bias

Another worry is that AI outputs might be biased in ways that could seep into the scholarly record. Infamously, older language models tended to portray ^[7] people who are female, black and/or gay in distinctly unflattering ways, compared with people who are male, white and/or straight.

This kind of bias is less pronounced ^[8] in the current version of ChatGPT.

However, other studies have found a different kind ^[9] of bias ^[10] in ChatGPT and other large language models ^[11]: a tendency to reflect a left-liberal political ideology.

Any such bias could subtly distort scholarly writing produced using these tools.

The hallucination problem

The most serious worry relates to a well-known limitation of generative AI systems: that they often make serious mistakes.

For example, when I asked ChatGPT-4 to generate an ASCII image of a mushroom, it provided me with the following output.

   .--'|
   /___^ |     .--.
       ) |    /    \
      / |   |      |
     |   `-._\    /
     \        `~~`
      `-..._____.-`

It then confidently told me I could use this image of a “mushroom” for my own purposes.

These kinds of overconfident mistakes have been referred to as “AI hallucinations ^[12]” and “AI bullshit ^[13]”. While it is easy to spot that the above ASCII image looks nothing like a mushroom (and quite a bit like a snail), it may be much harder to identify any mistakes ChatGPT makes when surveying scientific literature ^[14] or describing the state of a philosophical debate.

Unlike (most) humans, AI systems are fundamentally unconcerned with the truth of what they say. If used carelessly, their hallucinations could corrupt the scholarly record.

Should AI-produced text be banned?

One response to the rise of text generators has been to ban them outright. For example, Science – one of the world’s most influential academic journals – disallows any use of AI-generated text ^[15].

I see two problems with this approach.

The first problem is a practical one: current tools for detecting AI-generated text are highly unreliable. This includes the detector created by ChatGPT’s own developers, which was taken offline ^[16] after it was found to have only a 26% accuracy rate (and a 9% false positive rate ^[17]). Humans also make mistakes ^[18] when assessing whether something was written by AI.

It is also possible to circumvent AI text detectors. Online communities are actively exploring ^[19] how to prompt ChatGPT in ways that allow the user to evade detection. Human users can also superficially rewrite AI outputs, effectively scrubbing away the traces of AI (like its overuse of the words “commendable”, “meticulously” and “intricate”).

The second problem is that banning generative AI outright prevents us from realising these technologies’ benefits. Used well, generative AI can boost academic productivity ^[20] by streamlining the writing process. In this way, it could help further human knowledge. Ideally, we should try to reap these benefits while avoiding the problems.

The problem is poor quality control, not AI

The most serious problem with AI is the risk of introducing unnoticed errors, leading to sloppy scholarship. Instead of banning AI, we should try to ensure that mistaken, implausible or biased claims cannot make it onto the academic record.

After all, humans can also produce writing with serious errors, and mechanisms such as peer review often fail ^[21] to prevent its publication.

We need to get better at ensuring academic papers are free from serious mistakes, regardless of whether these mistakes are caused by careless use of AI or sloppy human scholarship. Not only is this more achievable than policing AI usage, it will improve the standards of academic research as a whole.

This would be (as ChatGPT might say) a commendable and meticulously intricate solution.

References

^{^} as an AI language model (twitter.com)
^{^} paper (journals.ekb.eg)
^{^} recent study (arxiv.org)
^{^} speculation (www.theguardian.com)
^{^} contaminating (arxiv.org)
^{^} more like (theconversation.com)
^{^} tended to portray (arxiv.org)
^{^} less pronounced (www.nature.com)
^{^} kind (arxiv.org)
^{^} bias (www.mdpi.com)
^{^} other large language models (www.maximumtruth.org)
^{^} AI hallucinations (theconversation.com)
^{^} AI bullshit (blog.practicalethics.ox.ac.uk)
^{^} surveying scientific literature (time.com)
^{^} any use of AI-generated text (www.science.org)
^{^} taken offline (arstechnica.com)
^{^} 9% false positive rate (openai.com)
^{^} make mistakes (www.nature.com)
^{^} actively exploring (www.youtube.com)
^{^} boost academic productivity (www.nature.com)
^{^} often fail (www.routledge.com)

AI-assisted writing is quietly booming in academic journals. Here’s why that’s OK

Who should take credit for AI writing?

The question of bias

The hallucination problem

Should AI-produced text be banned?

The problem is poor quality control, not AI

References

YepAI Emerges as AI Dark Horse, Launches V3 SuperAgent to Revolutionize E-commerce

What SMEs Should Look For When Choosing a Shared Office in 2026

Why so many students are applying for early offers to uni

Thrive Early Learning Unveils Australian-First Immersive Experience for Pre-Schoolers

How ‘build-to-rent-to-own’ could help more renters get a toehold in the housing market

A Modern Twist on Traditional Cafe Fare at Cafe Thornbury

Times Magazine

Australia’s electric vehicle surge — EVs and hybrids hit record levels

Tim Ayres on the AI rollout’s looming ‘bumps and glitches’

Seven in Ten Australian Workers Say Employers Are Failing to Prepare Them for AI Future

Mapping for Trucks: More Than Directions, It’s Optimisation

Can bigger-is-better ‘scaling laws’ keep AI improving forever? History says we can’t be too sure

A backlash against AI imagery in ads may have begun as brands promote ‘human-made’

The Times Features

The way Australia produces food is unique. Our updated dietary guidelines have to recognise this

Why a Holiday or Short Break in the Noosa Region Is an Ideal Getaway

How Dynamic Pricing in Accommodation — From Caravan Parks to Hotels — Affects Holiday Affordability

The rise of chatbot therapists: Why AI cannot replace human care

Australians Can Now Experience The World of Wicked Across Universal Studios Singapore and Resorts World Sentosa

Mineral vs chemical sunscreens? Science shows the difference is smaller than you think

Here’s what new debt-to-income home loan caps mean for banks and borrowers

Why the Mortgage Industry Needs More Women (And What We're Actually Doing About It)

Inflation jumps in October, adding to pressure on government to make budget savings