The Times Australia
The Times World News

.
Times Media

.

A new ‘AI scientist’ can write science papers without any human input. Here’s why that’s a problem

  • Written by Karin Verspoor, Dean, School of Computing Technologies, RMIT University, RMIT University

Scientific discovery is one of the most sophisticated human activities. First, scientists must understand the existing knowledge and identify a significant gap. Next, they must formulate a research question and design and conduct an experiment in pursuit of an answer. Then, they must analyse and interpret the results of the experiment, which may raise yet another research question.

Can a process this complex be automated? Last week, Sakana AI Labs announced[1] the creation of an “AI scientist” – an artificial intelligence system they claim can make scientific discoveries in the area of machine learning in a fully automated way.

Using generative large language models (LLMs) like those behind ChatGPT and other AI chatbots, the system can brainstorm, select a promising idea, code new algorithms, plot results, and write a paper summarising the experiment and its findings, complete with references. Sakana claims the AI tool can undertake the complete lifecycle of a scientific experiment at a cost of just US$15 per paper – less than the cost of a scientist’s lunch.

These are some big claims. Do they stack up? And even if they do, would an army of AI scientists churning out research papers with inhuman speed really be good news for science?

How a computer can ‘do science’

A lot of science is done in the open, and almost all scientific knowledge has been written down somewhere (or we wouldn’t have a way to “know” it). Millions of scientific papers are freely available online in repositories such as arXiv[2] and PubMed[3].

LLMs trained with this data capture the language of science and its patterns. It is therefore perhaps not at all surprising that a generative LLM can produce something that looks like a good scientific paper – it has ingested many examples that it can copy.

What is less clear is whether an AI system can produce an interesting scientific paper. Crucially, good science requires novelty.

But is it interesting?

Scientists don’t want to be told about things that are already known. Rather, they want to learn new things, especially new things that are significantly different from what is already known. This requires judgement about the scope and value of a contribution.

The Sakana system tries to address interestingness in two ways. First, it “scores” new paper ideas for similarity to existing research (indexed in the Semantic Scholar[4] repository). Anything too similar is discarded.

Second, Sakana’s system introduces a “peer review” step – using another LLM to judge the quality and novelty of the generated paper. Here again, there are plenty of examples of peer review online on sites such as openreview.net[5] that can guide how to critique a paper. LLMs have ingested these, too.

AI may be a poor judge of AI output

Feedback is mixed on Sakana AI’s output. Some have described it as producing “endless scientific slop[6]”.

Even the system’s own review of its outputs judges the papers weak at best. This is likely to improve as the technology evolves, but the question of whether automated scientific papers are valuable remains.

The ability of LLMs to judge the quality of research is also an open question. My own work (soon to be published in Research Synthesis Methods[7]) shows LLMs are not great at judging the risk of bias in medical research studies, though this too may improve over time.

Sakana’s system automates discoveries in computational research, which is much easier than in other types of science that require physical experiments. Sakana’s experiments are done with code, which is also structured text that LLMs can be trained to generate.

AI tools to support scientists, not replace them

AI researchers have been developing systems to support science for decades. Given the huge volumes of published research, even finding publications relevant to a specific scientific question can be challenging.

Specialised search tools make use of AI to help scientists find and synthesise existing work. These include the above-mentioned Semantic Scholar, but also newer systems such as Elicit[8], Research Rabbit[9], scite[10] and Consensus[11].

Text mining tools such as PubTator[12] dig deeper into papers to identify key points of focus, such as specific genetic mutations and diseases, and their established relationships. This is especially useful for curating and organising scientific information.

Machine learning has also been used to support the synthesis and analysis of medical evidence, in tools such as Robot Reviewer[13]. Summaries that compare and contrast claims in papers from Scholarcy[14] help to perform literature reviews.

All these tools aim to help scientists do their jobs more effectively, not to replace them.

AI research may exacerbate existing problems

While Sakana AI states[15] it doesn’t see the role of human scientists diminishing, the company’s vision of “a fully AI-driven scientific ecosystem” would have major implications for science.

One concern is that, if AI-generated papers flood the scientific literature, future AI systems may be trained on AI output and undergo model collapse[16]. This means they may become increasingly ineffectual at innovating.

However, the implications for science go well beyond impacts on AI science systems themselves.

There are already bad actors in science, including “paper mills” churning out fake papers[17]. This problem will only get worse[18] when a scientific paper can be produced with US$15 and a vague initial prompt.

The need to check for errors in a mountain of automatically generated research could rapidly overwhelm the capacity of actual scientists. The peer review system is arguably already broken[19], and dumping more research of questionable quality into the system won’t fix it.

Science is fundamentally based on trust. Scientists emphasise the integrity of the scientific process so we can be confident our understanding of the world (and now, the world’s machines) is valid and improving.

A scientific ecosystem where AI systems are key players raises fundamental questions about the meaning and value of this process, and what level of trust we should have in AI scientists. Is this the kind of scientific ecosystem we want?

References

  1. ^ Sakana AI Labs announced (sakana.ai)
  2. ^ arXiv (arxiv.org)
  3. ^ PubMed (pubmed.ncbi.nlm.nih.gov)
  4. ^ Semantic Scholar (www.semanticscholar.org)
  5. ^ openreview.net (openreview.net)
  6. ^ endless scientific slop (arstechnica.com)
  7. ^ Research Synthesis Methods (onlinelibrary.wiley.com)
  8. ^ Elicit (elicit.com)
  9. ^ Research Rabbit (www.researchrabbit.ai)
  10. ^ scite (scite.ai)
  11. ^ Consensus (consensus.app)
  12. ^ PubTator (www.ncbi.nlm.nih.gov)
  13. ^ Robot Reviewer (www.robotreviewer.net)
  14. ^ Scholarcy (www.scholarcy.com)
  15. ^ states (sakana.ai)
  16. ^ model collapse (www.nature.com)
  17. ^ fake papers (www.nature.com)
  18. ^ get worse (www.nature.com)
  19. ^ already broken (theconversation.com)

Read more https://theconversation.com/a-new-ai-scientist-can-write-science-papers-without-any-human-input-heres-why-thats-a-problem-237029

The Times Features

HCF’s Healthy Hearts Roadshow Wraps Up 2024 with a Final Regional Sprint

Next week marks the final leg of the HCF Healthy Hearts Roadshow for 2024, bringing free heart health checks to some of NSW’s most vibrant regional communities. As Australia’s ...

The Budget-Friendly Traveler: How Off-Airport Car Hire Can Save You Money

When planning a trip, transportation is one of the most crucial considerations. For many, the go-to option is renting a car at the airport for convenience. But what if we told ...

Air is an overlooked source of nutrients – evidence shows we can inhale some vitamins

You know that feeling you get when you take a breath of fresh air in nature? There may be more to it than a simple lack of pollution. When we think of nutrients, we think of t...

FedEx Australia Announces Christmas Shipping Cut-Off Dates To Help Beat the Holiday Rush

With Christmas just around the corner, FedEx is advising Australian shoppers to get their presents sorted early to ensure they arrive on time for the big day. FedEx has reveale...

Will the Wage Price Index growth ease financial pressure for households?

The Wage Price Index’s quarterly increase of 0.8% has been met with mixed reactions. While Australian wages continue to increase, it was the smallest increase in two and a half...

Back-to-School Worries? 70% of Parents Fear Their Kids Aren’t Ready for Day On

Australian parents find themselves confronting a key decision: should they hold back their child on the age border for another year before starting school? Recent research from...

Times Magazine

Telstra Launches 2 Hour Delivery Service

Telstra today announced the launch of a 2 hour delivery service from participating Telstra Stores to coincide with the latest handset launches. The service, offered in partnership with Zoom2u, will begin with a limited offer for Telstra customers...

Opportunities in the Blue Carbon Space through Khory Hancock’s Lens

Restoring and protecting our marine ecosystems has never been more pressing. As our oceans face numerous threats from pollution, overfishing, and climate change, we must take action to safeguard these vital ecosystems. Many initiatives have been ...

Take Extra Care Through the Help of iPhone Camera Repairs

As technology continues to advance at a breakneck pace, it's becoming increasingly important to know how to repair your Apple iPhone camera. With the rise of social media and the importance of capturing life's moments, having a functioning camera on ...

5 signs your partner might be cheating on you

Suspecting your partner might be cheating on you is not an easy feeling to have. The mistrust, anxieties and sadness are enough to paralyse anyone. But you shouldn’t be living in doubt. It’s not fair for you and your peace of mind, and it’s not f...

oOh!media puts Neon up in lights

oOh!media has transformed its high-impact Panorama sites across the country for a campaign to mark the merger of Neon and Lightbox under the Neon brand. Sky’s ‘Get it on Neon’ campaign went live on street furniture assets on 17 August in Chris...

Elevate Your Gift-Giving Experience with Magnetic Gift Boxes

Gift-giving is an art form, and just like any form of art, presentation plays a crucial role in its impact. Whether it's for birthdays, weddings, holidays, or any other special occasion, the way you Make Your Own Gift Box and a gift is packaged can...